uop's picture
1

uop

parixit8985
Β·

AI & ML interests

None yet

Recent Activity

liked a Space about 1 month ago
ameerazam08/Diffusion-Eraser
reacted to tegridydev's post with ❀️ about 1 month ago
Open-MalSec v0.1 – Open-Source Cybersecurity Dataset Evening! 🫑 πŸ“‚ Just uploaded an early-stage open-source cybersecurity dataset focused on phishing, scams, and malware-related text samples. This is the base version (v0.1)β€”a few structured sample files. Full dataset builds will come over the next few weeks. πŸ”— Dataset link: https://huggingface.co/datasets/tegridydev/open-malsec πŸ” What’s in v0.1? A few structured scam examples (text-based) Covers DeFi, crypto, phishing, and social engineering Initial labelling format for scam classification ⚠️ This is not a full dataset yet (samples are currently available). Just establishing the structure + getting feedback. πŸ“‚ Current Schema & Labelling Approach "instruction" β†’ Task prompt (e.g., "Evaluate this message for scams") "input" β†’ Source & message details (e.g., Telegram post, Tweet) "output" β†’ Scam classification & risk indicators πŸ—‚οΈ Current v0.1 Sample Categories Crypto Scams β†’ Meme token pump & dumps, fake DeFi projects Phishing β†’ Suspicious finance/social media messages Social Engineering β†’ Manipulative messages exploiting trust πŸ”œ Next Steps - Expanding datasets with more phishing & malware examples - Refining schema & annotation quality - Open to feedback, contributions, and suggestions If this is something you might find useful, bookmark/follow/like the dataset repo <3 πŸ’¬ Thoughts, feedback, and ideas are always welcome! Drop a comment or DMs are open πŸ€™
published a Space about 1 month ago
parixit8985/tesst5
View all activity

Organizations

None yet

parixit8985's activity

reacted to tegridydev's post with ❀️ about 1 month ago
view post
Post
1439
Open-MalSec v0.1 – Open-Source Cybersecurity Dataset

Evening! 🫑

πŸ“‚ Just uploaded an early-stage open-source cybersecurity dataset focused on phishing, scams, and malware-related text samples.

This is the base version (v0.1)β€”a few structured sample files. Full dataset builds will come over the next few weeks.

πŸ”— Dataset link:

tegridydev/open-malsec

πŸ” What’s in v0.1?
A few structured scam examples (text-based)
Covers DeFi, crypto, phishing, and social engineering
Initial labelling format for scam classification

⚠️ This is not a full dataset yet (samples are currently available). Just establishing the structure + getting feedback.

πŸ“‚ Current Schema & Labelling Approach
"instruction" β†’ Task prompt (e.g., "Evaluate this message for scams")
"input" β†’ Source & message details (e.g., Telegram post, Tweet)
"output" β†’ Scam classification & risk indicators

πŸ—‚οΈ Current v0.1 Sample Categories
Crypto Scams β†’ Meme token pump & dumps, fake DeFi projects
Phishing β†’ Suspicious finance/social media messages
Social Engineering β†’ Manipulative messages exploiting trust

πŸ”œ Next Steps
- Expanding datasets with more phishing & malware examples
- Refining schema & annotation quality
- Open to feedback, contributions, and suggestions

If this is something you might find useful, bookmark/follow/like the dataset repo <3

πŸ’¬ Thoughts, feedback, and ideas are always welcome! Drop a comment or DMs are open πŸ€™
published a Space about 1 month ago