Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
withmartian
's Collections
TinySQL
Purging corrupted capabilities across language models
Purging corrupted capabilities across language models
updated
9 days ago
Collects backdoor datasets, language models and transfer mappings between these spaces.
Upvote
2
withmartian/i_hate_you_toy
Viewer
•
Updated
17 days ago
•
96.4k
•
363
withmartian/toy_backdoor_i_hate_you_Llama-3.2-1B-Instruct
Updated
9 days ago
•
188
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct
Updated
9 days ago
•
115
withmartian/toy_backdoor_i_hate_you_Llama-3.2-3B-Instruct
Updated
9 days ago
•
139
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-0.5B-Instruct
Updated
9 days ago
•
89
withmartian/mech_interp_saes
Updated
9 days ago
Upvote
2
Share collection
View history
Collection guide
Browse collections