arxiv:2501.17433
TianshengHuang
TianshengHuang
AI & ML interests
LLM safety
Recent Activity
authored
a paper
5 days ago
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing
Guardrail Moderation
commented on
a paper
6 days ago
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing
Guardrail Moderation
Organizations
None yet
models
None public yet
datasets
None public yet