Zhexin Zhang's picture

2 2 1

Zhexin Zhang

nonstopfor

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Agent-SafetyBench: Evaluating the Safety of LLM Agents

commented a paper 2 days ago

Agent-SafetyBench: Evaluating the Safety of LLM Agents

liked a model about 2 months ago

thu-coai/ShieldLM-7B-internlm2

View all activity

Organizations

nonstopfor's activity

commented a paper 2 days ago

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published 8 days ago • 8 •

commented a paper 6 months ago

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Paper • 2407.02855 • Published Jul 3 • 10 •