JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models Paper • 2404.01318 • Published Mar 28
Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition Paper • 2406.07954 • Published Jun 12 • 2
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents Paper • 2406.13352 • Published Jun 19
ARB: Advanced Reasoning Benchmark for Large Language Models Paper • 2307.13692 • Published Jul 25, 2023 • 16
Preprocessors Matter! Realistic Decision-Based Attacks on Machine Learning Systems Paper • 2210.03297 • Published Oct 7, 2022