Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You In Paper • 2410.16950 • Published Oct 22, 2024
Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations Paper • 2403.06009 • Published Mar 9, 2024