Guardians of the Agentic System: Preventing Many Shots Jailbreak with Agentic System Paper • 2502.16750 • Published about 1 month ago • 10
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published 29 days ago • 71
Beyond Release: Access Considerations for Generative AI Systems Paper • 2502.16701 • Published about 1 month ago • 12
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published Feb 19 • 68
Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement Paper • 2410.04444 • Published Oct 6, 2024 • 2
Tree of Thoughts: Deliberate Problem Solving with Large Language Models Paper • 2305.10601 • Published May 17, 2023 • 12
Chain of Hindsight Aligns Language Models with Feedback Paper • 2302.02676 • Published Feb 6, 2023 • 1
ReAct: Synergizing Reasoning and Acting in Language Models Paper • 2210.03629 • Published Oct 6, 2022 • 24