OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper ā¢ 2407.16741 ā¢ Published Jul 23 ā¢ 68
Advancing LLM Reasoning Generalists with Preference Trees Paper ā¢ 2404.02078 ā¢ Published Apr 2 ā¢ 44
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales Paper ā¢ 2405.20974 ā¢ Published May 31
A Single Transformer for Scalable Vision-Language Modeling Paper ā¢ 2407.06438 ā¢ Published Jul 8 ā¢ 1
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper ā¢ 2407.16741 ā¢ Published Jul 23 ā¢ 68
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Paper ā¢ 2406.11896 ā¢ Published Jun 14 ā¢ 18
MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback Paper ā¢ 2309.10691 ā¢ Published Sep 19, 2023 ā¢ 4
CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets Paper ā¢ 2309.17428 ā¢ Published Sep 29, 2023 ā¢ 1
R-Tuning: Teaching Large Language Models to Refuse Unknown Questions Paper ā¢ 2311.09677 ā¢ Published Nov 16, 2023 ā¢ 3
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents Paper ā¢ 2401.00812 ā¢ Published Jan 1 ā¢ 3