Towards General-Purpose Model-Free Reinforcement Learning Paper • 2501.16142 • Published 1 day ago • 15
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 15 days ago • 50
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Paper • 2402.09844 • Published Feb 15, 2024 • 21
Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree Search Paper • 2408.10635 • Published Aug 20, 2024 • 15
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent Paper • 2312.10003 • Published Dec 15, 2023 • 38