Hierarchical reinforcement learning with natural language subgoals Paper • 2309.11564 • Published Sep 20, 2023
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent Paper • 2312.10003 • Published Dec 15, 2023 • 40