DSBench: How Far Are Data Science Agents to Becoming Data Science Experts? Paper • 2409.07703 • Published Sep 12, 2024 • 67
From Words to Routes: Applying Large Language Models to Vehicle Routing Paper • 2403.10795 • Published Mar 16, 2024
HyperPPO: A scalable method for finding small policies for robotic control Paper • 2309.16663 • Published Sep 28, 2023
Decentralized Control of Quadrotor Swarms with End-to-end Deep Reinforcement Learning Paper • 2109.07735 • Published Sep 16, 2021 • 1
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning Paper • 2006.11751 • Published Jun 21, 2020
QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control Paper • 2306.09537 • Published Jun 15, 2023
Collision Avoidance and Navigation for a Quadrotor Swarm Using End-to-end Deep Reinforcement Learning Paper • 2309.13285 • Published Sep 23, 2023
Guaranteed Trust Region Optimization via Two-Phase KL Penalization Paper • 2312.05405 • Published Dec 8, 2023