Agent tuning zai-org/SWE-Dev-train Viewer • Updated about 1 month ago • 20.1k • 207 • 6 SWE-Gym/OpenHands-SFT-Trajectories Viewer • Updated May 10 • 491 • 351 • 11 lmarena-ai/webdev-arena-preference-10k Viewer • Updated Mar 10 • 10.5k • 253 • 11 SWE-bench/SWE-smith-trajectories Viewer • Updated 21 days ago • 76k • 1.33k • 21
Agent Benchmarks xw27/scibench Viewer • Updated May 6, 2024 • 692 • 330 • 19 google/frames-benchmark Viewer • Updated Oct 15, 2024 • 824 • 3.37k • 218 gaia-benchmark/GAIA Updated Feb 13 • 7.66k • 406 HuggingFaceH4/MATH-500 Viewer • Updated Nov 15, 2024 • 500 • 68.5k • 169
Agent tuning zai-org/SWE-Dev-train Viewer • Updated about 1 month ago • 20.1k • 207 • 6 SWE-Gym/OpenHands-SFT-Trajectories Viewer • Updated May 10 • 491 • 351 • 11 lmarena-ai/webdev-arena-preference-10k Viewer • Updated Mar 10 • 10.5k • 253 • 11 SWE-bench/SWE-smith-trajectories Viewer • Updated 21 days ago • 76k • 1.33k • 21
Agent Benchmarks xw27/scibench Viewer • Updated May 6, 2024 • 692 • 330 • 19 google/frames-benchmark Viewer • Updated Oct 15, 2024 • 824 • 3.37k • 218 gaia-benchmark/GAIA Updated Feb 13 • 7.66k • 406 HuggingFaceH4/MATH-500 Viewer • Updated Nov 15, 2024 • 500 • 68.5k • 169