super_leaderboard / _header.md
benbogin's picture
leaderboard
507ce38
|
raw
history blame
312 Bytes

SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories

💻 GitHub | 🤗 HuggingFace | Updated: {LAST_UPDATED}