Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
HuggingFaceH4/lm-eval-leaderboard
open-r1
/
open-r1-eval-leaderboard
like
72
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
645fb7b
open-r1-eval-leaderboard
Ctrl+K
Ctrl+K
4 contributors
History:
17078 commits
edbeeching
HF Staff
Upload eval_results/AI-MO/NuminaMath-70B-TIR-release-candidate-1/main/aime_2024/results_2024-07-17T21-14-23.207355.json with huggingface_hub
645fb7b
verified
10 months ago
eval_results
Upload eval_results/AI-MO/NuminaMath-70B-TIR-release-candidate-1/main/aime_2024/results_2024-07-17T21-14-23.207355.json with huggingface_hub
10 months ago
.gitattributes
Safe
1.6 kB
Upload eval_results/Qwen/Qwen1.5-0.5B/main/gsm8k with huggingface_hub
about 1 year ago
.gitignore
Safe
3.08 kB
Add app
about 1 year ago
README.md
Safe
256 Bytes
Update README.md
about 1 year ago
app.py
Safe
9.68 kB
Add ifeval metrics
10 months ago
debug.ipynb
Safe
20.5 kB
Add agg
11 months ago
requirements.txt
Safe
6 Bytes
Add app
about 1 year ago