Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
open-r1/open-r1-eval-leaderboard
yentinglin
/
zhtw-reasoning-eval-leaderboard
like
2
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
zhtw-reasoning-eval-leaderboard
/
eval_results
Ctrl+K
Ctrl+K
3 contributors
History:
21221 commits
yentinglin
Upload eval_results//fsx/ubuntu/yentinglin/ckpt/run_20250223_0340//True/gpqa:diamond/results_2025-02-23T06-48-22.757338.json with huggingface_hub
b9ec16e
verified
2 months ago
GAIR
Upload eval_results/GAIR/LIMO/main/gpqa:diamond/results_2025-02-15T15-14-48.876554.json with huggingface_hub
2 months ago
data
Upload eval_results/data/Mistral-Small-24B-Instruct-2501-S1-ZHS1-SFT/True/aime24/results_2025-02-14T05-16-30.002267.json with huggingface_hub
2 months ago
deepseek-ai
Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B/main/gpqa:diamond/results_2025-02-16T15-54-30.662634.json with huggingface_hub
2 months ago
fsx
Upload eval_results//fsx/ubuntu/yentinglin/ckpt/run_20250223_0340//True/gpqa:diamond/results_2025-02-23T06-48-22.757338.json with huggingface_hub
2 months ago
mistralai
Upload eval_results/mistralai/Mistral-Small-24B-Instruct-2501/main/aime24/results_2025-02-14T04-20-38.981835.json with huggingface_hub
2 months ago
simplescaling
Upload eval_results/simplescaling/s1.1-32B/4d6d573e59b7fafae141a124cdd3f541e5aa967a/math_500/results_2025-02-15T15-05-40.169064.json with huggingface_hub
2 months ago