Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Quadyun
's Collections
Reward Model
All Math Benchmark Datasets
MATH-TIR
All Math Benchmark Datasets
updated
7 days ago
Upvote
-
AI-MO/aimo-validation-aime
Viewer
•
Updated
Jul 10
•
90
•
1.71k
•
16
lighteval/MATH
Viewer
•
Updated
Oct 17, 2023
•
25k
•
8.76k
•
61
HuggingFaceH4/MATH-500
Viewer
•
Updated
Nov 15
•
500
•
4.84k
•
26
TIGER-Lab/MMLU-STEM
Viewer
•
Updated
Jun 20
•
3.15k
•
12k
•
9
Upvote
-
Share collection
View history
Collection guide
Browse collections