Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ScalerLab
community
Activity Feed
Follow
2
AI & ML interests
None defined yet.
Recent Activity
kylemontgomery
updated
a Space
about 2 months ago
ScalerLab/JudgeBench
sijuntan
authored
a paper
2 months ago
JudgeBench: A Benchmark for Evaluating LLM-based Judges
kylemontgomery
authored
a paper
2 months ago
Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning
View all activity
Team members
2
spaces
1
Running
17
🏆
JudgeBench Leaderboard
models
None public yet
datasets
1
ScalerLab/JudgeBench
Viewer
•
Updated
Oct 9
•
620
•
154
•
4