Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
WildEval
non-profit
wild_eval
WildEval
Activity Feed
Request to join this org
Follow
12
AI & ML interests
None defined yet.
Recent Activity
valpy
Â
authored
a paper
8 days ago
2 OLMo 2 Furious
valpy
Â
authored
a paper
8 days ago
IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance
valpy
Â
authored
a paper
8 days ago
RewardBench 2: Advancing Reward Model Evaluation
View all activity
Team members
9
WildEval
's Spaces
1
Sort:Â Recently updated
pinned
Runtime error
6
Zebra Logic Bench
🦓
Explore and evaluate Zebra Logic models