ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
Paper
•
2412.06745
•
Published
•
6
Foundation Models, Scaling Laws, Continual Pretraining, Data-centric ML, Neuroscience, Human-Machine comparisons