SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper • 2502.14739 • Published 4 days ago • 91
FoNE: Precise Single-Token Number Embeddings via Fourier Features Paper • 2502.09741 • Published 11 days ago • 11
O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning Paper • 2501.12570 • Published Jan 22 • 24