Light-R1 Collection Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond • 7 items • Updated about 23 hours ago • 9
SIFT: Grounding LLM Reasoning in Contexts via Stickers Paper • 2502.14922 • Published 23 days ago • 30
TransMLA: Multi-head Latent Attention Is All You Need Paper • 2502.07864 • Published about 1 month ago • 47
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 263
view article Article 🐺🐦⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram • Dec 4, 2024 • 76