Running 535 535 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
UNIVA-Bllossom/DeepSeek-llama3.3-Bllossom-70B Text Generation • Updated 28 days ago • 3.48k • 52
UNIVA-Bllossom/DeepSeek-llama3.1-Bllossom-8B Text Generation • Updated 29 days ago • 8.15k • 38
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 346