DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 4 days ago • 185
nexa-collaboration/output_llama3-1_8b_distillation_from_sparse Text Generation • Updated 6 days ago • 7
nexa-collaboration/output_llama3-1_8b_distillation_from_sparse Text Generation • Updated 6 days ago • 7