Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. By MaziyarPanahi • 13 days ago • 124
LLMGameHub: How We Won the Gradio Agents & MCP Hackathon 2025 By kikikita and 1 other • about 19 hours ago • 7
OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • 11 days ago • 47
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 193
Detecting Beyond Sight: Building AI-Enabled SAR Intelligence with Synthetic Data By DualityAI-RebekahBogdanoff • 4 days ago • 4
Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models By AI-MO and 17 others • 19 days ago • 46
Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1 By codelion • 6 days ago • 3
Introducing any-llm: A unified API to access any LLM provider By mozilla-ai and 1 other • 7 days ago • 3
Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. By MaziyarPanahi • 13 days ago • 124
LLMGameHub: How We Won the Gradio Agents & MCP Hackathon 2025 By kikikita and 1 other • about 19 hours ago • 7
OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • 11 days ago • 47
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 193
Detecting Beyond Sight: Building AI-Enabled SAR Intelligence with Synthetic Data By DualityAI-RebekahBogdanoff • 4 days ago • 4
Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models By AI-MO and 17 others • 19 days ago • 46
Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1 By codelion • 6 days ago • 3
Introducing any-llm: A unified API to access any LLM provider By mozilla-ai and 1 other • 7 days ago • 3