🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It? By Kseniase • 24 days ago • 145
Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers? By Kseniase and 1 other • 6 days ago • 13
Deepsite:HuggingFace Founder Introduces Free Web-Based 'Cursor' Alternative By LLMhacker • 9 days ago • 13
Optimise AI Models and Make Them Faster, Smaller, Cheaper, Greener By PrunaAI and 2 others • 7 days ago • 12
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 101
Training Large Language Models with Interpreter Feedback using WebAssembly By axolotl-ai-co and 1 other • 7 days ago • 10
Empowering Public Organizations: Preparing Your Data for the AI Era By evijit and 1 other • about 11 hours ago • 8
Porting Pi0-FAST to LeRobot from JAX to PyTorch: Challenges, Fixes, and Open Questions By danaaubakirova and 3 others • 8 days ago • 8
🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It? By Kseniase • 24 days ago • 145
Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers? By Kseniase and 1 other • 6 days ago • 13
Deepsite:HuggingFace Founder Introduces Free Web-Based 'Cursor' Alternative By LLMhacker • 9 days ago • 13
Optimise AI Models and Make Them Faster, Smaller, Cheaper, Greener By PrunaAI and 2 others • 7 days ago • 12
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 101
Training Large Language Models with Interpreter Feedback using WebAssembly By axolotl-ai-co and 1 other • 7 days ago • 10
Empowering Public Organizations: Preparing Your Data for the AI Era By evijit and 1 other • about 11 hours ago • 8
Porting Pi0-FAST to LeRobot from JAX to PyTorch: Challenges, Fixes, and Open Questions By danaaubakirova and 3 others • 8 days ago • 8