view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 9 days ago • 326
🇩🇪German SFT and DPO datasets Collection Datasets that can be used for LLM training with axolotl, trl or llama_factory. • 33 items • Updated Jan 23 • 11
Evolutionary Optimization of Model Merging Recipes Paper • 2403.13187 • Published Mar 19, 2024 • 53
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 108
Leaderboards and benchmarks ✨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 91 items • Updated 21 days ago • 99