Running 2.24k 2.24k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
NousResearch/DeepHermes-3-Llama-3-8B-Preview Text Generation • Updated about 8 hours ago • 17.4k • 294
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 16 days ago • 106
Leaderboards and benchmarks ✨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 91 items • Updated 14 days ago • 98
Running on CPU Upgrade 91 91 Open LLM Leaderboard Model Comparator 🏆 Compare Open LLM Leaderboard results