Running 2.15k 2.15k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation β’ Updated 14 days ago β’ 1.49M β’ β’ 1.24k
cognitivecomputations/dolphin-2.9.2-qwen2-7b Text Generation β’ Updated Jun 18, 2024 β’ 2.5k β’ 67
Running on CPU Upgrade 140 140 Open Arabic LLM Leaderboard π Track, rank and evaluate open Arabic LLMs and chatbots
Running 852 852 FineWeb: decanting the web for the finest text data at scale π· Generate high-quality web text data for LLM training
argilla/distilabel-capybara-dpo-7k-binarized Viewer β’ Updated Jul 16, 2024 β’ 7.56k β’ 2.62k β’ 181