Running 1.69k 1.69k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4 Text Generation • Updated Aug 7, 2024 • 4.84k • 22