Running 2.31k 2.31k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
lmstudio-community/DeepSeek-R1-Distill-Qwen-7B-GGUF Text Generation β’ Updated Jan 20 β’ 314k β’ 70
Running 535 535 Scaling test-time compute π Enhance math problem solving by scaling test-time compute