Running 2.29k 2.29k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
lmstudio-community/DeepSeek-R1-Distill-Qwen-7B-GGUF Text Generation β’ Updated Jan 20 β’ 324k β’ 69
Running 536 536 Scaling test-time compute π Enhance math problem solving by scaling test-time compute