Running 3.07k 3.07k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Running on Zero 529 529 Chat with DeepSeek-VL2-small 🌍 Generate responses using images and text input
Running 1.04k 1.04k FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality web text data for LLM training
Foundation AI Papers Collection Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15, 2024 • 34