Running on CPU Upgrade 215 215 GPT-OSS-120B on AMD MI300X 💻 gpt-oss-120b model running on AMD MI300 infrastructure.
view article Article Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training By siro1 and 4 others • 10 days ago • 45
view article Article SmolLM - blazingly fast and remarkably powerful By loubnabnl and 2 others • Jul 16, 2024 • 410
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models By loubnabnl and 2 others • Mar 20, 2024 • 100
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 139
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 626
view article Article The Missing Semester of AI for Organizations #1: LLM Security By huseyingulsin • 11 days ago • 8
view post Post 4194 Run OpenAI's new gpt-oss models locally with Unsloth GGUFs! 🔥🦥20b GGUF: unsloth/gpt-oss-20b-GGUF120b GGUF: unsloth/gpt-oss-120b-GGUFModel will run on 14GB RAM for 20b and 66GB for 120b. See translation 2 replies · ❤️ 16 16 🔥 5 5 🚀 4 4 + Reply
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 13 days ago • 459
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated 11 days ago • 292
view post Post 3519 Qwen3-30B-A3B-Thinking-2507 🔥 latest step in scaling thinking capabilities from Alibaba Qwen team. Qwen/Qwen3-30B-A3B-Thinking-2507-FP8✨ 30B total / 3B active - Apache 2.0 ✨ Native 256K context✨ SOTA coding, alignment, agentic reasoning See translation 🔥 9 9 + Reply