view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! By dvilasuero and 5 others • 10 days ago • 56
view article Article Vision Language Model Alignment in TRL ⚡️ By sergiopaniego and 4 others • 11 days ago • 64
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated 11 days ago • 294
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 13 days ago • 459
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 20 days ago • 155
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents Paper • 2507.04009 • Published Jul 5 • 39
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! Jun 6 • 53
view article Article Test-Driving the LLMD Inference Engine by ZML 🚀 By erikkaum • about 1 month ago • 22
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • Jul 9 • 646
view article Article FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages By davanstrien and 5 others • Jul 8 • 29
view article Article Bringing Fusion Down to Earth: ML for Stellarator Optimization By cgeorgiaw • Jul 2 • 72
view article Article Teaching Data Literacy with Hugging Face's AI Sheets By ParulPandey • Jun 30 • 23
view article Article Gemma 3n fully available in the open-source ecosystem! By ariG23498 and 7 others • Jun 26 • 115
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26 • 65
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub By drbh and 6 others • Jun 12 • 125
view article Article Featherless AI on Hugging Face Inference Providers 🔥 By sbrandeis and 5 others • Jun 12 • 47
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published Jun 5 • 46