High-Fidelity Simultaneous Speech-To-Speech Translation Paper • 2502.03382 • Published 4 days ago • 8
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 2 days ago • 38
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 4 days ago • 137
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 5 days ago • 80
view article Article 🚀 Deploying OLMo-7B with Text Generation Inference (TGI) on Hugging Face Spaces By ariG23498 • 7 days ago • 5
view article Article **How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents** By Steveeeeeeen • 10 days ago • 15
view article Article 🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker! By ariG23498 • 11 days ago • 13
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 7 items • Updated 2 days ago • 41
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 13 days ago • 330
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 14 days ago • 99
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... By srinivasbilla • 19 days ago • 60
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 17 days ago • 67
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others • 19 days ago • 32
Eagle 2 Collection Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. • 9 items • Updated 17 days ago • 31