Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 37 items • Updated about 16 hours ago • 32
Trained Models 🏋️ Collection They may be small, but they're training like giants! • 9 items • Updated about 16 hours ago • 20
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 241