view article Article Fine-Tuning 1B LLaMA 3.2: A Comprehensive Step-by-Step Guide with Code By ImranzamanML • Oct 2, 2024 • 42
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Nov 14, 2024 • 538
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 18 days ago • 132
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 13 days ago • 74
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 13 days ago • 40
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 13 days ago • 17
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 12 days ago • 107
TimesFM Release Collection TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting. • 4 items • Updated 8 days ago • 10
The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 181
Centaurus Collection Series of uncensored models based on Llama-3. • 5 items • Updated May 27, 2024 • 4
Recommended small models Collection This is everything recent smaller than ~25B parameters that are high quality/reputable • 19 items • Updated Nov 30, 2024 • 31
Open LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 64 items • Updated about 1 hour ago • 491