MARS: Unleashing the Power of Variance Reduction for Training Large Models Paper • 2411.10438 • Published Nov 15 • 13
Multimodal Autoregressive Pre-training of Large Vision Encoders Paper • 2411.14402 • Published Nov 21 • 43
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 3 days ago • 195
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 7 days ago • 95
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22 • 126
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 12 days ago • 142
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated 12 days ago • 47
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 86
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 28 days ago • 289
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 19 days ago • 548
view article Article Unleash ML Power on iOS: Apple Silicon Optimization Secrets By fguzman82 • Jul 18 • 4
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated 20 days ago • 180