SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 22 days ago • 129
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 23 days ago • 65
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 870
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 2 days ago • 145