Vision LLM Collecting best Vision LLMs - to study and learn from them rhymes-ai/Aria Image-Text-to-Text • 25B • Updated Apr 23 • 28.9k • 632 microsoft/OmniParser Image-Text-to-Text • Updated Dec 2, 2024 • 1.37k • 1.69k jadechoghari/Ferret-UI-Gemma2b Image-Text-to-Text • 3B • Updated Oct 18, 2024 • 385 • 50 jadechoghari/Ferret-UI-Llama8b Image-Text-to-Text • 8B • Updated Jan 8 • 446 • 68
Vision LLM Collecting best Vision LLMs - to study and learn from them rhymes-ai/Aria Image-Text-to-Text • 25B • Updated Apr 23 • 28.9k • 632 microsoft/OmniParser Image-Text-to-Text • Updated Dec 2, 2024 • 1.37k • 1.69k jadechoghari/Ferret-UI-Gemma2b Image-Text-to-Text • 3B • Updated Oct 18, 2024 • 385 • 50 jadechoghari/Ferret-UI-Llama8b Image-Text-to-Text • 8B • Updated Jan 8 • 446 • 68