microsoft/Phi-3-vision-128k-instruct Text Generation β’ 4B β’ Updated Aug 20, 2024 β’ 39.1k β’ 964
Running on Zero 789 789 Florence 2 π Analyze images to generate captions, detect objects, or perform OCR
prometheus-eval/prometheus-8x7b-v2.0 Text Generation β’ 47B β’ Updated Nov 29, 2024 β’ 4.59k β’ 50
Running 554 554 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects