Running
on
Zero
984
π¬
Try PaliGemma on document understanding tasks
GPT 4o like bot.
Microsoft Phi-3 Vision 128k with Multimodal capabilities
A Fully Open Multilingual Multimodal LLM for 39 Languages
Demo for DocLayout-YOLO
Huggingface space for JanusFlow-1.3B
PaliGemma2 LoRA finetuned on VQAv2
Gaze detection using Moondream