Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
carlizor
's Collections
Multi lora spaces
TTS
Utilities
Document retrieval / chat
Flux
Image restoration
3D Generation
LLM
Embedding
LLM - Small
Video vision
To Read
Video
Image Segmentation
Image Generation (Fast)
Image Depth
Image caption
Audio
Image Generation
Image that talks
Image Enhance
Image Vision
Image editing
Image upscaling
Face Recognition
Multimodal
LLM - Medium
Image Vision
updated
3 days ago
Upvote
-
Salesforce/xgen-mm-phi3-mini-instruct-r-v1
Image-Text-to-Text
•
Updated
Feb 3
•
1.07k
•
186
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text
•
Updated
29 days ago
•
5.37k
•
269
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
Jan 14
•
20.6k
•
767
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
1.61k
•
1.65k
deepseek-ai/Janus-1.3B
Any-to-Any
•
Updated
Jan 27
•
11k
•
583
deepseek-ai/JanusFlow-1.3B
Any-to-Any
•
Updated
Jan 27
•
4.94k
•
143
NexaAIDev/OmniVLM-968M
Updated
Dec 17, 2024
•
1.17k
•
513
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
Jan 9
•
168k
•
1.08k
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
Feb 4
•
95.9k
•
1.44k
jiuhai/florence-vl-8b-sft
Updated
Dec 3, 2024
•
44
•
19
AI-Safeguard/Ivy-VL-llava
Visual Question Answering
•
Updated
Dec 31, 2024
•
836
•
63
OpenGVLab/InternVL2_5-78B
Image-Text-to-Text
•
Updated
3 days ago
•
4.46k
•
188
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
Updated
Jan 12
•
140k
•
•
573
deepseek-ai/deepseek-vl2
Image-Text-to-Text
•
Updated
Dec 18, 2024
•
12.4k
•
309
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Oct 10, 2024
•
690k
•
518
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
•
Updated
Jan 11
•
61.2k
•
58
ByteDance/Sa2VA-1B
Image-Text-to-Text
•
Updated
9 days ago
•
1.69k
•
21
HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text
•
Updated
21 days ago
•
23.6k
•
116
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
Updated
5 days ago
•
384k
•
•
408
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
Updated
5 days ago
•
3.31M
•
•
746
OpenGVLab/InternVideo2_5_Chat_8B
Video-Text-to-Text
•
Updated
Feb 18
•
7.72k
•
52
nvidia/Eagle2-9B
Image-Text-to-Text
•
Updated
Jan 28
•
5.6k
•
45
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text
•
Updated
Jan 31
•
166k
•
178
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text
•
Updated
about 1 month ago
•
564k
•
583
microsoft/Magma-8B
Image-Text-to-Text
•
Updated
22 days ago
•
14.9k
•
340
marco/mcdse-2b-v1
Updated
Oct 29, 2024
•
6.28k
•
54
CohereForAI/aya-vision-8b
Image-Text-to-Text
•
Updated
23 days ago
•
151k
•
270
Skywork/Skywork-R1V-38B
Image-Text-to-Text
•
Updated
about 17 hours ago
•
3.09k
•
106
ds4sd/SmolDocling-256M-preview
Image-Text-to-Text
•
Updated
4 days ago
•
40.8k
•
986
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text
•
Updated
1 day ago
•
52.9k
•
239
Upvote
-
Share collection
View history
Collection guide
Browse collections