Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Audio-Text-to-Text
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Visual Document Retrieval
Any-to-Any
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Keypoint Detection
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Text Ranking
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
8,980
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
mlx-community/deepseek-vl2-8bit
Image-Text-to-Text
•
Updated
Jan 2
•
59
•
5
OS-Copilot/OS-Genesis-7B-AC
Image-Text-to-Text
•
Updated
Jan 8
•
10
•
7
OpenGVLab/InternVL2_5-78B-MPO-AWQ
Image-Text-to-Text
•
Updated
Jan 6
•
630
•
9
OpenGVLab/InternVL2_5-26B-MPO-AWQ
Image-Text-to-Text
•
Updated
Mar 25
•
164
•
4
OS-Copilot/OS-Genesis-4B-AC
Image-Text-to-Text
•
Updated
Jan 8
•
13
•
7
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
Updated
Jan 12
•
50.8k
•
•
587
lmstudio-community/QVQ-72B-Preview-GGUF
Image-Text-to-Text
•
Updated
Jan 6
•
217
•
2
mjschock/SmolVLM-Instruct
Image-Text-to-Text
•
Updated
Dec 25, 2024
•
18
•
2
unsloth/QVQ-72B-Preview-bnb-4bit
Image-Text-to-Text
•
Updated
Dec 25, 2024
•
199
•
6
nintwentydo/pixtral-12b-FP8-dynamic-FP8-KV-cache
Image-Text-to-Text
•
Updated
Jan 6
•
12
•
1
huihui-ai/QVQ-72B-Preview-abliterated-GPTQ-Int4
Image-Text-to-Text
•
Updated
Dec 31, 2024
•
7
•
1
Michael4933/Migician
Image-Text-to-Text
•
Updated
Mar 23
•
97
•
1
jzhang533/manga-ocr-base-2025
Image-Text-to-Text
•
Updated
Jan 3
•
670
•
4
nasselt48/lora_radio_vesion_model_finetune
Image-Text-to-Text
•
Updated
Jan 3
•
6
•
1
nintwentydo/pixtral-12b-2409-2of4-sparse
Image-Text-to-Text
•
Updated
Jan 5
•
5
•
1
nintwentydo/pixtral-12b-2409-W4A16-G128
Image-Text-to-Text
•
Updated
Jan 5
•
7
•
2
matatonic/QVQ-72B-Preview-abliterated-6.5bpw-h8-exl2
Image-Text-to-Text
•
Updated
Jan 6
•
8
•
1
zhouyik/colva_internvl2_4b
Image-Text-to-Text
•
Updated
Jan 9
•
27
•
1
osunlp/UGround-V1-72B-Preview
Image-Text-to-Text
•
Updated
Jan 12
•
21
•
2
TianHuiLab/Falcon-Single-Instruction-Base
Image-Text-to-Text
•
Updated
Mar 21
•
2
TianHuiLab/Falcon-Single-Instruction-Large
Image-Text-to-Text
•
Updated
Mar 21
•
4
ByteDance/Sa2VA-8B
Image-Text-to-Text
•
Updated
Mar 19
•
597
•
56
MBZUAI/AIN
Image-Text-to-Text
•
Updated
Mar 13
•
1.41k
•
7
ByteDance/Sa2VA-1B
Image-Text-to-Text
•
Updated
Mar 19
•
2.45k
•
24
MIL-UT/Asagi-14B
Image-Text-to-Text
•
Updated
Feb 24
•
118
•
9
nintwentydo/Razorback-12B-v0.1
Image-Text-to-Text
•
Updated
Jan 10
•
8
•
3
llamaindex/vdr-2b-v1
Image-Text-to-Text
•
Updated
Jan 10
•
536
•
13
nintwentydo/Razorback-12B-v0.2
Image-Text-to-Text
•
Updated
Jan 10
•
10
•
3
Qingyun/Florence-2-large-DOTA-v1.0-lmmrotate
Image-Text-to-Text
•
Updated
Jan 29
•
17
•
1
TucanoBR/ViTucano-1b5-v1
Image-Text-to-Text
•
Updated
Feb 17
•
37
•
2
Previous
1
...
14
15
16
17
18
...
100
Next