Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Audio-Text-to-Text
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Visual Document Retrieval
Any-to-Any
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Keypoint Detection
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Text Ranking
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
9,595
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
bartowski/google_gemma-3-12b-it-qat-GGUF
Image-Text-to-Text
•
Updated
Apr 18
•
17.1k
•
21
NAMAA-Space/Adasah-QA-0.1-3B-Instruct-merged-4bits
Image-Text-to-Text
•
Updated
1 day ago
•
104
•
2
unsloth/gemma-3-4b-it-qat-GGUF
Image-Text-to-Text
•
Updated
12 days ago
•
10.6k
•
8
Skywork/Skywork-R1V2-38B
Image-Text-to-Text
•
Updated
27 days ago
•
69.6k
•
119
ginipick/Gemma-3-R1984-4B
Image-Text-to-Text
•
Updated
about 1 month ago
•
511
•
33
meta-llama/Llama-Guard-4-12B
Image-Text-to-Text
•
Updated
24 days ago
•
22.4k
•
32
unsloth/gemma-3-27b-it-qat-GGUF
Image-Text-to-Text
•
Updated
14 days ago
•
14.1k
•
8
leon-se/gemma-3-27b-it-qat-W4A16-G128
Image-Text-to-Text
•
Updated
26 days ago
•
2.27k
•
9
xlangai/Jedi-7B-1080p
Image-Text-to-Text
•
Updated
3 days ago
•
347
•
4
henryhe0123/PC-Agent-E
Image-Text-to-Text
•
Updated
3 days ago
•
35
•
2
unsloth/Qwen2.5-VL-3B-Instruct-GGUF
Image-Text-to-Text
•
Updated
12 days ago
•
5.25k
•
4
turing-motors/Heron-NVILA-Lite-33B
Image-Text-to-Text
•
Updated
11 days ago
•
180
•
4
tonyli8623/Himedical-R1-Sft-Gemma-27b-Q8.GGUF
Image-Text-to-Text
•
Updated
7 days ago
•
91
•
2
shreydan/SmolVLM-256M-Detection
Image-Text-to-Text
•
Updated
6 days ago
•
16
•
2
John6666/llama-joycaption-beta-one-hf-llava-nf4
Image-Text-to-Text
•
Updated
6 days ago
•
65
•
2
unsloth/InternVL3-38B-GGUF
Image-Text-to-Text
•
Updated
5 days ago
•
1.05k
•
2
unsloth/medgemma-27b-text-it
Image-Text-to-Text
•
Updated
3 days ago
•
182
•
2
NikolayKozloff/medgemma-4b-it-Q8_0-GGUF
Image-Text-to-Text
•
Updated
3 days ago
•
71
•
2
mlx-community/medgemma-27b-text-it-4bit
Image-Text-to-Text
•
Updated
2 days ago
•
63
•
2
Salesforce/blip2-opt-6.7b
Image-Text-to-Text
•
Updated
Feb 3
•
6.93k
•
77
liuhaotian/llava-v1.5-7b-lora
Image-Text-to-Text
•
Updated
May 9, 2024
•
979
•
24
llava-hf/llava-v1.6-vicuna-13b-hf
Image-Text-to-Text
•
Updated
Jan 27
•
14.4k
•
20
OpenGVLab/InternVL-Chat-V1-5
Image-Text-to-Text
•
Updated
Mar 25
•
2.98k
•
412
xtuner/llava-llama-3-8b-v1_1-transformers
Image-Text-to-Text
•
Updated
Apr 28, 2024
•
537k
•
79
google/paligemma-3b-ft-stvqa-448-jax
Image-Text-to-Text
•
Updated
Jul 19, 2024
•
1
microsoft/llava-med-v1.5-mistral-7b
Image-Text-to-Text
•
Updated
May 14, 2024
•
101k
•
88
microsoft/Florence-2-large-ft
Image-Text-to-Text
•
Updated
Jul 20, 2024
•
144k
•
352
microsoft/Florence-2-base-ft
Image-Text-to-Text
•
Updated
Jul 20, 2024
•
88k
•
114
onnx-community/Florence-2-base-ft
Image-Text-to-Text
•
Updated
15 days ago
•
30.8k
•
31
qresearch/doubutsu-2b-pt-756
Image-Text-to-Text
•
Updated
Feb 9
•
51
•
4
Previous
1
...
3
4
5
6
7
...
100
Next