Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
AutoTrain Compatible
custom_code
4-bit precision
Merge
Eval Results
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
367
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
Updated
18 days ago
•
2.49M
•
•
971
GoodiesHere/Apollo-LMMs-Apollo-7B-t32
Video-Text-to-Text
•
Updated
6 days ago
•
203
•
28
Qwen/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
Updated
18 days ago
•
91.2k
•
228
jinaai/jina-clip-v2
Feature Extraction
•
Updated
11 days ago
•
20.4k
•
136
GoodiesHere/Apollo-LMMs-Apollo-3B-t32
Text Generation
•
Updated
6 days ago
•
106
•
13
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
18 days ago
•
818k
•
•
330
AI-Safeguard/Ivy-VL-llava
Visual Question Answering
•
Updated
4 days ago
•
1.53k
•
53
prithivMLmods/Qwen2-VL-Ocrtest-2B-Instruct
Image-Text-to-Text
•
Updated
3 days ago
•
156
•
9
bartowski/Qwen2-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
7 days ago
•
5.15k
•
15
NexaAIDev/OmniVLM-968M
Updated
7 days ago
•
5.03k
•
482
GoodiesHere/Apollo-LMMs-Apollo-1_5B-t32
Video-Text-to-Text
•
Updated
6 days ago
•
158
•
6
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Oct 10
•
233k
•
462
robotics-diffusion-transformer/rdt-1b
Robotics
•
Updated
Oct 17
•
3.5k
•
54
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
Updated
Oct 14
•
21k
•
597
openvla/openvla-7b
Image-Text-to-Text
•
Updated
Sep 16
•
94.7k
•
86
bartowski/Qwen2-VL-2B-Instruct-GGUF
Image-Text-to-Text
•
Updated
7 days ago
•
2.03k
•
7
qnguyen3/nanoLLaVA
Text Generation
•
Updated
Oct 27
•
23.8k
•
150
chenjoya/videollm-online-8b-v1plus
Video-Text-to-Text
•
Updated
Jul 13
•
5.54k
•
17
qnguyen3/nanoLLaVA-1.5
Image-Text-to-Text
•
Updated
Sep 21
•
486
•
104
HuggingFaceM4/Idefics3-8B-Llama3
Image-Text-to-Text
•
Updated
22 days ago
•
18k
•
257
Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
Sep 21
•
24.4k
•
19
Qwen/Qwen2-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
Updated
Sep 25
•
47.5k
•
40
allenai/Molmo-72B-0924
Image-Text-to-Text
•
Updated
Oct 10
•
3.41k
•
272
rhymes-ai/Aria
Image-Text-to-Text
•
Updated
6 days ago
•
18.1k
•
599
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
Oct 18
•
10.1k
•
757
ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-MUV-101
Updated
Nov 1
•
37
•
2
ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-SIDER-101
Updated
Nov 1
•
32
•
2
NCSOFT/VARCO-VISION-14B-HF
Image-Text-to-Text
•
Updated
14 days ago
•
1.1k
•
20
nvidia/NVLM-D-72B-mcore
Image-Text-to-Text
•
Updated
3 days ago
•
2
imageomics/bioclip
Zero-Shot Image Classification
•
Updated
May 17
•
5.53k
•
42
Previous
1
2
3
...
13
Next