Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Together AI
Replicate
Novita
Hyperbolic
Fireworks
SambaNova
Cerebras
fal
Nebius AI Studio
HF Inference API
Misc
Reset Misc
multi-modal
custom_code
Inference Endpoints
AutoTrain Compatible
text-generation-inference
Eval Results
Misc with no match
Merge
4-bit precision
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
30
Full-text search
Edit filters
Sort: Trending
Active filters:
multi-modal
Clear all
jinaai/jina-clip-v2
Zero-Shot Image Classification
•
Updated
21 days ago
•
95k
•
210
geolocal/StreetCLIP
Zero-Shot Image Classification
•
Updated
Sep 13, 2023
•
14.6k
•
88
kakaobrain/align-base
Zero-Shot Image Classification
•
Updated
Mar 8, 2023
•
144k
•
25
skit-ai/speechllm-2B
Feature Extraction
•
Updated
Jun 25, 2024
•
244
•
16
skit-ai/speechllm-1.5B
Feature Extraction
•
Updated
Jun 25, 2024
•
130
•
7
PKU-Alignment/Beaver-Vision-11B
Image-Text-to-Text
•
Updated
Nov 10, 2024
•
69
•
2
mbreuss/MoDE_CALVIN_ABCD
Robotics
•
Updated
Dec 19, 2024
•
5
•
2
mbreuss/MoDE_Pretrained
Robotics
•
Updated
Dec 19, 2024
•
10
•
4
mbreuss/MoDE_CALVIN_ABC_3
Robotics
•
Updated
Dec 19, 2024
•
8
•
1
mbreuss/MoDE_CALVIN_ABC_1
Robotics
•
Updated
Dec 19, 2024
•
14
•
1
DAMO-NLP-SG/VideoLLaMA3-7B
Visual Question Answering
•
Updated
8 days ago
•
24.7k
•
40
MMR1/MMR1-Math-v0-7B
Image-Text-to-Text
•
Updated
15 days ago
•
198
•
5
LinWeizheDragon/PreFLMR_ViT-L
Feature Extraction
•
Updated
Feb 27, 2024
•
5.01k
•
1
LinWeizheDragon/PreFLMR_ViT-B
Feature Extraction
•
Updated
Feb 27, 2024
•
233
•
2
LinWeizheDragon/PreFLMR_ViT-G
Feature Extraction
•
Updated
Feb 27, 2024
•
2.62k
•
8
LinWeizheDragon/FLMR
Feature Extraction
•
Updated
Feb 27, 2024
•
223
•
1
frizai/Pulse-v1
Updated
Mar 2, 2024
•
1
shangeth/SpeechLLM
Feature Extraction
•
Updated
Jun 4, 2024
•
14
jrheiner/thesis-clip-geoloc-continent
Zero-Shot Image Classification
•
Updated
Aug 27, 2024
•
80
jrheiner/thesis-clip-geoloc-country
Zero-Shot Image Classification
•
Updated
Aug 27, 2024
•
21
mbreuss/MoDE_CALVIN_D
Robotics
•
Updated
Dec 19, 2024
•
8
mbreuss/MoDE_LIBERO_90
Robotics
•
Updated
Dec 19, 2024
•
5
mbreuss/MoDE_CALVIN_ABC_2
Robotics
•
Updated
Dec 19, 2024
•
7
LinWeizheDragon/PreFLMR_ViT-L_ENCN
Feature Extraction
•
Updated
Dec 23, 2024
•
14
mbreuss/MoDE_LIBERO_10
Robotics
•
Updated
Dec 19, 2024
•
8
DAMO-NLP-SG/VideoLLaMA3-2B
Visual Question Answering
•
Updated
8 days ago
•
4.92k
•
10
DAMO-NLP-SG/VideoLLaMA3-2B-Image
Visual Question Answering
•
Updated
8 days ago
•
934
•
7
DAMO-NLP-SG/VideoLLaMA3-7B-Image
Visual Question Answering
•
Updated
8 days ago
•
6.85k
•
10
fneurociencias/neurodiagnoses-agnostic-ml
Updated
23 days ago
Allen8/TVC-7B
Image-Text-to-Text
•
Updated
7 days ago
•
9