Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2stacks
's Collections
Speach
Datasets
Reasoning
MultiModal
Training
Other
Video
Images
MultiModal
updated
Mar 27
Upvote
-
nvidia/NVLM-D-72B
Image-Text-to-Text
•
79B
•
Updated
Jan 14
•
51.1k
•
772
mistralai/Mistral-Small-3.1-24B-Instruct-2503
24B
•
Updated
21 days ago
•
237k
•
1.3k
Qwen/Qwen2.5-Omni-7B
Any-to-Any
•
11B
•
Updated
Apr 30
•
132k
•
1.75k
Upvote
-
Share collection
View history
Collection guide
Browse collections