Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
OmniParser
like
1.53k
Follow
Microsoft
7.46k
Image-Text-to-Text
Transformers
Safetensors
blip-2
visual-question-answering
Inference Endpoints
arxiv:
2408.00203
License:
mit
Model card
Files
Files and versions
Community
44
Train
Deploy
Use this model
87a3f61
OmniParser
/
icon_caption_blip2
3 contributors
History:
3 commits
adamlu1
add florence model
87a3f61
3 months ago
icon_caption_blip2
add florence model
3 months ago
icon_detect
add florence model
3 months ago