Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ziffir
/
PASDV1
like
4
Image-to-Text
4 datasets
English
image-captioning
visual-question-answering
arxiv:
2308.14469
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
arxiv.org/abs/2308.14469
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference Providers
NEW
Image-to-Text
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Datasets used to train
ziffir/PASDV1
HuggingFaceM4/VQAv2
Updated
Jun 30, 2022
•
1.13k
•
41
ranjaykrishna/visual_genome
Updated
Jun 29, 2023
•
419
•
76
vicenteor/sbu_captions
Updated
Jan 18, 2024
•
223
•
18