Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
AnyModal
/
Image-Captioning-Llama-3.2-1B
like
1
Follow
AnyModal
4
Image-to-Text
Safetensors
AnyModal/flickr30k
English
AnyModal
vlm
vision
multimodal
License:
mit
Model card
Files
Files and versions
Community
1
ritabratamaiti
commited on
Dec 1, 2024
Commit
0cb15c7
·
verified
·
1 Parent(s):
41b8fea
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+8
-3
README.md
CHANGED
Viewed
@@ -1,3 +1,8 @@
1
-
---
2
-
license: mit
3
-
---
1
+
---
2
+
license: mit
3
+
datasets:
4
+
- AnyModal/flickr30k
5
+
base_model:
6
+
- meta-llama/Llama-3.2-1B
7
+
- google/vit-base-patch16-224
8
+
---