Update README.md
Browse files
README.md
CHANGED
@@ -13,6 +13,12 @@ tags:
|
|
13 |
- image-captioning
|
14 |
---
|
15 |
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
# vit-gpt2-image-captioning_COCO_FineTuned
|
17 |
This repository contains the fine-tuned ViT-GPT2 model for image captioning, trained on the COCO dataset. The model combines a Vision Transformer (ViT) for image feature extraction and GPT-2 for text generation to create descriptive captions from images.
|
18 |
|
|
|
13 |
- image-captioning
|
14 |
---
|
15 |
|
16 |
+
widget:
|
17 |
+
- text: "picture of a futuristic tiger, artstation"
|
18 |
+
output:
|
19 |
+
url:
|
20 |
+
|
21 |
+
|
22 |
# vit-gpt2-image-captioning_COCO_FineTuned
|
23 |
This repository contains the fine-tuned ViT-GPT2 model for image captioning, trained on the COCO dataset. The model combines a Vision Transformer (ViT) for image feature extraction and GPT-2 for text generation to create descriptive captions from images.
|
24 |
|