Thirawarit
commited on
Commit
•
d675ad7
1
Parent(s):
f1b4e16
Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ base_model:
|
|
9 |
pipeline_tag: visual-question-answering
|
10 |
---
|
11 |
|
12 |
-
# Pathumma-llm-vision-
|
13 |
|
14 |
## Model Overview
|
15 |
Pathumma-llm-vision-1.0.0 is a multi-modal language model fine-tuned for Visual Question Answering (VQA) and Image Captioning tasks. It contains 8 billion parameters and leverages both image and text processing to understand and generate multi-modal content.
|
|
|
9 |
pipeline_tag: visual-question-answering
|
10 |
---
|
11 |
|
12 |
+
# Pathumma-llm-vision-1.0.0
|
13 |
|
14 |
## Model Overview
|
15 |
Pathumma-llm-vision-1.0.0 is a multi-modal language model fine-tuned for Visual Question Answering (VQA) and Image Captioning tasks. It contains 8 billion parameters and leverages both image and text processing to understand and generate multi-modal content.
|