OpenGVLab
/

InternVL-Chat-V1-1

Image-Text-to-Text

feature-extraction

Model card Files Files and versions Community

czczup commited on Jan 26, 2024

Commit

62f8dfb

·

verified ·

1 Parent(s): c144b6c

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -27,6 +27,7 @@ It is _**the largest open-source vision/vision-language foundation model (14B)**
   - Architecture: InternViT-6B + MLP + LLaMA2-13B
   - Params (M): 19B
   - Image size: 448 x 448
 - **Training Strategy:**
   - Pretraining Stage

   - Architecture: InternViT-6B + MLP + LLaMA2-13B
   - Params (M): 19B
   - Image size: 448 x 448
+  - Number of visual tokens: 256
 - **Training Strategy:**
   - Pretraining Stage