Efficient-Large-Model
/

VILA1.5-40b

Text Generation

Model card Files Files and versions

klldmofashi commited on Jul 18, 2024

Commit

428e95e

·

verified ·

1 Parent(s): c8c45c0

Update README.md

Files changed (1) hide show

README.md +35 -1

README.md CHANGED Viewed

@@ -40,7 +40,7 @@ https://github.com/NVLabs/VILA
     - [Dataset Licenses](https://github.com/Efficient-Large-Model/VILA/blob/main/data_prepare/LICENSE) for each one used during training.
 **Where to send questions or comments about the model:**
-https://github.com/Efficient-Large-Model/VILA/issues
 ## Intended use
 **Primary intended uses:**
@@ -49,6 +49,40 @@ The primary use of VILA is research on large multimodal models and chatbots.
 **Primary intended users:**
 The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.
 ## Training dataset
 See [Dataset Preparation](https://github.com/Efficient-Large-Model/VILA/blob/main/data_prepare/README.md) for more details.

     - [Dataset Licenses](https://github.com/Efficient-Large-Model/VILA/blob/main/data_prepare/LICENSE) for each one used during training.
 **Where to send questions or comments about the model:**
+https://github.com/NVLabs/VILA/issues
 ## Intended use
 **Primary intended uses:**
 **Primary intended users:**
 The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.
+## Model Architecture:
+**Architecture Type:** Transformer
+**Network Architecture:** InternViT, Yi
+## Input:
+**Input Type:** Image, Video, Text
+**Input Format:** Red, Green, Blue; MP4 ;String
+**Input Parameters:** 2D, 3D
+## Output:
+**Output Type:** Text
+**Output Format:** String
+**Supported Hardware Microarchitecture Compatibility:**
+* Ampere
+* Jetson
+* Hopper
+* Lovelace
+**[Preferred/Supported] Operating System(s):** <br>
+Linux
+## Model Version(s):
+VILA1.5-3B
+VILA1.5-3B-s2
+Llama-3-VILA1.5-8B
+VILA1.5-13B
+VILA1.5-40B
+VILA1.5-3B-AWQ
+VILA1.5-3B-s2-AWQ
+Llama-3-VILA1.5-8B-AWQ
+VILA1.5-13B-AWQ
+VILA1.5-40B-AWQ
 ## Training dataset
 See [Dataset Preparation](https://github.com/Efficient-Large-Model/VILA/blob/main/data_prepare/README.md) for more details.