Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,44 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: llama2
|
3 |
+
pipeline_tag: text-generation
|
4 |
+
Tags:
|
5 |
+
- cortex.cpp
|
6 |
+
- multimodal
|
7 |
+
- vicuna
|
8 |
+
- vision-language
|
9 |
+
---
|
10 |
+
|
11 |
+
## Overview
|
12 |
+
|
13 |
+
**LLaVA** (Large Language and Vision Assistant) is an open-source chatbot trained to handle multimodal instruction-following tasks. It is a fine-tuned **Vicuna-7B** model, designed to process both **text and image** inputs. This auto-regressive language model leverages the **transformer architecture** to improve interactions in vision-language tasks, making it useful for research in **computer vision, natural language processing, machine learning, and artificial intelligence**.
|
14 |
+
|
15 |
+
LLaVA-v1.6-Vicuna-7B is the latest iteration, trained in **December 2023**, and optimized for improved instruction-following performance in multimodal settings.
|
16 |
+
|
17 |
+
## Variants
|
18 |
+
|
19 |
+
| No | Variant | Cortex CLI command |
|
20 |
+
| --- | --- | --- |
|
21 |
+
| 1 | [llava-v1.6-vicuna-7b-f16](https://huggingface.co/cortexso/llava-v1.6/tree/gguf-f16) | `cortex run llava-v1.6:gguf-f16` |
|
22 |
+
| 2 | [llava-v1.6-vicuna-7b-q4_km](https://huggingface.co/cortexso/llava-v1.6/tree/gguf-q4-km) | `cortex run llava-v1.6:gguf-q4-km` |
|
23 |
+
|
24 |
+
## Use it with Jan (UI)
|
25 |
+
|
26 |
+
1. Install **Jan** using [Quickstart](https://jan.ai/docs/quickstart)
|
27 |
+
2. Use in Jan model Hub:
|
28 |
+
```bash
|
29 |
+
cortexso/llava-v1.6
|
30 |
+
```
|
31 |
+
|
32 |
+
## Use it with Cortex (CLI)
|
33 |
+
|
34 |
+
1. Install **Cortex** using [Quickstart](https://cortex.jan.ai/docs/quickstart)
|
35 |
+
2. Run the model with command:
|
36 |
+
```bash
|
37 |
+
cortex run llava-v1.6
|
38 |
+
```
|
39 |
+
|
40 |
+
## Credits
|
41 |
+
- **Author:** LLaVA Research Team
|
42 |
+
- **Converter:** [Homebrew](https://www.homebrew.ltd/)
|
43 |
+
- **Original License:** [LLAMA 2 Community License](https://github.com/facebookresearch/llama/blob/main/LICENSE)
|
44 |
+
- **Papers:** [LLaVA-v1.6: Enhancing Large Multimodal Models](https://llava-vl.github.io/)
|