update readme
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ datasets:
|
|
12 |
> ——*George R.R. Martin, A Clash of Kings*
|
13 |
|
14 |
|
15 |
-
\[
|
16 |
|
17 |
## Introduction
|
18 |
|
@@ -28,7 +28,7 @@ We release our model weights and provide an example below to run our model . Det
|
|
28 |
|
29 |
**Install dependencies**
|
30 |
```bash
|
31 |
-
pip install transformers # latest version is ok, but we recommend v4.
|
32 |
pip install -q pillow accelerate einops
|
33 |
```
|
34 |
|
@@ -71,10 +71,11 @@ We conduct evaluation on 9 commonly-used benchmarks, including 5 academic VQA be
|
|
71 |
| Models | Size | VQAv2 | GQA | SQA(IMG) | TextVQA | POPE | MME(P) | MMB |MMBCN |MM-Vet|
|
72 |
|:--------:|:-----:|:----:|:-------------:|:--------:|:-----:|:----:|:-------:|:-------:|:-------:|:-------:|
|
73 |
| [LLaVA-v1.5-lora](https://huggingface.co/liuhaotian/llava-v1.5-7b) | 7B |79.10 | 63.00| 68.40 |58.20| 86.40 | 1476.9 | 66.10 |- |30.2|
|
74 |
-
| [TinyGPT-V](https://huggingface.co/Tyrannosaurus/TinyGPT-V) | 3B | - |
|
75 |
-
| [
|
76 |
-
| [MobileVLM](https://huggingface.co/mtgv/MobileVLM-3B) | 3B | - | 59.00 | 61.00 | 47.50 | 84.90 | 1288.9 | 59.60 |- |-|
|
77 |
-
| [
|
|
|
78 |
| **Imp-v1.5-3B-Phi2** | 3B | **81.18** | **63.54** | **72.78**| **59.84** | **88.87**| **1446.4** | **72.94**| 46.65 |**43.3**|
|
79 |
|
80 |
## License
|
|
|
12 |
> ——*George R.R. Martin, A Clash of Kings*
|
13 |
|
14 |
|
15 |
+
\[[Paper](https://arxiv.org/abs/2405.12107)\] [[Demo](https://xmbot.net/imp/)\] [[Github](https://github.com/MILVLG/imp)\]
|
16 |
|
17 |
## Introduction
|
18 |
|
|
|
28 |
|
29 |
**Install dependencies**
|
30 |
```bash
|
31 |
+
pip install transformers # latest version is ok, but we recommend v4.36.0
|
32 |
pip install -q pillow accelerate einops
|
33 |
```
|
34 |
|
|
|
71 |
| Models | Size | VQAv2 | GQA | SQA(IMG) | TextVQA | POPE | MME(P) | MMB |MMBCN |MM-Vet|
|
72 |
|:--------:|:-----:|:----:|:-------------:|:--------:|:-----:|:----:|:-------:|:-------:|:-------:|:-------:|
|
73 |
| [LLaVA-v1.5-lora](https://huggingface.co/liuhaotian/llava-v1.5-7b) | 7B |79.10 | 63.00| 68.40 |58.20| 86.40 | 1476.9 | 66.10 |- |30.2|
|
74 |
+
| [TinyGPT-V-3B](https://huggingface.co/Tyrannosaurus/TinyGPT-V) | 3B | - | 38.9 | - | - | -| - | - |- |-|
|
75 |
+
| [LaVA-Phi-3B](https://github.com/zhuyiche/llava-phi) | 3B | 71.40 | - | 68.40 | 48.60 | 85.00 | 1335.1 | 59.80 |-|28.9|
|
76 |
+
| [MobileVLM-3B](https://huggingface.co/mtgv/MobileVLM-3B) | 3B | - | 59.00 | 61.00 | 47.50 | 84.90 | 1288.9 | 59.60 |- |-|
|
77 |
+
| [MiniCPM-V-3B](https://huggingface.co/mtgv/MobileVLM-3B) | 3B | - |- | - | - | - | 1452.0 | 67.9 | 65.3 |-|
|
78 |
+
| [Bunny-3B](https://huggingface.co/visheratin/MC-LLaVA-3b) | 3B | 79.8 | 62.5 | 70.9 | - | 86.8| 1488.8 | 68.6 |- |-|
|
79 |
| **Imp-v1.5-3B-Phi2** | 3B | **81.18** | **63.54** | **72.78**| **59.84** | **88.87**| **1446.4** | **72.94**| 46.65 |**43.3**|
|
80 |
|
81 |
## License
|