Text Generation
Transformers
Safetensors
imp
custom_code
Oyoy1235 commited on
Commit
827c02d
•
1 Parent(s): 5a92362

update readme

Browse files
Files changed (1) hide show
  1. README.md +7 -6
README.md CHANGED
@@ -12,7 +12,7 @@ datasets:
12
  >           â€”—*George R.R. Martin, A Clash of Kings*
13
 
14
 
15
- \[Technical report (coming soon)\]  [[Demo](https://xmbot.net/imp/)\]  [[Github](https://github.com/MILVLG/imp)\]
16
 
17
  ## Introduction
18
 
@@ -28,7 +28,7 @@ We release our model weights and provide an example below to run our model . Det
28
 
29
  **Install dependencies**
30
  ```bash
31
- pip install transformers # latest version is ok, but we recommend v4.31.0
32
  pip install -q pillow accelerate einops
33
  ```
34
 
@@ -71,10 +71,11 @@ We conduct evaluation on 9 commonly-used benchmarks, including 5 academic VQA be
71
  | Models | Size | VQAv2 | GQA | SQA(IMG) | TextVQA | POPE | MME(P) | MMB |MMBCN |MM-Vet|
72
  |:--------:|:-----:|:----:|:-------------:|:--------:|:-----:|:----:|:-------:|:-------:|:-------:|:-------:|
73
  | [LLaVA-v1.5-lora](https://huggingface.co/liuhaotian/llava-v1.5-7b) | 7B |79.10 | 63.00| 68.40 |58.20| 86.40 | 1476.9 | 66.10 |- |30.2|
74
- | [TinyGPT-V](https://huggingface.co/Tyrannosaurus/TinyGPT-V) | 3B | - | 33.60 | - | - | -| - | - |- |-|
75
- | [LLaVA-Phi](https://github.com/zhuyiche/llava-phi) | 3B | 71.40 | - | 68.40 | 48.60 | 85.00 | 1335.1 | 59.80 |-|28.9|
76
- | [MobileVLM](https://huggingface.co/mtgv/MobileVLM-3B) | 3B | - | 59.00 | 61.00 | 47.50 | 84.90 | 1288.9 | 59.60 |- |-|
77
- | [MC-LLaVA-3b](https://huggingface.co/visheratin/MC-LLaVA-3b) | 3B | 64.24 | 49.60 | - | 38.59 | 80.59 | - | - |- |-|
 
78
  | **Imp-v1.5-3B-Phi2** | 3B | **81.18** | **63.54** | **72.78**| **59.84** | **88.87**| **1446.4** | **72.94**| 46.65 |**43.3**|
79
 
80
  ## License
 
12
  >           â€”—*George R.R. Martin, A Clash of Kings*
13
 
14
 
15
+ \[[Paper](https://arxiv.org/abs/2405.12107)\]  [[Demo](https://xmbot.net/imp/)\]  [[Github](https://github.com/MILVLG/imp)\]
16
 
17
  ## Introduction
18
 
 
28
 
29
  **Install dependencies**
30
  ```bash
31
+ pip install transformers # latest version is ok, but we recommend v4.36.0
32
  pip install -q pillow accelerate einops
33
  ```
34
 
 
71
  | Models | Size | VQAv2 | GQA | SQA(IMG) | TextVQA | POPE | MME(P) | MMB |MMBCN |MM-Vet|
72
  |:--------:|:-----:|:----:|:-------------:|:--------:|:-----:|:----:|:-------:|:-------:|:-------:|:-------:|
73
  | [LLaVA-v1.5-lora](https://huggingface.co/liuhaotian/llava-v1.5-7b) | 7B |79.10 | 63.00| 68.40 |58.20| 86.40 | 1476.9 | 66.10 |- |30.2|
74
+ | [TinyGPT-V-3B](https://huggingface.co/Tyrannosaurus/TinyGPT-V) | 3B | - | 38.9 | - | - | -| - | - |- |-|
75
+ | [LaVA-Phi-3B](https://github.com/zhuyiche/llava-phi) | 3B | 71.40 | - | 68.40 | 48.60 | 85.00 | 1335.1 | 59.80 |-|28.9|
76
+ | [MobileVLM-3B](https://huggingface.co/mtgv/MobileVLM-3B) | 3B | - | 59.00 | 61.00 | 47.50 | 84.90 | 1288.9 | 59.60 |- |-|
77
+ | [MiniCPM-V-3B](https://huggingface.co/mtgv/MobileVLM-3B) | 3B | - |- | - | - | - | 1452.0 | 67.9 | 65.3 |-|
78
+ | [Bunny-3B](https://huggingface.co/visheratin/MC-LLaVA-3b) | 3B | 79.8 | 62.5 | 70.9 | - | 86.8| 1488.8 | 68.6 |- |-|
79
  | **Imp-v1.5-3B-Phi2** | 3B | **81.18** | **63.54** | **72.78**| **59.84** | **88.87**| **1446.4** | **72.94**| 46.65 |**43.3**|
80
 
81
  ## License