MILVLG
/

Imp-v1.5-3B-Phi2

Text Generation

Model card Files Files and versions Community

Oyoy1235 commited on May 21

Commit

827c02d

•

1 Parent(s): 5a92362

update readme

Files changed (1) hide show

README.md +7 -6

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ datasets:
 > &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;——*George R.R. Martin, A Clash of Kings*
-\[Technical report (coming soon)\]&nbsp;&nbsp;[[Demo](https://xmbot.net/imp/)\]&nbsp;&nbsp;[[Github](https://github.com/MILVLG/imp)\]
 ## Introduction
@@ -28,7 +28,7 @@ We release our model weights and provide an example below to run our model . Det
 **Install dependencies**
 ```bash
-pip install transformers # latest version is ok, but we recommend v4.31.0
 pip install -q pillow accelerate einops
 ```
@@ -71,10 +71,11 @@ We conduct evaluation on 9 commonly-used benchmarks, including 5 academic VQA be
 | Models | Size | VQAv2 | GQA | SQA(IMG) | TextVQA | POPE |  MME(P) | MMB  |MMBCN  |MM-Vet|
 |:--------:|:-----:|:----:|:-------------:|:--------:|:-----:|:----:|:-------:|:-------:|:-------:|:-------:|
 | [LLaVA-v1.5-lora](https://huggingface.co/liuhaotian/llava-v1.5-7b) | 7B |79.10 | 63.00|  68.40 |58.20| 86.40 | 1476.9 | 66.10 |- |30.2|
-| [TinyGPT-V](https://huggingface.co/Tyrannosaurus/TinyGPT-V) | 3B | - | 33.60  |    -   |    -  | -| - | - |- |-|
-| [LLaVA-Phi](https://github.com/zhuyiche/llava-phi) | 3B | 71.40  | - |    68.40   |    48.60  | 85.00 | 1335.1 | 59.80 |-|28.9|
-| [MobileVLM](https://huggingface.co/mtgv/MobileVLM-3B) | 3B | - | 59.00  |    61.00   |    47.50   | 84.90 | 1288.9 | 59.60 |- |-|
-| [MC-LLaVA-3b](https://huggingface.co/visheratin/MC-LLaVA-3b) | 3B | 64.24 | 49.60  |   -   |    38.59   | 80.59 | - | - |- |-|
 | **Imp-v1.5-3B-Phi2** | 3B | **81.18**  | **63.54** | **72.78**| **59.84** | **88.87**| **1446.4** | **72.94**| 46.65 |**43.3**|
 ## License

 > &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;——*George R.R. Martin, A Clash of Kings*
+\[[Paper](https://arxiv.org/abs/2405.12107)\]&nbsp;&nbsp;[[Demo](https://xmbot.net/imp/)\]&nbsp;&nbsp;[[Github](https://github.com/MILVLG/imp)\]
 ## Introduction
 **Install dependencies**
 ```bash
+pip install transformers # latest version is ok, but we recommend v4.36.0
 pip install -q pillow accelerate einops
 ```
 | Models | Size | VQAv2 | GQA | SQA(IMG) | TextVQA | POPE |  MME(P) | MMB  |MMBCN  |MM-Vet|
 |:--------:|:-----:|:----:|:-------------:|:--------:|:-----:|:----:|:-------:|:-------:|:-------:|:-------:|
 | [LLaVA-v1.5-lora](https://huggingface.co/liuhaotian/llava-v1.5-7b) | 7B |79.10 | 63.00|  68.40 |58.20| 86.40 | 1476.9 | 66.10 |- |30.2|
+| [TinyGPT-V-3B](https://huggingface.co/Tyrannosaurus/TinyGPT-V) | 3B | - | 38.9  |    -   |    -  | -| - | - |- |-|
+| [LaVA-Phi-3B](https://github.com/zhuyiche/llava-phi) | 3B | 71.40  | - |    68.40   |    48.60  | 85.00 | 1335.1 | 59.80 |-|28.9|
+| [MobileVLM-3B](https://huggingface.co/mtgv/MobileVLM-3B) | 3B | - | 59.00  |    61.00   |    47.50   | 84.90 | 1288.9 | 59.60 |- |-|
+| [MiniCPM-V-3B](https://huggingface.co/mtgv/MobileVLM-3B) | 3B | - |-  | - | - | - |  1452.0 |  67.9 | 65.3 |-|
+| [Bunny-3B](https://huggingface.co/visheratin/MC-LLaVA-3b) | 3B |  79.8 |  62.5  |   70.9  |    -  | 86.8| 1488.8 | 68.6 |- |-|
 | **Imp-v1.5-3B-Phi2** | 3B | **81.18**  | **63.54** | **72.78**| **59.84** | **88.87**| **1446.4** | **72.94**| 46.65 |**43.3**|
 ## License