Update README.md
Browse files
README.md
CHANGED
@@ -10,12 +10,30 @@ language:
|
|
10 |
|
11 |
[GitHub](https://github.com/RLHF-V/RLAIF-V)
|
12 |
|
13 |
-
RLAIF-V is a novel framework that aligns MLLMs in a fully open-source paradigm for super GPT-4V trustworthiness
|
14 |
|
15 |
|
16 |
## Model Details
|
17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
### Model Description
|
19 |
- **Trained from model:** [llava-v1.5-7B](https://huggingface.co/liuhaotian/llava-v1.5-7b)
|
20 |
- **Trained on data:** [RLAIF-V-Dataset](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset)
|
21 |
|
|
|
|
|
|
10 |
|
11 |
[GitHub](https://github.com/RLHF-V/RLAIF-V)
|
12 |
|
13 |
+
**RLAIF-V-7B** is a variant of LLaVA-v1.5-7B with greatly enhanced trustworthiness. The model is trained by a novel framework, **RLAIF-V**, that aligns MLLMs in a **fully open-source paradigm for super GPT-4V trustworthiness**. RLAIF-V maximally exploits the open-source feedback from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
|
14 |
|
15 |
|
16 |
## Model Details
|
17 |
|
18 |
+
### Key Features
|
19 |
+
|
20 |
+
* 📈**Best trustworthiness enhancement on LLaVA-v1.5**: By learning from open-source AI feedback, specifically, the feedback from LLaVA-NeXT-34B, RLAIF-V-7B achieves the best trustworthiness improvement on LLaVA-v1.5 compared to other hallucination reduction methods.
|
21 |
+
* 💪**Maintaining Well Performance on General Abilities**: On benchmarks tested with the general abilities (e.g. LLaVABench, MMStar), RLAIF-V-7B also exhibits good performance.
|
22 |
+
|
23 |
+
|
24 |
+
<p align="center">
|
25 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/6566e0c493e30c8a60048eb3/ypXZxb4HE-jDPJU9115bi.png" alt="fig1" width="90%"/>
|
26 |
+
</p>
|
27 |
+
|
28 |
+
### Examples
|
29 |
+
<p align="center">
|
30 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/6566e0c493e30c8a60048eb3/Hyu2Et5CQtDFmxaYHKdu-.png" alt="fig2-1" width="80%"/>
|
31 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/6566e0c493e30c8a60048eb3/16mJpyH_-vnRfl8Ywfa6k.png" alt="fig2-1" width="80%"/>
|
32 |
+
</p>
|
33 |
+
|
34 |
### Model Description
|
35 |
- **Trained from model:** [llava-v1.5-7B](https://huggingface.co/liuhaotian/llava-v1.5-7b)
|
36 |
- **Trained on data:** [RLAIF-V-Dataset](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset)
|
37 |
|
38 |
+
## Usage
|
39 |
+
Please look at [GitHub](https://github.com/RLHF-V/RLAIF-V) for more details about usage.
|