Update README.md
Browse files
README.md
CHANGED
@@ -10,15 +10,15 @@ language:
|
|
10 |
|
11 |
[GitHub](https://github.com/RLHF-V/RLAIF-V)
|
12 |
|
13 |
-
**RLAIF-V-7B** is a variant of LLaVA
|
14 |
|
15 |
|
16 |
## Model Details
|
17 |
|
18 |
### Key Features
|
19 |
|
20 |
-
*
|
21 |
-
*
|
22 |
|
23 |
|
24 |
<p align="center">
|
|
|
10 |
|
11 |
[GitHub](https://github.com/RLHF-V/RLAIF-V)
|
12 |
|
13 |
+
**RLAIF-V-7B** is a variant of LLaVA 1.5 7B with greatly enhanced trustworthiness. The model is trained by a novel framework, **RLAIF-V**, that aligns MLLMs in a **fully open-source paradigm for super GPT-4V trustworthiness**. RLAIF-V maximally exploits the open-source feedback from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
|
14 |
|
15 |
|
16 |
## Model Details
|
17 |
|
18 |
### Key Features
|
19 |
|
20 |
+
* 📈 **Most trustworthy LLaVA 1.5**: By learning from open-source AI feedback, specifically, the feedback from LLaVA-NeXT-34B, RLAIF-V-7B achieves the best trustworthiness improvement on LLaVA-v1.5 compared to other hallucination reduction methods.
|
21 |
+
* 💪 **Maintaining Well Performance on General Abilities**: On benchmarks evaluating general capabilities (e.g. LLaVA Bench, MMStar), RLAIF-V-7B also exhibits good performance.
|
22 |
|
23 |
|
24 |
<p align="center">
|