openbmb
/

RLAIF-V-7B

Text Generation

Model card Files Files and versions Community

HaoyeZhang commited on May 25, 2024

Commit

81c848e

·

verified ·

1 Parent(s): 860fccc

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -10,15 +10,15 @@ language:
 [GitHub](https://github.com/RLHF-V/RLAIF-V)
-**RLAIF-V-7B** is a variant of LLaVA-v1.5-7B with greatly enhanced trustworthiness. The model is trained by a novel framework, **RLAIF-V**, that aligns MLLMs in a **fully open-source paradigm for super GPT-4V trustworthiness**. RLAIF-V maximally exploits the open-source feedback from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
 ## Model Details
 ### Key Features
-* 📈**Best trustworthiness enhancement on LLaVA-v1.5**: By learning from open-source AI feedback, specifically, the feedback from LLaVA-NeXT-34B, RLAIF-V-7B achieves the best trustworthiness improvement on LLaVA-v1.5 compared to other hallucination reduction methods.
-* 💪**Maintaining Well Performance on General Abilities**: On benchmarks tested with the general abilities (e.g. LLaVABench, MMStar), RLAIF-V-7B also exhibits good performance.
 <p align="center">

 [GitHub](https://github.com/RLHF-V/RLAIF-V)
+**RLAIF-V-7B** is a variant of LLaVA 1.5 7B with greatly enhanced trustworthiness. The model is trained by a novel framework, **RLAIF-V**, that aligns MLLMs in a **fully open-source paradigm for super GPT-4V trustworthiness**. RLAIF-V maximally exploits the open-source feedback from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
 ## Model Details
 ### Key Features
+* 📈 **Most trustworthy LLaVA 1.5**: By learning from open-source AI feedback, specifically, the feedback from LLaVA-NeXT-34B, RLAIF-V-7B achieves the best trustworthiness improvement on LLaVA-v1.5 compared to other hallucination reduction methods.
+* 💪 **Maintaining Well Performance on General Abilities**: On benchmarks evaluating general capabilities (e.g. LLaVA Bench, MMStar), RLAIF-V-7B also exhibits good performance.
 <p align="center">