Text Generation
Transformers
Safetensors
English
llava_llama
Inference Endpoints
HaoyeZhang commited on
Commit
860fccc
·
verified ·
1 Parent(s): e9a840d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -1
README.md CHANGED
@@ -10,12 +10,30 @@ language:
10
 
11
  [GitHub](https://github.com/RLHF-V/RLAIF-V)
12
 
13
- RLAIF-V is a novel framework that aligns MLLMs in a fully open-source paradigm for super GPT-4V trustworthiness. RLAIF-V maximally exploits the open-source feedback from two key perspectives, including high-quality feedback data and online feedback learning algorithm.
14
 
15
 
16
  ## Model Details
17
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
  ### Model Description
19
  - **Trained from model:** [llava-v1.5-7B](https://huggingface.co/liuhaotian/llava-v1.5-7b)
20
  - **Trained on data:** [RLAIF-V-Dataset](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset)
21
 
 
 
 
10
 
11
  [GitHub](https://github.com/RLHF-V/RLAIF-V)
12
 
13
+ **RLAIF-V-7B** is a variant of LLaVA-v1.5-7B with greatly enhanced trustworthiness. The model is trained by a novel framework, **RLAIF-V**, that aligns MLLMs in a **fully open-source paradigm for super GPT-4V trustworthiness**. RLAIF-V maximally exploits the open-source feedback from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
14
 
15
 
16
  ## Model Details
17
 
18
+ ### Key Features
19
+
20
+ * 📈**Best trustworthiness enhancement on LLaVA-v1.5**: By learning from open-source AI feedback, specifically, the feedback from LLaVA-NeXT-34B, RLAIF-V-7B achieves the best trustworthiness improvement on LLaVA-v1.5 compared to other hallucination reduction methods.
21
+ * 💪**Maintaining Well Performance on General Abilities**: On benchmarks tested with the general abilities (e.g. LLaVABench, MMStar), RLAIF-V-7B also exhibits good performance.
22
+
23
+
24
+ <p align="center">
25
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/6566e0c493e30c8a60048eb3/ypXZxb4HE-jDPJU9115bi.png" alt="fig1" width="90%"/>
26
+ </p>
27
+
28
+ ### Examples
29
+ <p align="center">
30
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/6566e0c493e30c8a60048eb3/Hyu2Et5CQtDFmxaYHKdu-.png" alt="fig2-1" width="80%"/>
31
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/6566e0c493e30c8a60048eb3/16mJpyH_-vnRfl8Ywfa6k.png" alt="fig2-1" width="80%"/>
32
+ </p>
33
+
34
  ### Model Description
35
  - **Trained from model:** [llava-v1.5-7B](https://huggingface.co/liuhaotian/llava-v1.5-7b)
36
  - **Trained on data:** [RLAIF-V-Dataset](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset)
37
 
38
+ ## Usage
39
+ Please look at [GitHub](https://github.com/RLHF-V/RLAIF-V) for more details about usage.