Text Generation
Transformers
Safetensors
English
llava_llama
Yirany commited on
Commit
e31668a
·
verified ·
1 Parent(s): 5273e5c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -10,7 +10,9 @@ language:
10
 
11
  [GitHub](https://github.com/RLHF-V/RLAIF-V)
12
 
13
- **RLAIF-V-7B** is a variant of LLaVA 1.5 7B with greatly enhanced trustworthiness. The model is trained by a novel framework, **RLAIF-V**, that aligns MLLMs in a **fully open-source paradigm for super GPT-4V trustworthiness**. RLAIF-V maximally exploits the [open-source feedback](https://huggingface.co/datasets/openbmb/RLAIF-V-Dataset) from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
 
 
14
 
15
 
16
  ## Model Details
 
10
 
11
  [GitHub](https://github.com/RLHF-V/RLAIF-V)
12
 
13
+ **RLAIF-V-7B** is trained based on LLaVA 1.5 7B with the novel [RLAIF-V](https://github.com/RLHF-V/RLAIF-V) framework.
14
+ By aligning with human preference via large scale [open-source feedback](https://huggingface.co/datasets/openbmb/RLAIF-V-Dataset), the model achieves **super GPT-4V trustworthiness**.
15
+ RLAIF-V maximally exploits the open-source feedback from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
16
 
17
 
18
  ## Model Details