Text Generation
Transformers
Safetensors
English
llava_llama
Inference Endpoints
Yirany commited on
Commit
12c5206
·
verified ·
1 Parent(s): 9ef6abe

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -11,7 +11,7 @@ language:
11
  [GitHub](https://github.com/RLHF-V/RLAIF-V)
12
 
13
  **RLAIF-V-7B** is trained based on LLaVA 1.5 7B with the novel [RLAIF-V](https://github.com/RLHF-V/RLAIF-V) framework.
14
- By aligning with human preference via large scale [open-source feedback](https://huggingface.co/datasets/openbmb/RLAIF-V-Dataset), the model achieves **super GPT-4V trustworthiness**.
15
  RLAIF-V maximally exploits the open-source feedback from two key perspectives, including high-quality feedback data and an online feedback learning algorithm.
16
 
17
 
 
11
  [GitHub](https://github.com/RLHF-V/RLAIF-V)
12
 
13
  **RLAIF-V-7B** is trained based on LLaVA 1.5 7B with the novel [RLAIF-V](https://github.com/RLHF-V/RLAIF-V) framework.
14
+ By aligning with human preference via large scale [AI feedback](https://huggingface.co/datasets/openbmb/RLAIF-V-Dataset), the model achieves **super GPT-4V trustworthiness**.
15
  RLAIF-V maximally exploits the open-source feedback from two key perspectives, including high-quality feedback data and an online feedback learning algorithm.
16
 
17