openbmb
/

RLAIF-V-7B

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Yirany commited on May 27, 2024

Commit

12c5206

·

verified ·

1 Parent(s): 9ef6abe

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ language:
 [GitHub](https://github.com/RLHF-V/RLAIF-V)
 **RLAIF-V-7B** is trained based on LLaVA 1.5 7B with the novel [RLAIF-V](https://github.com/RLHF-V/RLAIF-V) framework.
-By aligning with human preference via large scale [open-source feedback](https://huggingface.co/datasets/openbmb/RLAIF-V-Dataset), the model achieves **super GPT-4V trustworthiness**.
 RLAIF-V maximally exploits the open-source feedback from two key perspectives, including high-quality feedback data and an online feedback learning algorithm.

 [GitHub](https://github.com/RLHF-V/RLAIF-V)
 **RLAIF-V-7B** is trained based on LLaVA 1.5 7B with the novel [RLAIF-V](https://github.com/RLHF-V/RLAIF-V) framework.
+By aligning with human preference via large scale [AI feedback](https://huggingface.co/datasets/openbmb/RLAIF-V-Dataset), the model achieves **super GPT-4V trustworthiness**.
 RLAIF-V maximally exploits the open-source feedback from two key perspectives, including high-quality feedback data and an online feedback learning algorithm.