Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ language:
|
|
11 |
[GitHub](https://github.com/RLHF-V/RLAIF-V)
|
12 |
|
13 |
**RLAIF-V-7B** is trained based on LLaVA 1.5 7B with the novel [RLAIF-V](https://github.com/RLHF-V/RLAIF-V) framework.
|
14 |
-
By aligning with human preference via large scale [
|
15 |
RLAIF-V maximally exploits the open-source feedback from two key perspectives, including high-quality feedback data and an online feedback learning algorithm.
|
16 |
|
17 |
|
|
|
11 |
[GitHub](https://github.com/RLHF-V/RLAIF-V)
|
12 |
|
13 |
**RLAIF-V-7B** is trained based on LLaVA 1.5 7B with the novel [RLAIF-V](https://github.com/RLHF-V/RLAIF-V) framework.
|
14 |
+
By aligning with human preference via large scale [AI feedback](https://huggingface.co/datasets/openbmb/RLAIF-V-Dataset), the model achieves **super GPT-4V trustworthiness**.
|
15 |
RLAIF-V maximally exploits the open-source feedback from two key perspectives, including high-quality feedback data and an online feedback learning algorithm.
|
16 |
|
17 |
|