openbmb
/

RLAIF-V-7B

Text Generation

Inference Endpoints

Model card Files Files and versions Community

XiaomanLu commited on May 20, 2024

Commit

d2127c2

·

verified ·

1 Parent(s): f1d3c3a

Update README.md

Files changed (1) hide show

README.md +21 -3

README.md CHANGED Viewed

@@ -1,3 +1,21 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+datasets:
+- HaoyeZhang/RLAIF-V-Dataset
+language:
+- en
+---
+# Model Card for RLAIF-V
+[GitHub ](https://github.com/RLHF-V/RLAIF-V)  | [Paper]()
+RLAIF-V is a novel framework that aligns MLLMs in a fully open-source paradigm for super GPT-4V trustworthiness. RLAIF-V maximally exploits the open-source feedback from two key perspectives, including high-quality feedback data and online feedback learning algorithm.
+## Model Details
+### Model Description
+- **Trained from model:** LLaVA1.5 7B
+- **Trained on data:** [RLAIF-V-Dataset](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset)