Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ base_model:
|
|
11 |
This model is trained on sdxl-turbo based on DPO preference data constructed by our [UnifiedReward-7B](https://huggingface.co/CodeGoat24/UnifiedReward-7b) for enhanced image generation quality.
|
12 |
|
13 |
For further details, please refer to the following resources:
|
14 |
-
- π° Paper:
|
15 |
- πͺ Project Page: https://codegoat24.github.io/UnifiedReward/
|
16 |
- π€ Model Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-models-67c3008148c3a380d15ac63a
|
17 |
- π€ Dataset Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-training-data-67c300d4fd5eff00fa7f1ede
|
@@ -39,5 +39,10 @@ image = pipe(prompt=prompt, num_inference_steps=1, guidance_scale=0.0).images[0]
|
|
39 |
## Citation
|
40 |
|
41 |
```
|
42 |
-
|
|
|
|
|
|
|
|
|
|
|
43 |
```
|
|
|
11 |
This model is trained on sdxl-turbo based on DPO preference data constructed by our [UnifiedReward-7B](https://huggingface.co/CodeGoat24/UnifiedReward-7b) for enhanced image generation quality.
|
12 |
|
13 |
For further details, please refer to the following resources:
|
14 |
+
- π° Paper: https://arxiv.org/pdf/2503.05236
|
15 |
- πͺ Project Page: https://codegoat24.github.io/UnifiedReward/
|
16 |
- π€ Model Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-models-67c3008148c3a380d15ac63a
|
17 |
- π€ Dataset Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-training-data-67c300d4fd5eff00fa7f1ede
|
|
|
39 |
## Citation
|
40 |
|
41 |
```
|
42 |
+
@article{UnifiedReward,
|
43 |
+
title={Unified Reward Model for Multimodal Understanding and Generation.},
|
44 |
+
author={Wang, Yibin and Zang, Yuhang, and Li, Hao and Jin, Cheng and Wang Jiaqi},
|
45 |
+
journal={arXiv preprint arXiv:2503.05236},
|
46 |
+
year={2025}
|
47 |
+
}
|
48 |
```
|