haoranxu
/

ALMA-7B-R

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

haoranxu commited on Jan 18

Commit

ec7e8c8

•

1 Parent(s): a72f26f

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -3,6 +3,14 @@ license: mit
 ---
 **[ALMA-R](https://arxiv.org/abs/2401.08417)** builds upon [ALMA models](https://arxiv.org/abs/2309.11674), with further LoRA fine-tuning with our proposed **Contrastive Preference Optimization (CPO)** as opposed to the Supervised Fine-tuning used in ALMA. CPO fine-tuning requires our [triplet preference data](https://huggingface.co/datasets/haoranxu/ALMA-R-Preference) for preference learning. ALMA-R now can matches or even exceeds GPT-4 or WMT winners!
 # Download ALMA(-R) Models and Dataset 🚀

 ---
 **[ALMA-R](https://arxiv.org/abs/2401.08417)** builds upon [ALMA models](https://arxiv.org/abs/2309.11674), with further LoRA fine-tuning with our proposed **Contrastive Preference Optimization (CPO)** as opposed to the Supervised Fine-tuning used in ALMA. CPO fine-tuning requires our [triplet preference data](https://huggingface.co/datasets/haoranxu/ALMA-R-Preference) for preference learning. ALMA-R now can matches or even exceeds GPT-4 or WMT winners!
+@misc{xu2024contrastive,
+      title={Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation},
+      author={Haoran Xu and Amr Sharaf and Yunmo Chen and Weiting Tan and Lingfeng Shen and Benjamin Van Durme and Kenton Murray and Young Jin Kim},
+      year={2024},
+      eprint={2401.08417},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
 # Download ALMA(-R) Models and Dataset 🚀