haoranxu
/

ALMA-7B-R

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

haoranxu commited on Jan 18

Commit

fe1686b

•

1 Parent(s): ec7e8c8

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -2,7 +2,7 @@
 license: mit
 ---
 **[ALMA-R](https://arxiv.org/abs/2401.08417)** builds upon [ALMA models](https://arxiv.org/abs/2309.11674), with further LoRA fine-tuning with our proposed **Contrastive Preference Optimization (CPO)** as opposed to the Supervised Fine-tuning used in ALMA. CPO fine-tuning requires our [triplet preference data](https://huggingface.co/datasets/haoranxu/ALMA-R-Preference) for preference learning. ALMA-R now can matches or even exceeds GPT-4 or WMT winners!
 @misc{xu2024contrastive,
       title={Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation},
       author={Haoran Xu and Amr Sharaf and Yunmo Chen and Weiting Tan and Lingfeng Shen and Benjamin Van Durme and Kenton Murray and Young Jin Kim},
@@ -11,7 +11,7 @@ license: mit
       archivePrefix={arXiv},
       primaryClass={cs.CL}
 }
 # Download ALMA(-R) Models and Dataset 🚀
 We release six translation models presented in the paper:

 license: mit
 ---
 **[ALMA-R](https://arxiv.org/abs/2401.08417)** builds upon [ALMA models](https://arxiv.org/abs/2309.11674), with further LoRA fine-tuning with our proposed **Contrastive Preference Optimization (CPO)** as opposed to the Supervised Fine-tuning used in ALMA. CPO fine-tuning requires our [triplet preference data](https://huggingface.co/datasets/haoranxu/ALMA-R-Preference) for preference learning. ALMA-R now can matches or even exceeds GPT-4 or WMT winners!
+```
 @misc{xu2024contrastive,
       title={Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation},
       author={Haoran Xu and Amr Sharaf and Yunmo Chen and Weiting Tan and Lingfeng Shen and Benjamin Van Durme and Kenton Murray and Young Jin Kim},
       archivePrefix={arXiv},
       primaryClass={cs.CL}
 }
+```
 # Download ALMA(-R) Models and Dataset 🚀
 We release six translation models presented in the paper: