halfrot
/

sft-mt5-base

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Community

halfrot commited on Mar 18, 2024

Commit

13c576d

·

verified ·

1 Parent(s): 1fc2718

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -1,3 +1,8 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
+datasets:
+- Helsinki-NLP/europarl
 ---
+Trained SFT policy for MT task in the paper "[ALaRM: Align Language Models via Hierarchical Rewards Modeling](https://arxiv.org/abs/2403.06754)".
+Check out our [project page](https://alarm-fdu.github.io/) for more information.