|
# WMT 20 |
|
|
|
This page provides pointers to the models of Facebook-FAIR's WMT'20 news translation task submission [(Chen et al., 2020)](https://arxiv.org/abs/2011.08298). |
|
|
|
## Single best MT models (after finetuning on part of WMT20 news dev set) |
|
|
|
Model | Description | Download |
|
---|---|--- |
|
`transformer.wmt20.ta-en` | Ta->En | [download (.tar.gz)](https://dl.fbaipublicfiles.com/fairseq/models/wmt20.ta-en.single.tar.gz) |
|
`transformer.wmt20.en-ta` | En->Ta | [download (.tar.gz)](https://dl.fbaipublicfiles.com/fairseq/models/wmt20.en-ta.single.tar.gz) |
|
`transformer.wmt20.iu-en.news` | Iu->En (News domain) | [download (.tar.gz)](https://dl.fbaipublicfiles.com/fairseq/models/wmt20.iu-en.news.single.tar.gz) |
|
`transformer.wmt20.en-iu.news` | En->Iu (News domain) | [download (.tar.gz)](https://dl.fbaipublicfiles.com/fairseq/models/wmt20.en-iu.news.single.tar.gz) |
|
`transformer.wmt20.iu-en.nh` | Iu->En (Nunavut Hansard domain) | [download (.tar.gz)](https://dl.fbaipublicfiles.com/fairseq/models/wmt20.iu-en.nh.single.tar.gz) |
|
`transformer.wmt20.en-iu.nh` | En->Iu (Nunavut Hansard domain) | [download (.tar.gz)](https://dl.fbaipublicfiles.com/fairseq/models/wmt20.en-iu.nh.single.tar.gz) |
|
|
|
## Language models |
|
Model | Description | Download |
|
---|---|--- |
|
`transformer_lm.wmt20.en` | En Language Model | [download (.tar.gz)](https://dl.fbaipublicfiles.com/fairseq/models/wmt20.en.tar.gz) |
|
`transformer_lm.wmt20.ta` | Ta Language Model | [download (.tar.gz)](https://dl.fbaipublicfiles.com/fairseq/models/wmt20.ta.tar.gz) |
|
`transformer_lm.wmt20.iu.news` | Iu Language Model (News domain) | [download (.tar.gz)](https://dl.fbaipublicfiles.com/fairseq/models/wmt20.iu.news.tar.gz) |
|
`transformer_lm.wmt20.iu.nh` | Iu Language Model (Nunavut Hansard domain) | [download (.tar.gz)](https://dl.fbaipublicfiles.com/fairseq/models/wmt20.iu.nh.tar.gz) |
|
|
|
## Example usage (torch.hub) |
|
|
|
#### Translation |
|
|
|
```python |
|
import torch |
|
|
|
# English to Tamil translation |
|
en2ta = torch.hub.load('pytorch/fairseq', 'transformer.wmt20.en-ta') |
|
en2ta.translate("Machine learning is great!") # 'இயந்திரக் கற்றல் அருமை!' |
|
|
|
# Tamil to English translation |
|
ta2en = torch.hub.load('pytorch/fairseq', 'transformer.wmt20.ta-en') |
|
ta2en.translate("இயந்திரக் கற்றல் அருமை!") # 'Machine learning is great!' |
|
|
|
# English to Inuktitut translation |
|
en2iu = torch.hub.load('pytorch/fairseq', 'transformer.wmt20.en-iu.news') |
|
en2iu.translate("machine learning is great!") # 'ᖃᒧᑕᐅᔭᓄᑦ ᐃᓕᓐᓂᐊᕐᓂᖅ ᐱᐅᔪᒻᒪᕆᒃ!' |
|
|
|
# Inuktitut to English translation |
|
iu2en = torch.hub.load('pytorch/fairseq', 'transformer.wmt20.iu-en.news') |
|
iu2en.translate("ᖃᒧᑕᐅᔭᓄᑦ ᐃᓕᓐᓂᐊᕐᓂᖅ ᐱᐅᔪᒻᒪᕆᒃ!") # 'Machine learning excellence!' |
|
``` |
|
|
|
#### Language Modeling |
|
|
|
```python |
|
# Sample from the English LM |
|
en_lm = torch.hub.load('pytorch/fairseq', 'transformer_lm.wmt20.en') |
|
en_lm.sample("Machine learning is") # 'Machine learning is a type of artificial intelligence that uses machine learning to learn from data and make predictions.' |
|
|
|
# Sample from the Tamil LM |
|
ta_lm = torch.hub.load('pytorch/fairseq', 'transformer_lm.wmt20.ta') |
|
ta_lm.sample("இயந்திரக் கற்றல் என்பது செயற்கை நுண்ணறிவின்") # 'இயந்திரக் கற்றல் என்பது செயற்கை நுண்ணறிவின் ஒரு பகுதியாகும்.' |
|
|
|
# Sample from the Inuktitut LM |
|
iu_lm = torch.hub.load('pytorch/fairseq', 'transformer_lm.wmt20.iu.news') |
|
iu_lm.sample("ᖃᒧᑕᐅᔭᓄᑦ ᐃᓕᓐᓂᐊᕐᓂᖅ") # 'ᖃᒧᑕᐅᔭᓄᑦ ᐃᓕᓐᓂᐊᕐᓂᖅ, ᐊᒻᒪᓗ ᓯᓚᐅᑉ ᐊᓯᙳᖅᐸᓪᓕᐊᓂᖓᓄᑦ ᖃᓄᐃᓕᐅᕈᑎᒃᓴᑦ, ᐃᓚᖃᖅᖢᑎᒃ ᐅᑯᓂᖓ:' |
|
``` |
|
|
|
## Citation |
|
```bibtex |
|
@inproceedings{chen2020facebook |
|
title={Facebook AI's WMT20 News Translation Task Submission}, |
|
author={Peng-Jen Chen and Ann Lee and Changhan Wang and Naman Goyal and Angela Fan and Mary Williamson and Jiatao Gu}, |
|
booktitle={Proc. of WMT}, |
|
year={2020}, |
|
} |
|
``` |
|
|