|
--- |
|
language: |
|
- nl |
|
- en |
|
- lb |
|
- af |
|
- de |
|
- fy |
|
- yi |
|
- gmw |
|
|
|
tags: |
|
- translation |
|
|
|
license: apache-2.0 |
|
--- |
|
|
|
### gmw-gmw |
|
|
|
* source group: West Germanic languages |
|
* target group: West Germanic languages |
|
* OPUS readme: [gmw-gmw](https://github.com/Helsinki-NLP/Tatoeba-Challenge/tree/master/models/gmw-gmw/README.md) |
|
|
|
* model: transformer |
|
* source language(s): afr ang_Latn deu eng enm_Latn frr fry gos gsw ksh ltz nds nld pdc sco stq swg yid |
|
* target language(s): afr ang_Latn deu eng enm_Latn frr fry gos gsw ksh ltz nds nld pdc sco stq swg yid |
|
* model: transformer |
|
* pre-processing: normalization + SentencePiece (spm32k,spm32k) |
|
* a sentence initial language token is required in the form of `>>id<<` (id = valid target language ID) |
|
* download original weights: [opus-2020-07-27.zip](https://object.pouta.csc.fi/Tatoeba-MT-models/gmw-gmw/opus-2020-07-27.zip) |
|
* test set translations: [opus-2020-07-27.test.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/gmw-gmw/opus-2020-07-27.test.txt) |
|
* test set scores: [opus-2020-07-27.eval.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/gmw-gmw/opus-2020-07-27.eval.txt) |
|
|
|
## Benchmarks |
|
|
|
| testset | BLEU | chr-F | |
|
|-----------------------|-------|-------| |
|
| newssyscomb2009-deueng.deu.eng | 25.3 | 0.527 | |
|
| newssyscomb2009-engdeu.eng.deu | 19.0 | 0.502 | |
|
| news-test2008-deueng.deu.eng | 23.7 | 0.515 | |
|
| news-test2008-engdeu.eng.deu | 19.2 | 0.491 | |
|
| newstest2009-deueng.deu.eng | 23.1 | 0.514 | |
|
| newstest2009-engdeu.eng.deu | 18.6 | 0.495 | |
|
| newstest2010-deueng.deu.eng | 25.8 | 0.545 | |
|
| newstest2010-engdeu.eng.deu | 20.3 | 0.505 | |
|
| newstest2011-deueng.deu.eng | 23.7 | 0.523 | |
|
| newstest2011-engdeu.eng.deu | 18.9 | 0.490 | |
|
| newstest2012-deueng.deu.eng | 24.4 | 0.529 | |
|
| newstest2012-engdeu.eng.deu | 19.2 | 0.489 | |
|
| newstest2013-deueng.deu.eng | 27.2 | 0.545 | |
|
| newstest2013-engdeu.eng.deu | 22.4 | 0.514 | |
|
| newstest2014-deen-deueng.deu.eng | 27.0 | 0.546 | |
|
| newstest2015-ende-deueng.deu.eng | 28.4 | 0.552 | |
|
| newstest2015-ende-engdeu.eng.deu | 25.3 | 0.541 | |
|
| newstest2016-ende-deueng.deu.eng | 33.2 | 0.595 | |
|
| newstest2016-ende-engdeu.eng.deu | 29.8 | 0.578 | |
|
| newstest2017-ende-deueng.deu.eng | 29.0 | 0.557 | |
|
| newstest2017-ende-engdeu.eng.deu | 23.9 | 0.534 | |
|
| newstest2018-ende-deueng.deu.eng | 35.9 | 0.607 | |
|
| newstest2018-ende-engdeu.eng.deu | 34.8 | 0.609 | |
|
| newstest2019-deen-deueng.deu.eng | 32.1 | 0.579 | |
|
| newstest2019-ende-engdeu.eng.deu | 31.0 | 0.579 | |
|
| Tatoeba-test.afr-ang.afr.ang | 0.0 | 0.065 | |
|
| Tatoeba-test.afr-deu.afr.deu | 46.8 | 0.668 | |
|
| Tatoeba-test.afr-eng.afr.eng | 58.5 | 0.728 | |
|
| Tatoeba-test.afr-enm.afr.enm | 13.4 | 0.357 | |
|
| Tatoeba-test.afr-fry.afr.fry | 5.3 | 0.026 | |
|
| Tatoeba-test.afr-gos.afr.gos | 3.5 | 0.228 | |
|
| Tatoeba-test.afr-ltz.afr.ltz | 1.6 | 0.131 | |
|
| Tatoeba-test.afr-nld.afr.nld | 55.4 | 0.715 | |
|
| Tatoeba-test.afr-yid.afr.yid | 3.4 | 0.008 | |
|
| Tatoeba-test.ang-afr.ang.afr | 3.1 | 0.096 | |
|
| Tatoeba-test.ang-deu.ang.deu | 2.6 | 0.188 | |
|
| Tatoeba-test.ang-eng.ang.eng | 5.4 | 0.211 | |
|
| Tatoeba-test.ang-enm.ang.enm | 1.7 | 0.197 | |
|
| Tatoeba-test.ang-gos.ang.gos | 6.6 | 0.186 | |
|
| Tatoeba-test.ang-ltz.ang.ltz | 5.3 | 0.072 | |
|
| Tatoeba-test.ang-yid.ang.yid | 0.9 | 0.131 | |
|
| Tatoeba-test.deu-afr.deu.afr | 52.7 | 0.699 | |
|
| Tatoeba-test.deu-ang.deu.ang | 0.8 | 0.133 | |
|
| Tatoeba-test.deu-eng.deu.eng | 43.5 | 0.621 | |
|
| Tatoeba-test.deu-enm.deu.enm | 6.9 | 0.245 | |
|
| Tatoeba-test.deu-frr.deu.frr | 0.8 | 0.200 | |
|
| Tatoeba-test.deu-fry.deu.fry | 15.1 | 0.367 | |
|
| Tatoeba-test.deu-gos.deu.gos | 2.2 | 0.279 | |
|
| Tatoeba-test.deu-gsw.deu.gsw | 1.0 | 0.176 | |
|
| Tatoeba-test.deu-ksh.deu.ksh | 0.6 | 0.208 | |
|
| Tatoeba-test.deu-ltz.deu.ltz | 12.1 | 0.274 | |
|
| Tatoeba-test.deu-nds.deu.nds | 18.8 | 0.446 | |
|
| Tatoeba-test.deu-nld.deu.nld | 48.6 | 0.669 | |
|
| Tatoeba-test.deu-pdc.deu.pdc | 4.6 | 0.198 | |
|
| Tatoeba-test.deu-sco.deu.sco | 12.0 | 0.340 | |
|
| Tatoeba-test.deu-stq.deu.stq | 3.2 | 0.240 | |
|
| Tatoeba-test.deu-swg.deu.swg | 0.5 | 0.179 | |
|
| Tatoeba-test.deu-yid.deu.yid | 1.7 | 0.160 | |
|
| Tatoeba-test.eng-afr.eng.afr | 55.8 | 0.730 | |
|
| Tatoeba-test.eng-ang.eng.ang | 5.7 | 0.157 | |
|
| Tatoeba-test.eng-deu.eng.deu | 36.7 | 0.584 | |
|
| Tatoeba-test.eng-enm.eng.enm | 2.0 | 0.272 | |
|
| Tatoeba-test.eng-frr.eng.frr | 6.1 | 0.246 | |
|
| Tatoeba-test.eng-fry.eng.fry | 15.3 | 0.378 | |
|
| Tatoeba-test.eng-gos.eng.gos | 1.2 | 0.242 | |
|
| Tatoeba-test.eng-gsw.eng.gsw | 0.9 | 0.164 | |
|
| Tatoeba-test.eng-ksh.eng.ksh | 0.9 | 0.170 | |
|
| Tatoeba-test.eng-ltz.eng.ltz | 13.7 | 0.263 | |
|
| Tatoeba-test.eng-nds.eng.nds | 17.1 | 0.410 | |
|
| Tatoeba-test.eng-nld.eng.nld | 49.6 | 0.673 | |
|
| Tatoeba-test.eng-pdc.eng.pdc | 5.1 | 0.218 | |
|
| Tatoeba-test.eng-sco.eng.sco | 34.8 | 0.587 | |
|
| Tatoeba-test.eng-stq.eng.stq | 2.1 | 0.322 | |
|
| Tatoeba-test.eng-swg.eng.swg | 1.7 | 0.192 | |
|
| Tatoeba-test.eng-yid.eng.yid | 1.7 | 0.173 | |
|
| Tatoeba-test.enm-afr.enm.afr | 13.4 | 0.397 | |
|
| Tatoeba-test.enm-ang.enm.ang | 0.7 | 0.063 | |
|
| Tatoeba-test.enm-deu.enm.deu | 41.5 | 0.514 | |
|
| Tatoeba-test.enm-eng.enm.eng | 21.3 | 0.483 | |
|
| Tatoeba-test.enm-fry.enm.fry | 0.0 | 0.058 | |
|
| Tatoeba-test.enm-gos.enm.gos | 10.7 | 0.354 | |
|
| Tatoeba-test.enm-ksh.enm.ksh | 7.0 | 0.161 | |
|
| Tatoeba-test.enm-nds.enm.nds | 18.6 | 0.316 | |
|
| Tatoeba-test.enm-nld.enm.nld | 38.3 | 0.524 | |
|
| Tatoeba-test.enm-yid.enm.yid | 0.7 | 0.128 | |
|
| Tatoeba-test.frr-deu.frr.deu | 4.1 | 0.219 | |
|
| Tatoeba-test.frr-eng.frr.eng | 14.1 | 0.186 | |
|
| Tatoeba-test.frr-fry.frr.fry | 3.1 | 0.129 | |
|
| Tatoeba-test.frr-gos.frr.gos | 3.6 | 0.226 | |
|
| Tatoeba-test.frr-nds.frr.nds | 12.4 | 0.145 | |
|
| Tatoeba-test.frr-nld.frr.nld | 9.8 | 0.209 | |
|
| Tatoeba-test.frr-stq.frr.stq | 2.8 | 0.142 | |
|
| Tatoeba-test.fry-afr.fry.afr | 0.0 | 1.000 | |
|
| Tatoeba-test.fry-deu.fry.deu | 30.1 | 0.535 | |
|
| Tatoeba-test.fry-eng.fry.eng | 28.0 | 0.486 | |
|
| Tatoeba-test.fry-enm.fry.enm | 16.0 | 0.262 | |
|
| Tatoeba-test.fry-frr.fry.frr | 5.5 | 0.160 | |
|
| Tatoeba-test.fry-gos.fry.gos | 1.6 | 0.307 | |
|
| Tatoeba-test.fry-ltz.fry.ltz | 30.4 | 0.438 | |
|
| Tatoeba-test.fry-nds.fry.nds | 8.1 | 0.083 | |
|
| Tatoeba-test.fry-nld.fry.nld | 41.4 | 0.616 | |
|
| Tatoeba-test.fry-stq.fry.stq | 1.6 | 0.217 | |
|
| Tatoeba-test.fry-yid.fry.yid | 1.6 | 0.159 | |
|
| Tatoeba-test.gos-afr.gos.afr | 6.3 | 0.318 | |
|
| Tatoeba-test.gos-ang.gos.ang | 6.2 | 0.058 | |
|
| Tatoeba-test.gos-deu.gos.deu | 11.7 | 0.363 | |
|
| Tatoeba-test.gos-eng.gos.eng | 14.9 | 0.322 | |
|
| Tatoeba-test.gos-enm.gos.enm | 9.1 | 0.398 | |
|
| Tatoeba-test.gos-frr.gos.frr | 3.3 | 0.117 | |
|
| Tatoeba-test.gos-fry.gos.fry | 13.1 | 0.387 | |
|
| Tatoeba-test.gos-ltz.gos.ltz | 3.1 | 0.154 | |
|
| Tatoeba-test.gos-nds.gos.nds | 2.4 | 0.206 | |
|
| Tatoeba-test.gos-nld.gos.nld | 13.9 | 0.395 | |
|
| Tatoeba-test.gos-stq.gos.stq | 2.1 | 0.209 | |
|
| Tatoeba-test.gos-yid.gos.yid | 1.7 | 0.147 | |
|
| Tatoeba-test.gsw-deu.gsw.deu | 10.5 | 0.350 | |
|
| Tatoeba-test.gsw-eng.gsw.eng | 10.7 | 0.299 | |
|
| Tatoeba-test.ksh-deu.ksh.deu | 12.0 | 0.373 | |
|
| Tatoeba-test.ksh-eng.ksh.eng | 3.2 | 0.225 | |
|
| Tatoeba-test.ksh-enm.ksh.enm | 13.4 | 0.308 | |
|
| Tatoeba-test.ltz-afr.ltz.afr | 37.4 | 0.525 | |
|
| Tatoeba-test.ltz-ang.ltz.ang | 2.8 | 0.036 | |
|
| Tatoeba-test.ltz-deu.ltz.deu | 40.3 | 0.596 | |
|
| Tatoeba-test.ltz-eng.ltz.eng | 31.7 | 0.490 | |
|
| Tatoeba-test.ltz-fry.ltz.fry | 36.3 | 0.658 | |
|
| Tatoeba-test.ltz-gos.ltz.gos | 2.9 | 0.209 | |
|
| Tatoeba-test.ltz-nld.ltz.nld | 38.8 | 0.530 | |
|
| Tatoeba-test.ltz-stq.ltz.stq | 5.8 | 0.165 | |
|
| Tatoeba-test.ltz-yid.ltz.yid | 1.0 | 0.159 | |
|
| Tatoeba-test.multi.multi | 36.4 | 0.568 | |
|
| Tatoeba-test.nds-deu.nds.deu | 35.0 | 0.573 | |
|
| Tatoeba-test.nds-eng.nds.eng | 29.6 | 0.495 | |
|
| Tatoeba-test.nds-enm.nds.enm | 3.7 | 0.194 | |
|
| Tatoeba-test.nds-frr.nds.frr | 6.6 | 0.133 | |
|
| Tatoeba-test.nds-fry.nds.fry | 4.2 | 0.087 | |
|
| Tatoeba-test.nds-gos.nds.gos | 2.0 | 0.243 | |
|
| Tatoeba-test.nds-nld.nds.nld | 41.4 | 0.618 | |
|
| Tatoeba-test.nds-swg.nds.swg | 0.6 | 0.178 | |
|
| Tatoeba-test.nds-yid.nds.yid | 8.3 | 0.238 | |
|
| Tatoeba-test.nld-afr.nld.afr | 59.4 | 0.759 | |
|
| Tatoeba-test.nld-deu.nld.deu | 49.9 | 0.685 | |
|
| Tatoeba-test.nld-eng.nld.eng | 54.1 | 0.699 | |
|
| Tatoeba-test.nld-enm.nld.enm | 5.0 | 0.250 | |
|
| Tatoeba-test.nld-frr.nld.frr | 2.4 | 0.224 | |
|
| Tatoeba-test.nld-fry.nld.fry | 19.4 | 0.446 | |
|
| Tatoeba-test.nld-gos.nld.gos | 2.5 | 0.273 | |
|
| Tatoeba-test.nld-ltz.nld.ltz | 13.8 | 0.292 | |
|
| Tatoeba-test.nld-nds.nld.nds | 21.3 | 0.457 | |
|
| Tatoeba-test.nld-sco.nld.sco | 14.7 | 0.423 | |
|
| Tatoeba-test.nld-stq.nld.stq | 1.9 | 0.257 | |
|
| Tatoeba-test.nld-swg.nld.swg | 4.2 | 0.162 | |
|
| Tatoeba-test.nld-yid.nld.yid | 2.6 | 0.186 | |
|
| Tatoeba-test.pdc-deu.pdc.deu | 39.7 | 0.529 | |
|
| Tatoeba-test.pdc-eng.pdc.eng | 25.0 | 0.427 | |
|
| Tatoeba-test.sco-deu.sco.deu | 28.4 | 0.428 | |
|
| Tatoeba-test.sco-eng.sco.eng | 41.8 | 0.595 | |
|
| Tatoeba-test.sco-nld.sco.nld | 36.4 | 0.565 | |
|
| Tatoeba-test.stq-deu.stq.deu | 7.7 | 0.328 | |
|
| Tatoeba-test.stq-eng.stq.eng | 21.1 | 0.428 | |
|
| Tatoeba-test.stq-frr.stq.frr | 2.0 | 0.118 | |
|
| Tatoeba-test.stq-fry.stq.fry | 6.3 | 0.255 | |
|
| Tatoeba-test.stq-gos.stq.gos | 1.4 | 0.244 | |
|
| Tatoeba-test.stq-ltz.stq.ltz | 4.4 | 0.204 | |
|
| Tatoeba-test.stq-nld.stq.nld | 10.7 | 0.371 | |
|
| Tatoeba-test.stq-yid.stq.yid | 1.4 | 0.105 | |
|
| Tatoeba-test.swg-deu.swg.deu | 9.5 | 0.343 | |
|
| Tatoeba-test.swg-eng.swg.eng | 15.1 | 0.306 | |
|
| Tatoeba-test.swg-nds.swg.nds | 0.7 | 0.196 | |
|
| Tatoeba-test.swg-nld.swg.nld | 11.6 | 0.308 | |
|
| Tatoeba-test.swg-yid.swg.yid | 0.9 | 0.186 | |
|
| Tatoeba-test.yid-afr.yid.afr | 100.0 | 1.000 | |
|
| Tatoeba-test.yid-ang.yid.ang | 0.6 | 0.079 | |
|
| Tatoeba-test.yid-deu.yid.deu | 16.7 | 0.372 | |
|
| Tatoeba-test.yid-eng.yid.eng | 15.8 | 0.344 | |
|
| Tatoeba-test.yid-enm.yid.enm | 1.3 | 0.166 | |
|
| Tatoeba-test.yid-fry.yid.fry | 5.6 | 0.157 | |
|
| Tatoeba-test.yid-gos.yid.gos | 2.2 | 0.160 | |
|
| Tatoeba-test.yid-ltz.yid.ltz | 2.1 | 0.238 | |
|
| Tatoeba-test.yid-nds.yid.nds | 14.4 | 0.365 | |
|
| Tatoeba-test.yid-nld.yid.nld | 20.9 | 0.397 | |
|
| Tatoeba-test.yid-stq.yid.stq | 3.7 | 0.165 | |
|
| Tatoeba-test.yid-swg.yid.swg | 1.8 | 0.156 | |
|
|
|
|
|
### System Info: |
|
- hf_name: gmw-gmw |
|
|
|
- source_languages: gmw |
|
|
|
- target_languages: gmw |
|
|
|
- opus_readme_url: https://github.com/Helsinki-NLP/Tatoeba-Challenge/tree/master/models/gmw-gmw/README.md |
|
|
|
- original_repo: Tatoeba-Challenge |
|
|
|
- tags: ['translation'] |
|
|
|
- languages: ['nl', 'en', 'lb', 'af', 'de', 'fy', 'yi', 'gmw'] |
|
|
|
- src_constituents: {'ksh', 'nld', 'eng', 'enm_Latn', 'ltz', 'stq', 'afr', 'pdc', 'deu', 'gos', 'ang_Latn', 'fry', 'gsw', 'frr', 'nds', 'yid', 'swg', 'sco'} |
|
|
|
- tgt_constituents: {'ksh', 'nld', 'eng', 'enm_Latn', 'ltz', 'stq', 'afr', 'pdc', 'deu', 'gos', 'ang_Latn', 'fry', 'gsw', 'frr', 'nds', 'yid', 'swg', 'sco'} |
|
|
|
- src_multilingual: True |
|
|
|
- tgt_multilingual: True |
|
|
|
- prepro: normalization + SentencePiece (spm32k,spm32k) |
|
|
|
- url_model: https://object.pouta.csc.fi/Tatoeba-MT-models/gmw-gmw/opus-2020-07-27.zip |
|
|
|
- url_test_set: https://object.pouta.csc.fi/Tatoeba-MT-models/gmw-gmw/opus-2020-07-27.test.txt |
|
|
|
- src_alpha3: gmw |
|
|
|
- tgt_alpha3: gmw |
|
|
|
- short_pair: gmw-gmw |
|
|
|
- chrF2_score: 0.568 |
|
|
|
- bleu: 36.4 |
|
|
|
- brevity_penalty: 1.0 |
|
|
|
- ref_len: 72534.0 |
|
|
|
- src_name: West Germanic languages |
|
|
|
- tgt_name: West Germanic languages |
|
|
|
- train_date: 2020-07-27 |
|
|
|
- src_alpha2: gmw |
|
|
|
- tgt_alpha2: gmw |
|
|
|
- prefer_old: False |
|
|
|
- long_pair: gmw-gmw |
|
|
|
- helsinki_git_sha: 480fcbe0ee1bf4774bcbe6226ad9f58e63f6c535 |
|
|
|
- transformers_git_sha: 2207e5d8cb224e954a7cba69fa4ac2309e9ff30b |
|
|
|
- port_machine: brutasse |
|
|
|
- port_time: 2020-08-21-14:41 |