Fairseq
Catalan
German
AudreyVM commited on
Commit
e600000
1 Parent(s): c81ad3f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -3
README.md CHANGED
@@ -60,15 +60,12 @@ The Catalan-German data collected from the web was a combination of the followin
60
  | WikiMatrix | 180.322 | 125.811 |
61
  | GNOME | 12.333| 1.241|
62
  | KDE4 | 165.439 | 105.098 |
63
- | QED | 63.041 | 49.181 |
64
- | TED2020 v1 | 46.680 | 38.428 |
65
  | OpenSubtitles | 303.329 | 171.376 |
66
  | GlobalVoices| 4.636 | 3.578|
67
  | Tatoeba | 732 | 655 |
68
  | Books | 4.445 | 2049 |
69
  | Europarl | 1.734.643 | 1.734.643 |
70
  | Tilde | 3.434.091 | 3.434.091 |
71
- | **Total** | **7.427.843** | **6.258.272** |
72
 
73
  All corpora except Europarl and Tilde were collected from [Opus](https://opus.nlpl.eu/).
74
  The Europarl and Tilde corpora are synthetic parallel corpora created from the original Spanish-German corpora by [SoftCatalà](https://github.com/Softcatala).
 
60
  | WikiMatrix | 180.322 | 125.811 |
61
  | GNOME | 12.333| 1.241|
62
  | KDE4 | 165.439 | 105.098 |
 
 
63
  | OpenSubtitles | 303.329 | 171.376 |
64
  | GlobalVoices| 4.636 | 3.578|
65
  | Tatoeba | 732 | 655 |
66
  | Books | 4.445 | 2049 |
67
  | Europarl | 1.734.643 | 1.734.643 |
68
  | Tilde | 3.434.091 | 3.434.091 |
 
69
 
70
  All corpora except Europarl and Tilde were collected from [Opus](https://opus.nlpl.eu/).
71
  The Europarl and Tilde corpora are synthetic parallel corpora created from the original Spanish-German corpora by [SoftCatalà](https://github.com/Softcatala).