Update README.md
Browse files
README.md
CHANGED
@@ -60,15 +60,12 @@ The Catalan-German data collected from the web was a combination of the followin
|
|
60 |
| WikiMatrix | 180.322 | 125.811 |
|
61 |
| GNOME | 12.333| 1.241|
|
62 |
| KDE4 | 165.439 | 105.098 |
|
63 |
-
| QED | 63.041 | 49.181 |
|
64 |
-
| TED2020 v1 | 46.680 | 38.428 |
|
65 |
| OpenSubtitles | 303.329 | 171.376 |
|
66 |
| GlobalVoices| 4.636 | 3.578|
|
67 |
| Tatoeba | 732 | 655 |
|
68 |
| Books | 4.445 | 2049 |
|
69 |
| Europarl | 1.734.643 | 1.734.643 |
|
70 |
| Tilde | 3.434.091 | 3.434.091 |
|
71 |
-
| **Total** | **7.427.843** | **6.258.272** |
|
72 |
|
73 |
All corpora except Europarl and Tilde were collected from [Opus](https://opus.nlpl.eu/).
|
74 |
The Europarl and Tilde corpora are synthetic parallel corpora created from the original Spanish-German corpora by [SoftCatalà](https://github.com/Softcatala).
|
|
|
60 |
| WikiMatrix | 180.322 | 125.811 |
|
61 |
| GNOME | 12.333| 1.241|
|
62 |
| KDE4 | 165.439 | 105.098 |
|
|
|
|
|
63 |
| OpenSubtitles | 303.329 | 171.376 |
|
64 |
| GlobalVoices| 4.636 | 3.578|
|
65 |
| Tatoeba | 732 | 655 |
|
66 |
| Books | 4.445 | 2049 |
|
67 |
| Europarl | 1.734.643 | 1.734.643 |
|
68 |
| Tilde | 3.434.091 | 3.434.091 |
|
|
|
69 |
|
70 |
All corpora except Europarl and Tilde were collected from [Opus](https://opus.nlpl.eu/).
|
71 |
The Europarl and Tilde corpora are synthetic parallel corpora created from the original Spanish-German corpora by [SoftCatalà](https://github.com/Softcatala).
|