Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -148,7 +148,7 @@ The adaptation procedure is explained in [this blog post](https://medium.com/@mp
 The training corpus consists of 26B tokens of several corpora gathered from web crawlings and public domain data.
-| Dataset             | Language | Tokens (per-epoch) | Epochs       |
 |---------------------|----------|--------------------|--------------|
 | Wikipedia           | en       |           2169.97M |  1.428144485 |
 | C4_es               | es       |          53709.80M | 0.1049686196 |

 The training corpus consists of 26B tokens of several corpora gathered from web crawlings and public domain data.
+| Dataset             | Language | Words (per-epoch) | Epochs       |
 |---------------------|----------|--------------------|--------------|
 | Wikipedia           | en       |           2169.97M |  1.428144485 |
 | C4_es               | es       |          53709.80M | 0.1049686196 |