Update README.md
Browse files
README.md
CHANGED
@@ -57,7 +57,7 @@ set a seed for reproducibility:
|
|
57 |
|
58 |
## Dataset
|
59 |
|
60 |
-
The training data was split evenly
|
61 |
|
62 |
**English and Code**
|
63 |
- [Open-Hermes-2B](https://huggingface.co/datasets/teknium/OpenHermes-2.5)
|
|
|
57 |
|
58 |
## Dataset
|
59 |
|
60 |
+
The training data was split evenly between German and English based on the total number of tokens. We would like to thank [Disco Research](https://huggingface.co/DiscoResearch), [Jan Philipp Harries](https://huggingface.co/jphme), and [Björn Plüster](https://huggingface.co/bjoernp) for making their dataset available to us.
|
61 |
|
62 |
**English and Code**
|
63 |
- [Open-Hermes-2B](https://huggingface.co/datasets/teknium/OpenHermes-2.5)
|