bnjmnmarie
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -3,6 +3,8 @@ language:
|
|
3 |
- en
|
4 |
license: apache-2.0
|
5 |
library_name: transformers
|
|
|
|
|
6 |
---
|
7 |
|
8 |
## Model Details
|
@@ -16,4 +18,8 @@ All the details about the Minivoc approach and evaluation in this article:
|
|
16 |
|
17 |
- **Developed by:** [The Kaitchup](https://newsletter.kaitchup.com/)
|
18 |
- **Language(s) (NLP):** English
|
19 |
-
- **License:** apache-2.0
|
|
|
|
|
|
|
|
|
|
3 |
- en
|
4 |
license: apache-2.0
|
5 |
library_name: transformers
|
6 |
+
datasets:
|
7 |
+
- HuggingFaceFW/fineweb-edu
|
8 |
---
|
9 |
|
10 |
## Model Details
|
|
|
18 |
|
19 |
- **Developed by:** [The Kaitchup](https://newsletter.kaitchup.com/)
|
20 |
- **Language(s) (NLP):** English
|
21 |
+
- **License:** apache-2.0
|
22 |
+
|
23 |
+
## Training Data
|
24 |
+
|
25 |
+
The Minivoc approach has two training steps. For this model, I used 0.2B tokens randomly sampled from HuggingFaceFW/fineweb-edu.
|