Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -23,14 +23,12 @@ Details for this model specifically:
|
|
23 |
* Architecture: gpt2
|
24 |
* Parameters: 124770816
|
25 |
* Maximum sequence length: 512 tokens
|
26 |
-
* Training text data (raw): [XXXX]
|
27 |
* Training tokens: 12B
|
28 |
* Vocabulary size: 50000
|
29 |
* Compute cost: ~9 NVIDIA A6000 GPU hours
|
30 |
* CO2 Emission: 1.17 kg
|
31 |
|
32 |
-
Training datasets
|
33 |
-
* 100%: [OSCAR 2021/09](https://huggingface.co/datasets/oscar-corpus/OSCAR-2109)
|
34 |
|
35 |
Checkpoints are taken at training steps: 0, 10000, 20000, 30000, 40000, 50000, 64000, 64010, 64020, 64030, 64040, 64050, 64060, 64070, 64080, 64090, 64100, 64110, 64120, 64130, 64140, 64150, 64160, 64170, 64180, 64190, 64200, 64300, 64400, 64500, 64600, 64700, 64800, 64900, 65000, 66000, 67000, 68000, 69000, 70000, 80000, 90000, 100000, 110000, 120000, 128000.
|
36 |
|
|
|
23 |
* Architecture: gpt2
|
24 |
* Parameters: 124770816
|
25 |
* Maximum sequence length: 512 tokens
|
|
|
26 |
* Training tokens: 12B
|
27 |
* Vocabulary size: 50000
|
28 |
* Compute cost: ~9 NVIDIA A6000 GPU hours
|
29 |
* CO2 Emission: 1.17 kg
|
30 |
|
31 |
+
Training dataset: [OSCAR 2021/09](https://huggingface.co/datasets/oscar-corpus/OSCAR-2109)
|
|
|
32 |
|
33 |
Checkpoints are taken at training steps: 0, 10000, 20000, 30000, 40000, 50000, 64000, 64010, 64020, 64030, 64040, 64050, 64060, 64070, 64080, 64090, 64100, 64110, 64120, 64130, 64140, 64150, 64160, 64170, 64180, 64190, 64200, 64300, 64400, 64500, 64600, 64700, 64800, 64900, 65000, 66000, 67000, 68000, 69000, 70000, 80000, 90000, 100000, 110000, 120000, 128000.
|
34 |
|