Add description about newer model in README.md
#2
by
Eric2333
- opened
README.md
CHANGED
@@ -25,10 +25,11 @@ Firstly, combine the split model using the command `cat model.pt.part-* > model.
|
|
25 |
β βββ example_finetuning.py
|
26 |
β βββ example_pretraining.py
|
27 |
βββ experiments_data
|
28 |
-
βββ model.pt.part-aa # splited bin data of
|
29 |
-
βββ model.pt.part-ab
|
30 |
βββ model.pt.part-ac
|
31 |
βββ model.pt.part-ad
|
|
|
32 |
βββ model.py # define the architecture
|
33 |
βββ sampling.py # script to generate sequences
|
34 |
βββ tokenization.py # preparete data
|
|
|
25 |
β βββ example_finetuning.py
|
26 |
β βββ example_pretraining.py
|
27 |
βββ experiments_data
|
28 |
+
βββ model.pt.part-aa # splited bin data of *HISTORICAL* model (shorter context window, less VRAM comsuption)
|
29 |
+
βββ model.pt.part-ab
|
30 |
βββ model.pt.part-ac
|
31 |
βββ model.pt.part-ad
|
32 |
+
βββ model_updated.pt # *NEWER* model, with longer context windows and being trained on a deduplicated dataset
|
33 |
βββ model.py # define the architecture
|
34 |
βββ sampling.py # script to generate sequences
|
35 |
βββ tokenization.py # preparete data
|