Commit
β’
be067e6
1
Parent(s):
65056db
Add description about newer model in README.md (#2)
Browse files- Add description about newer model in README.md (991fc3f484aaad404966f65fb619de239434c901)
Co-authored-by: YichongEricZhao <[email protected]>
README.md
CHANGED
@@ -25,10 +25,11 @@ Firstly, combine the split model using the command `cat model.pt.part-* > model.
|
|
25 |
β βββ example_finetuning.py
|
26 |
β βββ example_pretraining.py
|
27 |
βββ experiments_data
|
28 |
-
βββ model.pt.part-aa # splited bin data of
|
29 |
-
βββ model.pt.part-ab
|
30 |
βββ model.pt.part-ac
|
31 |
βββ model.pt.part-ad
|
|
|
32 |
βββ model.py # define the architecture
|
33 |
βββ sampling.py # script to generate sequences
|
34 |
βββ tokenization.py # preparete data
|
|
|
25 |
β βββ example_finetuning.py
|
26 |
β βββ example_pretraining.py
|
27 |
βββ experiments_data
|
28 |
+
βββ model.pt.part-aa # splited bin data of *HISTORICAL* model (shorter context window, less VRAM comsuption)
|
29 |
+
βββ model.pt.part-ab
|
30 |
βββ model.pt.part-ac
|
31 |
βββ model.pt.part-ad
|
32 |
+
βββ model_updated.pt # *NEWER* model, with longer context windows and being trained on a deduplicated dataset
|
33 |
βββ model.py # define the architecture
|
34 |
βββ sampling.py # script to generate sequences
|
35 |
βββ tokenization.py # preparete data
|