AdaCodruta
commited on
Commit
•
73431bc
1
Parent(s):
df06711
Model save
Browse files- README.md +1 -5
- model-00001-of-00002.safetensors +1 -1
- model-00002-of-00002.safetensors +1 -1
- training_args.bin +1 -1
README.md
CHANGED
@@ -40,13 +40,9 @@ The following hyperparameters were used during training:
|
|
40 |
- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
41 |
- lr_scheduler_type: linear
|
42 |
- lr_scheduler_warmup_steps: 500
|
43 |
-
-
|
44 |
- mixed_precision_training: Native AMP
|
45 |
|
46 |
-
### Training results
|
47 |
-
|
48 |
-
|
49 |
-
|
50 |
### Framework versions
|
51 |
|
52 |
- Transformers 4.46.3
|
|
|
40 |
- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
41 |
- lr_scheduler_type: linear
|
42 |
- lr_scheduler_warmup_steps: 500
|
43 |
+
- training_steps: 5000
|
44 |
- mixed_precision_training: Native AMP
|
45 |
|
|
|
|
|
|
|
|
|
46 |
### Framework versions
|
47 |
|
48 |
- Transformers 4.46.3
|
model-00001-of-00002.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4992706480
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d8367f71e5d216a2dec4d825315681c30c471be80bcf94e7701d17e801b0fe5d
|
3 |
size 4992706480
|
model-00002-of-00002.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1180663192
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:764f4e684d3bb3eeb4ea5484922f3213e0c75d75c4bb5f14b7c338f7e6c9a29d
|
3 |
size 1180663192
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5432
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:65105c8fc3073a418a4f9a94b53479295e77ec71b6f16cb4b5c3af28a730c8a3
|
3 |
size 5432
|