End of training
Browse files
README.md
CHANGED
@@ -34,7 +34,7 @@ More information needed
|
|
34 |
### Training hyperparameters
|
35 |
|
36 |
The following hyperparameters were used during training:
|
37 |
-
- learning_rate:
|
38 |
- train_batch_size: 14
|
39 |
- eval_batch_size: 14
|
40 |
- seed: 42
|
@@ -97,7 +97,7 @@ The following hyperparameters were used during training:
|
|
97 |
| 5.6381 | 49.0 | 441 | 5.3355 |
|
98 |
| 5.7056 | 50.0 | 450 | 5.3355 |
|
99 |
| 5.6693 | 51.0 | 459 | 5.3355 |
|
100 |
-
| 5.
|
101 |
| 5.6551 | 53.0 | 477 | 5.3355 |
|
102 |
| 5.5851 | 54.0 | 486 | 5.3355 |
|
103 |
| 5.6209 | 55.0 | 495 | 5.3355 |
|
@@ -133,10 +133,10 @@ The following hyperparameters were used during training:
|
|
133 |
| 5.638 | 85.0 | 765 | 5.3355 |
|
134 |
| 5.6206 | 86.0 | 774 | 5.3355 |
|
135 |
| 5.7414 | 87.0 | 783 | 5.3355 |
|
136 |
-
| 5.
|
137 |
| 5.6389 | 89.0 | 801 | 5.3355 |
|
138 |
| 5.6339 | 90.0 | 810 | 5.3355 |
|
139 |
-
| 5.
|
140 |
| 5.6303 | 92.0 | 828 | 5.3355 |
|
141 |
| 5.6814 | 93.0 | 837 | 5.3355 |
|
142 |
| 5.6435 | 94.0 | 846 | 5.3355 |
|
|
|
34 |
### Training hyperparameters
|
35 |
|
36 |
The following hyperparameters were used during training:
|
37 |
+
- learning_rate: 3.125e-09
|
38 |
- train_batch_size: 14
|
39 |
- eval_batch_size: 14
|
40 |
- seed: 42
|
|
|
97 |
| 5.6381 | 49.0 | 441 | 5.3355 |
|
98 |
| 5.7056 | 50.0 | 450 | 5.3355 |
|
99 |
| 5.6693 | 51.0 | 459 | 5.3355 |
|
100 |
+
| 5.6042 | 52.0 | 468 | 5.3355 |
|
101 |
| 5.6551 | 53.0 | 477 | 5.3355 |
|
102 |
| 5.5851 | 54.0 | 486 | 5.3355 |
|
103 |
| 5.6209 | 55.0 | 495 | 5.3355 |
|
|
|
133 |
| 5.638 | 85.0 | 765 | 5.3355 |
|
134 |
| 5.6206 | 86.0 | 774 | 5.3355 |
|
135 |
| 5.7414 | 87.0 | 783 | 5.3355 |
|
136 |
+
| 5.7041 | 88.0 | 792 | 5.3355 |
|
137 |
| 5.6389 | 89.0 | 801 | 5.3355 |
|
138 |
| 5.6339 | 90.0 | 810 | 5.3355 |
|
139 |
+
| 5.6446 | 91.0 | 819 | 5.3355 |
|
140 |
| 5.6303 | 92.0 | 828 | 5.3355 |
|
141 |
| 5.6814 | 93.0 | 837 | 5.3355 |
|
142 |
| 5.6435 | 94.0 | 846 | 5.3355 |
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 649032520
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4a7e293529faef397a80c2ffbaad4a2a2a7b4a7af9e82cb9d8f0a688381aa273
|
3 |
size 649032520
|
runs/Mar06_16-32-29_Software-AI/events.out.tfevents.1709730150.Software-AI.118212.21
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5b490cb0aac83b233340b34c0f391e2e53988e123f227bebc095951797db63d0
|
3 |
+
size 47443
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4219
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:aa3a8875bffca293226fc92be5f0fe11b06b492736f1d6a9f4bfb0a82fd4b605
|
3 |
size 4219
|