Commit
·
f50496d
1
Parent(s):
617cef1
Training in progress epoch 0
Browse files- README.md +4 -4
- checkpoint/extra_data.pickle +1 -1
- checkpoint/weights.h5 +1 -1
- logs/train/events.out.tfevents.1674577195.ip-10-39-226-229.afrd.expertcity.com.27105.0.v2 +2 -2
- logs/train/events.out.tfevents.1674587149.ip-10-39-226-229.afrd.expertcity.com.22831.0.v2 +3 -0
- logs/validation/events.out.tfevents.1674578951.ip-10-39-226-229.afrd.expertcity.com.27105.1.v2 +2 -2
- logs/validation/events.out.tfevents.1674588886.ip-10-39-226-229.afrd.expertcity.com.22831.1.v2 +3 -0
- tf_model.h5 +1 -1
README.md
CHANGED
@@ -14,8 +14,8 @@ probably proofread and complete it, then remove this comment. -->
|
|
14 |
|
15 |
This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
|
16 |
It achieves the following results on the evaluation set:
|
17 |
-
- Train Loss: 5.
|
18 |
-
- Validation Loss: 4.
|
19 |
- Epoch: 0
|
20 |
|
21 |
## Model description
|
@@ -35,14 +35,14 @@ More information needed
|
|
35 |
### Training hyperparameters
|
36 |
|
37 |
The following hyperparameters were used during training:
|
38 |
-
- optimizer: {'name': 'Adam', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5e-05, 'decay_steps':
|
39 |
- training_precision: mixed_float16
|
40 |
|
41 |
### Training results
|
42 |
|
43 |
| Train Loss | Validation Loss | Epoch |
|
44 |
|:----------:|:---------------:|:-----:|
|
45 |
-
| 5.
|
46 |
|
47 |
|
48 |
### Framework versions
|
|
|
14 |
|
15 |
This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
|
16 |
It achieves the following results on the evaluation set:
|
17 |
+
- Train Loss: 5.4252
|
18 |
+
- Validation Loss: 4.4731
|
19 |
- Epoch: 0
|
20 |
|
21 |
## Model description
|
|
|
35 |
### Training hyperparameters
|
36 |
|
37 |
The following hyperparameters were used during training:
|
38 |
+
- optimizer: {'name': 'Adam', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5e-05, 'decay_steps': 188670, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
|
39 |
- training_precision: mixed_float16
|
40 |
|
41 |
### Training results
|
42 |
|
43 |
| Train Loss | Validation Loss | Epoch |
|
44 |
|:----------:|:---------------:|:-----:|
|
45 |
+
| 5.4252 | 4.4731 | 0 |
|
46 |
|
47 |
|
48 |
### Framework versions
|
checkpoint/extra_data.pickle
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 748191129
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2744018ab65bcd2e8f0a2caae515bc8df8112c95f135a9135c55df522350a860
|
3 |
size 748191129
|
checkpoint/weights.h5
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 374265936
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d0d32541b09951a13cafb94d831673cdb11ad3bfc3953a12886110a964001274
|
3 |
size 374265936
|
logs/train/events.out.tfevents.1674577195.ip-10-39-226-229.afrd.expertcity.com.27105.0.v2
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d7750f85077000ae1509c0677da56374f3b49285b4e9d65e3271ec2c3e90041c
|
3 |
+
size 1371229
|
logs/train/events.out.tfevents.1674587149.ip-10-39-226-229.afrd.expertcity.com.22831.0.v2
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c59f2f0873c42cf62f1c56583d74a26e44405868f7e07a3aaa35bcba3113dc25
|
3 |
+
size 1246658
|
logs/validation/events.out.tfevents.1674578951.ip-10-39-226-229.afrd.expertcity.com.27105.1.v2
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:90d5df1f082c82fabee58a557936e574ab858145d30c503343da7148b5f149cc
|
3 |
+
size 194
|
logs/validation/events.out.tfevents.1674588886.ip-10-39-226-229.afrd.expertcity.com.22831.1.v2
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a9b6c5e0f70bb273f3b2aeddad73b038314809e4f85eb56231bebe1af9e88e37
|
3 |
+
size 128
|
tf_model.h5
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 374265936
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d0d32541b09951a13cafb94d831673cdb11ad3bfc3953a12886110a964001274
|
3 |
size 374265936
|