Ashraf-kasem commited on
Commit
f50496d
·
1 Parent(s): 617cef1

Training in progress epoch 0

Browse files
README.md CHANGED
@@ -14,8 +14,8 @@ probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Train Loss: 5.6125
18
- - Validation Loss: 4.9977
19
  - Epoch: 0
20
 
21
  ## Model description
@@ -35,14 +35,14 @@ More information needed
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
- - optimizer: {'name': 'Adam', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5e-05, 'decay_steps': 6289, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
39
  - training_precision: mixed_float16
40
 
41
  ### Training results
42
 
43
  | Train Loss | Validation Loss | Epoch |
44
  |:----------:|:---------------:|:-----:|
45
- | 5.6125 | 4.9977 | 0 |
46
 
47
 
48
  ### Framework versions
 
14
 
15
  This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Train Loss: 5.4252
18
+ - Validation Loss: 4.4731
19
  - Epoch: 0
20
 
21
  ## Model description
 
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
+ - optimizer: {'name': 'Adam', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5e-05, 'decay_steps': 188670, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
39
  - training_precision: mixed_float16
40
 
41
  ### Training results
42
 
43
  | Train Loss | Validation Loss | Epoch |
44
  |:----------:|:---------------:|:-----:|
45
+ | 5.4252 | 4.4731 | 0 |
46
 
47
 
48
  ### Framework versions
checkpoint/extra_data.pickle CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5525bd1c8028ae27f9f44029765534ecb32424ea29e9a4ad3d0291c5439a5a65
3
  size 748191129
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2744018ab65bcd2e8f0a2caae515bc8df8112c95f135a9135c55df522350a860
3
  size 748191129
checkpoint/weights.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:027f221adb36d4468f38589a470ff47568af72b5f53a6e87dead3593b040c648
3
  size 374265936
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d0d32541b09951a13cafb94d831673cdb11ad3bfc3953a12886110a964001274
3
  size 374265936
logs/train/events.out.tfevents.1674577195.ip-10-39-226-229.afrd.expertcity.com.27105.0.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6e18b1f6529faed8526093f15b6b1cd3429a00d144299032fd1a173fb2260a3b
3
- size 1246657
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d7750f85077000ae1509c0677da56374f3b49285b4e9d65e3271ec2c3e90041c
3
+ size 1371229
logs/train/events.out.tfevents.1674587149.ip-10-39-226-229.afrd.expertcity.com.22831.0.v2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c59f2f0873c42cf62f1c56583d74a26e44405868f7e07a3aaa35bcba3113dc25
3
+ size 1246658
logs/validation/events.out.tfevents.1674578951.ip-10-39-226-229.afrd.expertcity.com.27105.1.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:824aa5fdfdc061f7538eb2a65ef9f4100e7d46138034da213e46f788f09b48ef
3
- size 128
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:90d5df1f082c82fabee58a557936e574ab858145d30c503343da7148b5f149cc
3
+ size 194
logs/validation/events.out.tfevents.1674588886.ip-10-39-226-229.afrd.expertcity.com.22831.1.v2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a9b6c5e0f70bb273f3b2aeddad73b038314809e4f85eb56231bebe1af9e88e37
3
+ size 128
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:027f221adb36d4468f38589a470ff47568af72b5f53a6e87dead3593b040c648
3
  size 374265936
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d0d32541b09951a13cafb94d831673cdb11ad3bfc3953a12886110a964001274
3
  size 374265936