dung1308 commited on
Commit
52d0b32
·
1 Parent(s): 9a546ba

Training in progress epoch 0

Browse files
README.md CHANGED
@@ -13,8 +13,8 @@ probably proofread and complete it, then remove this comment. -->
13
 
14
  This model is a fine-tuned version of [vinai/phobert-base](https://huggingface.co/vinai/phobert-base) on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
- - Train Loss: 4.6440
17
- - Validation Loss: 4.2231
18
  - Epoch: 0
19
 
20
  ## Model description
@@ -34,14 +34,14 @@ More information needed
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
- - optimizer: {'inner_optimizer': {'class_name': 'AdamWeightDecay', 'config': {'name': 'AdamWeightDecay', 'learning_rate': 2e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}}, 'dynamic': True, 'initial_scale': 32768.0, 'dynamic_growth_steps': 2000}
38
  - training_precision: mixed_float16
39
 
40
  ### Training results
41
 
42
  | Train Loss | Validation Loss | Epoch |
43
  |:----------:|:---------------:|:-----:|
44
- | 4.6440 | 4.2231 | 0 |
45
 
46
 
47
  ### Framework versions
 
13
 
14
  This model is a fine-tuned version of [vinai/phobert-base](https://huggingface.co/vinai/phobert-base) on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Train Loss: 5.4299
17
+ - Validation Loss: 4.7135
18
  - Epoch: 0
19
 
20
  ## Model description
 
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
+ - optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'WarmUp', 'config': {'initial_learning_rate': 2e-05, 'decay_schedule_fn': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': -516, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, '__passive_serialization__': True}, 'warmup_steps': 1000, 'power': 1.0, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
38
  - training_precision: mixed_float16
39
 
40
  ### Training results
41
 
42
  | Train Loss | Validation Loss | Epoch |
43
  |:----------:|:---------------:|:-----:|
44
+ | 5.4299 | 4.7135 | 0 |
45
 
46
 
47
  ### Framework versions
checkpoint/extra_data.pickle CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:962468d06d6a93b7cf28d1a7f1353f8898d951a9c82dee9b2f93a436c30af909
3
  size 1080533909
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:40c1f2408c77c4a49c361cf62f3ce59ff66278d26dd9cd3277addac1f4ce0d5c
3
  size 1080533909
checkpoint/weights.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:72950c0f718a9ecbe2ac165e6ad2d9f7c715920adb4c99c04ddce7d2ea4a6500
3
  size 737947308
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ccf0dd7717b752c19b451f55b9dafe806c6a0d4dbfa0d69031997e0b7f89aafe
3
  size 737947308
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2bd46ac4d35396d965abc72789cbfe4cac153106545f663e81b6a8b11376c5a8
3
  size 737947308
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ae6b02454727b165fa9582798a3907f0b0694a16d365c53efdaab3a78b904faa
3
  size 737947308