SIA86 commited on
Commit
be4ee71
·
1 Parent(s): 15f2640

Training in progress epoch 0

Browse files
Files changed (2) hide show
  1. README.md +7 -16
  2. tf_model.h5 +1 -1
README.md CHANGED
@@ -15,11 +15,11 @@ probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 0.3425
19
- - Train Accuracy: 0.9555
20
- - Validation Loss: 0.9625
21
- - Validation Accuracy: 0.7835
22
- - Epoch: 9
23
 
24
  ## Model description
25
 
@@ -38,23 +38,14 @@ More information needed
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
- - optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 4e-05, 'decay_steps': 1160, 'end_learning_rate': 4e-08, 'power': 1.0, 'cycle': False, 'name': None}}, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
42
  - training_precision: float32
43
 
44
  ### Training results
45
 
46
  | Train Loss | Train Accuracy | Validation Loss | Validation Accuracy | Epoch |
47
  |:----------:|:--------------:|:---------------:|:-------------------:|:-----:|
48
- | 2.7650 | 0.2801 | 2.4192 | 0.3463 | 0 |
49
- | 2.3111 | 0.3626 | 2.1394 | 0.3680 | 1 |
50
- | 1.8627 | 0.4777 | 1.7134 | 0.5325 | 2 |
51
- | 1.3526 | 0.6580 | 1.4877 | 0.5758 | 3 |
52
- | 1.0242 | 0.7318 | 1.3084 | 0.6623 | 4 |
53
- | 0.7743 | 0.8046 | 1.2363 | 0.6883 | 5 |
54
- | 0.5871 | 0.8784 | 1.0712 | 0.7273 | 6 |
55
- | 0.4679 | 0.9164 | 1.0081 | 0.7749 | 7 |
56
- | 0.3966 | 0.9381 | 0.9864 | 0.7706 | 8 |
57
- | 0.3425 | 0.9555 | 0.9625 | 0.7835 | 9 |
58
 
59
 
60
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 2.8423
19
+ - Train Accuracy: 0.2313
20
+ - Validation Loss: 2.5340
21
+ - Validation Accuracy: 0.3593
22
+ - Epoch: 0
23
 
24
  ## Model description
25
 
 
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
+ - optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5e-05, 'decay_steps': 2320, 'end_learning_rate': 0, 'power': 1.0, 'cycle': False, 'name': None}}, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
42
  - training_precision: float32
43
 
44
  ### Training results
45
 
46
  | Train Loss | Train Accuracy | Validation Loss | Validation Accuracy | Epoch |
47
  |:----------:|:--------------:|:---------------:|:-------------------:|:-----:|
48
+ | 2.8423 | 0.2313 | 2.5340 | 0.3593 | 0 |
 
 
 
 
 
 
 
 
 
49
 
50
 
51
  ### Framework versions
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d20eb0dc581653a4361caf7747ea05ee88d5a0fadab54f62cf905b42e41a07c1
3
  size 711818664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0508dfb64a5307ecd322af864378a7f6aded18f00b16637bf7a3c7a8c8cb42a5
3
  size 711818664