deanna-emery commited on
Commit
2819637
·
1 Parent(s): 32b0e02

Upload TFT5ForConditionalGeneration

Browse files
Files changed (2) hide show
  1. README.md +14 -13
  2. tf_model.h5 +1 -1
README.md CHANGED
@@ -14,15 +14,15 @@ probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [deanna-emery/ASL_t5_movinet_sentence](https://huggingface.co/deanna-emery/ASL_t5_movinet_sentence) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Train Loss: 0.2383
18
- - Train Top 1: 0.9491
19
- - Train Top 5: 0.9730
20
- - Validation Loss: 0.3164
21
- - Validation Top 1: 0.9416
22
- - Validation Top 5: 0.9642
23
- - Train Bleu: 0.5761
24
- - Train Gen Len: 13.5547
25
- - Epoch: 2
26
 
27
  ## Model description
28
 
@@ -41,16 +41,17 @@ More information needed
41
  ### Training hyperparameters
42
 
43
  The following hyperparameters were used during training:
44
- - optimizer: {'name': 'Adafactor', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'CosineDecay', 'config': {'initial_learning_rate': 0.0005, 'decay_steps': 1998, 'alpha': 0.0, 'name': None, 'warmup_target': 0.0005, 'warmup_steps': 600}, 'registered_name': None}, 'beta_2_decay': -0.8, 'epsilon_1': 1e-30, 'epsilon_2': 0.001, 'clip_threshold': 1.0, 'relative_step': True}
45
  - training_precision: float32
46
 
47
  ### Training results
48
 
49
  | Train Loss | Train Top 1 | Train Top 5 | Validation Loss | Validation Top 1 | Validation Top 5 | Train Bleu | Train Gen Len | Epoch |
50
  |:----------:|:-----------:|:-----------:|:---------------:|:----------------:|:----------------:|:----------:|:-------------:|:-----:|
51
- | 0.2845 | 0.9432 | 0.9669 | 0.3092 | 0.9416 | 0.9644 | 0.4712 | 14.4505 | 0 |
52
- | 0.2619 | 0.9459 | 0.9697 | 0.3116 | 0.9417 | 0.9643 | 0.6262 | 13.9115 | 1 |
53
- | 0.2383 | 0.9491 | 0.9730 | 0.3164 | 0.9416 | 0.9642 | 0.5761 | 13.5547 | 2 |
 
54
 
55
 
56
  ### Framework versions
 
14
 
15
  This model is a fine-tuned version of [deanna-emery/ASL_t5_movinet_sentence](https://huggingface.co/deanna-emery/ASL_t5_movinet_sentence) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Train Loss: 0.1776
18
+ - Train Top 1: 0.9586
19
+ - Train Top 5: 0.9815
20
+ - Validation Loss: 0.3415
21
+ - Validation Top 1: 0.9408
22
+ - Validation Top 5: 0.9633
23
+ - Train Bleu: 1.984
24
+ - Train Gen Len: 12.7656
25
+ - Epoch: 3
26
 
27
  ## Model description
28
 
 
41
  ### Training hyperparameters
42
 
43
  The following hyperparameters were used during training:
44
+ - optimizer: {'name': 'Adafactor', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'CosineDecay', 'config': {'initial_learning_rate': 0.0005, 'decay_steps': 3330, 'alpha': 0.0, 'name': None, 'warmup_target': 0.0005, 'warmup_steps': 600}, 'registered_name': None}, 'beta_2_decay': -0.8, 'epsilon_1': 1e-30, 'epsilon_2': 0.001, 'clip_threshold': 1.0, 'relative_step': True}
45
  - training_precision: float32
46
 
47
  ### Training results
48
 
49
  | Train Loss | Train Top 1 | Train Top 5 | Validation Loss | Validation Top 1 | Validation Top 5 | Train Bleu | Train Gen Len | Epoch |
50
  |:----------:|:-----------:|:-----------:|:---------------:|:----------------:|:----------------:|:----------:|:-------------:|:-----:|
51
+ | 0.2351 | 0.9496 | 0.9736 | 0.3181 | 0.9413 | 0.9640 | 1.1898 | 13.0052 | 0 |
52
+ | 0.2173 | 0.9522 | 0.9760 | 0.3255 | 0.9409 | 0.9638 | 1.525 | 13.0182 | 1 |
53
+ | 0.1965 | 0.9554 | 0.9789 | 0.3372 | 0.9405 | 0.9630 | 1.4326 | 13.8776 | 2 |
54
+ | 0.1776 | 0.9586 | 0.9815 | 0.3415 | 0.9408 | 0.9633 | 1.984 | 12.7656 | 3 |
55
 
56
 
57
  ### Framework versions
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:856df96167528cc61d42046f91f4b8b5a2e18690219278131e9904ab2844c631
3
  size 1089544048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:52f925a56ff4e2fea8cf01fb0c0e951e1db7f152a0899a24637e3e2f9431467c
3
  size 1089544048