deanna-emery's picture
Upload TFT5ForConditionalGeneration
2819637
metadata
base_model: deanna-emery/ASL_t5_movinet_sentence
tags:
  - generated_from_keras_callback
model-index:
  - name: ASL_t5_movinet_sentence
    results: []

ASL_t5_movinet_sentence

This model is a fine-tuned version of deanna-emery/ASL_t5_movinet_sentence on an unknown dataset. It achieves the following results on the evaluation set:

  • Train Loss: 0.1776
  • Train Top 1: 0.9586
  • Train Top 5: 0.9815
  • Validation Loss: 0.3415
  • Validation Top 1: 0.9408
  • Validation Top 5: 0.9633
  • Train Bleu: 1.984
  • Train Gen Len: 12.7656
  • Epoch: 3

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • optimizer: {'name': 'Adafactor', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'CosineDecay', 'config': {'initial_learning_rate': 0.0005, 'decay_steps': 3330, 'alpha': 0.0, 'name': None, 'warmup_target': 0.0005, 'warmup_steps': 600}, 'registered_name': None}, 'beta_2_decay': -0.8, 'epsilon_1': 1e-30, 'epsilon_2': 0.001, 'clip_threshold': 1.0, 'relative_step': True}
  • training_precision: float32

Training results

Train Loss Train Top 1 Train Top 5 Validation Loss Validation Top 1 Validation Top 5 Train Bleu Train Gen Len Epoch
0.2351 0.9496 0.9736 0.3181 0.9413 0.9640 1.1898 13.0052 0
0.2173 0.9522 0.9760 0.3255 0.9409 0.9638 1.525 13.0182 1
0.1965 0.9554 0.9789 0.3372 0.9405 0.9630 1.4326 13.8776 2
0.1776 0.9586 0.9815 0.3415 0.9408 0.9633 1.984 12.7656 3

Framework versions

  • Transformers 4.34.1
  • TensorFlow 2.13.0
  • Datasets 2.15.0
  • Tokenizers 0.14.1