deanna-emery
/

ASL_t5_movinet_sentence

Text2Text Generation

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

ASL_t5_movinet_sentence / README.md

deanna-emery's picture

Upload TFT5ForConditionalGeneration

2819637 about 1 year ago

|

history blame contribute delete

2.64 kB

	---
	base_model: deanna-emery/ASL_t5_movinet_sentence
	tags:
	- generated_from_keras_callback
	model-index:
	- name: ASL_t5_movinet_sentence
	results: []
	---

	<!-- This model card has been generated automatically according to the information Keras had access to. You should
	probably proofread and complete it, then remove this comment. -->

	# ASL_t5_movinet_sentence

	This model is a fine-tuned version of [deanna-emery/ASL_t5_movinet_sentence](https://huggingface.co/deanna-emery/ASL_t5_movinet_sentence) on an unknown dataset.
	It achieves the following results on the evaluation set:
	- Train Loss: 0.1776
	- Train Top 1: 0.9586
	- Train Top 5: 0.9815
	- Validation Loss: 0.3415
	- Validation Top 1: 0.9408
	- Validation Top 5: 0.9633
	- Train Bleu: 1.984
	- Train Gen Len: 12.7656
	- Epoch: 3

	## Model description

	More information needed

	## Intended uses & limitations

	More information needed

	## Training and evaluation data

	More information needed

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- optimizer: {'name': 'Adafactor', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'CosineDecay', 'config': {'initial_learning_rate': 0.0005, 'decay_steps': 3330, 'alpha': 0.0, 'name': None, 'warmup_target': 0.0005, 'warmup_steps': 600}, 'registered_name': None}, 'beta_2_decay': -0.8, 'epsilon_1': 1e-30, 'epsilon_2': 0.001, 'clip_threshold': 1.0, 'relative_step': True}
	- training_precision: float32

	### Training results

	\| Train Loss \| Train Top 1 \| Train Top 5 \| Validation Loss \| Validation Top 1 \| Validation Top 5 \| Train Bleu \| Train Gen Len \| Epoch \|
	\|:----------:\|:-----------:\|:-----------:\|:---------------:\|:----------------:\|:----------------:\|:----------:\|:-------------:\|:-----:\|
	\| 0.2351 \| 0.9496 \| 0.9736 \| 0.3181 \| 0.9413 \| 0.9640 \| 1.1898 \| 13.0052 \| 0 \|
	\| 0.2173 \| 0.9522 \| 0.9760 \| 0.3255 \| 0.9409 \| 0.9638 \| 1.525 \| 13.0182 \| 1 \|
	\| 0.1965 \| 0.9554 \| 0.9789 \| 0.3372 \| 0.9405 \| 0.9630 \| 1.4326 \| 13.8776 \| 2 \|
	\| 0.1776 \| 0.9586 \| 0.9815 \| 0.3415 \| 0.9408 \| 0.9633 \| 1.984 \| 12.7656 \| 3 \|


	### Framework versions

	- Transformers 4.34.1
	- TensorFlow 2.13.0
	- Datasets 2.15.0
	- Tokenizers 0.14.1