Model save

739c9b8 verified about 1 year ago

3.32 kB

	---
	tags:
	- generated_from_trainer
	model-index:
	- name: pp-mod-subj-lstm-1
	results: []
	---

	<!-- This model card has been generated automatically according to the information the Trainer had access to. You
	should probably proofread and complete it, then remove this comment. -->

	# pp-mod-subj-lstm-1

	This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
	It achieves the following results on the evaluation set:
	- Loss: 4.0224

	## Model description

	More information needed

	## Intended uses & limitations

	More information needed

	## Training and evaluation data

	More information needed

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- learning_rate: 5e-05
	- train_batch_size: 32
	- eval_batch_size: 32
	- seed: 1
	- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
	- lr_scheduler_type: linear
	- training_steps: 3052726

	### Training results

	\| Training Loss \| Epoch \| Step \| Validation Loss \|
	\|:-------------:\|:-----:\|:-------:\|:---------------:\|
	\| 4.7849 \| 0.03 \| 76319 \| 4.8022 \|
	\| 4.5004 \| 1.03 \| 152638 \| 4.5227 \|
	\| 4.3599 \| 0.03 \| 228957 \| 4.3896 \|
	\| 4.2696 \| 1.03 \| 305276 \| 4.3071 \|
	\| 4.2077 \| 0.03 \| 381595 \| 4.2521 \|
	\| 4.159 \| 1.03 \| 457914 \| 4.2115 \|
	\| 4.1216 \| 0.03 \| 534233 \| 4.1809 \|
	\| 4.0933 \| 1.03 \| 610552 \| 4.1563 \|
	\| 4.0655 \| 0.03 \| 686871 \| 4.1367 \|
	\| 4.04 \| 0.03 \| 763190 \| 4.1205 \|
	\| 4.0235 \| 1.03 \| 839509 \| 4.1073 \|
	\| 3.9961 \| 0.03 \| 915829 \| 4.0956 \|
	\| 3.9757 \| 1.03 \| 992149 \| 4.0857 \|
	\| 3.9626 \| 2.03 \| 1068469 \| 4.0777 \|
	\| 3.9615 \| 0.03 \| 1144789 \| 4.0696 \|
	\| 3.95 \| 1.03 \| 1221109 \| 4.0640 \|
	\| 3.9353 \| 0.03 \| 1297429 \| 4.0601 \|
	\| 3.9269 \| 1.03 \| 1373749 \| 4.0546 \|
	\| 3.92 \| 0.03 \| 1450069 \| 4.0511 \|
	\| 3.9153 \| 1.03 \| 1526389 \| 4.0480 \|
	\| 3.9133 \| 2.03 \| 1602709 \| 4.0449 \|
	\| 3.9024 \| 0.03 \| 1679029 \| 4.0422 \|
	\| 3.8976 \| 1.03 \| 1755349 \| 4.0404 \|
	\| 3.893 \| 2.03 \| 1831669 \| 4.0375 \|
	\| 3.8841 \| 0.03 \| 1907989 \| 4.0360 \|
	\| 3.8781 \| 1.03 \| 1984309 \| 4.0336 \|
	\| 3.8733 \| 0.03 \| 2060629 \| 4.0318 \|
	\| 3.8696 \| 0.03 \| 2136949 \| 4.0307 \|
	\| 3.8654 \| 1.03 \| 2213269 \| 4.0296 \|
	\| 3.8611 \| 2.03 \| 2289589 \| 4.0286 \|
	\| 3.8572 \| 0.03 \| 2365909 \| 4.0275 \|
	\| 3.8535 \| 0.03 \| 2442229 \| 4.0267 \|
	\| 3.8476 \| 0.03 \| 2518549 \| 4.0260 \|
	\| 3.8458 \| 1.03 \| 2594869 \| 4.0250 \|
	\| 3.8425 \| 0.03 \| 2671189 \| 4.0245 \|
	\| 3.8468 \| 1.03 \| 2747509 \| 4.0237 \|
	\| 3.847 \| 2.03 \| 2823829 \| 4.0235 \|
	\| 3.8412 \| 0.03 \| 2900149 \| 4.0230 \|
	\| 3.8407 \| 1.03 \| 2976469 \| 4.0225 \|
	\| 3.8391 \| 2.02 \| 3052726 \| 4.0224 \|


	### Framework versions

	- Transformers 4.33.3
	- Pytorch 2.0.1
	- Datasets 2.12.0
	- Tokenizers 0.13.3

	---
	tags:
	- generated_from_trainer
	model-index:
	- name: pp-mod-subj-lstm-1
	results: []
	---

	<!-- This model card has been generated automatically according to the information the Trainer had access to. You
	should probably proofread and complete it, then remove this comment. -->

	# pp-mod-subj-lstm-1

	This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
	It achieves the following results on the evaluation set:
	- Loss: 4.0224

	## Model description

	More information needed

	## Intended uses & limitations

	More information needed

	## Training and evaluation data

	More information needed

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- learning_rate: 5e-05
	- train_batch_size: 32
	- eval_batch_size: 32
	- seed: 1
	- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
	- lr_scheduler_type: linear
	- training_steps: 3052726

	### Training results

	\| Training Loss \| Epoch \| Step \| Validation Loss \|
	\|:-------------:\|:-----:\|:-------:\|:---------------:\|
	\| 4.7849 \| 0.03 \| 76319 \| 4.8022 \|
	\| 4.5004 \| 1.03 \| 152638 \| 4.5227 \|
	\| 4.3599 \| 0.03 \| 228957 \| 4.3896 \|
	\| 4.2696 \| 1.03 \| 305276 \| 4.3071 \|
	\| 4.2077 \| 0.03 \| 381595 \| 4.2521 \|
	\| 4.159 \| 1.03 \| 457914 \| 4.2115 \|
	\| 4.1216 \| 0.03 \| 534233 \| 4.1809 \|
	\| 4.0933 \| 1.03 \| 610552 \| 4.1563 \|
	\| 4.0655 \| 0.03 \| 686871 \| 4.1367 \|
	\| 4.04 \| 0.03 \| 763190 \| 4.1205 \|
	\| 4.0235 \| 1.03 \| 839509 \| 4.1073 \|
	\| 3.9961 \| 0.03 \| 915829 \| 4.0956 \|
	\| 3.9757 \| 1.03 \| 992149 \| 4.0857 \|
	\| 3.9626 \| 2.03 \| 1068469 \| 4.0777 \|
	\| 3.9615 \| 0.03 \| 1144789 \| 4.0696 \|
	\| 3.95 \| 1.03 \| 1221109 \| 4.0640 \|
	\| 3.9353 \| 0.03 \| 1297429 \| 4.0601 \|
	\| 3.9269 \| 1.03 \| 1373749 \| 4.0546 \|
	\| 3.92 \| 0.03 \| 1450069 \| 4.0511 \|
	\| 3.9153 \| 1.03 \| 1526389 \| 4.0480 \|
	\| 3.9133 \| 2.03 \| 1602709 \| 4.0449 \|
	\| 3.9024 \| 0.03 \| 1679029 \| 4.0422 \|
	\| 3.8976 \| 1.03 \| 1755349 \| 4.0404 \|
	\| 3.893 \| 2.03 \| 1831669 \| 4.0375 \|
	\| 3.8841 \| 0.03 \| 1907989 \| 4.0360 \|
	\| 3.8781 \| 1.03 \| 1984309 \| 4.0336 \|
	\| 3.8733 \| 0.03 \| 2060629 \| 4.0318 \|
	\| 3.8696 \| 0.03 \| 2136949 \| 4.0307 \|
	\| 3.8654 \| 1.03 \| 2213269 \| 4.0296 \|
	\| 3.8611 \| 2.03 \| 2289589 \| 4.0286 \|
	\| 3.8572 \| 0.03 \| 2365909 \| 4.0275 \|
	\| 3.8535 \| 0.03 \| 2442229 \| 4.0267 \|
	\| 3.8476 \| 0.03 \| 2518549 \| 4.0260 \|
	\| 3.8458 \| 1.03 \| 2594869 \| 4.0250 \|
	\| 3.8425 \| 0.03 \| 2671189 \| 4.0245 \|
	\| 3.8468 \| 1.03 \| 2747509 \| 4.0237 \|
	\| 3.847 \| 2.03 \| 2823829 \| 4.0235 \|
	\| 3.8412 \| 0.03 \| 2900149 \| 4.0230 \|
	\| 3.8407 \| 1.03 \| 2976469 \| 4.0225 \|
	\| 3.8391 \| 2.02 \| 3052726 \| 4.0224 \|


	### Framework versions

	- Transformers 4.33.3
	- Pytorch 2.0.1
	- Datasets 2.12.0
	- Tokenizers 0.13.3