JuwonOh
/

gpt2_mitre

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

JuwonOh commited on Mar 16, 2023

Commit

23c0bf2

·

1 Parent(s): cbac854

update model card README.md

Files changed (1) hide show

README.md +6 -3

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4919
 ## Model description
@@ -42,16 +42,19 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 1000
-- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 ### Framework versions
 - Transformers 4.26.1
-- Pytorch 1.7.1+cu110
 - Datasets 2.10.1
 - Tokenizers 0.13.2

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0078
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 1000
+- num_epochs: 100
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 0.3681        | 74.63 | 5000 | 0.0080          |
 ### Framework versions
 - Transformers 4.26.1
+- Pytorch 1.10.1+cu111
 - Datasets 2.10.1
 - Tokenizers 0.13.2