Kain17
/

reuters-gpt2-textgen

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Metrics Training metrics Community

Kain17 commited on Sep 29, 2024

Commit

c1115ce

·

verified ·

1 Parent(s): 9066509

End of training

Files changed (1) hide show

README.md +7 -4

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 7.6745
 ## Model description
@@ -43,15 +43,18 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
-- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| No log        | 0.9655 | 7    | 8.0728          |
-| 8.6453        | 1.9310 | 14   | 7.6745          |
 ### Framework versions

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 6.9251
 ## Model description
 - total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
+- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| No log        | 0.9655 | 7    | 7.3408          |
+| 6.9261        | 1.9310 | 14   | 7.0712          |
+| 6.1889        | 2.8966 | 21   | 6.9572          |
+| 6.1889        | 4.0    | 29   | 6.9370          |
+| 5.8938        | 4.8276 | 35   | 6.9251          |
 ### Framework versions