cotysong113
/

my_awesome_eli5_clm-model

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Metrics Training metrics Community

cotysong113 commited on Dec 15, 2024

Commit

6f5ece8

·

verified ·

1 Parent(s): 2a930e9

End of training

Files changed (2) hide show

README.md +6 -6
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.7019
 ## Model description
@@ -49,14 +49,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 273  | 3.7034          |
-| 3.7201        | 2.0   | 546  | 3.7009          |
-| 3.7201        | 3.0   | 819  | 3.7019          |
 ### Framework versions
 - Transformers 4.47.0
-- Pytorch 2.5.1
-- Datasets 3.1.0
 - Tokenizers 0.21.0

 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.7168
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 274  | 3.7314          |
+| 3.8123        | 2.0   | 548  | 3.7202          |
+| 3.8123        | 3.0   | 822  | 3.7168          |
 ### Framework versions
 - Transformers 4.47.0
+- Pytorch 2.5.1+cu124
+- Datasets 2.21.0
 - Tokenizers 0.21.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3cccca07435ee476b7ac0d61ba8cd058518f89adb11bc409c67e33a5c7000b8a
 size 327657928

 version https://git-lfs.github.com/spec/v1
+oid sha256:3760aa5c3e042acb000dbb07b08e73ea538c151fca440f0ef69ba2672fb195c6
 size 327657928