cotysong113 commited on
Commit
6f5ece8
·
verified ·
1 Parent(s): 2a930e9

End of training

Browse files
Files changed (2) hide show
  1. README.md +6 -6
  2. model.safetensors +1 -1
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 3.7019
22
 
23
  ## Model description
24
 
@@ -49,14 +49,14 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
- | No log | 1.0 | 273 | 3.7034 |
53
- | 3.7201 | 2.0 | 546 | 3.7009 |
54
- | 3.7201 | 3.0 | 819 | 3.7019 |
55
 
56
 
57
  ### Framework versions
58
 
59
  - Transformers 4.47.0
60
- - Pytorch 2.5.1
61
- - Datasets 3.1.0
62
  - Tokenizers 0.21.0
 
18
 
19
  This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 3.7168
22
 
23
  ## Model description
24
 
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
+ | No log | 1.0 | 274 | 3.7314 |
53
+ | 3.8123 | 2.0 | 548 | 3.7202 |
54
+ | 3.8123 | 3.0 | 822 | 3.7168 |
55
 
56
 
57
  ### Framework versions
58
 
59
  - Transformers 4.47.0
60
+ - Pytorch 2.5.1+cu124
61
+ - Datasets 2.21.0
62
  - Tokenizers 0.21.0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3cccca07435ee476b7ac0d61ba8cd058518f89adb11bc409c67e33a5c7000b8a
3
  size 327657928
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3760aa5c3e042acb000dbb07b08e73ea538c151fca440f0ef69ba2672fb195c6
3
  size 327657928