irishzhang commited on
Commit
0ce19c7
·
1 Parent(s): fb30a13

End of training

Browse files
Files changed (2) hide show
  1. README.md +5 -5
  2. pytorch_model.bin +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 3.7124
19
 
20
  ## Model description
21
 
@@ -34,7 +34,7 @@ More information needed
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
- - learning_rate: 2e-05
38
  - train_batch_size: 8
39
  - eval_batch_size: 8
40
  - seed: 42
@@ -46,9 +46,9 @@ The following hyperparameters were used during training:
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | 3.8725 | 1.0 | 1132 | 3.7322 |
50
- | 3.7786 | 2.0 | 2264 | 3.7163 |
51
- | 3.7259 | 3.0 | 3396 | 3.7124 |
52
 
53
 
54
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 3.9182
19
 
20
  ## Model description
21
 
 
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
+ - learning_rate: 0.0002
38
  - train_batch_size: 8
39
  - eval_batch_size: 8
40
  - seed: 42
 
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | No log | 1.0 | 228 | 3.8485 |
50
+ | No log | 2.0 | 456 | 3.8676 |
51
+ | 3.7056 | 3.0 | 684 | 3.9182 |
52
 
53
 
54
  ### Framework versions
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:412979808c2fbc5a9741d4e4082d2afd0a87999f5280f1fbd11e1ea912461047
3
  size 327674194
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e26a20f7e3fc65cf5199616b6c8558b59fe4d7bdd49b0e8a36cf2f07df571fdf
3
  size 327674194