koshirowada commited on
Commit
d17bade
·
verified ·
1 Parent(s): f85d51c

End of training

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [EleutherAI/pythia-14m](https://huggingface.co/EleutherAI/pythia-14m) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 2.5974
18
 
19
  ## Model description
20
 
@@ -34,8 +34,8 @@ More information needed
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 5e-05
37
- - train_batch_size: 4
38
- - eval_batch_size: 8
39
  - seed: 42
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
@@ -43,11 +43,11 @@ The following hyperparameters were used during training:
43
 
44
  ### Training results
45
 
46
- | Training Loss | Epoch | Step | Validation Loss |
47
- |:-------------:|:-----:|:-----:|:---------------:|
48
- | 2.6472 | 1.0 | 4750 | 2.9652 |
49
- | 2.3754 | 2.0 | 9500 | 2.7470 |
50
- | 1.9951 | 3.0 | 14250 | 2.5974 |
51
 
52
 
53
  ### Framework versions
 
14
 
15
  This model is a fine-tuned version of [EleutherAI/pythia-14m](https://huggingface.co/EleutherAI/pythia-14m) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 2.4367
18
 
19
  ## Model description
20
 
 
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 5e-05
37
+ - train_batch_size: 32
38
+ - eval_batch_size: 64
39
  - seed: 42
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
 
43
 
44
  ### Training results
45
 
46
+ | Training Loss | Epoch | Step | Validation Loss |
47
+ |:-------------:|:-----:|:----:|:---------------:|
48
+ | 2.5074 | 1.0 | 594 | 2.5707 |
49
+ | 2.2083 | 2.0 | 1188 | 2.4775 |
50
+ | 2.1035 | 3.0 | 1782 | 2.4367 |
51
 
52
 
53
  ### Framework versions