stojchet commited on
Commit
69a6a32
·
verified ·
1 Parent(s): f725401

End of training

Browse files
Files changed (1) hide show
  1. README.md +7 -7
README.md CHANGED
@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.4472
21
- - Eval/rewards/chosen: -3.2197
22
- - Eval/logps/chosen: -135.3643
23
- - Eval/rewards/rejected: -13.7858
24
- - Eval/logps/rejected: -268.6870
25
- - Eval/rewards/margins: 10.5660
26
  - Eval/kl: 0.0
27
 
28
  ## Model description
@@ -59,7 +59,7 @@ The following hyperparameters were used during training:
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | |
61
  |:-------------:|:------:|:----:|:---------------:|:---:|
62
- | 0.1139 | 1.7058 | 100 | 0.4472 | 0.0 |
63
 
64
 
65
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.4491
21
+ - Eval/rewards/chosen: -3.2146
22
+ - Eval/logps/chosen: -135.3127
23
+ - Eval/rewards/rejected: -13.3743
24
+ - Eval/logps/rejected: -264.5731
25
+ - Eval/rewards/margins: 10.1598
26
  - Eval/kl: 0.0
27
 
28
  ## Model description
 
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | |
61
  |:-------------:|:------:|:----:|:---------------:|:---:|
62
+ | 0.1138 | 1.7058 | 100 | 0.4491 | 0.0 |
63
 
64
 
65
  ### Framework versions