stojchet
/

jk1

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

stojchet commited on Jul 18, 2024

Commit

69a6a32

·

verified ·

1 Parent(s): f725401

End of training

Files changed (1) hide show

README.md +7 -7

README.md CHANGED Viewed

@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4472
-- Eval/rewards/chosen: -3.2197
-- Eval/logps/chosen: -135.3643
-- Eval/rewards/rejected: -13.7858
-- Eval/logps/rejected: -268.6870
-- Eval/rewards/margins: 10.5660
 - Eval/kl: 0.0
 ## Model description
@@ -59,7 +59,7 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |     |
 |:-------------:|:------:|:----:|:---------------:|:---:|
-| 0.1139        | 1.7058 | 100  | 0.4472          | 0.0 |
 ### Framework versions

 This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4491
+- Eval/rewards/chosen: -3.2146
+- Eval/logps/chosen: -135.3127
+- Eval/rewards/rejected: -13.3743
+- Eval/logps/rejected: -264.5731
+- Eval/rewards/margins: 10.1598
 - Eval/kl: 0.0
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |     |
 |:-------------:|:------:|:----:|:---------------:|:---:|
+| 0.1138        | 1.7058 | 100  | 0.4491          | 0.0 |
 ### Framework versions