UNISG-MCS
/

NLP

Luca9100 commited on May 11

Commit

bafed37

verified ·

1 Parent(s): 90dcd04

End of training

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 library_name: peft
 license: other
-base_model: deepseek-ai/deepseek-coder-33b-instruct
 tags:
 - generated_from_trainer
 model-index:
@@ -14,9 +14,9 @@ should probably proofread and complete it, then remove this comment. -->
 # NLP
-This model is a fine-tuned version of [deepseek-ai/deepseek-coder-33b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-33b-instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.4692
 ## Model description
@@ -50,9 +50,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 300  | 1.5509          |
-| 1.7095        | 2.0   | 600  | 1.4852          |
-| 1.7095        | 3.0   | 900  | 1.4692          |
 ### Framework versions

 ---
 library_name: peft
 license: other
+base_model: deepseek-ai/deepseek-coder-7b-instruct
 tags:
 - generated_from_trainer
 model-index:
 # NLP
+This model is a fine-tuned version of [deepseek-ai/deepseek-coder-7b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-7b-instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.6388
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 300  | 1.7294          |
+| 1.8968        | 2.0   | 600  | 1.6561          |
+| 1.8968        | 3.0   | 900  | 1.6388          |
 ### Framework versions