UNISG-MCS
/

NLP

Luca9100 commited on May 10

Commit

5aafaf6

verified ·

1 Parent(s): f515a87

End of training

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 library_name: peft
 license: other
-base_model: deepseek-ai/deepseek-coder-7b-instruct
 tags:
 - generated_from_trainer
 model-index:
@@ -14,9 +14,9 @@ should probably proofread and complete it, then remove this comment. -->
 # NLP
-This model is a fine-tuned version of [deepseek-ai/deepseek-coder-7b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-7b-instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6407
 ## Model description
@@ -50,9 +50,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 300  | 1.7307          |
-| 1.8979        | 2.0   | 600  | 1.6580          |
-| 1.8979        | 3.0   | 900  | 1.6407          |
 ### Framework versions
@@ -60,5 +60,5 @@ The following hyperparameters were used during training:
 - PEFT 0.15.2
 - Transformers 4.51.3
 - Pytorch 2.6.0+cu124
-- Datasets 3.5.1
 - Tokenizers 0.21.1

 ---
 library_name: peft
 license: other
+base_model: deepseek-ai/deepseek-coder-33b-instruct
 tags:
 - generated_from_trainer
 model-index:
 # NLP
+This model is a fine-tuned version of [deepseek-ai/deepseek-coder-33b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-33b-instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4692
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 300  | 1.5509          |
+| 1.7095        | 2.0   | 600  | 1.4852          |
+| 1.7095        | 3.0   | 900  | 1.4692          |
 ### Framework versions
 - PEFT 0.15.2
 - Transformers 4.51.3
 - Pytorch 2.6.0+cu124
+- Datasets 3.6.0
 - Tokenizers 0.21.1