Luca9100 commited on
Commit
bafed37
·
verified ·
1 Parent(s): 90dcd04

End of training

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  library_name: peft
3
  license: other
4
- base_model: deepseek-ai/deepseek-coder-33b-instruct
5
  tags:
6
  - generated_from_trainer
7
  model-index:
@@ -14,9 +14,9 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  # NLP
16
 
17
- This model is a fine-tuned version of [deepseek-ai/deepseek-coder-33b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-33b-instruct) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 1.4692
20
 
21
  ## Model description
22
 
@@ -50,9 +50,9 @@ The following hyperparameters were used during training:
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
- | No log | 1.0 | 300 | 1.5509 |
54
- | 1.7095 | 2.0 | 600 | 1.4852 |
55
- | 1.7095 | 3.0 | 900 | 1.4692 |
56
 
57
 
58
  ### Framework versions
 
1
  ---
2
  library_name: peft
3
  license: other
4
+ base_model: deepseek-ai/deepseek-coder-7b-instruct
5
  tags:
6
  - generated_from_trainer
7
  model-index:
 
14
 
15
  # NLP
16
 
17
+ This model is a fine-tuned version of [deepseek-ai/deepseek-coder-7b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-7b-instruct) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 1.6388
20
 
21
  ## Model description
22
 
 
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
+ | No log | 1.0 | 300 | 1.7294 |
54
+ | 1.8968 | 2.0 | 600 | 1.6561 |
55
+ | 1.8968 | 3.0 | 900 | 1.6388 |
56
 
57
 
58
  ### Framework versions