Luca9100 commited on
Commit
5aafaf6
·
verified ·
1 Parent(s): f515a87

End of training

Browse files
Files changed (1) hide show
  1. README.md +7 -7
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  library_name: peft
3
  license: other
4
- base_model: deepseek-ai/deepseek-coder-7b-instruct
5
  tags:
6
  - generated_from_trainer
7
  model-index:
@@ -14,9 +14,9 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  # NLP
16
 
17
- This model is a fine-tuned version of [deepseek-ai/deepseek-coder-7b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-7b-instruct) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 1.6407
20
 
21
  ## Model description
22
 
@@ -50,9 +50,9 @@ The following hyperparameters were used during training:
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
- | No log | 1.0 | 300 | 1.7307 |
54
- | 1.8979 | 2.0 | 600 | 1.6580 |
55
- | 1.8979 | 3.0 | 900 | 1.6407 |
56
 
57
 
58
  ### Framework versions
@@ -60,5 +60,5 @@ The following hyperparameters were used during training:
60
  - PEFT 0.15.2
61
  - Transformers 4.51.3
62
  - Pytorch 2.6.0+cu124
63
- - Datasets 3.5.1
64
  - Tokenizers 0.21.1
 
1
  ---
2
  library_name: peft
3
  license: other
4
+ base_model: deepseek-ai/deepseek-coder-33b-instruct
5
  tags:
6
  - generated_from_trainer
7
  model-index:
 
14
 
15
  # NLP
16
 
17
+ This model is a fine-tuned version of [deepseek-ai/deepseek-coder-33b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-33b-instruct) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 1.4692
20
 
21
  ## Model description
22
 
 
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
+ | No log | 1.0 | 300 | 1.5509 |
54
+ | 1.7095 | 2.0 | 600 | 1.4852 |
55
+ | 1.7095 | 3.0 | 900 | 1.4692 |
56
 
57
 
58
  ### Framework versions
 
60
  - PEFT 0.15.2
61
  - Transformers 4.51.3
62
  - Pytorch 2.6.0+cu124
63
+ - Datasets 3.6.0
64
  - Tokenizers 0.21.1