sahlebrahim commited on
Commit
cc951b3
·
verified ·
1 Parent(s): 4fbbdee

Model save

Browse files
Files changed (1) hide show
  1. README.md +7 -7
README.md CHANGED
@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on the dialogstudio dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 2.1617
21
 
22
  ## Model description
23
 
@@ -45,17 +45,17 @@ The following hyperparameters were used during training:
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: cosine
47
  - lr_scheduler_warmup_ratio: 0.05
48
- - num_epochs: 7
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
- | 1.7993 | 1.4 | 77 | 1.8588 |
55
- | 1.5367 | 2.8 | 154 | 1.8706 |
56
- | 1.1499 | 4.2 | 231 | 2.0429 |
57
- | 1.1036 | 5.6 | 308 | 2.1262 |
58
- | 0.9217 | 7.0 | 385 | 2.1617 |
59
 
60
 
61
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on the dialogstudio dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.8479
21
 
22
  ## Model description
23
 
 
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: cosine
47
  - lr_scheduler_warmup_ratio: 0.05
48
+ - num_epochs: 3
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
+ | 1.8459 | 0.6 | 33 | 1.8965 |
55
+ | 1.7921 | 1.2 | 66 | 1.8626 |
56
+ | 1.7595 | 1.8 | 99 | 1.8452 |
57
+ | 1.6381 | 2.4 | 132 | 1.8500 |
58
+ | 1.6185 | 3.0 | 165 | 1.8479 |
59
 
60
 
61
  ### Framework versions