frankmorales2020 commited on
Commit
45a978d
·
verified ·
1 Parent(s): 0130469

Model save

Browse files
Files changed (1) hide show
  1. README.md +11 -11
README.md CHANGED
@@ -19,9 +19,11 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 4.8939
23
- - Bleu: 0.4256
24
- - F1: 0.9813
 
 
25
 
26
  ## Model description
27
 
@@ -49,17 +51,15 @@ The following hyperparameters were used during training:
49
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: constant
51
  - lr_scheduler_warmup_ratio: 0.03
52
- - num_epochs: 5
53
 
54
  ### Training results
55
 
56
- | Training Loss | Epoch | Step | Validation Loss | Bleu | F1 |
57
- |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|
58
- | 6.5778 | 1.0 | 25 | 4.9328 | 0.2183 | 0.9821 |
59
- | 4.9001 | 2.0 | 50 | 4.8955 | 0.3297 | 0.9820 |
60
- | 4.8728 | 3.0 | 75 | 4.8911 | 0.4116 | 0.9816 |
61
- | 4.8634 | 4.0 | 100 | 4.8931 | 0.4004 | 0.9813 |
62
- | 4.8598 | 5.0 | 125 | 4.8939 | 0.4256 | 0.9813 |
63
 
64
 
65
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 10.2218
23
+ - Bleu: 0.3549
24
+ - Rougel: 0.4821
25
+ - F1: 0.0005
26
+ - Perplexity: 30611.9414
27
 
28
  ## Model description
29
 
 
51
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
52
  - lr_scheduler_type: constant
53
  - lr_scheduler_warmup_ratio: 0.03
54
+ - num_epochs: 3
55
 
56
  ### Training results
57
 
58
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Rougel | F1 | Perplexity |
59
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:----------:|
60
+ | 10.2565 | 1.0 | 25 | 10.2347 | 0.2507 | 0.3703 | 0.0009 | 29825.1777 |
61
+ | 10.1919 | 2.0 | 50 | 10.2232 | 0.3097 | 0.4539 | 0.0008 | 30276.8613 |
62
+ | 10.1739 | 3.0 | 75 | 10.2218 | 0.3549 | 0.4821 | 0.0005 | 30611.9414 |
 
 
63
 
64
 
65
  ### Framework versions