frankmorales2020
/

Mistral-7B-v0.1_AviationQA

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

frankmorales2020 commited on Mar 5

Commit

45a978d

·

verified ·

1 Parent(s): 0130469

Model save

Files changed (1) hide show

README.md +11 -11

README.md CHANGED Viewed

@@ -19,9 +19,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.8939
-- Bleu: 0.4256
-- F1: 0.9813
 ## Model description
@@ -49,17 +51,15 @@ The following hyperparameters were used during training:
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_ratio: 0.03
-- num_epochs: 5
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Bleu   | F1     |
-|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|
-| 6.5778        | 1.0   | 25   | 4.9328          | 0.2183 | 0.9821 |
-| 4.9001        | 2.0   | 50   | 4.8955          | 0.3297 | 0.9820 |
-| 4.8728        | 3.0   | 75   | 4.8911          | 0.4116 | 0.9816 |
-| 4.8634        | 4.0   | 100  | 4.8931          | 0.4004 | 0.9813 |
-| 4.8598        | 5.0   | 125  | 4.8939          | 0.4256 | 0.9813 |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 10.2218
+- Bleu: 0.3549
+- Rougel: 0.4821
+- F1: 0.0005
+- Perplexity: 30611.9414
 ## Model description
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_ratio: 0.03
+- num_epochs: 3
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Bleu   | Rougel | F1     | Perplexity |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:----------:|
+| 10.2565       | 1.0   | 25   | 10.2347         | 0.2507 | 0.3703 | 0.0009 | 29825.1777 |
+| 10.1919       | 2.0   | 50   | 10.2232         | 0.3097 | 0.4539 | 0.0008 | 30276.8613 |
+| 10.1739       | 3.0   | 75   | 10.2218         | 0.3549 | 0.4821 | 0.0005 | 30611.9414 |
 ### Framework versions