Grigorij
/

mistral_instruct_generation

Generated from Trainer

Model card Files Files and versions

Grigorij commited on Jan 4, 2024

Commit

e0bbcdb

·

1 Parent(s): 5bb1d8a

Model save

Files changed (1) hide show

README.md +17 -19

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7197
 ## Model description
@@ -44,29 +44,27 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_steps: 0.03
-- training_steps: 170
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 2.784         | 0.4   | 10   | 2.0951          |
-| 1.6947        | 0.8   | 20   | 1.2317          |
-| 1.0477        | 1.2   | 30   | 0.9772          |
-| 0.9452        | 1.6   | 40   | 0.9071          |
-| 0.9092        | 2.0   | 50   | 0.8293          |
-| 0.7453        | 2.4   | 60   | 0.7538          |
-| 0.7201        | 2.8   | 70   | 0.6689          |
-| 0.5659        | 3.2   | 80   | 0.6524          |
-| 0.5609        | 3.6   | 90   | 0.6494          |
-| 0.6039        | 4.0   | 100  | 0.6349          |
-| 0.5073        | 4.4   | 110  | 0.6542          |
-| 0.4997        | 4.8   | 120  | 0.6404          |
-| 0.495         | 5.2   | 130  | 0.6557          |
-| 0.453         | 5.6   | 140  | 0.6815          |
-| 0.4803        | 6.0   | 150  | 0.6361          |
-| 0.4233        | 6.4   | 160  | 0.7303          |
-| 0.4209        | 6.8   | 170  | 0.7197          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5788
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_steps: 0.03
+- training_steps: 150
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 4.0261        | 0.4   | 10   | 2.8697          |
+| 2.2616        | 0.8   | 20   | 1.6009          |
+| 1.1871        | 1.2   | 30   | 0.9599          |
+| 0.8522        | 1.6   | 40   | 0.7228          |
+| 0.7375        | 2.0   | 50   | 0.6601          |
+| 0.5916        | 2.4   | 60   | 0.6184          |
+| 0.6219        | 2.8   | 70   | 0.5957          |
+| 0.5025        | 3.2   | 80   | 0.5980          |
+| 0.5148        | 3.6   | 90   | 0.5849          |
+| 0.5502        | 4.0   | 100  | 0.5639          |
+| 0.4414        | 4.4   | 110  | 0.5875          |
+| 0.4423        | 4.8   | 120  | 0.5847          |
+| 0.43          | 5.2   | 130  | 0.5902          |
+| 0.3843        | 5.6   | 140  | 0.6223          |
+| 0.4173        | 6.0   | 150  | 0.5788          |
 ### Framework versions