shawgpt-ft-lr2e-05-wd0.001

Files changed (3) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.1471
 ## Model description
@@ -52,15 +52,15 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 25.5434       | 0.5714 | 1    | 4.2401          |
-| 25.8648       | 1.5714 | 2    | 4.2344          |
-| 25.5005       | 2.5714 | 3    | 4.2157          |
-| 25.4701       | 3.5714 | 4    | 4.1987          |
-| 25.2595       | 4.5714 | 5    | 4.1841          |
-| 25.199        | 5.5714 | 6    | 4.1720          |
-| 25.1152       | 6.5714 | 7    | 4.1624          |
-| 25.2176       | 7.5714 | 8    | 4.1549          |
-| 25.1768       | 8.5714 | 9    | 4.1497          |
-| 16.3551       | 9.5714 | 10   | 4.1471          |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.1499
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 25.5434       | 0.5714 | 1    | 4.2401          |
+| 25.8658       | 1.5714 | 2    | 4.2350          |
+| 25.5046       | 2.5714 | 3    | 4.2167          |
+| 25.48         | 3.5714 | 4    | 4.2006          |
+| 25.2746       | 4.5714 | 5    | 4.1864          |
+| 25.2152       | 5.5714 | 6    | 4.1744          |
+| 25.1326       | 6.5714 | 7    | 4.1646          |
+| 25.2366       | 7.5714 | 8    | 4.1574          |
+| 25.1957       | 8.5714 | 9    | 4.1522          |
+| 16.3685       | 9.5714 | 10   | 4.1499          |
 ### Framework versions

runs/Feb18_20-08-13_9a3887f9873e/events.out.tfevents.1739909293.9a3887f9873e.3474.18 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:be2ef5deeefb1009e707a9a5438d7ff7869a9eda05c7823ed2fbab79a37ef118
+size 10847

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5946e9099210bd7ac2e26d3b39a3e530ae19a0aa9cd4a0d43e4c3912831f450b
 size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:7ad0b17c3549fa415bdc4e6560e9dd19624d3505aaeff66ce0827daeeafcee4a
 size 5368