shawgpt-ft-epoch-17

Browse files

Files changed (3) hide show

README.md +20 -20
runs/Feb18_19-17-59_9a3887f9873e/events.out.tfevents.1739906279.9a3887f9873e.3474.6 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.6168
 ## Model description
@@ -51,29 +51,29 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch   | Step | Validation Loss |
 |:-------------:|:-------:|:----:|:---------------:|
-| 8.5018        | 0.5714  | 1    | 4.2401          |
-| 8.5358        | 1.5714  | 2    | 4.1591          |
-| 8.0877        | 2.5714  | 3    | 3.9716          |
-| 7.7629        | 3.5714  | 4    | 3.7879          |
-| 7.3443        | 4.5714  | 5    | 3.6202          |
-| 7.0525        | 5.5714  | 6    | 3.4621          |
-| 6.7639        | 6.5714  | 7    | 3.3163          |
-| 6.4843        | 7.5714  | 8    | 3.1840          |
-| 6.2574        | 8.5714  | 9    | 3.0666          |
-| 6.0887        | 9.5714  | 10   | 2.9645          |
-| 5.8105        | 10.5714 | 11   | 2.8762          |
-| 5.6722        | 11.5714 | 12   | 2.8019          |
-| 5.538         | 12.5714 | 13   | 2.7407          |
-| 5.4225        | 13.5714 | 14   | 2.6918          |
-| 5.3219        | 14.5714 | 15   | 2.6548          |
-| 5.2683        | 15.5714 | 16   | 2.6297          |
-| 2.6273        | 16.5714 | 17   | 2.6168          |
 ### Framework versions
 - PEFT 0.14.0
-- Transformers 4.48.3
-- Pytorch 2.5.1+cu124
 - Datasets 3.3.1
 - Tokenizers 0.21.0

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.6155
 ## Model description
 | Training Loss | Epoch   | Step | Validation Loss |
 |:-------------:|:-------:|:----:|:---------------:|
+| 25.5434       | 0.5714  | 1    | 4.2401          |
+| 25.6938       | 1.5714  | 2    | 4.1565          |
+| 24.646        | 2.5714  | 3    | 3.9657          |
+| 23.5063       | 3.5714  | 4    | 3.7821          |
+| 22.2803       | 4.5714  | 5    | 3.6134          |
+| 21.3242       | 5.5714  | 6    | 3.4549          |
+| 20.3798       | 6.5714  | 7    | 3.3075          |
+| 19.658        | 7.5714  | 8    | 3.1749          |
+| 18.9316       | 8.5714  | 9    | 3.0579          |
+| 18.1952       | 9.5714  | 10   | 2.9563          |
+| 17.5537       | 10.5714 | 11   | 2.8690          |
+| 17.0554       | 11.5714 | 12   | 2.7957          |
+| 16.6773       | 12.5714 | 13   | 2.7354          |
+| 16.3041       | 13.5714 | 14   | 2.6879          |
+| 15.9872       | 14.5714 | 15   | 2.6520          |
+| 15.7942       | 15.5714 | 16   | 2.6279          |
+| 10.5046       | 16.5714 | 17   | 2.6155          |
 ### Framework versions
 - PEFT 0.14.0
+- Transformers 4.47.1
+- Pytorch 2.5.1+cu121
 - Datasets 3.3.1
 - Tokenizers 0.21.0

runs/Feb18_19-17-59_9a3887f9873e/events.out.tfevents.1739906279.9a3887f9873e.3474.6 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1b75b916ccd96e87f5b15b77e2b7e0c8d4b65262e77f5c659e69cf524a9ad55e
+size 14137

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a43561b7d9eb4946dc4ae3a4dbeb826674d9d177a9e4121b581dfb63836f0789
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:4e5e1b4231f9ec15c59266c9e9531f234583cf3cca0e3a2d75960600d846942c
 size 5304