jd0g/Mistral-7B-NLI-v0.1

Files changed (6) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-v0.1-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-v0.1-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3549
 ## Model description
@@ -44,16 +44,23 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 0.7334        | 0.992 | 31   | 0.3914          |
-| 0.3755        | 1.984 | 62   | 0.3592          |
-| 0.348         | 2.976 | 93   | 0.3549          |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/Mistral-7B-v0.1-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-v0.1-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6735
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 1.8799        | 0.9231 | 3    | 1.6354          |
+| 1.6206        | 1.8462 | 6    | 1.3630          |
+| 1.3224        | 2.7692 | 9    | 1.1313          |
+| 0.8177        | 4.0    | 13   | 0.9223          |
+| 0.9144        | 4.9231 | 16   | 0.8115          |
+| 0.801         | 5.8462 | 19   | 0.7444          |
+| 0.7393        | 6.7692 | 22   | 0.7097          |
+| 0.5279        | 8.0    | 26   | 0.6836          |
+| 0.6872        | 8.9231 | 29   | 0.6745          |
+| 0.4589        | 9.2308 | 30   | 0.6735          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:60bcd217837893d8bc57b929d3816e0d2ce1552f7dc581097bd9c7f2b9e833e8
 size 4203824

 version https://git-lfs.github.com/spec/v1
+oid sha256:186a8f496eea24fe331caa52f969f5ce4ee60952d14104dc3c4e38fbb66e5026
 size 4203824

runs/Apr25_13-03-59_8ce629014f90/events.out.tfevents.1714050239.8ce629014f90.284.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:729fb2458cb87f4d0c1251e3a75ca1dacc79e36fe3fb37258b7376f43aec97d1
+size 5330

runs/Apr25_13-04-30_8ce629014f90/events.out.tfevents.1714050271.8ce629014f90.284.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:f56b50b400d5366d38d61dde62a2fa8b41025d47baf3db01ab2c85147878c8e0
+size 5330

runs/Apr25_13-05-49_8ce629014f90/events.out.tfevents.1714050349.8ce629014f90.3864.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:fdbf8abfdf2669a2cab51716cb761fd992dfe5e837a52353757d3f7c1c5e7d47
+size 10408

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:de351c59c2e0a2e4cdcf07e7ba361eb5e866e4c5848eb10e3e5be204d4502329
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:1ccb48c341715f03da45d6376940a73d2f4b4ac536ed63d81b2b56365c52b7f6
 size 4984