lillybak/mistral-7binstruct-summary-100s

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.5951
 ## Model description
@@ -45,14 +45,19 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant
-- lr_scheduler_warmup_steps: 2
-- training_steps: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.7489        | 0.08  | 10   | 1.5951          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4327
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant
+- lr_scheduler_warmup_steps: 0.03
+- training_steps: 125
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.7572        | 0.17  | 20   | 1.5089          |
+| 1.5374        | 0.34  | 40   | 1.4566          |
+| 1.4774        | 0.51  | 60   | 1.4456          |
+| 1.5517        | 0.68  | 80   | 1.4398          |
+| 1.5103        | 0.85  | 100  | 1.4347          |
+| 1.4976        | 1.02  | 120  | 1.4327          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -19,8 +19,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
-    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
+    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5217f764be78fb61e7c8eb130d9da235f856717619021a27606e76c6375021d6
 size 27280152

 version https://git-lfs.github.com/spec/v1
+oid sha256:e0ad9b2c83ad86fb1ceca5cc43c7a1339b0160b1389763fe3ffda29ead074ba7
 size 27280152

runs/Mar01_18-32-34_e5ccb9cdd337/events.out.tfevents.1709317955.e5ccb9cdd337.2969.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0ce8dad1129b4ac3ac71bf19bb6d76b76a099a0c9c40de67b46dd1e379685259
+size 8238

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0c1d6f64e30b14cc94e2303d53562000be29480f76fc6f5787d21002c98b9090
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:a2a7e3d80807f33f36db8cd2093e0f105934cec6246227ed60c3609b552f01ce
 size 4920