TheBloke/CodeLlama-7B-Instruct-AWQ-FaVe-rank32-2epochs-v2

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/CodeLlama-7B-Instruct-AWQ](https://huggingface.co/TheBloke/CodeLlama-7B-Instruct-AWQ) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4073
 ## Model description
@@ -52,13 +52,13 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| No log        | 0.2685 | 10   | 1.3934          |
-| 1.5978        | 0.5369 | 20   | 0.6803          |
-| 1.5978        | 0.8054 | 30   | 0.5716          |
-| 0.5694        | 1.0738 | 40   | 0.5086          |
-| 0.5694        | 1.3423 | 50   | 0.4432          |
-| 0.399         | 1.6107 | 60   | 0.4193          |
-| 0.399         | 1.8792 | 70   | 0.4073          |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/CodeLlama-7B-Instruct-AWQ](https://huggingface.co/TheBloke/CodeLlama-7B-Instruct-AWQ) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4323
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| No log        | 0.2685 | 10   | 1.4509          |
+| 1.5667        | 0.5369 | 20   | 0.7012          |
+| 1.5667        | 0.8054 | 30   | 0.6017          |
+| 0.5581        | 1.0738 | 40   | 0.5068          |
+| 0.5581        | 1.3423 | 50   | 0.4670          |
+| 0.4108        | 1.6107 | 60   | 0.4462          |
+| 0.4108        | 1.8792 | 70   | 0.4323          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,8 +20,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
-    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
+    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0d92caffd61b02f9dfd53278021742a16adac3fce12c78e1fbd052b3465bc9f6
 size 67126104

 version https://git-lfs.github.com/spec/v1
+oid sha256:5cfd5d2c9119c85b745ea155ee95083135ab5de1a0ee7b04c9e79b1019babd69
 size 67126104

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e62c3c20150901c635a5516eb51e97553fcbafc63c3a24ad2837029014ddecd5
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:3440519dfedb7d40d27c25c94d82b1aa7dbc10caa7e22e77de4f1265c709f3d3
 size 5112