mistral-lp2-org_aug_b

Browse files

Files changed (3) hide show

README.md +20 -20
adapter_config.json +3 -3
adapter_model.safetensors +2 -2

README.md CHANGED Viewed

@@ -16,10 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.8643
-- F1 Micro: 0.6606
-- F1 Macro: 0.6479
-- F1 Weighted: 0.6611
 ## Model description
@@ -50,22 +50,22 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | F1 Micro | F1 Macro | F1 Weighted |
 |:-------------:|:------:|:----:|:---------------:|:--------:|:--------:|:-----------:|
-| 1.6117        | 0.0154 | 25   | 1.3679          | 0.5541   | 0.4980   | 0.5313      |
-| 1.2903        | 0.0308 | 50   | 1.2094          | 0.6156   | 0.5779   | 0.6029      |
-| 1.164         | 0.0462 | 75   | 1.0987          | 0.6206   | 0.5931   | 0.6141      |
-| 1.1168        | 0.0615 | 100  | 1.1057          | 0.6376   | 0.5883   | 0.6165      |
-| 1.026         | 0.0769 | 125  | 0.9896          | 0.6314   | 0.6196   | 0.6328      |
-| 0.9481        | 0.0923 | 150  | 0.9619          | 0.6438   | 0.6173   | 0.6373      |
-| 0.9797        | 0.1077 | 175  | 0.9549          | 0.6514   | 0.6191   | 0.6411      |
-| 1.045         | 0.1231 | 200  | 0.9121          | 0.6541   | 0.6403   | 0.6543      |
-| 0.8954        | 0.1385 | 225  | 0.8991          | 0.6595   | 0.6418   | 0.6576      |
-| 0.9245        | 0.1538 | 250  | 0.8887          | 0.6588   | 0.6433   | 0.6580      |
-| 0.8636        | 0.1692 | 275  | 0.8824          | 0.6602   | 0.6458   | 0.6600      |
-| 0.846         | 0.1846 | 300  | 0.8793          | 0.6672   | 0.6451   | 0.6627      |
-| 0.8885        | 0.2    | 325  | 0.8820          | 0.6696   | 0.6431   | 0.6624      |
-| 0.8323        | 0.2154 | 350  | 0.8652          | 0.6618   | 0.6474   | 0.6616      |
-| 0.9313        | 0.2308 | 375  | 0.8654          | 0.6601   | 0.6477   | 0.6608      |
-| 0.857         | 0.2462 | 400  | 0.8643          | 0.6606   | 0.6479   | 0.6611      |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0305
+- F1 Micro: 0.7988
+- F1 Macro: 0.7745
+- F1 Weighted: 0.8091
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss | F1 Micro | F1 Macro | F1 Weighted |
 |:-------------:|:------:|:----:|:---------------:|:--------:|:--------:|:-----------:|
+| 1.7847        | 0.0064 | 25   | 1.4983          | 0.7827   | 0.7547   | 0.7929      |
+| 1.3333        | 0.0127 | 50   | 1.2986          | 0.7926   | 0.7660   | 0.8031      |
+| 1.2721        | 0.0191 | 75   | 1.2255          | 0.7755   | 0.7520   | 0.7862      |
+| 1.127         | 0.0255 | 100  | 1.1722          | 0.7945   | 0.7694   | 0.8053      |
+| 1.1108        | 0.0318 | 125  | 1.1561          | 0.7922   | 0.7556   | 0.7971      |
+| 1.0969        | 0.0382 | 150  | 1.1181          | 0.7875   | 0.7581   | 0.7955      |
+| 1.0714        | 0.0446 | 175  | 1.1001          | 0.7884   | 0.7658   | 0.7993      |
+| 1.0219        | 0.0510 | 200  | 1.0758          | 0.8000   | 0.7727   | 0.8091      |
+| 1.0979        | 0.0573 | 225  | 1.0671          | 0.7973   | 0.7656   | 0.8040      |
+| 1.0846        | 0.0637 | 250  | 1.0632          | 0.7866   | 0.7582   | 0.7944      |
+| 0.9977        | 0.0701 | 275  | 1.0590          | 0.7934   | 0.7600   | 0.7991      |
+| 1.1262        | 0.0764 | 300  | 1.0404          | 0.7984   | 0.7699   | 0.8066      |
+| 1.0066        | 0.0828 | 325  | 1.0396          | 0.7981   | 0.7681   | 0.8053      |
+| 1.0534        | 0.0892 | 350  | 1.0360          | 0.8005   | 0.7768   | 0.8113      |
+| 1.0302        | 0.0955 | 375  | 1.0320          | 0.7993   | 0.7754   | 0.8099      |
+| 1.0965        | 0.1019 | 400  | 1.0305          | 0.7988   | 0.7745   | 0.8091      |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,10 +20,10 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
-    "q_proj",
     "k_proj",
-    "o_proj"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "k_proj",
+    "o_proj",
+    "q_proj",
+    "v_proj"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0789a908ecfc7cdc60018ed5a135a610ddf9a4edd3e2dc3aebcfb909c83c39fe
-size 578881968

 version https://git-lfs.github.com/spec/v1
+oid sha256:dd16b2361941c5bc72f4955f731ac6362dc2d0c5c2146d96d059e1e7e8b15828
+size 578898352