**kwargs

Files changed (11) hide show

README.md CHANGED Viewed

@@ -16,9 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7111
-- Exact Match Ratio: 0.0
-- Sequence Accuracy: 0.0
 ## Model description
@@ -38,20 +36,22 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Exact Match Ratio | Sequence Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:-----------------:|:-----------------:|
-| 1.3755        | 1.0   | 500  | 0.7111          | 0.0               | 0.0               |
-| 1.1658        | 2.0   | 1000 | 1.0461          | 0.0               | 0.0               |
-| 1.0217        | 3.0   | 1500 | 0.7995          | 0.0               | 0.0               |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6422
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 24
+- eval_batch_size: 24
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 2.7655        | 1.0   | 167  | 0.6422          |
+| 0.6496        | 2.0   | 334  | 0.6717          |
+| 0.6479        | 3.0   | 501  | 0.6706          |
+| 0.8804        | 4.0   | 668  | 0.8621          |
+| 0.8283        | 5.0   | 835  | 0.7388          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -26,10 +26,10 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "qkv_proj",
-    "down_proj",
     "gate_up_proj",
-    "o_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "o_proj",
     "gate_up_proj",
+    "qkv_proj",
+    "down_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:23ff80ac0881d0b146286fe51c47ae8c659ded1fb958af8712edcd2195100df7
 size 888703384

 version https://git-lfs.github.com/spec/v1
+oid sha256:ede0c3d8772bde5cc0c9aa4f9d1561ba9ebfd4d0f82db956ba7f9390d6cc51a6
 size 888703384

runs/Dec06_23-06-28_default/events.out.tfevents.1733526389.default.1802.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ae9a5d43eb280fd8beff8228d2e6ec2a0139be91899afd4ef38a82acaf869d7b
+size 8409

runs/Dec06_23-08-25_default/events.out.tfevents.1733526505.default.2093.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:9e236dee036aefa0ce4549cbcf592eaf6248c4443da7a804c2b1e8fc0f5edf49
+size 8409

runs/Dec06_23-12-46_default/events.out.tfevents.1733526766.default.2379.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:3557c21060c159f2af6aafaf68aeb01e25308c9c92d40ed5187c473982aa5f57
+size 8409

runs/Dec06_23-14-02_default/events.out.tfevents.1733526842.default.2799.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:24a8c7065dabef127ba142558025ece4cf211acbf74e40e879b5890046c8bc1d
+size 8409

runs/Dec06_23-21-30_default/events.out.tfevents.1733527290.default.3178.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:06e5fe9e880333d6bf698e4927acbe3a5e6fec7141eb1d36438b57794bda4cd6
+size 10210

runs/Dec06_23-48-58_default/events.out.tfevents.1733528939.default.3685.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:96a04fcc2a4f0b5e530e08e15396abc827c8d66216adf29635177305d43b528d
+size 11173

runs/Dec06_23-48-58_default/events.out.tfevents.1733530992.default.3685.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:7a973a7f924f8a5d30127df64e0edee7517b5fee3dd5780911436da399417a7e
+size 359

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:12510a0bc410ceb3b1133e6fd17f26c3a9340e3895a0abf3208ec5cf3e778526
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:d668e3c511e10a8a666786c1b9ffdbe2254bb587205cb613b44ff9aab8b28d70
 size 5304