eskayML/electra_interview_new

Files changed (6) hide show

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mrm8488/electra-small-finetuned-squadv2](https://huggingface.co/mrm8488/electra-small-finetuned-squadv2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.0631
-- Accuracy: 0.5556
 ## Model description
@@ -50,21 +50,21 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| No log        | 1.0   | 54   | 2.2794          | 0.1667   |
-| No log        | 2.0   | 108  | 2.2574          | 0.3241   |
-| No log        | 3.0   | 162  | 2.2288          | 0.3148   |
-| No log        | 4.0   | 216  | 2.1978          | 0.3796   |
-| No log        | 5.0   | 270  | 2.1605          | 0.4537   |
-| No log        | 6.0   | 324  | 2.1250          | 0.4537   |
-| No log        | 7.0   | 378  | 2.1010          | 0.4444   |
-| No log        | 8.0   | 432  | 2.0799          | 0.5093   |
-| No log        | 9.0   | 486  | 2.0676          | 0.5185   |
-| 2.1978        | 10.0  | 540  | 2.0631          | 0.5556   |
 ### Framework versions
-- Transformers 4.46.2
 - Pytorch 2.5.1+cu121
-- Datasets 3.1.0
-- Tokenizers 0.20.3

 This model is a fine-tuned version of [mrm8488/electra-small-finetuned-squadv2](https://huggingface.co/mrm8488/electra-small-finetuned-squadv2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4590
+- Accuracy: 0.6164
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 2.782         | 1.0   | 579  | 2.6037          | 0.1983   |
+| 2.5903        | 2.0   | 1158 | 2.4794          | 0.1983   |
+| 2.5114        | 3.0   | 1737 | 2.3349          | 0.2586   |
+| 2.3676        | 4.0   | 2316 | 2.1538          | 0.4569   |
+| 2.2466        | 5.0   | 2895 | 1.9574          | 0.4526   |
+| 2.0461        | 6.0   | 3474 | 1.7796          | 0.5690   |
+| 1.7791        | 7.0   | 4053 | 1.6913          | 0.5776   |
+| 1.7205        | 8.0   | 4632 | 1.5485          | 0.5733   |
+| 1.59          | 9.0   | 5211 | 1.4805          | 0.6121   |
+| 1.5614        | 10.0  | 5790 | 1.4590          | 0.6164   |
 ### Framework versions
+- Transformers 4.47.1
 - Pytorch 2.5.1+cu121
+- Datasets 3.2.0
+- Tokenizers 0.21.0

config.json CHANGED Viewed

@@ -10,30 +10,50 @@
   "hidden_dropout_prob": 0.1,
   "hidden_size": 256,
   "id2label": {
-    "0": "Provider Characteristics",
-    "1": "Finanicial Impact",
-    "2": "Imaging modalities in general",
-    "3": "Clinical utility & efficiency-Provider perspective",
-    "4": "Health System Characteristics",
-    "5": "Training",
-    "6": "Value equation",
-    "7": "Workflow related problems",
-    "8": "Credentialing / Quality Assurance Infrastructure",
-    "9": "Patient/Physican interaction in LUS"
   },
   "initializer_range": 0.02,
   "intermediate_size": 1024,
   "label2id": {
-    "Clinical utility & efficiency-Provider perspective": 3,
-    "Credentialing / Quality Assurance Infrastructure": 8,
-    "Finanicial Impact": 1,
-    "Health System Characteristics": 4,
-    "Imaging modalities in general": 2,
-    "Patient/Physican interaction in LUS": 9,
-    "Provider Characteristics": 0,
-    "Training": 5,
-    "Value equation": 6,
-    "Workflow related problems": 7
   },
   "layer_norm_eps": 1e-12,
   "max_position_embeddings": 512,
@@ -48,7 +68,7 @@
   "summary_type": "first",
   "summary_use_proj": true,
   "torch_dtype": "float32",
-  "transformers_version": "4.46.2",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

   "hidden_dropout_prob": 0.1,
   "hidden_size": 256,
   "id2label": {
+    "0": 0,
+    "1": 1,
+    "2": 2,
+    "3": 3,
+    "4": 4,
+    "5": 5,
+    "6": 6,
+    "7": 7,
+    "8": 8,
+    "9": 9,
+    "10": 10,
+    "11": 11,
+    "12": 12,
+    "13": 13,
+    "14": 14,
+    "15": 15,
+    "16": 16,
+    "17": 17,
+    "18": 18,
+    "19": 19
   },
   "initializer_range": 0.02,
   "intermediate_size": 1024,
   "label2id": {
+    "0": 0,
+    "1": 1,
+    "2": 2,
+    "3": 3,
+    "4": 4,
+    "5": 5,
+    "6": 6,
+    "7": 7,
+    "8": 8,
+    "9": 9,
+    "10": 10,
+    "11": 11,
+    "12": 12,
+    "13": 13,
+    "14": 14,
+    "15": 15,
+    "16": 16,
+    "17": 17,
+    "18": 18,
+    "19": 19
   },
   "layer_norm_eps": 1e-12,
   "max_position_embeddings": 512,
   "summary_type": "first",
   "summary_use_proj": true,
   "torch_dtype": "float32",
+  "transformers_version": "4.47.1",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9c3989cc54663ab8a4685264596ef604d10917172939f260f605377a0a2ec728
-size 54229432

 version https://git-lfs.github.com/spec/v1
+oid sha256:5678dd28a6ad932eda4117ba2efc7a01bb83e3a55364844c3a361908bb1a5748
+size 54239712

runs/Dec19_00-55-11_8bd16529ee80/events.out.tfevents.1734569714.8bd16529ee80.968.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:d56c9ec70887777f4a799317b53fd2f5be4c2dbbf7ffca476fe1a7fd4dcdf8b3
+size 5777

tokenizer_config.json CHANGED Viewed

@@ -45,6 +45,7 @@
   "cls_token": "[CLS]",
   "do_basic_tokenize": true,
   "do_lower_case": true,
   "mask_token": "[MASK]",
   "max_length": 512,
   "model_max_length": 512,

   "cls_token": "[CLS]",
   "do_basic_tokenize": true,
   "do_lower_case": true,
+  "extra_special_tokens": {},
   "mask_token": "[MASK]",
   "max_length": 512,
   "model_max_length": 512,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cb6f33767ac8212e22b50aa60d39f04f60fa3b7704f3f0e45b980d635315c2d1
-size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:696c8c84281637804e271799054d0fb3f501144c057011eeb79843e891e858f0
+size 5304