End of training

Browse files

Files changed (6) hide show

README.md +30 -25
all_results.json +8 -8
predict_results.json +8 -8
predictions.txt +0 -0
runs/Nov17_22-14-34_347f825b15ea/events.out.tfevents.1731881675.347f825b15ea.3217.1 +2 -2
runs/Nov17_22-14-34_347f825b15ea/events.out.tfevents.1731884167.347f825b15ea.3217.2 +3 -0

README.md CHANGED Viewed

@@ -22,16 +22,16 @@ model-index:
     metrics:
     - name: Precision
       type: precision
-      value: 0.9399720800372267
     - name: Recall
       type: recall
-      value: 0.9465791940018744
     - name: F1
       type: f1
-      value: 0.9432640672425869
     - name: Accuracy
       type: accuracy
-      value: 0.9813340410474168
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -41,11 +41,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [layoutlmv3](https://huggingface.co/layoutlmv3) on the mp-02/cord-sroie dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0970
-- Precision: 0.9400
-- Recall: 0.9466
-- F1: 0.9433
-- Accuracy: 0.9813
 ## Model description
@@ -65,29 +65,34 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - training_steps: 4000
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
-|:-------------:|:-------:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| No log        | 2.2222  | 100  | 0.3258          | 0.8171    | 0.7685 | 0.7921 | 0.9363   |
-| No log        | 4.4444  | 200  | 0.1516          | 0.9078    | 0.8946 | 0.9011 | 0.9694   |
-| No log        | 6.6667  | 300  | 0.1085          | 0.9315    | 0.9175 | 0.9245 | 0.9761   |
-| No log        | 8.8889  | 400  | 0.1000          | 0.9382    | 0.9456 | 0.9419 | 0.9817   |
-| 0.4015        | 11.1111 | 500  | 0.0970          | 0.9400    | 0.9466 | 0.9433 | 0.9813   |
-| 0.4015        | 13.3333 | 600  | 0.1064          | 0.9505    | 0.9358 | 0.9431 | 0.9814   |
-| 0.4015        | 15.5556 | 700  | 0.1095          | 0.9465    | 0.9372 | 0.9418 | 0.9812   |
 ### Framework versions
-- Transformers 4.44.2
-- Pytorch 2.4.0+cu118
-- Datasets 2.21.0
-- Tokenizers 0.19.1

     metrics:
     - name: Precision
       type: precision
+      value: 0.9036656236030398
     - name: Recall
       type: recall
+      value: 0.9578298981284056
     - name: F1
       type: f1
+      value: 0.9299597469810236
     - name: Accuracy
       type: accuracy
+      value: 0.9736783204261605
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [layoutlmv3](https://huggingface.co/layoutlmv3) on the mp-02/cord-sroie dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0967
+- Precision: 0.9037
+- Recall: 0.9578
+- F1: 0.9300
+- Accuracy: 0.9737
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - training_steps: 4000
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
+|:-------------:|:------:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| No log        | 0.7937 | 100  | 0.4387          | 0.6226    | 0.5785 | 0.5998 | 0.9037   |
+| No log        | 1.5873 | 200  | 0.2236          | 0.8925    | 0.8439 | 0.8675 | 0.9562   |
+| No log        | 2.3810 | 300  | 0.1342          | 0.9127    | 0.8965 | 0.9045 | 0.9692   |
+| No log        | 3.1746 | 400  | 0.1054          | 0.9119    | 0.9273 | 0.9195 | 0.9735   |
+| 0.6635        | 3.9683 | 500  | 0.1341          | 0.8555    | 0.9495 | 0.9001 | 0.9630   |
+| 0.6635        | 4.7619 | 600  | 0.1060          | 0.9059    | 0.9493 | 0.9271 | 0.9739   |
+| 0.6635        | 5.5556 | 700  | 0.1066          | 0.9080    | 0.9420 | 0.9247 | 0.9738   |
+| 0.6635        | 6.3492 | 800  | 0.1008          | 0.9078    | 0.9564 | 0.9315 | 0.9746   |
+| 0.6635        | 7.1429 | 900  | 0.0988          | 0.9086    | 0.9517 | 0.9296 | 0.9738   |
+| 0.0995        | 7.9365 | 1000 | 0.0967          | 0.9037    | 0.9578 | 0.9300 | 0.9737   |
+| 0.0995        | 8.7302 | 1100 | 0.1224          | 0.8777    | 0.9642 | 0.9189 | 0.9690   |
+| 0.0995        | 9.5238 | 1200 | 0.1263          | 0.8879    | 0.9536 | 0.9196 | 0.9694   |
 ### Framework versions
+- Transformers 4.46.2
+- Pytorch 2.5.1+cu121
+- Datasets 3.1.0
+- Tokenizers 0.20.3

all_results.json CHANGED Viewed

@@ -1,10 +1,10 @@
 {
-    "predict_accuracy": 0.9703861414884352,
-    "predict_f1": 0.923658709524935,
-    "predict_loss": 0.14045372605323792,
-    "predict_precision": 0.9067285382830627,
-    "predict_recall": 0.941233140655106,
-    "predict_runtime": 12.1468,
-    "predict_samples_per_second": 11.279,
-    "predict_steps_per_second": 0.741
 }

 {
+    "predict_accuracy": 0.9776242335499222,
+    "predict_f1": 0.9420884632922936,
+    "predict_loss": 0.09829691052436829,
+    "predict_precision": 0.9247985675917636,
+    "predict_recall": 0.9600371747211895,
+    "predict_runtime": 37.955,
+    "predict_samples_per_second": 7.193,
+    "predict_steps_per_second": 0.922
 }

predict_results.json CHANGED Viewed

@@ -1,10 +1,10 @@
 {
-    "predict_accuracy": 0.9703861414884352,
-    "predict_f1": 0.923658709524935,
-    "predict_loss": 0.14045372605323792,
-    "predict_precision": 0.9067285382830627,
-    "predict_recall": 0.941233140655106,
-    "predict_runtime": 12.1468,
-    "predict_samples_per_second": 11.279,
-    "predict_steps_per_second": 0.741
 }

 {
+    "predict_accuracy": 0.9776242335499222,
+    "predict_f1": 0.9420884632922936,
+    "predict_loss": 0.09829691052436829,
+    "predict_precision": 0.9247985675917636,
+    "predict_recall": 0.9600371747211895,
+    "predict_runtime": 37.955,
+    "predict_samples_per_second": 7.193,
+    "predict_steps_per_second": 0.922
 }

predictions.txt CHANGED Viewed

The diff for this file is too large to render. See raw diff

runs/Nov17_22-14-34_347f825b15ea/events.out.tfevents.1731881675.347f825b15ea.3217.1 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d6b6073a4262ac8fcb04c9cae175e9e94d58c26a59da99198456bb7ffe28c345
-size 13069

 version https://git-lfs.github.com/spec/v1
+oid sha256:7c2d30b02608afd522ef2602b1c2e4e8db78f49e0af43655dbdf776f6c1d88f7
+size 14367

runs/Nov17_22-14-34_347f825b15ea/events.out.tfevents.1731884167.347f825b15ea.3217.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8ca9fb8e32b34c0662eb44a0bc52d1012613c242673bd8602a8b993cffa9b0b7
+size 560