End of training

Browse files

Files changed (3) hide show

README.md +21 -9
model.safetensors +1 -1
runs/Oct02_05-00-35_67bb1d30543c/events.out.tfevents.1727845239.67bb1d30543c.706.11 +2 -2

README.md CHANGED Viewed

@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/git-base](https://huggingface.co/microsoft/git-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0431
-- Wer Score: 1.1979
 ## Model description
@@ -37,21 +37,33 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 2
-- total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Wer Score |
-|:-------------:|:------:|:----:|:---------------:|:---------:|
-| 0.0043        | 8.3333 | 50   | 0.0431          | 1.1979    |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/git-base](https://huggingface.co/microsoft/git-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0627
+- Wer Score: 8.5567
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 4
+- eval_batch_size: 4
 - seed: 42
 - gradient_accumulation_steps: 2
+- total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 30
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch   | Step | Validation Loss | Wer Score |
+|:-------------:|:-------:|:----:|:---------------:|:---------:|
+| 2.2863        | 2.1277  | 50   | 0.3771          | 0.4680    |
+| 0.1088        | 4.2553  | 100  | 0.0445          | 0.4631    |
+| 0.0219        | 6.3830  | 150  | 0.0438          | 0.4483    |
+| 0.0152        | 8.5106  | 200  | 0.0437          | 0.4532    |
+| 0.0124        | 10.6383 | 250  | 0.0474          | 0.4877    |
+| 0.0101        | 12.7660 | 300  | 0.0499          | 2.7241    |
+| 0.008         | 14.8936 | 350  | 0.0512          | 4.0493    |
+| 0.0064        | 17.0213 | 400  | 0.0535          | 5.2857    |
+| 0.0039        | 19.1489 | 450  | 0.0574          | 7.3103    |
+| 0.0025        | 21.2766 | 500  | 0.0587          | 7.6847    |
+| 0.0015        | 23.4043 | 550  | 0.0620          | 8.0443    |
+| 0.0011        | 25.5319 | 600  | 0.0617          | 9.0788    |
+| 0.0009        | 27.6596 | 650  | 0.0627          | 8.5567    |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8b36584469887ae33939a1e5197f8c5a8797640436d90e398b9600251eba097b
 size 706516040

 version https://git-lfs.github.com/spec/v1
+oid sha256:b54c4ee58e89c866217abb67359b3b31b46e252d9a28b6540dc55576b03e572e
 size 706516040

runs/Oct02_05-00-35_67bb1d30543c/events.out.tfevents.1727845239.67bb1d30543c.706.11 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:102467b5c3fa1194b9061216af7a3388aef320f04cb7ccb4e188282fc84bc57e
-size 12062

 version https://git-lfs.github.com/spec/v1
+oid sha256:5ea8a31a71dfdffb77d5382a7a138ee62f07911f817507eae643a7a426fef065
+size 12416