Trkkk
/

git-base-bdd100k

@@ -1,76 +1,121 @@
----
-base_model: microsoft/git-base
-library_name: transformers
-license: mit
-tags:
-- generated_from_trainer
-model-index:
-- name: git-base-bdd100k
-  results: []
----
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# git-base-bdd100k
-This model is a fine-tuned version of [microsoft/git-base](https://huggingface.co/microsoft/git-base) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.4317
-- Wer Score: 0.7406
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
-### Training hyperparameters
-The following hyperparameters were used during training:
-- learning_rate: 0.0003
-- train_batch_size: 10
-- eval_batch_size: 10
-- seed: 42
-- gradient_accumulation_steps: 2
-- total_train_batch_size: 20
-- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
-- lr_scheduler_type: linear
-- num_epochs: 15
-- mixed_precision_training: Native AMP
-### Training results
-| Training Loss | Epoch | Step | Validation Loss | Wer Score |
-|:-------------:|:-----:|:----:|:---------------:|:---------:|
-| 10.8372       | 1.0   | 3    | 10.0676         | 12.2594   |
-| 9.8981        | 2.0   | 6    | 8.6733          | 9.4194    |
-| 8.1063        | 3.0   | 9    | 6.8677          | 0.8955    |
-| 6.5282        | 4.0   | 12   | 5.5489          | 3.5342    |
-| 5.2838        | 5.0   | 15   | 4.3715          | 2.2077    |
-| 4.16          | 6.0   | 18   | 3.3203          | 3.32      |
-| 3.1554        | 7.0   | 21   | 2.3917          | 1.5897    |
-| 2.2691        | 8.0   | 24   | 1.6361          | 0.7832    |
-| 1.553         | 9.0   | 27   | 1.1034          | 0.7703    |
-| 1.0453        | 10.0  | 30   | 0.7820          | 0.7781    |
-| 0.7256        | 11.0  | 33   | 0.6073          | 0.7703    |
-| 0.5425        | 12.0  | 36   | 0.5168          | 0.7548    |
-| 0.4393        | 13.0  | 39   | 0.4689          | 0.7419    |
-| 0.3801        | 14.0  | 42   | 0.4449          | 0.7445    |
-| 0.3404        | 15.0  | 45   | 0.4317          | 0.7406    |
-### Framework versions
-- Transformers 4.46.0.dev0
-- Pytorch 2.0.1+cu117
-- Datasets 3.0.1
-- Tokenizers 0.20.1

+---
+library_name: transformers
+license: mit
+base_model: microsoft/git-base
+tags:
+- generated_from_trainer
+model-index:
+- name: git-base-bdd100k
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# git-base-bdd100k
+This model is a fine-tuned version of [microsoft/git-base](https://huggingface.co/microsoft/git-base) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.7580
+- Wer Score: 2.4791
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 25
+- eval_batch_size: 25
+- seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 50
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 60
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer Score |
+|:-------------:|:-----:|:----:|:---------------:|:---------:|
+| 10.5887       | 1.0   | 3    | 9.3421          | 6.9151    |
+| 9.214         | 2.0   | 6    | 8.8516          | 6.0480    |
+| 8.7067        | 3.0   | 9    | 8.2609          | 5.4057    |
+| 8.1888        | 4.0   | 12   | 7.8240          | 6.3589    |
+| 7.7891        | 5.0   | 15   | 7.4809          | 7.0127    |
+| 7.4625        | 6.0   | 18   | 7.1805          | 8.0121    |
+| 7.1743        | 7.0   | 21   | 6.9085          | 8.0926    |
+| 6.9065        | 8.0   | 24   | 6.6502          | 7.8826    |
+| 6.6509        | 9.0   | 27   | 6.4007          | 6.7806    |
+| 6.4049        | 10.0  | 30   | 6.1576          | 5.3401    |
+| 6.163         | 11.0  | 33   | 5.9207          | 3.7040    |
+| 5.9257        | 12.0  | 36   | 5.6873          | 2.8456    |
+| 5.6928        | 13.0  | 39   | 5.4572          | 2.7166    |
+| 5.4619        | 14.0  | 42   | 5.2319          | 2.2856    |
+| 5.235         | 15.0  | 45   | 5.0112          | 2.5138    |
+| 5.0122        | 16.0  | 48   | 4.7931          | 2.0353    |
+| 4.7935        | 17.0  | 51   | 4.5815          | 2.0843    |
+| 4.5784        | 18.0  | 54   | 4.3751          | 2.1378    |
+| 4.3684        | 19.0  | 57   | 4.1720          | 1.9609    |
+| 4.1622        | 20.0  | 60   | 3.9752          | 1.9994    |
+| 3.9616        | 21.0  | 63   | 3.7828          | 2.1312    |
+| 3.764         | 22.0  | 66   | 3.5945          | 2.1163    |
+| 3.5738        | 23.0  | 69   | 3.4124          | 2.1417    |
+| 3.3868        | 24.0  | 72   | 3.2380          | 2.2707    |
+| 3.2067        | 25.0  | 75   | 3.0658          | 2.2205    |
+| 3.0298        | 26.0  | 78   | 2.9021          | 2.2029    |
+| 2.8614        | 27.0  | 81   | 2.7425          | 2.3682    |
+| 2.6981        | 28.0  | 84   | 2.5918          | 2.2133    |
+| 2.5412        | 29.0  | 87   | 2.4445          | 2.2889    |
+| 2.3899        | 30.0  | 90   | 2.3042          | 2.2795    |
+| 2.2443        | 31.0  | 93   | 2.1726          | 2.3831    |
+| 2.1068        | 32.0  | 96   | 2.0445          | 2.3649    |
+| 1.975         | 33.0  | 99   | 1.9276          | 2.3291    |
+| 1.8509        | 34.0  | 102  | 1.8173          | 2.3252    |
+| 1.733         | 35.0  | 105  | 1.7116          | 2.3809    |
+| 1.6231        | 36.0  | 108  | 1.6166          | 2.3743    |
+| 1.5204        | 37.0  | 111  | 1.5221          | 2.4256    |
+| 1.4227        | 38.0  | 114  | 1.4396          | 2.4305    |
+| 1.3334        | 39.0  | 117  | 1.3620          | 2.5766    |
+| 1.2509        | 40.0  | 120  | 1.2913          | 2.4140    |
+| 1.1736        | 41.0  | 123  | 1.2291          | 2.4140    |
+| 1.1027        | 42.0  | 126  | 1.1664          | 2.4162    |
+| 1.0378        | 43.0  | 129  | 1.1151          | 2.4531    |
+| 0.9774        | 44.0  | 132  | 1.0686          | 2.4013    |
+| 0.9234        | 45.0  | 135  | 1.0257          | 2.4548    |
+| 0.8731        | 46.0  | 138  | 0.9856          | 2.4603    |
+| 0.8301        | 47.0  | 141  | 0.9499          | 2.5463    |
+| 0.7886        | 48.0  | 144  | 0.9213          | 2.3953    |
+| 0.7511        | 49.0  | 147  | 0.8932          | 2.5083    |
+| 0.7193        | 50.0  | 150  | 0.8675          | 2.4542    |
+| 0.6894        | 51.0  | 153  | 0.8475          | 2.4713    |
+| 0.664         | 52.0  | 156  | 0.8284          | 2.4030    |
+| 0.6405        | 53.0  | 159  | 0.8146          | 2.4548    |
+| 0.6205        | 54.0  | 162  | 0.7990          | 2.5424    |
+| 0.6042        | 55.0  | 165  | 0.7881          | 2.4961    |
+| 0.5893        | 56.0  | 168  | 0.7785          | 2.4664    |
+| 0.5766        | 57.0  | 171  | 0.7710          | 2.4598    |
+| 0.5664        | 58.0  | 174  | 0.7650          | 2.4564    |
+| 0.5598        | 59.0  | 177  | 0.7613          | 2.4895    |
+| 0.5539        | 60.0  | 180  | 0.7580          | 2.4791    |
+### Framework versions
+- Transformers 4.45.2
+- Pytorch 2.1.0+cu118
+- Datasets 3.0.1
+- Tokenizers 0.20.1

generation_config.json CHANGED Viewed

@@ -1,7 +1,7 @@
-{
-  "_from_model_config": true,
-  "bos_token_id": 101,
-  "eos_token_id": 102,
-  "pad_token_id": 0,
-  "transformers_version": "4.46.0.dev0"
-}

+{
+  "_from_model_config": true,
+  "bos_token_id": 101,
+  "eos_token_id": 102,
+  "pad_token_id": 0,
+  "transformers_version": "4.45.2"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e75e8688a40c9d96faa6c02427f95df2466642c893f7a5157951c3d99539813e
 size 706516040

 version https://git-lfs.github.com/spec/v1
+oid sha256:0e06eca1e70144ebcaed81fb57ea39a29ad6764cf3bd572e4e99b821e60e4a58
 size 706516040