End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 7.3033
-- Bleu: 0.6555
-- Gen Len: 12.15
 ## Model description
@@ -49,18 +49,18 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Bleu   | Gen Len |
-|:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
-| No log        | 1.0   | 125  | 7.9945          | 0.0    | 19.0    |
-| No log        | 2.0   | 250  | 7.8580          | 0.0325 | 14.53   |
-| No log        | 3.0   | 375  | 7.6932          | 0.028  | 13.445  |
-| 7.9046        | 4.0   | 500  | 7.5535          | 0.0495 | 15.705  |
-| 7.9046        | 5.0   | 625  | 7.4692          | 0.4831 | 14.0    |
-| 7.9046        | 6.0   | 750  | 7.3836          | 0.5158 | 14.055  |
-| 7.9046        | 7.0   | 875  | 7.3553          | 0.6008 | 12.765  |
-| 7.2416        | 8.0   | 1000 | 7.3290          | 0.4591 | 11.815  |
-| 7.2416        | 9.0   | 1125 | 7.3026          | 0.709  | 13.095  |
-| 7.2416        | 10.0  | 1250 | 7.3033          | 0.6555 | 12.15   |
 ### Framework versions

 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.4274
+- Bleu: 4.4727
+- Gen Len: 16.0017
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step   | Validation Loss | Bleu   | Gen Len |
+|:-------------:|:-----:|:------:|:---------------:|:------:|:-------:|
+| 5.0           | 1.0   | 17734  | 4.7335          | 2.2286 | 15.5907 |
+| 4.4395        | 2.0   | 35468  | 4.2401          | 2.9281 | 15.7406 |
+| 4.1509        | 3.0   | 53202  | 3.9709          | 3.206  | 16.1203 |
+| 3.9609        | 4.0   | 70936  | 3.7968          | 3.6191 | 15.8338 |
+| 3.8746        | 5.0   | 88670  | 3.6712          | 3.8795 | 16.0679 |
+| 3.7316        | 6.0   | 106404 | 3.5811          | 3.9517 | 15.9977 |
+| 3.7038        | 7.0   | 124138 | 3.5185          | 4.2873 | 16.0255 |
+| 3.5782        | 8.0   | 141872 | 3.4695          | 4.3817 | 16.0927 |
+| 3.5957        | 9.0   | 159606 | 3.4387          | 4.4197 | 16.0783 |
+| 3.564         | 10.0  | 177340 | 3.4274          | 4.4727 | 16.0017 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8506b123037a7460afd80b7411bfd9bb6fddaa34813280521f638ee144ac38db
 size 191081512

 version https://git-lfs.github.com/spec/v1
+oid sha256:14d93818b254100cb7503c2e4353164b9d45ae107bdab3e7aa64b3376840ad3e
 size 191081512

runs/Apr27_09-29-00_a13489f1ea0f/events.out.tfevents.1714210141.a13489f1ea0f.24.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:91ba13dfac2f0ed7bcc77b62faccaef800ab2962b9e03c8275993a28a5f50fd7
+size 85666

tokenizer.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e14d1f10456295e290544a3e00f9120e42ebea4a043f30cece639eb2e76a278e
 size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:75f7ea09b70d0e93b7fe6de0bab2000d62989352d1636f4a12be1cb2f817986f
 size 5048