rujengelal commited on
Commit
9a8da27
·
verified ·
1 Parent(s): b8efea4

End of training

Browse files
README.md CHANGED
@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 7.3033
21
- - Bleu: 0.6555
22
- - Gen Len: 12.15
23
 
24
  ## Model description
25
 
@@ -49,18 +49,18 @@ The following hyperparameters were used during training:
49
 
50
  ### Training results
51
 
52
- | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
- |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
54
- | No log | 1.0 | 125 | 7.9945 | 0.0 | 19.0 |
55
- | No log | 2.0 | 250 | 7.8580 | 0.0325 | 14.53 |
56
- | No log | 3.0 | 375 | 7.6932 | 0.028 | 13.445 |
57
- | 7.9046 | 4.0 | 500 | 7.5535 | 0.0495 | 15.705 |
58
- | 7.9046 | 5.0 | 625 | 7.4692 | 0.4831 | 14.0 |
59
- | 7.9046 | 6.0 | 750 | 7.3836 | 0.5158 | 14.055 |
60
- | 7.9046 | 7.0 | 875 | 7.3553 | 0.6008 | 12.765 |
61
- | 7.2416 | 8.0 | 1000 | 7.3290 | 0.4591 | 11.815 |
62
- | 7.2416 | 9.0 | 1125 | 7.3026 | 0.709 | 13.095 |
63
- | 7.2416 | 10.0 | 1250 | 7.3033 | 0.6555 | 12.15 |
64
 
65
 
66
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 3.4274
21
+ - Bleu: 4.4727
22
+ - Gen Len: 16.0017
23
 
24
  ## Model description
25
 
 
49
 
50
  ### Training results
51
 
52
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
+ |:-------------:|:-----:|:------:|:---------------:|:------:|:-------:|
54
+ | 5.0 | 1.0 | 17734 | 4.7335 | 2.2286 | 15.5907 |
55
+ | 4.4395 | 2.0 | 35468 | 4.2401 | 2.9281 | 15.7406 |
56
+ | 4.1509 | 3.0 | 53202 | 3.9709 | 3.206 | 16.1203 |
57
+ | 3.9609 | 4.0 | 70936 | 3.7968 | 3.6191 | 15.8338 |
58
+ | 3.8746 | 5.0 | 88670 | 3.6712 | 3.8795 | 16.0679 |
59
+ | 3.7316 | 6.0 | 106404 | 3.5811 | 3.9517 | 15.9977 |
60
+ | 3.7038 | 7.0 | 124138 | 3.5185 | 4.2873 | 16.0255 |
61
+ | 3.5782 | 8.0 | 141872 | 3.4695 | 4.3817 | 16.0927 |
62
+ | 3.5957 | 9.0 | 159606 | 3.4387 | 4.4197 | 16.0783 |
63
+ | 3.564 | 10.0 | 177340 | 3.4274 | 4.4727 | 16.0017 |
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8506b123037a7460afd80b7411bfd9bb6fddaa34813280521f638ee144ac38db
3
  size 191081512
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:14d93818b254100cb7503c2e4353164b9d45ae107bdab3e7aa64b3376840ad3e
3
  size 191081512
runs/Apr27_09-29-00_a13489f1ea0f/events.out.tfevents.1714210141.a13489f1ea0f.24.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:91ba13dfac2f0ed7bcc77b62faccaef800ab2962b9e03c8275993a28a5f50fd7
3
+ size 85666
tokenizer.json CHANGED
The diff for this file is too large to render. See raw diff
 
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e14d1f10456295e290544a3e00f9120e42ebea4a043f30cece639eb2e76a278e
3
  size 5048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:75f7ea09b70d0e93b7fe6de0bab2000d62989352d1636f4a12be1cb2f817986f
3
  size 5048