ThuyNT03 commited on
Commit
8232440
1 Parent(s): f51bed5

End of training

Browse files
Files changed (2) hide show
  1. README.md +19 -14
  2. pytorch_model.bin +1 -1
README.md CHANGED
@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [VietAI/vit5-base](https://huggingface.co/VietAI/vit5-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.6954
21
- - Rouge1: 74.1444
22
- - Rouge2: 67.6733
23
- - Rougel: 73.6458
24
- - Rougelsum: 73.7053
25
- - Gen Len: 7.3623
26
 
27
  ## Model description
28
 
@@ -42,22 +42,27 @@ More information needed
42
 
43
  The following hyperparameters were used during training:
44
  - learning_rate: 2e-05
45
- - train_batch_size: 4
46
- - eval_batch_size: 4
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
- - num_epochs: 5
51
 
52
  ### Training results
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
56
- | No log | 1.0 | 472 | 0.7246 | 72.5851 | 65.9384 | 72.1305 | 72.0232 | 8.4407 |
57
- | 1.0847 | 2.0 | 944 | 0.6714 | 73.9038 | 67.1961 | 73.5409 | 73.5136 | 6.214 |
58
- | 0.5906 | 3.0 | 1416 | 0.6565 | 74.0155 | 67.4387 | 73.6696 | 73.6203 | 7.2754 |
59
- | 0.464 | 4.0 | 1888 | 0.6696 | 74.3779 | 67.7236 | 73.9367 | 74.0007 | 7.214 |
60
- | 0.389 | 5.0 | 2360 | 0.6954 | 74.1444 | 67.6733 | 73.6458 | 73.7053 | 7.3623 |
 
 
 
 
 
61
 
62
 
63
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [VietAI/vit5-base](https://huggingface.co/VietAI/vit5-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.9951
21
+ - Rouge1: 74.1102
22
+ - Rouge2: 67.9199
23
+ - Rougel: 73.686
24
+ - Rougelsum: 73.7568
25
+ - Gen Len: 7.0148
26
 
27
  ## Model description
28
 
 
42
 
43
  The following hyperparameters were used during training:
44
  - learning_rate: 2e-05
45
+ - train_batch_size: 16
46
+ - eval_batch_size: 16
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
+ - num_epochs: 10
51
 
52
  ### Training results
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
56
+ | No log | 1.0 | 118 | 0.7373 | 74.1623 | 67.7624 | 73.6071 | 73.6764 | 7.3326 |
57
+ | No log | 2.0 | 236 | 0.7758 | 74.1167 | 67.7666 | 73.7039 | 73.8076 | 7.0869 |
58
+ | No log | 3.0 | 354 | 0.8174 | 73.8958 | 67.4854 | 73.3437 | 73.4362 | 7.1822 |
59
+ | No log | 4.0 | 472 | 0.8195 | 74.8085 | 68.4703 | 74.3389 | 74.4854 | 6.7903 |
60
+ | 0.2234 | 5.0 | 590 | 0.8848 | 74.1319 | 67.6899 | 73.5608 | 73.6273 | 7.2013 |
61
+ | 0.2234 | 6.0 | 708 | 0.9413 | 73.4933 | 67.0495 | 73.0176 | 73.0687 | 7.2839 |
62
+ | 0.2234 | 7.0 | 826 | 0.9167 | 74.1512 | 67.7638 | 73.7512 | 73.8058 | 6.9703 |
63
+ | 0.2234 | 8.0 | 944 | 0.9577 | 73.8412 | 67.3981 | 73.3697 | 73.4324 | 7.1525 |
64
+ | 0.1303 | 9.0 | 1062 | 0.9869 | 73.9929 | 67.64 | 73.4942 | 73.5355 | 7.2309 |
65
+ | 0.1303 | 10.0 | 1180 | 0.9951 | 74.1102 | 67.9199 | 73.686 | 73.7568 | 7.0148 |
66
 
67
 
68
  ### Framework versions
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:35bc72a26205d1c8deb4faa5b0a1b6181c484420110c12331623f8c9e457c8f6
3
  size 903892625
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1c1b786a5862a80735c4239c9a60d8d9396d97a6edffdf086699120e0b3be4e3
3
  size 903892625