nishita commited on
Commit
27bbd33
·
1 Parent(s): d5f52c1

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +112 -4
README.md CHANGED
@@ -1,6 +1,8 @@
1
  ---
2
  tags:
3
  - generated_from_trainer
 
 
4
  model-index:
5
  - name: outputs
6
  results: []
@@ -12,6 +14,13 @@ should probably proofread and complete it, then remove this comment. -->
12
  # outputs
13
 
14
  This model is a fine-tuned version of [mrm8488/t5-base-finetuned-common_gen](https://huggingface.co/mrm8488/t5-base-finetuned-common_gen) on an unknown dataset.
 
 
 
 
 
 
 
15
 
16
  ## Model description
17
 
@@ -36,13 +45,112 @@ The following hyperparameters were used during training:
36
  - seed: 42
37
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
38
  - lr_scheduler_type: linear
39
- - num_epochs: 1
40
 
41
  ### Training results
42
 
43
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
44
- |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
45
- | No log | 1.0 | 241 | 2.3602 | 73.3459 | 48.755 | 62.4776 | 62.4645 | 16.5806 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
46
 
47
 
48
  ### Framework versions
 
1
  ---
2
  tags:
3
  - generated_from_trainer
4
+ metrics:
5
+ - rouge
6
  model-index:
7
  - name: outputs
8
  results: []
 
14
  # outputs
15
 
16
  This model is a fine-tuned version of [mrm8488/t5-base-finetuned-common_gen](https://huggingface.co/mrm8488/t5-base-finetuned-common_gen) on an unknown dataset.
17
+ It achieves the following results on the evaluation set:
18
+ - Loss: 3.2833
19
+ - Rouge1: 79.0721
20
+ - Rouge2: 59.355
21
+ - Rougel: 70.9787
22
+ - Rougelsum: 70.9177
23
+ - Gen Len: 16.3819
24
 
25
  ## Model description
26
 
 
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
+ - num_epochs: 100
49
 
50
  ### Training results
51
 
52
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
53
+ |:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
54
+ | No log | 1.0 | 241 | 2.3934 | 75.6585 | 53.2221 | 65.7693 | 65.7796 | 16.4214 |
55
+ | No log | 2.0 | 482 | 2.4223 | 76.75 | 55.466 | 67.3377 | 67.3425 | 16.4381 |
56
+ | 0.5668 | 3.0 | 723 | 2.5147 | 77.5314 | 56.5654 | 68.4518 | 68.4263 | 16.4235 |
57
+ | 0.5668 | 4.0 | 964 | 2.5164 | 78.3267 | 57.4609 | 69.1759 | 69.1835 | 16.4381 |
58
+ | 0.4546 | 5.0 | 1205 | 2.6058 | 78.6699 | 58.2725 | 69.8265 | 69.8802 | 16.358 |
59
+ | 0.4546 | 6.0 | 1446 | 2.6295 | 78.8943 | 58.182 | 69.5933 | 69.619 | 16.4464 |
60
+ | 0.4014 | 7.0 | 1687 | 2.6592 | 79.0497 | 58.7135 | 70.2046 | 70.2047 | 16.4152 |
61
+ | 0.4014 | 8.0 | 1928 | 2.6610 | 79.1756 | 59.143 | 70.5631 | 70.5854 | 16.4235 |
62
+ | 0.3652 | 9.0 | 2169 | 2.6636 | 79.2156 | 59.3209 | 70.5418 | 70.5757 | 16.4308 |
63
+ | 0.3652 | 10.0 | 2410 | 2.6647 | 78.9922 | 59.0252 | 70.49 | 70.5262 | 16.3569 |
64
+ | 0.341 | 11.0 | 2651 | 2.7105 | 79.0372 | 58.7891 | 70.2666 | 70.3333 | 16.4131 |
65
+ | 0.341 | 12.0 | 2892 | 2.7829 | 79.2323 | 59.4941 | 70.6877 | 70.7283 | 16.3684 |
66
+ | 0.3199 | 13.0 | 3133 | 2.7344 | 79.0326 | 58.8869 | 70.2648 | 70.2983 | 16.4329 |
67
+ | 0.3199 | 14.0 | 3374 | 2.7568 | 79.172 | 59.1695 | 70.5229 | 70.5505 | 16.435 |
68
+ | 0.301 | 15.0 | 3615 | 2.7773 | 78.9152 | 58.8776 | 70.368 | 70.3922 | 16.3923 |
69
+ | 0.301 | 16.0 | 3856 | 2.8178 | 79.1939 | 59.1455 | 70.6938 | 70.7153 | 16.4693 |
70
+ | 0.2883 | 17.0 | 4097 | 2.8105 | 79.1307 | 59.114 | 70.6573 | 70.67 | 16.4287 |
71
+ | 0.2883 | 18.0 | 4338 | 2.8360 | 78.8425 | 58.6959 | 70.4532 | 70.3955 | 16.358 |
72
+ | 0.2779 | 19.0 | 4579 | 2.8606 | 79.3713 | 59.3462 | 70.6531 | 70.648 | 16.3632 |
73
+ | 0.2779 | 20.0 | 4820 | 2.8513 | 79.0083 | 58.817 | 70.5779 | 70.5775 | 16.435 |
74
+ | 0.2664 | 21.0 | 5061 | 2.8882 | 79.1277 | 59.1771 | 70.8215 | 70.7963 | 16.4194 |
75
+ | 0.2664 | 22.0 | 5302 | 2.8852 | 79.0191 | 59.0226 | 70.6604 | 70.6499 | 16.3715 |
76
+ | 0.258 | 23.0 | 5543 | 2.9253 | 79.3219 | 59.1689 | 70.8438 | 70.8462 | 16.41 |
77
+ | 0.258 | 24.0 | 5784 | 2.9020 | 79.1941 | 59.044 | 70.6849 | 70.6449 | 16.3996 |
78
+ | 0.2494 | 25.0 | 6025 | 2.9579 | 79.5266 | 59.8762 | 71.5284 | 71.5085 | 16.4422 |
79
+ | 0.2494 | 26.0 | 6266 | 2.9660 | 79.1653 | 59.3299 | 70.7888 | 70.7755 | 16.4475 |
80
+ | 0.2415 | 27.0 | 6507 | 2.9682 | 79.3267 | 59.7158 | 71.1062 | 71.0864 | 16.411 |
81
+ | 0.2415 | 28.0 | 6748 | 3.0086 | 79.0646 | 59.1926 | 70.8172 | 70.78 | 16.4422 |
82
+ | 0.2415 | 29.0 | 6989 | 3.0190 | 78.9127 | 58.8431 | 70.4618 | 70.4597 | 16.4308 |
83
+ | 0.2361 | 30.0 | 7230 | 2.9958 | 79.1999 | 59.5494 | 70.9411 | 70.9 | 16.3684 |
84
+ | 0.2361 | 31.0 | 7471 | 2.9824 | 79.0219 | 59.0778 | 70.6465 | 70.6445 | 16.3809 |
85
+ | 0.2295 | 32.0 | 7712 | 3.0079 | 79.1633 | 59.4146 | 71.0352 | 70.9144 | 16.4017 |
86
+ | 0.2295 | 33.0 | 7953 | 2.9894 | 79.106 | 59.0905 | 70.644 | 70.622 | 16.4069 |
87
+ | 0.2247 | 34.0 | 8194 | 3.0256 | 78.8789 | 59.0956 | 70.8588 | 70.796 | 16.3954 |
88
+ | 0.2247 | 35.0 | 8435 | 3.0451 | 78.7977 | 58.889 | 70.6925 | 70.6408 | 16.3975 |
89
+ | 0.2196 | 36.0 | 8676 | 3.0475 | 78.8942 | 58.8944 | 70.4803 | 70.4858 | 16.3871 |
90
+ | 0.2196 | 37.0 | 8917 | 3.0289 | 78.7668 | 58.7055 | 70.3606 | 70.3058 | 16.411 |
91
+ | 0.2157 | 38.0 | 9158 | 3.0748 | 78.833 | 58.6404 | 70.271 | 70.2338 | 16.4266 |
92
+ | 0.2157 | 39.0 | 9399 | 3.0665 | 79.2216 | 59.3481 | 70.9647 | 70.9076 | 16.4058 |
93
+ | 0.2104 | 40.0 | 9640 | 3.0773 | 78.9717 | 58.8784 | 70.6392 | 70.6359 | 16.3704 |
94
+ | 0.2104 | 41.0 | 9881 | 3.0823 | 78.901 | 58.9511 | 70.737 | 70.7051 | 16.3788 |
95
+ | 0.2061 | 42.0 | 10122 | 3.0637 | 78.8689 | 58.8098 | 70.754 | 70.7783 | 16.36 |
96
+ | 0.2061 | 43.0 | 10363 | 3.0965 | 79.4409 | 59.5757 | 71.2304 | 71.2553 | 16.4422 |
97
+ | 0.2027 | 44.0 | 10604 | 3.1178 | 78.8964 | 58.9388 | 70.5416 | 70.5367 | 16.4048 |
98
+ | 0.2027 | 45.0 | 10845 | 3.1078 | 79.2306 | 59.5613 | 71.1202 | 71.0741 | 16.4089 |
99
+ | 0.1998 | 46.0 | 11086 | 3.1432 | 79.2393 | 59.5587 | 71.1138 | 71.0502 | 16.4485 |
100
+ | 0.1998 | 47.0 | 11327 | 3.1493 | 79.3553 | 59.5023 | 71.0459 | 71.0508 | 16.4256 |
101
+ | 0.1966 | 48.0 | 11568 | 3.1670 | 79.0395 | 59.1524 | 70.9374 | 70.921 | 16.4006 |
102
+ | 0.1966 | 49.0 | 11809 | 3.1580 | 79.2359 | 59.2038 | 71.0061 | 70.9725 | 16.4568 |
103
+ | 0.1941 | 50.0 | 12050 | 3.1458 | 79.1914 | 59.4751 | 71.2448 | 71.2131 | 16.437 |
104
+ | 0.1941 | 51.0 | 12291 | 3.1379 | 79.2891 | 59.6776 | 71.3416 | 71.2787 | 16.4287 |
105
+ | 0.1903 | 52.0 | 12532 | 3.1618 | 79.4208 | 59.7335 | 71.4722 | 71.4196 | 16.4173 |
106
+ | 0.1903 | 53.0 | 12773 | 3.1712 | 79.2976 | 59.736 | 71.1151 | 71.0491 | 16.4454 |
107
+ | 0.188 | 54.0 | 13014 | 3.1795 | 79.2538 | 59.2768 | 71.0137 | 70.9602 | 16.4183 |
108
+ | 0.188 | 55.0 | 13255 | 3.1836 | 79.4504 | 59.3201 | 71.0908 | 71.0614 | 16.4277 |
109
+ | 0.188 | 56.0 | 13496 | 3.1800 | 79.067 | 59.1309 | 70.7069 | 70.6896 | 16.3996 |
110
+ | 0.1858 | 57.0 | 13737 | 3.1660 | 79.2369 | 59.5092 | 71.0077 | 70.9537 | 16.3871 |
111
+ | 0.1858 | 58.0 | 13978 | 3.1886 | 79.0404 | 58.8652 | 70.6484 | 70.6162 | 16.3798 |
112
+ | 0.1849 | 59.0 | 14219 | 3.1413 | 79.1468 | 59.1964 | 70.8362 | 70.8076 | 16.4121 |
113
+ | 0.1849 | 60.0 | 14460 | 3.1778 | 79.0189 | 58.9894 | 70.597 | 70.5447 | 16.3788 |
114
+ | 0.1824 | 61.0 | 14701 | 3.1674 | 78.8303 | 58.7354 | 70.4517 | 70.4009 | 16.3871 |
115
+ | 0.1824 | 62.0 | 14942 | 3.1756 | 78.8914 | 58.7551 | 70.3439 | 70.3031 | 16.3965 |
116
+ | 0.1793 | 63.0 | 15183 | 3.1943 | 78.9093 | 58.8599 | 70.415 | 70.3998 | 16.4152 |
117
+ | 0.1793 | 64.0 | 15424 | 3.1970 | 78.8353 | 58.8096 | 70.4231 | 70.3813 | 16.4194 |
118
+ | 0.1768 | 65.0 | 15665 | 3.2301 | 79.0958 | 59.2143 | 70.837 | 70.7973 | 16.4162 |
119
+ | 0.1768 | 66.0 | 15906 | 3.2415 | 79.1112 | 59.1872 | 70.7534 | 70.7145 | 16.4173 |
120
+ | 0.177 | 67.0 | 16147 | 3.2332 | 79.0807 | 59.3005 | 70.6878 | 70.6351 | 16.3757 |
121
+ | 0.177 | 68.0 | 16388 | 3.2415 | 79.0587 | 59.1872 | 70.6691 | 70.6295 | 16.3819 |
122
+ | 0.1744 | 69.0 | 16629 | 3.2454 | 79.3829 | 59.5478 | 70.997 | 70.9835 | 16.4037 |
123
+ | 0.1744 | 70.0 | 16870 | 3.2239 | 79.1289 | 59.1661 | 70.7455 | 70.684 | 16.4412 |
124
+ | 0.1715 | 71.0 | 17111 | 3.2056 | 79.2074 | 59.1811 | 70.7366 | 70.682 | 16.4339 |
125
+ | 0.1715 | 72.0 | 17352 | 3.2101 | 79.1756 | 59.1883 | 70.8208 | 70.821 | 16.4079 |
126
+ | 0.1722 | 73.0 | 17593 | 3.2086 | 79.4652 | 59.7631 | 71.3223 | 71.2917 | 16.4214 |
127
+ | 0.1722 | 74.0 | 17834 | 3.2269 | 79.0154 | 59.0411 | 70.6754 | 70.6184 | 16.3621 |
128
+ | 0.17 | 75.0 | 18075 | 3.2389 | 79.1212 | 59.3143 | 70.7629 | 70.6935 | 16.3444 |
129
+ | 0.17 | 76.0 | 18316 | 3.2387 | 79.1148 | 59.4001 | 70.897 | 70.8525 | 16.3663 |
130
+ | 0.1682 | 77.0 | 18557 | 3.2505 | 79.0955 | 59.2033 | 70.8455 | 70.8303 | 16.3632 |
131
+ | 0.1682 | 78.0 | 18798 | 3.2670 | 79.1781 | 59.1849 | 70.8704 | 70.8286 | 16.3611 |
132
+ | 0.1686 | 79.0 | 19039 | 3.2825 | 79.0246 | 59.1434 | 70.7866 | 70.7353 | 16.3559 |
133
+ | 0.1686 | 80.0 | 19280 | 3.2816 | 79.0431 | 58.9944 | 70.6819 | 70.6257 | 16.3486 |
134
+ | 0.1664 | 81.0 | 19521 | 3.2567 | 79.0287 | 59.0805 | 70.7245 | 70.6889 | 16.3392 |
135
+ | 0.1664 | 82.0 | 19762 | 3.2750 | 78.9783 | 59.1123 | 70.6961 | 70.6369 | 16.3465 |
136
+ | 0.1659 | 83.0 | 20003 | 3.2757 | 78.9774 | 59.2248 | 70.7978 | 70.7407 | 16.3455 |
137
+ | 0.1659 | 84.0 | 20244 | 3.2800 | 79.0205 | 59.2371 | 70.8016 | 70.7532 | 16.3704 |
138
+ | 0.1659 | 85.0 | 20485 | 3.2723 | 78.9507 | 59.1004 | 70.6419 | 70.5825 | 16.3663 |
139
+ | 0.165 | 86.0 | 20726 | 3.2723 | 78.9698 | 59.1362 | 70.7738 | 70.7018 | 16.3652 |
140
+ | 0.165 | 87.0 | 20967 | 3.2740 | 79.0156 | 59.2383 | 70.7799 | 70.7075 | 16.384 |
141
+ | 0.1637 | 88.0 | 21208 | 3.2628 | 79.0806 | 59.3468 | 70.8158 | 70.753 | 16.3725 |
142
+ | 0.1637 | 89.0 | 21449 | 3.2605 | 78.9712 | 59.223 | 70.7653 | 70.7021 | 16.3621 |
143
+ | 0.1642 | 90.0 | 21690 | 3.2738 | 79.0793 | 59.3309 | 70.8753 | 70.8282 | 16.385 |
144
+ | 0.1642 | 91.0 | 21931 | 3.2674 | 78.9877 | 59.1468 | 70.7031 | 70.6598 | 16.3694 |
145
+ | 0.1636 | 92.0 | 22172 | 3.2706 | 79.006 | 59.1366 | 70.7069 | 70.6638 | 16.3757 |
146
+ | 0.1636 | 93.0 | 22413 | 3.2783 | 79.0874 | 59.2523 | 70.7881 | 70.7397 | 16.3746 |
147
+ | 0.1629 | 94.0 | 22654 | 3.2801 | 79.1126 | 59.2826 | 70.8352 | 70.7865 | 16.3715 |
148
+ | 0.1629 | 95.0 | 22895 | 3.2797 | 79.0586 | 59.3089 | 70.8126 | 70.7697 | 16.3757 |
149
+ | 0.1625 | 96.0 | 23136 | 3.2826 | 79.1019 | 59.3294 | 70.8399 | 70.803 | 16.3777 |
150
+ | 0.1625 | 97.0 | 23377 | 3.2844 | 79.0739 | 59.3167 | 70.9062 | 70.846 | 16.3767 |
151
+ | 0.1622 | 98.0 | 23618 | 3.2826 | 79.0739 | 59.3167 | 70.9062 | 70.846 | 16.3767 |
152
+ | 0.1622 | 99.0 | 23859 | 3.2830 | 79.0721 | 59.355 | 70.9787 | 70.9177 | 16.3819 |
153
+ | 0.161 | 100.0 | 24100 | 3.2833 | 79.0721 | 59.355 | 70.9787 | 70.9177 | 16.3819 |
154
 
155
 
156
  ### Framework versions