update model card README.md
Browse files
README.md
CHANGED
@@ -1,5 +1,4 @@
|
|
1 |
---
|
2 |
-
license: apache-2.0
|
3 |
tags:
|
4 |
- generated_from_trainer
|
5 |
metrics:
|
@@ -14,14 +13,14 @@ should probably proofread and complete it, then remove this comment. -->
|
|
14 |
|
15 |
# flan-t5-large-extraction-cnndm_4000-all
|
16 |
|
17 |
-
This model
|
18 |
It achieves the following results on the evaluation set:
|
19 |
-
- Loss: 1.
|
20 |
-
- Rouge1: 35.
|
21 |
-
- Rouge2: 15.
|
22 |
-
- Rougel: 30.
|
23 |
-
- Rougelsum: 30.
|
24 |
-
- Gen Len:
|
25 |
|
26 |
## Model description
|
27 |
|
@@ -46,20 +45,23 @@ The following hyperparameters were used during training:
|
|
46 |
- seed: 1799
|
47 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
48 |
- lr_scheduler_type: linear
|
49 |
-
- num_epochs:
|
50 |
|
51 |
### Training results
|
52 |
|
53 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
54 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
|
55 |
-
|
|
56 |
-
| 1.
|
57 |
-
| 1.
|
58 |
-
| 1.
|
59 |
-
| 1.
|
60 |
-
| 1.
|
61 |
-
| 1.
|
62 |
-
| 1.
|
|
|
|
|
|
|
63 |
|
64 |
|
65 |
### Framework versions
|
|
|
1 |
---
|
|
|
2 |
tags:
|
3 |
- generated_from_trainer
|
4 |
metrics:
|
|
|
13 |
|
14 |
# flan-t5-large-extraction-cnndm_4000-all
|
15 |
|
16 |
+
This model was trained from scratch on the None dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
+
- Loss: 1.7531
|
19 |
+
- Rouge1: 35.0348
|
20 |
+
- Rouge2: 15.5615
|
21 |
+
- Rougel: 30.2628
|
22 |
+
- Rougelsum: 30.218
|
23 |
+
- Gen Len: 18.994
|
24 |
|
25 |
## Model description
|
26 |
|
|
|
45 |
- seed: 1799
|
46 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
47 |
- lr_scheduler_type: linear
|
48 |
+
- num_epochs: 20
|
49 |
|
50 |
### Training results
|
51 |
|
52 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
53 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
|
54 |
+
| 1.6013 | 0.4 | 200 | 1.7818 | 34.9655 | 15.0698 | 29.8381 | 29.7984 | 19.0 |
|
55 |
+
| 1.5864 | 0.8 | 400 | 1.7662 | 33.9055 | 14.3357 | 29.3382 | 29.3206 | 19.0 |
|
56 |
+
| 1.5214 | 1.2 | 600 | 1.7988 | 34.856 | 15.4214 | 29.9006 | 29.8346 | 19.0 |
|
57 |
+
| 1.4759 | 1.6 | 800 | 1.8195 | 33.7856 | 14.6215 | 29.2806 | 29.2144 | 19.0 |
|
58 |
+
| 1.5068 | 2.0 | 1000 | 1.7686 | 34.828 | 14.8614 | 29.761 | 29.7316 | 18.998 |
|
59 |
+
| 1.5696 | 2.4 | 1200 | 1.7531 | 35.0348 | 15.5615 | 30.2628 | 30.218 | 18.994 |
|
60 |
+
| 1.5671 | 2.8 | 1400 | 1.7651 | 34.0963 | 15.2973 | 29.8032 | 29.7499 | 19.0 |
|
61 |
+
| 1.5385 | 3.2 | 1600 | 1.7834 | 33.9286 | 14.8702 | 29.2844 | 29.2384 | 19.0 |
|
62 |
+
| 1.4972 | 3.6 | 1800 | 1.7808 | 34.6569 | 15.1071 | 29.8532 | 29.8168 | 19.0 |
|
63 |
+
| 1.4991 | 4.0 | 2000 | 1.7640 | 34.7095 | 15.0358 | 29.6992 | 29.672 | 19.0 |
|
64 |
+
| 1.4504 | 4.4 | 2200 | 1.7944 | 34.9119 | 15.3942 | 29.9696 | 29.9097 | 19.0 |
|
65 |
|
66 |
|
67 |
### Framework versions
|