Zekunli commited on
Commit
f390ebe
1 Parent(s): 0122669

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -17
README.md CHANGED
@@ -1,5 +1,4 @@
1
  ---
2
- license: apache-2.0
3
  tags:
4
  - generated_from_trainer
5
  metrics:
@@ -14,14 +13,14 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  # flan-t5-large-extraction-cnndm_4000-all
16
 
17
- This model is a fine-tuned version of [google/flan-t5-large](https://huggingface.co/google/flan-t5-large) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 1.7290
20
- - Rouge1: 35.0775
21
- - Rouge2: 15.2209
22
- - Rougel: 30.1796
23
- - Rougelsum: 30.1599
24
- - Gen Len: 19.0
25
 
26
  ## Model description
27
 
@@ -46,20 +45,23 @@ The following hyperparameters were used during training:
46
  - seed: 1799
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
- - num_epochs: 10
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
54
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
55
- | 2.1464 | 0.4 | 200 | 1.8323 | 35.2242 | 15.3495 | 30.142 | 30.1331 | 19.0 |
56
- | 1.9817 | 0.8 | 400 | 1.7729 | 34.3798 | 14.7287 | 29.5447 | 29.6052 | 18.986 |
57
- | 1.8842 | 1.2 | 600 | 1.7602 | 34.5807 | 15.1707 | 29.7768 | 29.8081 | 18.986 |
58
- | 1.8129 | 1.6 | 800 | 1.7629 | 34.5103 | 15.231 | 29.9182 | 29.9333 | 19.0 |
59
- | 1.8238 | 2.0 | 1000 | 1.7290 | 35.0775 | 15.2209 | 30.1796 | 30.1599 | 19.0 |
60
- | 1.7199 | 2.4 | 1200 | 1.7354 | 34.6552 | 15.7256 | 30.1894 | 30.2207 | 18.998 |
61
- | 1.7128 | 2.8 | 1400 | 1.7407 | 34.7198 | 15.5771 | 30.0585 | 30.0442 | 19.0 |
62
- | 1.6816 | 3.2 | 1600 | 1.7508 | 34.9611 | 15.5792 | 30.3518 | 30.3638 | 19.0 |
 
 
 
63
 
64
 
65
  ### Framework versions
 
1
  ---
 
2
  tags:
3
  - generated_from_trainer
4
  metrics:
 
13
 
14
  # flan-t5-large-extraction-cnndm_4000-all
15
 
16
+ This model was trained from scratch on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 1.7531
19
+ - Rouge1: 35.0348
20
+ - Rouge2: 15.5615
21
+ - Rougel: 30.2628
22
+ - Rougelsum: 30.218
23
+ - Gen Len: 18.994
24
 
25
  ## Model description
26
 
 
45
  - seed: 1799
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
+ - num_epochs: 20
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
53
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
54
+ | 1.6013 | 0.4 | 200 | 1.7818 | 34.9655 | 15.0698 | 29.8381 | 29.7984 | 19.0 |
55
+ | 1.5864 | 0.8 | 400 | 1.7662 | 33.9055 | 14.3357 | 29.3382 | 29.3206 | 19.0 |
56
+ | 1.5214 | 1.2 | 600 | 1.7988 | 34.856 | 15.4214 | 29.9006 | 29.8346 | 19.0 |
57
+ | 1.4759 | 1.6 | 800 | 1.8195 | 33.7856 | 14.6215 | 29.2806 | 29.2144 | 19.0 |
58
+ | 1.5068 | 2.0 | 1000 | 1.7686 | 34.828 | 14.8614 | 29.761 | 29.7316 | 18.998 |
59
+ | 1.5696 | 2.4 | 1200 | 1.7531 | 35.0348 | 15.5615 | 30.2628 | 30.218 | 18.994 |
60
+ | 1.5671 | 2.8 | 1400 | 1.7651 | 34.0963 | 15.2973 | 29.8032 | 29.7499 | 19.0 |
61
+ | 1.5385 | 3.2 | 1600 | 1.7834 | 33.9286 | 14.8702 | 29.2844 | 29.2384 | 19.0 |
62
+ | 1.4972 | 3.6 | 1800 | 1.7808 | 34.6569 | 15.1071 | 29.8532 | 29.8168 | 19.0 |
63
+ | 1.4991 | 4.0 | 2000 | 1.7640 | 34.7095 | 15.0358 | 29.6992 | 29.672 | 19.0 |
64
+ | 1.4504 | 4.4 | 2200 | 1.7944 | 34.9119 | 15.3942 | 29.9696 | 29.9097 | 19.0 |
65
 
66
 
67
  ### Framework versions