zera09 commited on
Commit
daab84c
·
verified ·
1 Parent(s): 046c217

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,12 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [google/long-t5-tglobal-base](https://huggingface.co/google/long-t5-tglobal-base) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.4039
 
 
 
 
 
19
 
20
  ## Model description
21
 
@@ -40,22 +45,15 @@ The following hyperparameters were used during training:
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
- - num_epochs: 10
44
 
45
  ### Training results
46
 
47
- | Training Loss | Epoch | Step | Validation Loss |
48
- |:-------------:|:-----:|:----:|:---------------:|
49
- | No log | 1.0 | 394 | 1.9051 |
50
- | 3.4843 | 2.0 | 788 | 1.4999 |
51
- | 1.8807 | 3.0 | 1182 | 1.4607 |
52
- | 1.7485 | 4.0 | 1576 | 1.4434 |
53
- | 1.7485 | 5.0 | 1970 | 1.4264 |
54
- | 1.6669 | 6.0 | 2364 | 1.4211 |
55
- | 1.6346 | 7.0 | 2758 | 1.4134 |
56
- | 1.6131 | 8.0 | 3152 | 1.4101 |
57
- | 1.6039 | 9.0 | 3546 | 1.4045 |
58
- | 1.6039 | 10.0 | 3940 | 1.4039 |
59
 
60
 
61
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [google/long-t5-tglobal-base](https://huggingface.co/google/long-t5-tglobal-base) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 1.5497
19
+ - Rouge Rouge1: 0.3896
20
+ - Rouge Rouge2: 0.1402
21
+ - Rouge Rougel: 0.227
22
+ - Rouge Rougelsum: 0.2269
23
+ - Gen Len: 392.0152
24
 
25
  ## Model description
26
 
 
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
+ - num_epochs: 3
49
 
50
  ### Training results
51
 
52
+ | Training Loss | Epoch | Step | Validation Loss | Rouge Rouge1 | Rouge Rouge2 | Rouge Rougel | Rouge Rougelsum | Gen Len |
53
+ |:-------------:|:-----:|:----:|:---------------:|:------------:|:------------:|:------------:|:---------------:|:--------:|
54
+ | No log | 1.0 | 394 | 1.9062 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
55
+ | 3.7289 | 2.0 | 788 | 1.5960 | 0.279 | 0.0991 | 0.1665 | 0.1662 | 320.4091 |
56
+ | 2.0261 | 3.0 | 1182 | 1.5497 | 0.3896 | 0.1402 | 0.227 | 0.2269 | 392.0152 |
 
 
 
 
 
 
 
57
 
58
 
59
  ### Framework versions
generation_config.json CHANGED
@@ -1,5 +1,4 @@
1
  {
2
- "_from_model_config": true,
3
  "decoder_start_token_id": 0,
4
  "eos_token_id": 1,
5
  "max_length": 400,
 
1
  {
 
2
  "decoder_start_token_id": 0,
3
  "eos_token_id": 1,
4
  "max_length": 400,
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:efe4d62d8ebc7335fcfbaac1eda551f987d2096042dd3ecb79449fa31ed0b980
3
  size 990386200
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:443ac3fb0c9b55856cf261149ebf2e1678c32bc2f3f117f695424070ae9aa34d
3
  size 990386200
runs/Dec03_21-15-34_user/events.out.tfevents.1733240735.user.1394782.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a46d1f80b226f86f263cb60985bd29c4d2a6629cb2c7e4405822a1462fbf7081
3
- size 6163
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:94f43d4bab0cc2558c9604215b8d33c21c7659da782052f74840383f742434d5
3
+ size 7066