pszemraj
/

long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP13

text2text-generation

Model card Files Files and versions Community

pszemraj commited on Aug 30, 2022

Commit

33c9eba

·

1 Parent(s): 2496e86

Update README.md

Files changed (1) hide show

README.md +3 -5

README.md CHANGED Viewed

@@ -6,12 +6,10 @@ model-index:
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP12-ft3-booksum
-This model is a fine-tuned version of [pszemraj/long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP12](https://huggingface.co/pszemraj/long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP12) on the None dataset.
 ## Model description
@@ -40,7 +38,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.01
-- num_epochs: 3
 ### Framework versions

   results: []
 ---
+# long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP13
+This model is a fine-tuned version of [pszemraj/long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP12](https://huggingface.co/pszemraj/long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP12) on the `kmfoda/booksum`. Evaluating some metric results before merging with the "main" wip version
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.01
+- num_epochs: 1.1
 ### Framework versions