pszemraj
/

long-t5-tglobal-base-16384-book-summary

text2text-generation

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Jun 27, 2022

Commit

768a7be

·

1 Parent(s): ad46a6b

format

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -88,7 +88,7 @@ A fine-tuned version of [google/long-t5-tglobal-base](https://huggingface.co/goo
 - At the time of writing, the model is not _fully converged_ despite training for 20+ epochs. This checkpoint is serviceable enough (see examples).
 - I plan to update this page with newer checkpoints and post some metrics over time.
-- Compare performance to [LED-base](https://huggingface.co/pszemraj/led-base-book-summary) trained on the same dataset.
 ## Training and evaluation data
@@ -111,7 +111,7 @@ The following hyperparameters were used during the **final** training round\*:
 - lr_scheduler_warmup_ratio: 0.02
 - num_epochs: 2
-\*_Prior training sessions used roughly similar parameters, multiple sessions were required as this takes eons to train
 ### Training results

 - At the time of writing, the model is not _fully converged_ despite training for 20+ epochs. This checkpoint is serviceable enough (see examples).
 - I plan to update this page with newer checkpoints and post some metrics over time.
+- Compare performance to [LED-base](https://huggingface.co/pszemraj/led-base-book-summary) trained on the same dataset (API gen parameters are the same).
 ## Training and evaluation data
 - lr_scheduler_warmup_ratio: 0.02
 - num_epochs: 2
+\*_Prior training sessions used roughly similar parameters, multiple sessions were required as this takes eons to train_
 ### Training results