ch4m3l1on
/

pegasus-samsum

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

ch4m3l1on commited on Dec 15, 2024

Commit

e1b0758

·

verified ·

1 Parent(s): 234644e

Update README.md

Files changed (1) hide show

README.md +16 -4

README.md CHANGED Viewed

@@ -13,21 +13,33 @@ should probably proofread and complete it, then remove this comment. -->
 # pegasus-samsum
-This model is a fine-tuned version of [google/pegasus-cnn_dailymail](https://huggingface.co/google/pegasus-cnn_dailymail) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 1.3986
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure

 # pegasus-samsum
+This model is a fine-tuned version of [google/pegasus-cnn_dailymail](https://huggingface.co/google/pegasus-cnn_dailymail) on the SAMsum dataset.
 It achieves the following results on the evaluation set:
 - Loss: 1.3986
 ## Model description
+The model uses PEGASUS pretrained on cnn-dailymail and it is fine-tuned on the SAMsum dataset in order to get summaries out of conversations
 ## Intended uses & limitations
+Summarization on conversations
 ## Training and evaluation data
+The training args are the following:
+                                 num_train_epochs=1,
+                                 warmup_steps=500,
+                                 per_device_train_batch_size=1,
+                                 per_gpu_eval_batch_size=1,
+                                 weight_decay=0.01,
+                                 logging_steps=10,
+                                 push_to_hub=True,
+                                 evaluation_strategy='steps',
+                                 eval_steps=500,
+                                 save_steps=1e6,
+                                 gradient_accumulation_steps=16,
+                                 remove_unused_columns=False,
 ## Training procedure