acrowth
/

touring2

Text2Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

acrowth commited on Mar 7, 2023

Commit

057319d

·

1 Parent(s): b348f7f

Upload 8 files

Files changed (3) hide show

README.md +16 -13
pytorch_model.bin +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.8345
-- Rouge1: 31.1956
-- Rouge2: 3.519
-- Rougel: 30.9876
-- Rougelsum: 31.0581
-- Gen Len: 7.6887
 ## Model description
@@ -41,19 +41,22 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 2
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum | Gen Len |
-|:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
-| No log        | 1.0   | 292  | 2.8988          | 30.2013 | 3.6434 | 30.0162 | 30.1004   | 7.6261  |
-| 3.3994        | 2.0   | 584  | 2.8345          | 31.1956 | 3.519  | 30.9876 | 31.0581   | 7.6887  |
 ### Framework versions

 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4042
+- Rouge1: 60.0314
+- Rouge2: 42.51
+- Rougel: 59.8461
+- Rougelsum: 59.6885
+- Gen Len: 9.6526
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 1
+- eval_batch_size: 1
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
+| 1.6599        | 1.0   | 1249 | 1.4452          | 52.1368 | 34.9356 | 51.6433 | 51.7033   | 9.4643  |
+| 1.2659        | 2.0   | 2498 | 1.4013          | 53.5023 | 35.6721 | 53.0881 | 53.1954   | 9.6526  |
+| 1.1027        | 3.0   | 3747 | 1.3475          | 59.004  | 41.5484 | 58.8625 | 58.785    | 9.6818  |
+| 0.9453        | 4.0   | 4996 | 1.3966          | 58.5942 | 40.7989 | 58.3943 | 58.3703   | 9.7516  |
+| 0.9083        | 5.0   | 6245 | 1.4042          | 60.0314 | 42.51   | 59.8461 | 59.6885   | 9.6526  |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:45794850adfd65077d446e9c579bb5a4070772e903e64e45d94282921613bcdd
 size 990408885

 version https://git-lfs.github.com/spec/v1
+oid sha256:1a3d7614a659cb9b4721f780a0d4e85dc5c738df8c7b5e37b49e866737679a8c
 size 990408885

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bf4093346e95dfb31b4e21345154798dde8da1840d9f10078b49861557b6c800
 size 3707

 version https://git-lfs.github.com/spec/v1
+oid sha256:2ec2aacd55a49aa18c35279dc59a78b260dce5a116907e62b4d519e7e1e97e56
 size 3707