Shakhovak
/

flan-t5-xl-absa-multitask-rest

Generated from Trainer

Model card Files Files and versions Community

Shakhovak commited on Apr 11

Commit

b8d143c

•

1 Parent(s): 4f8f3c3

End of training

Browse files

Files changed (3) hide show

README.md +33 -5
adapter_model.bin +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [ybelkada/flan-t5-xl-sharded-bf16](https://huggingface.co/ybelkada/flan-t5-xl-sharded-bf16) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2781
 ## Model description
@@ -40,15 +40,43 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
-- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 2.2483        | 0.32  | 200  | 0.4278          |
-| 0.4429        | 0.63  | 400  | 0.3025          |
-| 0.3615        | 0.95  | 600  | 0.2781          |
 ### Framework versions

 This model is a fine-tuned version of [ybelkada/flan-t5-xl-sharded-bf16](https://huggingface.co/ybelkada/flan-t5-xl-sharded-bf16) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1206
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 4.2698        | 0.32  | 200  | 2.8711          |
+| 1.3101        | 0.63  | 400  | 0.4477          |
+| 0.4353        | 0.95  | 600  | 0.2831          |
+| 0.324         | 1.27  | 800  | 0.2462          |
+| 0.2806        | 1.58  | 1000 | 0.2112          |
+| 0.2527        | 1.9   | 1200 | 0.1953          |
+| 0.2256        | 2.22  | 1400 | 0.1847          |
+| 0.2206        | 2.53  | 1600 | 0.1724          |
+| 0.2048        | 2.85  | 1800 | 0.1673          |
+| 0.1872        | 3.16  | 2000 | 0.1638          |
+| 0.177         | 3.48  | 2200 | 0.1607          |
+| 0.1749        | 3.8   | 2400 | 0.1511          |
+| 0.1736        | 4.11  | 2600 | 0.1436          |
+| 0.16          | 4.43  | 2800 | 0.1424          |
+| 0.1553        | 4.75  | 3000 | 0.1444          |
+| 0.1476        | 5.06  | 3200 | 0.1399          |
+| 0.1437        | 5.38  | 3400 | 0.1351          |
+| 0.1459        | 5.7   | 3600 | 0.1309          |
+| 0.1469        | 6.01  | 3800 | 0.1277          |
+| 0.1363        | 6.33  | 4000 | 0.1290          |
+| 0.1393        | 6.65  | 4200 | 0.1278          |
+| 0.1343        | 6.96  | 4400 | 0.1285          |
+| 0.1269        | 7.28  | 4600 | 0.1271          |
+| 0.1362        | 7.59  | 4800 | 0.1206          |
+| 0.1271        | 7.91  | 5000 | 0.1256          |
+| 0.1293        | 8.23  | 5200 | 0.1239          |
+| 0.1189        | 8.54  | 5400 | 0.1227          |
+| 0.1218        | 8.86  | 5600 | 0.1209          |
+| 0.1171        | 9.18  | 5800 | 0.1234          |
+| 0.124         | 9.49  | 6000 | 0.1222          |
+| 0.1218        | 9.81  | 6200 | 0.1206          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:19d0ea9c4a6c29a1a564978be3cb0849cc1755892f74c3e80358da5fd923fc16
 size 37855242

 version https://git-lfs.github.com/spec/v1
+oid sha256:17a66a0ed3d8a8b39cdfc98b26948b9a114b8dae4ee799ec4a6f9dad252da41e
 size 37855242

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8418f134a3051e363af41d1f3cfa63a8c9f99873cee2842aa059d1f0e990955e
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:a65946cfc661dcfe81069010e83b2f90523a4b9fd121756ea63c5a142de616f2
 size 5112