pabRomero
/

BART-Firefox-Simplification

Text2Text Generation

Transformers

Safetensors

bart

Generated from Trainer

Model card Files Files and versions Community

pabRomero commited on Feb 14

Commit

41c702f

verified ·

1 Parent(s): c5fefc4

End of training

Browse files

Files changed (2) hide show

README.md +23 -18
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.1026
 ## Model description
@@ -35,29 +35,34 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-06
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 1695
-- num_epochs: 10
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss |
-|:-------------:|:-----:|:-----:|:---------------:|
-| 2.9356        | 1.0   | 1696  | 2.6397          |
-| 2.6065        | 2.0   | 3392  | 2.4075          |
-| 2.4067        | 3.0   | 5088  | 2.2693          |
-| 2.319         | 4.0   | 6784  | 2.2217          |
-| 2.2325        | 5.0   | 8480  | 2.1939          |
-| 2.1475        | 6.0   | 10176 | 2.1746          |
-| 2.0414        | 7.0   | 11872 | 2.1443          |
-| 2.0622        | 8.0   | 13568 | 2.1175          |
-| 1.9606        | 9.0   | 15264 | 2.1103          |
-| 1.9139        | 10.0  | 16960 | 2.1026          |
 ### Framework versions

 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.8845
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2.5e-05
+- train_batch_size: 32
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 317
+- num_epochs: 15
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 2.9105        | 1.0   | 212  | 2.6066          |
+| 2.5528        | 2.0   | 424  | 2.2820          |
+| 2.2435        | 3.0   | 636  | 2.1249          |
+| 2.0536        | 4.0   | 848  | 2.0488          |
+| 1.8914        | 5.0   | 1060 | 2.0239          |
+| 1.8098        | 6.0   | 1272 | 1.9841          |
+| 1.7267        | 7.0   | 1484 | 1.9589          |
+| 1.6515        | 8.0   | 1696 | 1.9342          |
+| 1.5856        | 9.0   | 1908 | 1.9000          |
+| 1.5154        | 10.0  | 2120 | 1.9103          |
+| 1.457         | 11.0  | 2332 | 1.9085          |
+| 1.4072        | 12.0  | 2544 | 1.8845          |
+| 1.369         | 13.0  | 2756 | 1.8903          |
+| 1.3603        | 14.0  | 2968 | 1.8887          |
+| 1.3215        | 15.0  | 3180 | 1.8869          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d59cd7c33fd4816f359801d4d711b68dc72616349b4e0beeab5db422a530c8ce
 size 557921848

 version https://git-lfs.github.com/spec/v1
+oid sha256:cc2710abc2bfddad13725d0e3a7438e377611e02ddb9550234c1bcf8adce658d
 size 557921848