za17
/

helsinki-biomedical-finetuned

Translation

Transformers

Safetensors

marian

text2text-generation

Generated from Trainer

Model card Files Files and versions Community

za17 commited on Jun 7, 2024

Commit

2167636

1 Parent(s): 360a9b3

Revert to fine-tuned CL model

Browse files

Files changed (3) hide show

README.md +11 -33
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,10 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
 # helsinki-biomedical-finetuned
-This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-es](https://huggingface.co/Helsinki-NLP/opus-mt-en-es) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0247
-- Bleu: 55.6929
 ## Model description
@@ -38,46 +38,24 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1.5e-05
 - train_batch_size: 8
 - eval_batch_size: 16
 - seed: 42
 - gradient_accumulation_steps: 4
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
-- lr_scheduler_type: reduce_lr_on_plateau
-- num_epochs: 25
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch   | Step  | Validation Loss | Bleu    |
-|:-------------:|:-------:|:-----:|:---------------:|:-------:|
-| 0.0289        | 0.9998  | 3293  | 0.0253          | 52.7060 |
-| 0.0255        | 1.9998  | 6587  | 0.0238          | 53.6061 |
-| 0.0228        | 2.9999  | 9881  | 0.0231          | 54.1920 |
-| 0.0206        | 4.0     | 13175 | 0.0226          | 54.4549 |
-| 0.0192        | 4.9998  | 16468 | 0.0224          | 54.5222 |
-| 0.0176        | 5.9998  | 19762 | 0.0222          | 54.6624 |
-| 0.0167        | 6.9999  | 23056 | 0.0221          | 55.0200 |
-| 0.0154        | 8.0     | 26350 | 0.0223          | 53.3307 |
-| 0.0147        | 8.9998  | 29643 | 0.0223          | 55.2185 |
-| 0.0138        | 9.9998  | 32937 | 0.0224          | 54.9215 |
-| 0.0133        | 10.9999 | 36231 | 0.0225          | 55.3672 |
-| 0.0122        | 12.0    | 39525 | 0.0229          | 55.2831 |
-| 0.0115        | 12.9998 | 42818 | 0.0231          | 55.2310 |
-| 0.0108        | 13.9998 | 46112 | 0.0233          | 55.3215 |
-| 0.0103        | 14.9999 | 49406 | 0.0234          | 55.3170 |
-| 0.0096        | 16.0    | 52700 | 0.0237          | 55.3158 |
-| 0.0089        | 16.9998 | 55993 | 0.0242          | 55.0178 |
-| 0.0084        | 17.9998 | 59287 | 0.0243          | 55.1974 |
-| 0.0072        | 18.9999 | 62581 | 0.0244          | 55.6011 |
-| 0.007         | 20.0    | 65875 | 0.0245          | 55.5510 |
-| 0.0069        | 20.9998 | 69168 | 0.0246          | 55.6178 |
-| 0.0068        | 21.9998 | 72462 | 0.0246          | 55.7191 |
-| 0.0068        | 22.9999 | 75756 | 0.0247          | 55.6917 |
-| 0.0066        | 24.0    | 79050 | 0.0247          | 55.6962 |
-| 0.0067        | 24.9943 | 82325 | 0.0247          | 55.6929 |
 ### Framework versions

 # helsinki-biomedical-finetuned
+This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-es](https://huggingface.co/Helsinki-NLP/opus-mt-en-es) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0875
+- Bleu: 43.9070
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 8e-07
 - train_batch_size: 8
 - eval_batch_size: 16
 - seed: 42
 - gradient_accumulation_steps: 4
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: cosine
+- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Bleu    |
+|:-------------:|:------:|:----:|:---------------:|:-------:|
+| No log        | 0.9987 | 187  | 0.0890          | 43.7030 |
+| No log        | 1.9973 | 374  | 0.0880          | 43.7818 |
+| 0.0947        | 2.9960 | 561  | 0.0875          | 43.9070 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6335ad8280fc181bac5297c034176c06781616753b847f6fa7feebc70ef5d7d5
 size 309965092

 version https://git-lfs.github.com/spec/v1
+oid sha256:8270085986f0253d764e56a4ca7d1dc839d0c9345d7e50b6957bfd72cbce7388
 size 309965092

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d9c23a95b52128c133143d8847bfa4f4a2dfd234abefea64204da3abdee5f725
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:1e747c183dae3cb169560061858f8d235fcb25e6555f2915d4b7763b65e64965
 size 5176