kazandaev
/

opus-mt-ru-en-finetuned

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

kazandaev commited on Feb 24, 2022

Commit

1e5af0b

·

1 Parent(s): 2053776

update model card README.md

Files changed (1) hide show

README.md +21 -14

README.md CHANGED Viewed

@@ -13,11 +13,11 @@ should probably proofread and complete it, then remove this comment. -->
 # opus-mt-ru-en-finetuned
-This model was trained from scratch on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.2311
-- Bleu: 35.6405
-- Gen Len: 26.0366
 ## Model description
@@ -36,26 +36,33 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0001
-- train_batch_size: 85
-- eval_batch_size: 42
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss | Bleu    | Gen Len |
-|:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|
-| 1.2           | 1.0   | 20262 | 1.3793          | 32.0937 | 26.0227 |
-| 1.1325        | 2.0   | 40524 | 1.2856          | 34.3345 | 26.1998 |
-| 1.0781        | 3.0   | 60786 | 1.2311          | 35.6405 | 26.0366 |
 ### Framework versions
 - Transformers 4.16.2
-- Pytorch 1.10.2+cu113
 - Datasets 1.18.3
 - Tokenizers 0.11.0

 # opus-mt-ru-en-finetuned
+This model is a fine-tuned version of [kazandaev/opus-mt-ru-en-finetuned](https://huggingface.co/kazandaev/opus-mt-ru-en-finetuned) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1124
+- Bleu: 39.6748
+- Gen Len: 26.0628
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 49
+- eval_batch_size: 24
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
+| Training Loss | Epoch | Step   | Validation Loss | Bleu    | Gen Len |
+|:-------------:|:-----:|:------:|:---------------:|:-------:|:-------:|
+| 1.172         | 1.0   | 35147  | 1.2856          | 34.1563 | 26.1419 |
+| 1.1403        | 2.0   | 70294  | 1.2595          | 34.8515 | 26.198  |
+| 1.0997        | 3.0   | 105441 | 1.2305          | 35.7998 | 26.115  |
+| 1.0711        | 4.0   | 140588 | 1.2111          | 36.5266 | 26.17   |
+| 1.0392        | 5.0   | 175735 | 1.1953          | 36.9092 | 26.0507 |
+| 1.0109        | 6.0   | 210882 | 1.1662          | 37.7652 | 26.0546 |
+| 0.9878        | 7.0   | 246029 | 1.1542          | 38.4936 | 25.9766 |
+| 0.9573        | 8.0   | 281176 | 1.1298          | 39.06   | 26.1242 |
+| 0.9263        | 9.0   | 316323 | 1.1214          | 39.5778 | 26.0582 |
+| 0.9132        | 10.0  | 351470 | 1.1124          | 39.6748 | 26.0628 |
 ### Framework versions
 - Transformers 4.16.2
+- Pytorch 1.10.0+cu111
 - Datasets 1.18.3
 - Tokenizers 0.11.0