smitmenon
/

e2m_translation_project

@@ -1,5 +1,6 @@
 ---
 library_name: transformers
 tags:
 - generated_from_trainer
 model-index:
@@ -12,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
 # e2m_translation_project
-This model was trained from scratch on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4409
 ## Model description
@@ -45,8 +46,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.5001        | 1.0   | 625  | 0.4536          |
-| 0.3662        | 2.0   | 1250 | 0.4409          |
 ### Framework versions

 ---
 library_name: transformers
+base_model: facebook/mbart-large-50-one-to-many-mmt
 tags:
 - generated_from_trainer
 model-index:
 # e2m_translation_project
+This model is a fine-tuned version of [facebook/mbart-large-50-one-to-many-mmt](https://huggingface.co/facebook/mbart-large-50-one-to-many-mmt) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3881
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.4269        | 1.0   | 625  | 0.3894          |
+| 0.2929        | 2.0   | 1250 | 0.3881          |
 ### Framework versions

generation_config.json CHANGED Viewed

@@ -4,6 +4,7 @@
   "decoder_start_token_id": 2,
   "early_stopping": true,
   "eos_token_id": 2,
   "max_length": 200,
   "num_beams": 5,
   "pad_token_id": 1,

   "decoder_start_token_id": 2,
   "early_stopping": true,
   "eos_token_id": 2,
+  "forced_eos_token_id": 2,
   "max_length": 200,
   "num_beams": 5,
   "pad_token_id": 1,