smitmenon commited on
Commit
2732347
·
verified ·
1 Parent(s): bd9b317

Fine-tuned with denoising, version mbart_denoised_v1

Browse files
Files changed (2) hide show
  1. README.md +5 -5
  2. generation_config.json +1 -1
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  library_name: transformers
3
- base_model: facebook/mbart-large-50-many-to-one-mmt
4
  tags:
5
  - generated_from_trainer
6
  model-index:
@@ -13,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # e2m_denoise_project
15
 
16
- This model is a fine-tuned version of [facebook/mbart-large-50-many-to-one-mmt](https://huggingface.co/facebook/mbart-large-50-many-to-one-mmt) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.1636
19
 
20
  ## Model description
21
 
@@ -46,8 +46,8 @@ The following hyperparameters were used during training:
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | 2.6348 | 1.0 | 125 | 0.1741 |
50
- | 0.1862 | 2.0 | 250 | 0.1636 |
51
 
52
 
53
  ### Framework versions
 
1
  ---
2
  library_name: transformers
3
+ base_model: facebook/mbart-large-50-one-to-many-mmt
4
  tags:
5
  - generated_from_trainer
6
  model-index:
 
13
 
14
  # e2m_denoise_project
15
 
16
+ This model is a fine-tuned version of [facebook/mbart-large-50-one-to-many-mmt](https://huggingface.co/facebook/mbart-large-50-one-to-many-mmt) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.3997
19
 
20
  ## Model description
21
 
 
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | 2.8775 | 1.0 | 125 | 0.4854 |
50
+ | 0.4479 | 2.0 | 250 | 0.3997 |
51
 
52
 
53
  ### Framework versions
generation_config.json CHANGED
@@ -2,8 +2,8 @@
2
  "_from_model_config": true,
3
  "bos_token_id": 0,
4
  "decoder_start_token_id": 2,
 
5
  "eos_token_id": 2,
6
- "forced_bos_token_id": 250004,
7
  "forced_eos_token_id": 2,
8
  "max_length": 200,
9
  "num_beams": 5,
 
2
  "_from_model_config": true,
3
  "bos_token_id": 0,
4
  "decoder_start_token_id": 2,
5
+ "early_stopping": true,
6
  "eos_token_id": 2,
 
7
  "forced_eos_token_id": 2,
8
  "max_length": 200,
9
  "num_beams": 5,