kazandaev commited on
Commit
8de3399
1 Parent(s): 84282b7

reseting training

Browse files
README.md CHANGED
@@ -1,68 +1,33 @@
1
  ---
2
  tags:
3
- - generated_from_trainer
4
- metrics:
5
- - bleu
6
- model-index:
7
- - name: opus-mt-ru-en-finetuned
8
- results: []
9
  ---
10
 
11
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
- should probably proofread and complete it, then remove this comment. -->
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
 
14
- # opus-mt-ru-en-finetuned
15
-
16
- This model was trained from scratch on the None dataset.
17
- It achieves the following results on the evaluation set:
18
- - Loss: 1.5751
19
- - Bleu: 32.4337
20
- - Gen Len: 26.6242
21
-
22
- ## Model description
23
-
24
- More information needed
25
-
26
- ## Intended uses & limitations
27
-
28
- More information needed
29
-
30
- ## Training and evaluation data
31
-
32
- More information needed
33
-
34
- ## Training procedure
35
-
36
- ### Training hyperparameters
37
-
38
- The following hyperparameters were used during training:
39
- - learning_rate: 0.0001
40
- - train_batch_size: 80
41
- - eval_batch_size: 40
42
- - seed: 42
43
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
- - lr_scheduler_type: linear
45
- - num_epochs: 10
46
-
47
- ### Training results
48
-
49
- | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
50
- |:-------------:|:-----:|:------:|:---------------:|:-------:|:-------:|
51
- | 1.1574 | 1.0 | 21506 | 1.7303 | 27.2765 | 26.7867 |
52
- | 1.218 | 2.0 | 43012 | 1.7203 | 27.9434 | 26.7402 |
53
- | 1.1679 | 3.0 | 64518 | 1.6614 | 28.7723 | 27.033 |
54
- | 1.1122 | 4.0 | 86024 | 1.6420 | 29.5047 | 26.5014 |
55
- | 1.072 | 5.0 | 107530 | 1.6596 | 29.9396 | 27.11 |
56
- | 1.0324 | 6.0 | 129036 | 1.6229 | 30.6433 | 26.9464 |
57
- | 0.9953 | 7.0 | 150542 | 1.6070 | 30.9374 | 26.7292 |
58
- | 0.9597 | 8.0 | 172048 | 1.5850 | 31.7838 | 26.7111 |
59
- | 0.9172 | 9.0 | 193554 | 1.5789 | 32.4267 | 26.7353 |
60
- | 0.8885 | 10.0 | 215060 | 1.5751 | 32.4337 | 26.6242 |
61
-
62
-
63
- ### Framework versions
64
-
65
- - Transformers 4.16.2
66
- - Pytorch 1.10.2+cu113
67
- - Datasets 1.18.3
68
- - Tokenizers 0.11.0
 
1
  ---
2
  tags:
3
+ - translation
4
+ license: apache-2.0
 
 
 
 
5
  ---
6
 
7
+ ### opus-mt-ru-en
8
+
9
+ * source languages: ru
10
+ * target languages: en
11
+ * OPUS readme: [ru-en](https://github.com/Helsinki-NLP/OPUS-MT-train/blob/master/models/ru-en/README.md)
12
+
13
+ * dataset: opus
14
+ * model: transformer-align
15
+ * pre-processing: normalization + SentencePiece
16
+ * download original weights: [opus-2020-02-26.zip](https://object.pouta.csc.fi/OPUS-MT-models/ru-en/opus-2020-02-26.zip)
17
+ * test set translations: [opus-2020-02-26.test.txt](https://object.pouta.csc.fi/OPUS-MT-models/ru-en/opus-2020-02-26.test.txt)
18
+ * test set scores: [opus-2020-02-26.eval.txt](https://object.pouta.csc.fi/OPUS-MT-models/ru-en/opus-2020-02-26.eval.txt)
19
+
20
+ ## Benchmarks
21
+
22
+ | testset | BLEU | chr-F |
23
+ |-----------------------|-------|-------|
24
+ | newstest2012.ru.en | 34.8 | 0.603 |
25
+ | newstest2013.ru.en | 27.9 | 0.545 |
26
+ | newstest2014-ruen.ru.en | 31.9 | 0.591 |
27
+ | newstest2015-enru.ru.en | 30.4 | 0.568 |
28
+ | newstest2016-enru.ru.en | 30.1 | 0.565 |
29
+ | newstest2017-enru.ru.en | 33.4 | 0.593 |
30
+ | newstest2018-enru.ru.en | 29.6 | 0.565 |
31
+ | newstest2019-ruen.ru.en | 31.4 | 0.576 |
32
+ | Tatoeba.ru.en | 61.1 | 0.736 |
33
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
config.json CHANGED
@@ -1,5 +1,4 @@
1
  {
2
- "_name_or_path": "/home/alexey/work/ML/models/opus-mt-ru-en-finetuned",
3
  "_num_labels": 3,
4
  "activation_dropout": 0.0,
5
  "activation_function": "swish",
@@ -16,7 +15,6 @@
16
  ],
17
  "bos_token_id": 0,
18
  "classif_dropout": 0.0,
19
- "classifier_dropout": 0.0,
20
  "d_model": 512,
21
  "decoder_attention_heads": 8,
22
  "decoder_ffn_dim": 2048,
@@ -29,7 +27,6 @@
29
  "encoder_layerdrop": 0.0,
30
  "encoder_layers": 6,
31
  "eos_token_id": 0,
32
- "forced_eos_token_id": 0,
33
  "id2label": {
34
  "0": "LABEL_0",
35
  "1": "LABEL_1",
@@ -49,12 +46,8 @@
49
  "normalize_embedding": false,
50
  "num_beams": 6,
51
  "num_hidden_layers": 6,
52
- "output_attentions": true,
53
  "pad_token_id": 62517,
54
  "scale_embedding": true,
55
  "static_position_embeddings": true,
56
- "torch_dtype": "float32",
57
- "transformers_version": "4.16.2",
58
- "use_cache": true,
59
  "vocab_size": 62518
60
  }
 
1
  {
 
2
  "_num_labels": 3,
3
  "activation_dropout": 0.0,
4
  "activation_function": "swish",
 
15
  ],
16
  "bos_token_id": 0,
17
  "classif_dropout": 0.0,
 
18
  "d_model": 512,
19
  "decoder_attention_heads": 8,
20
  "decoder_ffn_dim": 2048,
 
27
  "encoder_layerdrop": 0.0,
28
  "encoder_layers": 6,
29
  "eos_token_id": 0,
 
30
  "id2label": {
31
  "0": "LABEL_0",
32
  "1": "LABEL_1",
 
46
  "normalize_embedding": false,
47
  "num_beams": 6,
48
  "num_hidden_layers": 6,
 
49
  "pad_token_id": 62517,
50
  "scale_embedding": true,
51
  "static_position_embeddings": true,
 
 
 
52
  "vocab_size": 62518
53
  }
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6616853b83f71d7b6ac76bbc5ef61bed5497678111179ee53558e4c3b314f043
3
- size 304935301
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:535450eb5613f3cc912f9ca3e54cfef6c14d201b319c24a88faf776a65538b5d
3
+ size 306991893
runs/Feb21_00-57-18_delta/1645405129.4565482/events.out.tfevents.1645405129.delta.340777.1 DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:4364c2e4db166bde7c28a458d3e91a19e9392b6e2917bcff73e4ef4833b739fd
3
- size 5119
 
 
 
 
runs/Feb21_00-57-18_delta/events.out.tfevents.1645405129.delta.340777.0 DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:4c176a6e7f2533a675ac090ece183212afc609755820b2cad03babddba275c9f
3
- size 18328
 
 
 
 
runs/Feb21_09-23-04_delta/1645435477.6763067/events.out.tfevents.1645435477.delta.384317.1 DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:8154ab446a19e588adcebc42e2e7dad877ffd92fb63875728f22078201d7e30d
3
- size 5122
 
 
 
 
runs/Feb21_09-23-04_delta/1645460644.374451/events.out.tfevents.1645460644.delta.384317.3 DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:cc6f672b5b37002189ca0e37b6efb7011d7c99de158c6fb44dc8c17036c85c77
3
- size 5122
 
 
 
 
runs/Feb21_09-23-04_delta/events.out.tfevents.1645435477.delta.384317.0 DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:09e73c04c291fea60b9b159bd8ba665e30de4af1a67ae917f1b93259e900800b
3
- size 77118
 
 
 
 
runs/Feb21_09-23-04_delta/events.out.tfevents.1645460644.delta.384317.2 DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:f0756b43f43aba45498b982653574f397e4619532373531a23cb4085af97d0a7
3
- size 28039
 
 
 
 
runs/Feb21_19-30-06_delta/1645471936.3118231/events.out.tfevents.1645471936.delta.532773.1 DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:fa203a95d2dddb8dc38f89d6b7f509b6bc424f3a265d8b5c2e47e4a27c48ceb1
3
- size 5122
 
 
 
 
runs/Feb21_19-30-06_delta/events.out.tfevents.1645471936.delta.532773.0 DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:baf1be96a0a9080ed2613e42ca3dae651583f5528801d97d5592fb661bb626f2
3
- size 13684
 
 
 
 
runs/Feb21_20-31-25_delta/1645475580.3563583/events.out.tfevents.1645475580.delta.550101.1 DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:7ed2d35914d7833a7b749c807f5c43d705e0e4d74bb3b7afc4b93bdebf4870cd
3
- size 5122
 
 
 
 
runs/Feb21_20-31-25_delta/events.out.tfevents.1645475580.delta.550101.0 DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:e67086f40b65b535ecebf41c1fb83b62f4b012dab21801758cb0566466b3a503
3
- size 131773
 
 
 
 
runs/Feb22_17-25-42_delta/1645550835.5555484/events.out.tfevents.1645550835.delta.890584.1 DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:2ea23ea0da0c0949532935dc472a7d79ab4175b5c972c52094707d78808c2820
3
- size 5122
 
 
 
 
runs/Feb22_17-25-42_delta/events.out.tfevents.1645550835.delta.890584.0 DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:27dd7a7a70d4ca32e6959f4d1ff27d750f20331431a7cae0c3a23a74e7b5bd69
3
- size 10587
 
 
 
 
tokenizer_config.json CHANGED
@@ -1 +1 @@
1
- {"source_lang": "ru", "target_lang": "en", "unk_token": "<unk>", "eos_token": "</s>", "pad_token": "<pad>", "model_max_length": 512, "sp_model_kwargs": {}, "special_tokens_map_file": null, "tokenizer_file": null, "name_or_path": "/home/alexey/work/ML/models/opus-mt-ru-en-finetuned", "tokenizer_class": "MarianTokenizer"}
 
1
+ {"target_lang": "en", "source_lang": "ru"}
vocab.json CHANGED
The diff for this file is too large to render. See raw diff