phucnn commited on
Commit
2ea33be
·
verified ·
1 Parent(s): b884daf

End of training

Browse files
README.md CHANGED
@@ -1,4 +1,7 @@
1
  ---
 
 
 
2
  license: apache-2.0
3
  base_model: google/mt5-xl
4
  tags:
@@ -17,9 +20,9 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [google/mt5-xl](https://huggingface.co/google/mt5-xl) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.8710
21
- - Bleu: 8.1488
22
- - Gen Len: 18.9785
23
 
24
  ## Model description
25
 
 
1
  ---
2
+ language:
3
+ - vie
4
+ - lao
5
  license: apache-2.0
6
  base_model: google/mt5-xl
7
  tags:
 
20
 
21
  This model is a fine-tuned version of [google/mt5-xl](https://huggingface.co/google/mt5-xl) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.8711
24
+ - Bleu: 19.5743
25
+ - Gen Len: 41.77
26
 
27
  ## Model description
28
 
all_results.json ADDED
@@ -0,0 +1,22 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 3.0,
3
+ "eval_bleu": 19.5743,
4
+ "eval_gen_len": 41.77,
5
+ "eval_loss": 0.8710537552833557,
6
+ "eval_runtime": 1747.0033,
7
+ "eval_samples": 1996,
8
+ "eval_samples_per_second": 1.143,
9
+ "eval_steps_per_second": 0.571,
10
+ "predict_bleu": 19.5743,
11
+ "predict_gen_len": 41.77,
12
+ "predict_loss": 0.8710537552833557,
13
+ "predict_runtime": 1745.3443,
14
+ "predict_samples": 1996,
15
+ "predict_samples_per_second": 1.144,
16
+ "predict_steps_per_second": 0.572,
17
+ "train_loss": 0.3135226284782316,
18
+ "train_runtime": 125809.459,
19
+ "train_samples": 143836,
20
+ "train_samples_per_second": 3.43,
21
+ "train_steps_per_second": 0.857
22
+ }
eval_results.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 3.0,
3
+ "eval_bleu": 19.5743,
4
+ "eval_gen_len": 41.77,
5
+ "eval_loss": 0.8710537552833557,
6
+ "eval_runtime": 1747.0033,
7
+ "eval_samples": 1996,
8
+ "eval_samples_per_second": 1.143,
9
+ "eval_steps_per_second": 0.571
10
+ }
generated_predictions.txt ADDED
The diff for this file is too large to render. See raw diff
 
predict_results.json ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "predict_bleu": 19.5743,
3
+ "predict_gen_len": 41.77,
4
+ "predict_loss": 0.8710537552833557,
5
+ "predict_runtime": 1745.3443,
6
+ "predict_samples": 1996,
7
+ "predict_samples_per_second": 1.144,
8
+ "predict_steps_per_second": 0.572
9
+ }
train_results.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 3.0,
3
+ "train_loss": 0.3135226284782316,
4
+ "train_runtime": 125809.459,
5
+ "train_samples": 143836,
6
+ "train_samples_per_second": 3.43,
7
+ "train_steps_per_second": 0.857
8
+ }
trainer_state.json ADDED
The diff for this file is too large to render. See raw diff