End of training

Files changed (7) hide show

README.md CHANGED Viewed

@@ -1,4 +1,7 @@
 ---
 license: apache-2.0
 base_model: google/mt5-xl
 tags:
@@ -17,9 +20,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-xl](https://huggingface.co/google/mt5-xl) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.8710
-- Bleu: 8.1488
-- Gen Len: 18.9785
 ## Model description

 ---
+language:
+- vie
+- lao
 license: apache-2.0
 base_model: google/mt5-xl
 tags:
 This model is a fine-tuned version of [google/mt5-xl](https://huggingface.co/google/mt5-xl) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8711
+- Bleu: 19.5743
+- Gen Len: 41.77
 ## Model description

all_results.json ADDED Viewed

+{
+    "epoch": 3.0,
+    "eval_bleu": 19.5743,
+    "eval_gen_len": 41.77,
+    "eval_loss": 0.8710537552833557,
+    "eval_runtime": 1747.0033,
+    "eval_samples": 1996,
+    "eval_samples_per_second": 1.143,
+    "eval_steps_per_second": 0.571,
+    "predict_bleu": 19.5743,
+    "predict_gen_len": 41.77,
+    "predict_loss": 0.8710537552833557,
+    "predict_runtime": 1745.3443,
+    "predict_samples": 1996,
+    "predict_samples_per_second": 1.144,
+    "predict_steps_per_second": 0.572,
+    "train_loss": 0.3135226284782316,
+    "train_runtime": 125809.459,
+    "train_samples": 143836,
+    "train_samples_per_second": 3.43,
+    "train_steps_per_second": 0.857
+}

eval_results.json ADDED Viewed

+{
+    "epoch": 3.0,
+    "eval_bleu": 19.5743,
+    "eval_gen_len": 41.77,
+    "eval_loss": 0.8710537552833557,
+    "eval_runtime": 1747.0033,
+    "eval_samples": 1996,
+    "eval_samples_per_second": 1.143,
+    "eval_steps_per_second": 0.571
+}

generated_predictions.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

predict_results.json ADDED Viewed

+{
+    "predict_bleu": 19.5743,
+    "predict_gen_len": 41.77,
+    "predict_loss": 0.8710537552833557,
+    "predict_runtime": 1745.3443,
+    "predict_samples": 1996,
+    "predict_samples_per_second": 1.144,
+    "predict_steps_per_second": 0.572
+}

train_results.json ADDED Viewed

+{
+    "epoch": 3.0,
+    "train_loss": 0.3135226284782316,
+    "train_runtime": 125809.459,
+    "train_samples": 143836,
+    "train_samples_per_second": 3.43,
+    "train_steps_per_second": 0.857
+}

trainer_state.json ADDED Viewed

The diff for this file is too large to render. See raw diff