End of training

Browse files

Files changed (5) hide show

README.md +9 -24
generation_config.json +5 -3
model.safetensors +1 -1
runs/Apr22_02-42-29_591f46b8fae4/events.out.tfevents.1713753750.591f46b8fae4.34.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,10 +1,8 @@
 ---
-license: apache-2.0
-base_model: t5-small
 tags:
 - generated_from_trainer
-metrics:
-- bleu
 model-index:
 - name: my_awesome_english_to_nepali_tst
   results: []
@@ -15,11 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 # my_awesome_english_to_nepali_tst
-This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 1.7758
-- Bleu: 4.076
-- Gen Len: 17.595
 ## Model description
@@ -39,28 +33,19 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Bleu   | Gen Len |
-|:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
-| No log        | 1.0   | 32   | 1.8790          | 3.86   | 17.665  |
-| No log        | 2.0   | 64   | 1.8311          | 4.0878 | 17.645  |
-| No log        | 3.0   | 96   | 1.8105          | 4.0976 | 17.615  |
-| No log        | 4.0   | 128  | 1.7988          | 4.1081 | 17.615  |
-| No log        | 5.0   | 160  | 1.7911          | 4.057  | 17.625  |
-| No log        | 6.0   | 192  | 1.7854          | 4.0552 | 17.61   |
-| No log        | 7.0   | 224  | 1.7812          | 4.0714 | 17.61   |
-| No log        | 8.0   | 256  | 1.7780          | 4.085  | 17.595  |
-| No log        | 9.0   | 288  | 1.7764          | 4.076  | 17.595  |
-| No log        | 10.0  | 320  | 1.7758          | 4.076  | 17.595  |
 ### Framework versions

 ---
+license: cc-by-nc-4.0
+base_model: facebook/nllb-200-distilled-600M
 tags:
 - generated_from_trainer
 model-index:
 - name: my_awesome_english_to_nepali_tst
   results: []
 # my_awesome_english_to_nepali_tst
+This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on an unknown dataset.
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Bleu    | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
+| No log        | 1.0   | 125  | 2.1541          | 12.7749 | 31.255  |
 ### Framework versions

generation_config.json CHANGED Viewed

@@ -1,6 +1,8 @@
 {
-  "decoder_start_token_id": 0,
-  "eos_token_id": 1,
-  "pad_token_id": 0,
   "transformers_version": "4.39.3"
 }

 {
+  "bos_token_id": 0,
+  "decoder_start_token_id": 2,
+  "eos_token_id": 2,
+  "max_length": 200,
+  "pad_token_id": 1,
   "transformers_version": "4.39.3"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c98fe7c19a834953e8206991a1a233d1599764cd4f11af64d075dcc0283da995
 size 2460354912

 version https://git-lfs.github.com/spec/v1
+oid sha256:cdb873ee2af19aa2e4f629d9d082ea0c3d4b95c4bc39f0356be6937615dc0804
 size 2460354912

runs/Apr22_02-42-29_591f46b8fae4/events.out.tfevents.1713753750.591f46b8fae4.34.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:19f6d25499df667314c54576ce92192700dff0530ee5e4a32fe886a4d9173134
+size 5755

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:69c22a12518114c75f17acc2b45072f3897c76e2488f8ace4713c650f60f89e2
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:b11e9307188b59281bb3a2b59afb97f2ded6e2011eb8e6d8e92616a2a8812368
 size 5112