second_model

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1972
 ## Model description
@@ -35,8 +35,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -44,23 +44,23 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 5.257         | 1.0   | 505  | 1.4156          |
-| 0.4709        | 2.0   | 1010 | 0.2085          |
-| 0.1929        | 3.0   | 1515 | 0.1996          |
-| 0.1762        | 4.0   | 2020 | 0.1973          |
-| 0.1649        | 5.0   | 2525 | 0.1966          |
-| 0.158         | 6.0   | 3030 | 0.1964          |
-| 0.1511        | 7.0   | 3535 | 0.1964          |
-| 0.1456        | 8.0   | 4040 | 0.1968          |
-| 0.144         | 9.0   | 4545 | 0.1971          |
-| 0.1421        | 10.0  | 5050 | 0.1972          |
 ### Framework versions
 - Transformers 4.33.0
 - Pytorch 2.2.1
-- Datasets 2.19.1
 - Tokenizers 0.13.3

 This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2003
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss |
+|:-------------:|:-----:|:-----:|:---------------:|
+| 0.4367        | 1.0   | 1009  | 0.2132          |
+| 0.1949        | 2.0   | 2018  | 0.1991          |
+| 0.1744        | 3.0   | 3027  | 0.1956          |
+| 0.1593        | 4.0   | 4036  | 0.1949          |
+| 0.1527        | 5.0   | 5045  | 0.1957          |
+| 0.1419        | 6.0   | 6054  | 0.1962          |
+| 0.1338        | 7.0   | 7063  | 0.1982          |
+| 0.1276        | 8.0   | 8072  | 0.1988          |
+| 0.1239        | 9.0   | 9081  | 0.1996          |
+| 0.1215        | 10.0  | 10090 | 0.2003          |
 ### Framework versions
 - Transformers 4.33.0
 - Pytorch 2.2.1
+- Datasets 2.19.2
 - Tokenizers 0.13.3

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1f9a6f06f616805e1f2b1a19642542d1dcbbe55d6b3c73034ddeb903991273fa
 size 2460469182

 version https://git-lfs.github.com/spec/v1
+oid sha256:2da0108c07e8d27da4a915f5eb671c132b138706d83ade7c14377c44aed209e7
 size 2460469182

tokenizer.json CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8ac789ad7dabea44d41537822d48c516ba358374c51813e2cba78c006e150c94
-size 17331224

 version https://git-lfs.github.com/spec/v1
+oid sha256:a5acc0bc2a48b6c16ee9854bd75bdb10bf95cadf1ededf6dce3af0a340b33a34
+size 17331489

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3662193fa00dbc6aa189b2f93558da4ab4c2ba363a6122bae5d620f089e67252
 size 4536

 version https://git-lfs.github.com/spec/v1
+oid sha256:118b6f76cd6ec212cb1be95c35eabfaa8009ff76d3a77dff02401643e55bb83e
 size 4536