madatnlp
/

mbart

Transformers

TensorFlow

mbart

text2text-generation

generated_from_keras_callback

Model card Files Files and versions Community

madatnlp commited on May 26, 2022

Commit

a87282b

1 Parent(s): 045a288

add model

Browse files

Files changed (2) hide show

README.md +39 -47
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -13,9 +13,9 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 1.7384
-- Validation Loss: 1.6859
-- Epoch: 43
 ## Model description
@@ -41,50 +41,42 @@ The following hyperparameters were used during training:
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
-| 12.5168    | 17.3038         | 0     |
-| 13.4675    | 12.0726         | 1     |
-| 6.7752     | 4.7717          | 2     |
-| 4.2218     | 3.7952          | 3     |
-| 3.6988     | 3.5533          | 4     |
-| 3.5477     | 3.4101          | 5     |
-| 3.4128     | 3.3961          | 6     |
-| 3.3888     | 3.3118          | 7     |
-| 3.3311     | 3.2826          | 8     |
-| 3.2614     | 3.2270          | 9     |
-| 3.2088     | 3.2100          | 10    |
-| 3.1062     | 3.0641          | 11    |
-| 3.0133     | 2.9382          | 12    |
-| 2.9223     | 2.8687          | 13    |
-| 2.8273     | 2.7063          | 14    |
-| 2.7913     | 2.8055          | 15    |
-| 2.6951     | 2.7133          | 16    |
-| 2.6631     | 2.5467          | 17    |
-| 2.5664     | 2.5094          | 18    |
-| 2.5150     | 2.4914          | 19    |
-| 2.4462     | 2.3662          | 20    |
-| 2.4081     | 2.3500          | 21    |
-| 2.3605     | 2.2733          | 22    |
-| 2.3164     | 2.2821          | 23    |
-| 2.3014     | 2.2191          | 24    |
-| 2.2103     | 2.0691          | 25    |
-| 2.1883     | 2.1237          | 26    |
-| 2.1814     | 2.1356          | 27    |
-| 2.1292     | 2.0151          | 28    |
-| 2.1046     | 1.9359          | 29    |
-| 2.0542     | 2.0341          | 30    |
-| 2.0365     | 1.9446          | 31    |
-| 1.9829     | 1.8999          | 32    |
-| 1.9801     | 1.8124          | 33    |
-| 1.9168     | 1.7824          | 34    |
-| 1.9209     | 1.8382          | 35    |
-| 1.8850     | 1.8215          | 36    |
-| 1.8748     | 1.7645          | 37    |
-| 1.8346     | 1.6847          | 38    |
-| 1.8076     | 1.7017          | 39    |
-| 1.8732     | 1.8328          | 40    |
-| 1.8164     | 1.7083          | 41    |
-| 1.7784     | 1.7930          | 42    |
-| 1.7384     | 1.6859          | 43    |
 ### Framework versions

 This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 0.5342
+- Validation Loss: 0.5633
+- Epoch: 35
 ## Model description
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
+| 4.5626     | 3.7843          | 0     |
+| 2.5836     | 1.9212          | 1     |
+| 1.6546     | 1.2552          | 2     |
+| 1.2499     | 1.0248          | 3     |
+| 1.0088     | 0.8457          | 4     |
+| 0.9100     | 0.7958          | 5     |
+| 0.8290     | 0.8421          | 6     |
+| 0.7999     | 0.7625          | 7     |
+| 0.7633     | 0.7202          | 8     |
+| 0.7439     | 0.7100          | 9     |
+| 0.7182     | 0.6787          | 10    |
+| 0.7092     | 0.6877          | 11    |
+| 0.6823     | 0.6684          | 12    |
+| 0.6738     | 0.6712          | 13    |
+| 0.6603     | 0.6858          | 14    |
+| 0.6462     | 0.6268          | 15    |
+| 0.6373     | 0.6208          | 16    |
+| 0.6424     | 0.6735          | 17    |
+| 0.6259     | 0.6423          | 18    |
+| 0.6249     | 0.6069          | 19    |
+| 0.6148     | 0.6510          | 20    |
+| 0.6063     | 0.6207          | 21    |
+| 0.5987     | 0.5977          | 22    |
+| 0.5917     | 0.6019          | 23    |
+| 0.5800     | 0.5828          | 24    |
+| 0.5779     | 0.5505          | 25    |
+| 0.5765     | 0.5887          | 26    |
+| 0.5667     | 0.5989          | 27    |
+| 0.5623     | 0.5859          | 28    |
+| 0.5564     | 0.5907          | 29    |
+| 0.5523     | 0.5928          | 30    |
+| 0.5478     | 0.5624          | 31    |
+| 0.5472     | 0.5563          | 32    |
+| 0.5462     | 0.5953          | 33    |
+| 0.5324     | 0.5593          | 34    |
+| 0.5342     | 0.5633          | 35    |
 ### Framework versions

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d890c143526f0a4a71d390c345c5c53f903e705421c77321e0a90e0e9293d31f
 size 2445079280

 version https://git-lfs.github.com/spec/v1
+oid sha256:1b1c2daabd53d121f49bcb1caf09aa2588149a8d7ccc7b42fb4dbf18b34b7fb9
 size 2445079280