madatnlp
/

ke-t5-scratch

text2text-generation

generated_from_keras_callback

Model card Files Files and versions Community

madatnlp commited on May 8, 2022

Commit

f4b9258

·

1 Parent(s): 65052c8

End of training

Files changed (2) hide show

README.md +34 -3
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -13,9 +13,9 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [madatnlp/ke-t5-math-py](https://huggingface.co/madatnlp/ke-t5-math-py) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 4.0241
-- Validation Loss: 1.8638
-- Epoch: 0
 ## Model description
@@ -42,6 +42,37 @@ The following hyperparameters were used during training:
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
 | 4.0241     | 1.8638          | 0     |
 ### Framework versions

 This model is a fine-tuned version of [madatnlp/ke-t5-math-py](https://huggingface.co/madatnlp/ke-t5-math-py) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 0.1796
+- Validation Loss: 0.7374
+- Epoch: 31
 ## Model description
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
 | 4.0241     | 1.8638          | 0     |
+| 1.9923     | 1.5399          | 1     |
+| 1.5972     | 1.2482          | 2     |
+| 1.4318     | 1.1416          | 3     |
+| 1.2938     | 1.0979          | 4     |
+| 1.1901     | 1.0205          | 5     |
+| 1.1353     | 0.9546          | 6     |
+| 1.0667     | 0.9118          | 7     |
+| 1.0332     | 0.9256          | 8     |
+| 0.9526     | 0.8136          | 9     |
+| 0.8977     | 0.8164          | 10    |
+| 0.8556     | 0.7733          | 11    |
+| 0.7916     | 0.7578          | 12    |
+| 0.7446     | 0.7172          | 13    |
+| 0.7004     | 0.6962          | 14    |
+| 0.6532     | 0.6611          | 15    |
+| 0.6171     | 0.6897          | 16    |
+| 0.5716     | 0.7145          | 17    |
+| 0.5272     | 0.6789          | 18    |
+| 0.4969     | 0.6006          | 19    |
+| 0.4483     | 0.6339          | 20    |
+| 0.4104     | 0.5614          | 21    |
+| 0.3901     | 0.6786          | 22    |
+| 0.3567     | 0.6617          | 23    |
+| 0.3142     | 0.6668          | 24    |
+| 0.3042     | 0.6578          | 25    |
+| 0.2796     | 0.6587          | 26    |
+| 0.2529     | 0.6606          | 27    |
+| 0.2303     | 0.7759          | 28    |
+| 0.2047     | 0.6297          | 29    |
+| 0.2037     | 0.7410          | 30    |
+| 0.1796     | 0.7374          | 31    |
 ### Framework versions

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5c4105be7715abcca03d2798999b5d6b8b1dc03aac4364b4419bc58db8e8f3ed
 size 831509840

 version https://git-lfs.github.com/spec/v1
+oid sha256:7616dfbe90c2db4ded5a526008bb3442fa9e2a7166fb7d25b70cf66d252c4e47
 size 831509840