node-py
/

my_awesome_eli5_clm-model

generated_from_keras_callback

Model card Files Files and versions Community

node-py commited on Sep 12, 2024

Commit

cae3119

·

1 Parent(s): 21a5acd

Training in progress epoch 4

Files changed (2) hide show

README.md +4 -3
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -16,9 +16,9 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 1.9794
-- Validation Loss: 1.9025
-- Epoch: 3
 ## Model description
@@ -48,6 +48,7 @@ The following hyperparameters were used during training:
 | 2.0425     | 1.9519          | 1     |
 | 2.0092     | 1.9229          | 2     |
 | 1.9794     | 1.9025          | 3     |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 1.9528
+- Validation Loss: 1.8787
+- Epoch: 4
 ## Model description
 | 2.0425     | 1.9519          | 1     |
 | 2.0092     | 1.9229          | 2     |
 | 1.9794     | 1.9025          | 3     |
+| 1.9528     | 1.8787          | 4     |
 ### Framework versions

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3ba52bd188fc0cb2ee757ec2fd66432ada8f0f089b3036881abdbc2bc6cd47ff
 size 327745472

 version https://git-lfs.github.com/spec/v1
+oid sha256:932d881cfbe03ef1b6aab5c210f736f6c76ac925560997c84027c343cbd72588
 size 327745472