node-py
/

my_awesome_eli5_clm-model

generated_from_keras_callback

Model card Files Files and versions Community

node-py commited on Sep 12, 2024

Commit

d8f0fdb

·

1 Parent(s): 2f98a86

Training in progress epoch 0

Files changed (2) hide show

README.md +4 -13
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -16,9 +16,9 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 1.8440
-- Validation Loss: 1.7976
-- Epoch: 9
 ## Model description
@@ -44,16 +44,7 @@ The following hyperparameters were used during training:
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
-| 2.0816     | 1.9831          | 0     |
-| 2.0425     | 1.9519          | 1     |
-| 2.0092     | 1.9229          | 2     |
-| 1.9794     | 1.9025          | 3     |
-| 1.9528     | 1.8787          | 4     |
-| 1.9269     | 1.8583          | 5     |
-| 1.9049     | 1.8418          | 6     |
-| 1.8832     | 1.8287          | 7     |
-| 1.8640     | 1.8091          | 8     |
-| 1.8440     | 1.7976          | 9     |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 1.8274
+- Validation Loss: 1.7835
+- Epoch: 0
 ## Model description
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
+| 1.8274     | 1.7835          | 0     |
 ### Framework versions

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f058058bb694e0f105eb612fd5fd88998b655627940ae3101d4503fcefa67b3b
 size 327745472

 version https://git-lfs.github.com/spec/v1
+oid sha256:cb15b83510bcaa57e272e7f1b7d1069ab390b67819bb5e62d5826737079fcbba
 size 327745472