Training completed for 6@lr:1e-05!

Files changed (4) hide show

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [ml6team/gpt2-small-german-finetune-oscar](https://huggingface.co/ml6team/gpt2-small-german-finetune-oscar) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 5.6147
 ## Model description
@@ -33,7 +33,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0005
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
@@ -45,12 +45,12 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 4.1666        | 1.0   | 210  | 4.2622          |
-| 3.1732        | 2.0   | 420  | 4.3527          |
-| 2.3188        | 3.0   | 630  | 4.6424          |
-| 1.7614        | 4.0   | 840  | 5.0776          |
-| 1.5433        | 5.0   | 1050 | 5.4150          |
-| 1.5366        | 6.0   | 1260 | 5.6147          |
 ### Framework versions

 This model is a fine-tuned version of [ml6team/gpt2-small-german-finetune-oscar](https://huggingface.co/ml6team/gpt2-small-german-finetune-oscar) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 5.8729
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.7388        | 1.0   | 210  | 5.7127          |
+| 0.6451        | 2.0   | 420  | 5.7653          |
+| 0.3591        | 3.0   | 630  | 5.8154          |
+| 0.3574        | 4.0   | 840  | 5.8520          |
+| 0.6094        | 5.0   | 1050 | 5.8755          |
+| 1.5115        | 6.0   | 1260 | 5.8729          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:49682c03f9efc7c52c2a3efe958c07931fb70c71a2b9028e522a1791547167dc
 size 497774208

 version https://git-lfs.github.com/spec/v1
+oid sha256:416af33197a2f83d8806112c98ddc85a9180a653004657dc3fe6c88f4fd53e54
 size 497774208

runs/Feb21_22-21-16_7ae86bb689ec/events.out.tfevents.1708554091.7ae86bb689ec.16399.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:6a3018add7fa54f9858a1d4c6b926101bc4af892c09c3ee3f272c6afd339d631
+size 272305

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f9a08032f2eed92242bd9d915afa7987f98694e86b7a6256de5f9e4bec26ce79
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:d0a4bc9829a46c86ff34fac6676549ece25761ead7414de38b0e579b833d5365
 size 4920