Training completed for 6@lr:0.0005!

Files changed (5) hide show

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [ml6team/gpt2-small-german-finetune-oscar](https://huggingface.co/ml6team/gpt2-small-german-finetune-oscar) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.4805
 ## Model description
@@ -39,15 +39,18 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 4.4964        | 1.0   | 210  | 4.2323          |
-| 3.4962        | 2.0   | 420  | 4.2861          |
-| 2.5871        | 3.0   | 630  | 4.4805          |
 ### Framework versions

 This model is a fine-tuned version of [ml6team/gpt2-small-german-finetune-oscar](https://huggingface.co/ml6team/gpt2-small-german-finetune-oscar) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 5.6147
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 6
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 4.1666        | 1.0   | 210  | 4.2622          |
+| 3.1732        | 2.0   | 420  | 4.3527          |
+| 2.3188        | 3.0   | 630  | 4.6424          |
+| 1.7614        | 4.0   | 840  | 5.0776          |
+| 1.5433        | 5.0   | 1050 | 5.4150          |
+| 1.5366        | 6.0   | 1260 | 5.6147          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7c5bece9bd08a867f6a81e4507b0866f7415859a2b38cd1c279969a77bfe2158
 size 497774208

 version https://git-lfs.github.com/spec/v1
+oid sha256:49682c03f9efc7c52c2a3efe958c07931fb70c71a2b9028e522a1791547167dc
 size 497774208

runs/Feb21_21-52-49_7ae86bb689ec/events.out.tfevents.1708552382.7ae86bb689ec.14183.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:812d72101d76cffd7212788015bac2016d435a9bd436323a84a78717fbcc1a7c
+size 83229

runs/Feb21_21-59-25_7ae86bb689ec/events.out.tfevents.1708552781.7ae86bb689ec.16399.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a82fde0dcc43cc8258221c6f754eae4bdb2672dbbef5becf3409156b2649a6e0
+size 272278

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cf84a81cd823c6c6144fcbfa203d262f2691ac3d4fc6b7445c646cb19ca3fd90
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:f9a08032f2eed92242bd9d915afa7987f98694e86b7a6256de5f9e4bec26ce79
 size 4920