twosegfaults/GPT2.3

Files changed (5) hide show

README.md CHANGED Viewed

@@ -1,6 +1,5 @@
 ---
-license: mit
-base_model: GG2k4/GPT2
 tags:
 - generated_from_trainer
 model-index:
@@ -13,7 +12,7 @@ should probably proofread and complete it, then remove this comment. -->
 # results
-This model is a fine-tuned version of [GG2k4/GPT2](https://huggingface.co/GG2k4/GPT2) on an unknown dataset.
 ## Model description
@@ -38,7 +37,7 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 ### Training results
@@ -48,5 +47,4 @@ The following hyperparameters were used during training:
 - Transformers 4.41.2
 - Pytorch 2.3.0+cu121
-- Datasets 2.20.0
 - Tokenizers 0.19.1

 ---
+base_model: GG2k4/GPT2.1
 tags:
 - generated_from_trainer
 model-index:
 # results
+This model is a fine-tuned version of [GG2k4/GPT2.1](https://huggingface.co/GG2k4/GPT2.1) on an unknown dataset.
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 - Transformers 4.41.2
 - Pytorch 2.3.0+cu121
 - Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "GG2k4/GPT2",
   "activation_function": "gelu_new",
   "architectures": [
     "GPT2LMHeadModel"

 {
+  "_name_or_path": "GG2k4/GPT2.1",
   "activation_function": "gelu_new",
   "architectures": [
     "GPT2LMHeadModel"

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0d6942b9653278fb49b757420b567cf26ea282c60969417c8654da3f747614ad
 size 497774208

 version https://git-lfs.github.com/spec/v1
+oid sha256:ccb4547c7b422a4f28720b78e59c9fa1b5f183e014a56bf95c1f97c31dc8dd2b
 size 497774208

runs/Jul15_12-19-30_a57dcd05bc52/events.out.tfevents.1721045971.a57dcd05bc52.958.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0bfd6d5441c1c1ae3efcac6f7ccb14a108881cd4d52e0028a2294b636fc24ce6
+size 6249

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4764ca00979c3f7279a32a022125b4310493bdb94bad0d51fde24971da72ba74
-size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:c052fda78b20ffa4e6de5ee86fea32989fa4ade34e7ca77c793492496ea966eb
+size 5112