End of training

Browse files

Files changed (4) hide show

README.md +8 -32
model.safetensors +1 -1
runs/Jun04_12-47-51_5f46b0e4b719/events.out.tfevents.1717505271.5f46b0e4b719.25.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,8 +1,8 @@
 ---
 license: mit
 tags:
 - generated_from_trainer
-base_model: Aravindan/gpt2out
 model-index:
 - name: gpt2coder-8epochs
   results: []
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Aravindan/gpt2out](https://huggingface.co/Aravindan/gpt2out) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.9270
 ## Model description
@@ -38,42 +38,18 @@ The following hyperparameters were used during training:
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
-- gradient_accumulation_steps: 10
-- total_train_batch_size: 80
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
-- num_epochs: 25
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss |
-|:-------------:|:-------:|:----:|:---------------:|
-| No log        | 0.9810  | 31   | 3.2508          |
-| No log        | 1.9937  | 63   | 2.6920          |
-| No log        | 2.9747  | 94   | 2.3769          |
-| No log        | 3.9873  | 126  | 2.1444          |
-| No log        | 5.0     | 158  | 1.9673          |
-| No log        | 5.9810  | 189  | 1.8320          |
-| No log        | 6.9937  | 221  | 1.7097          |
-| No log        | 7.9747  | 252  | 1.6159          |
-| No log        | 8.9873  | 284  | 1.5231          |
-| No log        | 10.0    | 316  | 1.4535          |
-| No log        | 10.9810 | 347  | 1.3788          |
-| No log        | 11.9937 | 379  | 1.3109          |
-| No log        | 12.9747 | 410  | 1.2496          |
-| No log        | 13.9873 | 442  | 1.1989          |
-| No log        | 14.9810 | 465  | 1.1647          |
-| No log        | 15.9937 | 497  | 1.1208          |
-| 1.3856        | 16.9747 | 528  | 1.0841          |
-| 1.3856        | 17.9873 | 560  | 1.0464          |
-| 1.3856        | 19.0    | 592  | 1.0180          |
-| 1.3856        | 19.9810 | 623  | 0.9928          |
-| 1.3856        | 20.9937 | 655  | 0.9689          |
-| 1.3856        | 21.9747 | 686  | 0.9517          |
-| 1.3856        | 22.9873 | 718  | 0.9390          |
-| 1.3856        | 24.0    | 750  | 0.9298          |
-| 1.3856        | 24.7911 | 775  | 0.9270          |
 ### Framework versions

 ---
 license: mit
+base_model: Aravindan/gpt2out
 tags:
 - generated_from_trainer
 model-index:
 - name: gpt2coder-8epochs
   results: []
 This model is a fine-tuned version of [Aravindan/gpt2out](https://huggingface.co/Aravindan/gpt2out) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.0309
 ## Model description
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
+- gradient_accumulation_steps: 16
+- total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 1
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 2.2389        | 0.9998 | 1739 | 2.0309          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d29ced89a6423e2315431bff8b01bf9452b60607d050fb307ccb939435a8470b
 size 497774208

 version https://git-lfs.github.com/spec/v1
+oid sha256:db5ed9c70628f43009980c4fd88939d66c16dec74f5d068c1f6ff9a974b1e663
 size 497774208

runs/Jun04_12-47-51_5f46b0e4b719/events.out.tfevents.1717505271.5f46b0e4b719.25.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:079de7dd737ab6143197e7840833df31196b7468c162fa593d659f7067cbadde
+size 6353

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:08ead8923aaaf05f683a0f272f9e8c5877c5d68113c2f6f34fbc69f833fffcc0
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:58c41c320c81744d1d2af38cb685c1228f0782b20aeb38f5c3a9cd0b7042c804
 size 5112