esekeroglu commited on
Commit
70a992a
·
verified ·
1 Parent(s): e69bb84

Training in progress, epoch 1

Browse files
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.6346
22
- - F1: 0.8985
23
 
24
  ## Model description
25
 
@@ -39,22 +39,24 @@ More information needed
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 7e-05
42
- - train_batch_size: 32
43
- - eval_batch_size: 16
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
- - num_epochs: 5
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | F1 |
52
  |:-------------:|:-----:|:----:|:---------------:|:------:|
53
- | No log | 1.0 | 29 | 0.7395 | 0.8302 |
54
- | No log | 2.0 | 58 | 0.5699 | 0.8987 |
55
- | No log | 3.0 | 87 | 0.6828 | 0.8906 |
56
- | 0.6144 | 4.0 | 116 | 0.6353 | 0.8770 |
57
- | 0.6144 | 5.0 | 145 | 0.6346 | 0.8985 |
 
 
58
 
59
 
60
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.3117
22
+ - F1: 0.9342
23
 
24
  ## Model description
25
 
 
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 7e-05
42
+ - train_batch_size: 64
43
+ - eval_batch_size: 32
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 7
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | F1 |
52
  |:-------------:|:-----:|:----:|:---------------:|:------:|
53
+ | No log | 1.0 | 15 | 0.6936 | 0.8593 |
54
+ | No log | 2.0 | 30 | 0.4958 | 0.8719 |
55
+ | No log | 3.0 | 45 | 0.3710 | 0.9093 |
56
+ | No log | 4.0 | 60 | 0.3575 | 0.8912 |
57
+ | No log | 5.0 | 75 | 0.3046 | 0.9366 |
58
+ | No log | 6.0 | 90 | 0.3042 | 0.9398 |
59
+ | 0.4101 | 7.0 | 105 | 0.3117 | 0.9342 |
60
 
61
 
62
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d6912ac04bf0717425b8dd1a9c9c8791d65600d6bae93eb4f9cfaac7f425ec66
3
  size 598513616
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9bcc69a58bc0f8dc252e62192669c691e76e143e54fdcb904b9bbf26d9605d16
3
  size 598513616
runs/Jan29_17-13-28_EmreSekeroglu/events.out.tfevents.1738160009.EmreSekeroglu.19740.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f1279d9b744dbcb7c06dd141fa6cc2a876ee184b6229046d8e5544719e62e0de
3
- size 9307
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bfe1e0603ea7daa9d04a69e83adf01f261350d908aca2af7c987c8c98ab396f6
3
+ size 9966
runs/Jan30_09-16-10_EmreSekeroglu/events.out.tfevents.1738217783.EmreSekeroglu.13376.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7848524ca237acc3b0eb8cad993638acb80619144fca9edd3224c42c8880e39a
3
+ size 7545
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8e95e19a817d3011d4bb3203400350348cf0a4860fc1b98057aa7ae985734388
3
  size 5432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a6873c8859468c9d230069d4c512c274f8f1fe691d2d3a85503b5f41eb253b02
3
  size 5432