Nared45 commited on
Commit
d3bb2e8
·
verified ·
1 Parent(s): 3dd1bc3

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.6586
19
 
20
  ## Model description
21
 
@@ -34,8 +34,8 @@ More information needed
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
- - learning_rate: 0.0003
38
- - train_batch_size: 8
39
  - eval_batch_size: 8
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
@@ -47,16 +47,16 @@ The following hyperparameters were used during training:
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | 0.6934 | 1.0 | 198 | 0.7427 |
51
- | 0.6614 | 2.0 | 396 | 0.6910 |
52
- | 0.6085 | 3.0 | 594 | 0.6586 |
53
- | 0.7075 | 4.0 | 792 | 0.6990 |
54
- | 0.6757 | 5.0 | 990 | 0.6594 |
55
- | 0.6255 | 6.0 | 1188 | 0.6685 |
56
- | 0.6672 | 7.0 | 1386 | 0.6613 |
57
- | 0.63 | 8.0 | 1584 | 0.6626 |
58
- | 0.6864 | 9.0 | 1782 | 0.6603 |
59
- | 0.5947 | 10.0 | 1980 | 0.6610 |
60
 
61
 
62
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.4176
19
 
20
  ## Model description
21
 
 
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
+ - learning_rate: 1e-05
38
+ - train_batch_size: 16
39
  - eval_batch_size: 8
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | 0.6749 | 1.0 | 99 | 0.6492 |
51
+ | 0.5996 | 2.0 | 198 | 0.5918 |
52
+ | 0.6368 | 3.0 | 297 | 0.5676 |
53
+ | 0.5453 | 4.0 | 396 | 0.4176 |
54
+ | 0.2549 | 5.0 | 495 | 0.4746 |
55
+ | 0.2518 | 6.0 | 594 | 0.5545 |
56
+ | 0.1891 | 7.0 | 693 | 0.7260 |
57
+ | 0.1042 | 8.0 | 792 | 0.9964 |
58
+ | 0.056 | 9.0 | 891 | 1.0677 |
59
+ | 0.0109 | 10.0 | 990 | 0.9208 |
60
 
61
 
62
  ### Framework versions
logs/events.out.tfevents.1711732343.5979db352813.13693.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b199691895a7d6a3c42d3b3eacd0947eb6499b51825f848b0f5fcaf3c66677e6
3
+ size 28508
logs/events.out.tfevents.1711732603.5979db352813.13693.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c08724bdcb7a2d2c95db8a274b630fd43df7affb67c2fb9b7020d46832eab733
3
+ size 311
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fe642c7e676fb0ee571b36a028808397c8224e7412d247181e78d48f80fe2b65
3
  size 498612824
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3c1494b7fb61adb8f60cfe1a9d616fcfeb4154581a1fbffae8cc7d5d70c2ff96
3
  size 498612824
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2a70241133a4a3fed4f6c481e6ec1b38605b75fe49a89780ccda47d81f6ac8ed
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a553c2bae29a897b3a6b6b63e5c542a9686b5d7278e0a49d1192c67086309da3
3
  size 4920