laurafcamargos commited on
Commit
6bc2dd4
verified
1 Parent(s): 2d560cc

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 7.1301
19
 
20
  ## Model description
21
 
@@ -46,56 +46,56 @@ The following hyperparameters were used during training:
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | No log | 1.0 | 9 | 5.7169 |
50
- | No log | 2.0 | 18 | 4.9843 |
51
- | No log | 3.0 | 27 | 4.2994 |
52
- | No log | 4.0 | 36 | 3.6410 |
53
- | No log | 5.0 | 45 | 3.4156 |
54
- | No log | 6.0 | 54 | 3.3735 |
55
- | No log | 7.0 | 63 | 3.4014 |
56
- | No log | 8.0 | 72 | 3.4605 |
57
- | No log | 9.0 | 81 | 3.6386 |
58
- | No log | 10.0 | 90 | 3.8670 |
59
- | No log | 11.0 | 99 | 3.9164 |
60
- | No log | 12.0 | 108 | 4.0859 |
61
- | No log | 13.0 | 117 | 4.3789 |
62
- | No log | 14.0 | 126 | 4.6237 |
63
- | No log | 15.0 | 135 | 4.3232 |
64
- | No log | 16.0 | 144 | 4.8507 |
65
- | No log | 17.0 | 153 | 4.9674 |
66
- | No log | 18.0 | 162 | 4.8131 |
67
- | No log | 19.0 | 171 | 4.9395 |
68
- | No log | 20.0 | 180 | 5.3052 |
69
- | No log | 21.0 | 189 | 5.2734 |
70
- | No log | 22.0 | 198 | 5.5004 |
71
- | No log | 23.0 | 207 | 5.7227 |
72
- | No log | 24.0 | 216 | 5.7561 |
73
- | No log | 25.0 | 225 | 5.9641 |
74
- | No log | 26.0 | 234 | 5.8868 |
75
- | No log | 27.0 | 243 | 6.2444 |
76
- | No log | 28.0 | 252 | 6.3476 |
77
- | No log | 29.0 | 261 | 6.3710 |
78
- | No log | 30.0 | 270 | 6.1785 |
79
- | No log | 31.0 | 279 | 6.5052 |
80
- | No log | 32.0 | 288 | 6.5157 |
81
- | No log | 33.0 | 297 | 6.6968 |
82
- | No log | 34.0 | 306 | 6.8228 |
83
- | No log | 35.0 | 315 | 6.7054 |
84
- | No log | 36.0 | 324 | 6.8514 |
85
- | No log | 37.0 | 333 | 6.7913 |
86
- | No log | 38.0 | 342 | 6.6872 |
87
- | No log | 39.0 | 351 | 7.0705 |
88
- | No log | 40.0 | 360 | 7.1790 |
89
- | No log | 41.0 | 369 | 7.0094 |
90
- | No log | 42.0 | 378 | 7.0502 |
91
- | No log | 43.0 | 387 | 7.3836 |
92
- | No log | 44.0 | 396 | 7.3730 |
93
- | No log | 45.0 | 405 | 7.0853 |
94
- | No log | 46.0 | 414 | 7.0111 |
95
- | No log | 47.0 | 423 | 6.9992 |
96
- | No log | 48.0 | 432 | 7.1171 |
97
- | No log | 49.0 | 441 | 7.1296 |
98
- | No log | 50.0 | 450 | 7.1301 |
99
 
100
 
101
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 8.5370
19
 
20
  ## Model description
21
 
 
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | No log | 1.0 | 9 | 5.7747 |
50
+ | No log | 2.0 | 18 | 5.1320 |
51
+ | No log | 3.0 | 27 | 4.4223 |
52
+ | No log | 4.0 | 36 | 3.7926 |
53
+ | No log | 5.0 | 45 | 3.5728 |
54
+ | No log | 6.0 | 54 | 3.4798 |
55
+ | No log | 7.0 | 63 | 3.5940 |
56
+ | No log | 8.0 | 72 | 3.6602 |
57
+ | No log | 9.0 | 81 | 3.8127 |
58
+ | No log | 10.0 | 90 | 4.0669 |
59
+ | No log | 11.0 | 99 | 4.4318 |
60
+ | No log | 12.0 | 108 | 4.8726 |
61
+ | No log | 13.0 | 117 | 5.0592 |
62
+ | No log | 14.0 | 126 | 5.2350 |
63
+ | No log | 15.0 | 135 | 5.4262 |
64
+ | No log | 16.0 | 144 | 5.5565 |
65
+ | No log | 17.0 | 153 | 5.7865 |
66
+ | No log | 18.0 | 162 | 5.8720 |
67
+ | No log | 19.0 | 171 | 5.9391 |
68
+ | No log | 20.0 | 180 | 6.1645 |
69
+ | No log | 21.0 | 189 | 6.4897 |
70
+ | No log | 22.0 | 198 | 6.5788 |
71
+ | No log | 23.0 | 207 | 6.5259 |
72
+ | No log | 24.0 | 216 | 6.8982 |
73
+ | No log | 25.0 | 225 | 6.6006 |
74
+ | No log | 26.0 | 234 | 6.7825 |
75
+ | No log | 27.0 | 243 | 6.9124 |
76
+ | No log | 28.0 | 252 | 7.1512 |
77
+ | No log | 29.0 | 261 | 7.0198 |
78
+ | No log | 30.0 | 270 | 7.2059 |
79
+ | No log | 31.0 | 279 | 7.5268 |
80
+ | No log | 32.0 | 288 | 7.5621 |
81
+ | No log | 33.0 | 297 | 7.5901 |
82
+ | No log | 34.0 | 306 | 7.8031 |
83
+ | No log | 35.0 | 315 | 8.0067 |
84
+ | No log | 36.0 | 324 | 8.0148 |
85
+ | No log | 37.0 | 333 | 8.0671 |
86
+ | No log | 38.0 | 342 | 8.0369 |
87
+ | No log | 39.0 | 351 | 8.1114 |
88
+ | No log | 40.0 | 360 | 8.3032 |
89
+ | No log | 41.0 | 369 | 8.5288 |
90
+ | No log | 42.0 | 378 | 8.3833 |
91
+ | No log | 43.0 | 387 | 8.2010 |
92
+ | No log | 44.0 | 396 | 8.4152 |
93
+ | No log | 45.0 | 405 | 8.5713 |
94
+ | No log | 46.0 | 414 | 8.4443 |
95
+ | No log | 47.0 | 423 | 8.3390 |
96
+ | No log | 48.0 | 432 | 8.4887 |
97
+ | No log | 49.0 | 441 | 8.5506 |
98
+ | No log | 50.0 | 450 | 8.5370 |
99
 
100
 
101
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:75fdef8acdba16ef25a676aa9a3a3168bd87864e1425df007dd873855b22b6b1
3
  size 265470032
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:458ec18ab5f7ca2a5e8024ec7ce650fd217085d9599024f029dfed8f51e5b31b
3
  size 265470032
runs/Mar11_18-38-47_0d15f6d702d9/events.out.tfevents.1710182327.0d15f6d702d9.655.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:afc4a51c060d604d42a8a560656cfa79f0b6c5c152fd7aea5e4089af44d62a14
3
+ size 18298
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:81acf28093226711fa6917f5e6d27ae76f2dd4cc2f1ef439a9cd8596cb720985
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3df055e62dcaa104f7ddc77ef6374e340215ebc940e29d6042a94e46bd7ca9e3
3
  size 4920