Training in progress, step 2862

Files changed (5) hide show

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3426
-- Accuracy: 0.9481
 ## Model description
@@ -50,21 +50,21 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 3.8162        | 1.0   | 318  | 2.8321          | 0.7381   |
-| 2.1655        | 2.0   | 636  | 1.4157          | 0.8658   |
-| 1.0801        | 3.0   | 954  | 0.7461          | 0.9142   |
-| 0.568         | 4.0   | 1272 | 0.4911          | 0.9319   |
-| 0.3528        | 5.0   | 1590 | 0.3975          | 0.9397   |
-| 0.2606        | 6.0   | 1908 | 0.3712          | 0.9403   |
-| 0.2195        | 7.0   | 2226 | 0.3510          | 0.9471   |
-| 0.1971        | 8.0   | 2544 | 0.3467          | 0.9468   |
-| 0.1862        | 9.0   | 2862 | 0.3450          | 0.9468   |
-| 0.181         | 10.0  | 3180 | 0.3426          | 0.9481   |
 ### Framework versions
-- Transformers 4.47.0
-- Pytorch 2.5.1
 - Datasets 3.2.0
 - Tokenizers 0.21.0

 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3223
+- Accuracy: 0.9461
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 2.8424        | 1.0   | 318  | 2.0795          | 0.7271   |
+| 1.6103        | 2.0   | 636  | 1.0650          | 0.8577   |
+| 0.8466        | 3.0   | 954  | 0.6074          | 0.9135   |
+| 0.4999        | 4.0   | 1272 | 0.4376          | 0.9310   |
+| 0.3539        | 5.0   | 1590 | 0.3770          | 0.9397   |
+| 0.2899        | 6.0   | 1908 | 0.3515          | 0.9419   |
+| 0.2589        | 7.0   | 2226 | 0.3353          | 0.9448   |
+| 0.2418        | 8.0   | 2544 | 0.3276          | 0.9458   |
+| 0.2319        | 9.0   | 2862 | 0.3234          | 0.9458   |
+| 0.2284        | 10.0  | 3180 | 0.3223          | 0.9461   |
 ### Framework versions
+- Transformers 4.47.1
+- Pytorch 2.5.1+cu124
 - Datasets 3.2.0
 - Tokenizers 0.21.0

config.json CHANGED Viewed

@@ -326,6 +326,6 @@
   "sinusoidal_pos_embds": false,
   "tie_weights_": true,
   "torch_dtype": "float32",
-  "transformers_version": "4.47.0",
   "vocab_size": 30522
 }

   "sinusoidal_pos_embds": false,
   "tie_weights_": true,
   "torch_dtype": "float32",
+  "transformers_version": "4.47.1",
   "vocab_size": 30522
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f4240220d56196762389df661777764ee3c6043cd6f318ecf708e8d3b1562e00
 size 268290900

 version https://git-lfs.github.com/spec/v1
+oid sha256:794fad7b47b0fe92673e092e17f8aea9beceeccdfe377212201cac5f6df3aeac
 size 268290900

tokenizer.json CHANGED Viewed

@@ -1,11 +1,6 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 512,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": null,
   "padding": null,
   "added_tokens": [
     {

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e1e18ba03519659b51c75020bdadad361164ce1bc7484d24ad53d560a985d230
 size 5432

 version https://git-lfs.github.com/spec/v1
+oid sha256:2a726f8484aefbc7550edb1167a34362de730f8d4644e045d57fe4d3ea85e937
 size 5432