Training in progress epoch 17

Files changed (4) hide show

README.md CHANGED Viewed

@@ -13,9 +13,9 @@ probably proofread and complete it, then remove this comment. -->
 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 1.2292
-- Validation Loss: 1.0824
-- Epoch: 16
 ## Model description
@@ -42,6 +42,7 @@ The following hyperparameters were used during training:
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
 | 1.2292     | 1.0824          | 16    |
 ### Framework versions

 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 1.2273
+- Validation Loss: 1.0844
+- Epoch: 17
 ## Model description
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
 | 1.2292     | 1.0824          | 16    |
+| 1.2273     | 1.0844          | 17    |
 ### Framework versions

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:783af7d97a235cb94cb3870acbc91992f3222e81e4568b9fc8a286c5fcc12e4b
 size 373902664

 version https://git-lfs.github.com/spec/v1
+oid sha256:81611ffd21b0a3c789e7a7e5486124f175c91a294d2622ee7eba0e4b2fe731ec
 size 373902664

tokenizer.json CHANGED Viewed

@@ -2,14 +2,12 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 128,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
-    "strategy": {
-      "Fixed": 128
-    },
     "direction": "Right",
     "pad_to_multiple_of": null,
     "pad_id": 0,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 512,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
+    "strategy": "BatchLongest",
     "direction": "Right",
     "pad_to_multiple_of": null,
     "pad_id": 0,

training_args.json CHANGED Viewed

	@@ -1 +1 @@
1	- {"last_epoch": 15}


1	+ {"last_epoch": 16}