Training in progress epoch 0

Browse files

Files changed (5) hide show

README.md +9 -11
tf_model.h5 +1 -1
tokenizer.json +16 -2
train/events.out.tfevents.1693462412.c2ae3b5698e2.634.0.v2 +3 -0
validation/events.out.tfevents.1693463490.c2ae3b5698e2.634.1.v2 +3 -0

README.md CHANGED Viewed

@@ -15,13 +15,13 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.5191
-- Train End Logits Accuracy: 0.8441
-- Train Start Logits Accuracy: 0.8831
-- Validation Loss: 0.4803
-- Validation End Logits Accuracy: 0.8492
-- Validation Start Logits Accuracy: 0.9059
-- Epoch: 2
 ## Model description
@@ -40,16 +40,14 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 1359, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False}
 - training_precision: float32
 ### Training results
 | Train Loss | Train End Logits Accuracy | Train Start Logits Accuracy | Validation Loss | Validation End Logits Accuracy | Validation Start Logits Accuracy | Epoch |
 |:----------:|:-------------------------:|:---------------------------:|:---------------:|:------------------------------:|:--------------------------------:|:-----:|
-| 1.5356     | 0.6202                    | 0.6584                      | 0.5280          | 0.8441                         | 0.8892                           | 0     |
-| 0.7416     | 0.7874                    | 0.8262                      | 0.4745          | 0.8595                         | 0.8982                           | 1     |
-| 0.5191     | 0.8441                    | 0.8831                      | 0.4803          | 0.8492                         | 0.9059                           | 2     |
 ### Framework versions

 This model is a fine-tuned version of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 1.4994
+- Train End Logits Accuracy: 0.6497
+- Train Start Logits Accuracy: 0.6777
+- Validation Loss: 0.4953
+- Validation End Logits Accuracy: 0.8479
+- Validation Start Logits Accuracy: 0.8982
+- Epoch: 0
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 2412, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False}
 - training_precision: float32
 ### Training results
 | Train Loss | Train End Logits Accuracy | Train Start Logits Accuracy | Validation Loss | Validation End Logits Accuracy | Validation Start Logits Accuracy | Epoch |
 |:----------:|:-------------------------:|:---------------------------:|:---------------:|:------------------------------:|:--------------------------------:|:-----:|
+| 1.4994     | 0.6497                    | 0.6777                      | 0.4953          | 0.8479                         | 0.8982                           | 0     |
 ### Framework versions

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b876a8347bdcc74c2236424bec34a870196e8104a06d6465e26e5c2505d8fb12
 size 709326800

 version https://git-lfs.github.com/spec/v1
+oid sha256:14fd3a01f3eb689b2e64d16dff701c04195aeb529a96cefd40a7ae9814809f6b
 size 709326800

tokenizer.json CHANGED Viewed

@@ -1,7 +1,21 @@
 {
   "version": "1.0",
-  "truncation": null,
-  "padding": null,
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 384,
+    "strategy": "OnlySecond",
+    "stride": 128
+  },
+  "padding": {
+    "strategy": {
+      "Fixed": 384
+    },
+    "direction": "Right",
+    "pad_to_multiple_of": null,
+    "pad_id": 0,
+    "pad_type_id": 0,
+    "pad_token": "[PAD]"
+  },
   "added_tokens": [
     {
       "id": 0,

train/events.out.tfevents.1693462412.c2ae3b5698e2.634.0.v2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8e6c7ef04d22d4275370d799876db5638d3ce22ebf71d030acd3e8898cd9169b
+size 2645337

validation/events.out.tfevents.1693463490.c2ae3b5698e2.634.1.v2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:395bba08ab2dd8f1a555771b30664ced947948cac28ce737c18115fa262cf078
+size 604