gokuls
/

hBERTv2_new_pretrain_cola

@@ -1,6 +1,4 @@
 ---
-language:
-- en
 tags:
 - generated_from_trainer
 datasets:
@@ -14,7 +12,7 @@ model-index:
       name: Text Classification
       type: text-classification
     dataset:
-      name: GLUE COLA
       type: glue
       config: cola
       split: validation
@@ -30,9 +28,9 @@ should probably proofread and complete it, then remove this comment. -->
 # hBERTv2_new_pretrain_cola
-This model is a fine-tuned version of [gokuls/bert_12_layer_model_v2_complete_training_new](https://huggingface.co/gokuls/bert_12_layer_model_v2_complete_training_new) on the GLUE COLA dataset.
 It achieves the following results on the evaluation set:
-- Loss: 9.8472
 - Matthews Correlation: 0.0
 ## Model description
@@ -52,7 +50,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0005
 - train_batch_size: 128
 - eval_batch_size: 128
 - seed: 10
@@ -65,12 +63,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Matthews Correlation |
 |:-------------:|:-----:|:----:|:---------------:|:--------------------:|
-| 225.85        | 1.0   | 67   | 9.8472          | 0.0                  |
-| 322.5451      | 2.0   | 134  | 176.9664        | 0.0                  |
-| 88.5896       | 3.0   | 201  | 127.8706        | 0.0                  |
-| 121.8729      | 4.0   | 268  | 106.4564        | 0.0                  |
-| 99.3696       | 5.0   | 335  | 85.7181         | 0.0                  |
-| 87.2883       | 6.0   | 402  | 79.4784         | 0.0                  |
 ### Framework versions

 ---
 tags:
 - generated_from_trainer
 datasets:
       name: Text Classification
       type: text-classification
     dataset:
+      name: glue
       type: glue
       config: cola
       split: validation
 # hBERTv2_new_pretrain_cola
+This model is a fine-tuned version of [gokuls/bert_12_layer_model_v2_complete_training_new](https://huggingface.co/gokuls/bert_12_layer_model_v2_complete_training_new) on the glue dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6200
 - Matthews Correlation: 0.0
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 4e-05
 - train_batch_size: 128
 - eval_batch_size: 128
 - seed: 10
 | Training Loss | Epoch | Step | Validation Loss | Matthews Correlation |
 |:-------------:|:-----:|:----:|:---------------:|:--------------------:|
+| 0.6294        | 1.0   | 67   | 0.6236          | 0.0                  |
+| 0.6169        | 2.0   | 134  | 0.6312          | 0.0                  |
+| 0.6115        | 3.0   | 201  | 0.6173          | 0.0                  |
+| 0.6372        | 4.0   | 268  | 0.6201          | 0.0                  |
+| 0.6087        | 5.0   | 335  | 0.6217          | 0.0                  |
+| 0.6086        | 6.0   | 402  | 0.6248          | 0.0                  |
+| 0.6113        | 7.0   | 469  | 0.6283          | 0.0                  |
+| 0.6109        | 8.0   | 536  | 0.6200          | 0.0                  |
 ### Framework versions