End of training

Browse files

Files changed (4) hide show

README.md +53 -53
model.safetensors +1 -1
runs/Jan17_17-22-11_38251c6f7091/events.out.tfevents.1705512141.38251c6f7091.441.0 +3 -0
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -4,10 +4,10 @@ base_model: distilbert-base-uncased
 tags:
 - generated_from_trainer
 metrics:
-- accuracy
 - precision
 - recall
 - f1
 model-index:
 - name: DIALOGUE_overfit_check
   results: []
@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2184
-- Accuracy: 0.9737
 - Precision: 0.9762
 - Recall: 0.9737
 - F1: 0.9736
 ## Model description
@@ -53,56 +53,56 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1     |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
-| 1.0414        | 0.62  | 30   | 0.5042          | 0.9211   | 0.9327    | 0.9211 | 0.9204 |
-| 0.3868        | 1.25  | 60   | 0.1559          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.1218        | 1.88  | 90   | 0.1743          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0363        | 2.5   | 120  | 0.1189          | 0.9474   | 0.9524    | 0.9474 | 0.9472 |
-| 0.0127        | 3.12  | 150  | 0.1455          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0077        | 3.75  | 180  | 0.1457          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.005         | 4.38  | 210  | 0.1587          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0039        | 5.0   | 240  | 0.1620          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0031        | 5.62  | 270  | 0.1667          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0026        | 6.25  | 300  | 0.1696          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0022        | 6.88  | 330  | 0.1768          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0019        | 7.5   | 360  | 0.1802          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0016        | 8.12  | 390  | 0.1811          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0015        | 8.75  | 420  | 0.1834          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0013        | 9.38  | 450  | 0.1872          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0012        | 10.0  | 480  | 0.1890          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0011        | 10.62 | 510  | 0.1924          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.001         | 11.25 | 540  | 0.1940          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0009        | 11.88 | 570  | 0.1967          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0008        | 12.5  | 600  | 0.1982          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0008        | 13.12 | 630  | 0.1995          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0008        | 13.75 | 660  | 0.2009          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0007        | 14.38 | 690  | 0.2023          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0007        | 15.0  | 720  | 0.2039          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0006        | 15.62 | 750  | 0.2049          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0006        | 16.25 | 780  | 0.2064          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0006        | 16.88 | 810  | 0.2075          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0005        | 17.5  | 840  | 0.2087          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0005        | 18.12 | 870  | 0.2101          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0005        | 18.75 | 900  | 0.2110          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0005        | 19.38 | 930  | 0.2116          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0005        | 20.0  | 960  | 0.2122          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0004        | 20.62 | 990  | 0.2130          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0004        | 21.25 | 1020 | 0.2138          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0004        | 21.88 | 1050 | 0.2143          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0004        | 22.5  | 1080 | 0.2146          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0004        | 23.12 | 1110 | 0.2152          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0004        | 23.75 | 1140 | 0.2158          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0004        | 24.38 | 1170 | 0.2162          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0004        | 25.0  | 1200 | 0.2167          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0004        | 25.62 | 1230 | 0.2170          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0004        | 26.25 | 1260 | 0.2174          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0003        | 26.88 | 1290 | 0.2177          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0003        | 27.5  | 1320 | 0.2179          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0003        | 28.12 | 1350 | 0.2181          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0003        | 28.75 | 1380 | 0.2183          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0003        | 29.38 | 1410 | 0.2183          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
-| 0.0004        | 30.0  | 1440 | 0.2184          | 0.9737   | 0.9762    | 0.9737 | 0.9736 |
 ### Framework versions

 tags:
 - generated_from_trainer
 metrics:
 - precision
 - recall
 - f1
+- accuracy
 model-index:
 - name: DIALOGUE_overfit_check
   results: []
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1840
 - Precision: 0.9762
 - Recall: 0.9737
 - F1: 0.9736
+- Accuracy: 0.9737
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| 1.0266        | 0.62  | 30   | 0.5087          | 1.0       | 1.0    | 1.0    | 1.0      |
+| 0.4009        | 1.25  | 60   | 0.1389          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.1301        | 1.88  | 90   | 0.1436          | 0.9637    | 0.9605 | 0.9604 | 0.9605   |
+| 0.0342        | 2.5   | 120  | 0.1055          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0288        | 3.12  | 150  | 0.1395          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0099        | 3.75  | 180  | 0.1259          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0057        | 4.38  | 210  | 0.1315          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0042        | 5.0   | 240  | 0.1338          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0033        | 5.62  | 270  | 0.1373          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0027        | 6.25  | 300  | 0.1403          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0024        | 6.88  | 330  | 0.1457          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.002         | 7.5   | 360  | 0.1483          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0017        | 8.12  | 390  | 0.1483          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0016        | 8.75  | 420  | 0.1503          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0014        | 9.38  | 450  | 0.1535          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0013        | 10.0  | 480  | 0.1546          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0012        | 10.62 | 510  | 0.1576          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0011        | 11.25 | 540  | 0.1593          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.001         | 11.88 | 570  | 0.1672          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0009        | 12.5  | 600  | 0.1686          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0008        | 13.12 | 630  | 0.1696          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0008        | 13.75 | 660  | 0.1696          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0007        | 14.38 | 690  | 0.1702          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0007        | 15.0  | 720  | 0.1711          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0006        | 15.62 | 750  | 0.1716          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0006        | 16.25 | 780  | 0.1726          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0006        | 16.88 | 810  | 0.1731          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0006        | 17.5  | 840  | 0.1744          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0006        | 18.12 | 870  | 0.1762          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0005        | 18.75 | 900  | 0.1773          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0005        | 19.38 | 930  | 0.1777          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0005        | 20.0  | 960  | 0.1781          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0005        | 20.62 | 990  | 0.1785          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 21.25 | 1020 | 0.1795          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 21.88 | 1050 | 0.1801          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 22.5  | 1080 | 0.1805          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 23.12 | 1110 | 0.1812          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 23.75 | 1140 | 0.1818          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 24.38 | 1170 | 0.1821          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 25.0  | 1200 | 0.1824          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 25.62 | 1230 | 0.1827          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 26.25 | 1260 | 0.1831          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 26.88 | 1290 | 0.1833          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 27.5  | 1320 | 0.1836          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 28.12 | 1350 | 0.1838          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 28.75 | 1380 | 0.1839          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 29.38 | 1410 | 0.1840          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
+| 0.0004        | 30.0  | 1440 | 0.1840          | 0.9762    | 0.9737 | 0.9736 | 0.9737   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d8b7b048e6173bffa0feccfe2ca99bd9e8085335bd4fd6bced3f3594aec77210
 size 267838720

 version https://git-lfs.github.com/spec/v1
+oid sha256:0762e4c56734675118584bda80674b880a16381e8a3363eb60e041a924f0c132
 size 267838720

runs/Jan17_17-22-11_38251c6f7091/events.out.tfevents.1705512141.38251c6f7091.441.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e1e03de68573c10a0a4286f98bba917b30e825d3700bef2da79d0f8eb01aae87
+size 34963

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8e206c762958ccd14dd8e02e0b778f3afab4476f3dd156847c274978d4a937d3
-size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:01d55654f54e1149e5d0ae0246749a4a5b172c104ba2d02df269e8fe382c803b
+size 4664