JT000 commited on
Commit
f2dee42
·
verified ·
1 Parent(s): a7e146a

End of training

Browse files
Files changed (2) hide show
  1. README.md +41 -41
  2. model.safetensors +1 -1
README.md CHANGED
@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [uer/gpt2-chinese-cluecorpussmall](https://huggingface.co/uer/gpt2-chinese-cluecorpussmall) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 0.1152
18
 
19
  ## Model description
20
 
@@ -47,46 +47,46 @@ The following hyperparameters were used during training:
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | No log | 1.0 | 13 | 0.7769 |
51
- | No log | 2.0 | 26 | 0.7101 |
52
- | No log | 3.0 | 39 | 0.6086 |
53
- | No log | 4.0 | 52 | 0.4720 |
54
- | No log | 5.0 | 65 | 0.3012 |
55
- | No log | 6.0 | 78 | 0.1616 |
56
- | No log | 7.0 | 91 | 0.1318 |
57
- | No log | 8.0 | 104 | 0.1268 |
58
- | No log | 9.0 | 117 | 0.1236 |
59
- | No log | 10.0 | 130 | 0.1225 |
60
- | No log | 11.0 | 143 | 0.1218 |
61
- | No log | 12.0 | 156 | 0.1216 |
62
- | No log | 13.0 | 169 | 0.1178 |
63
- | No log | 14.0 | 182 | 0.1169 |
64
- | No log | 15.0 | 195 | 0.1161 |
65
- | No log | 16.0 | 208 | 0.1137 |
66
- | No log | 17.0 | 221 | 0.1138 |
67
- | No log | 18.0 | 234 | 0.1155 |
68
- | No log | 19.0 | 247 | 0.1094 |
69
- | No log | 20.0 | 260 | 0.1100 |
70
- | No log | 21.0 | 273 | 0.1067 |
71
- | No log | 22.0 | 286 | 0.1117 |
72
- | No log | 23.0 | 299 | 0.1089 |
73
- | No log | 24.0 | 312 | 0.1060 |
74
- | No log | 25.0 | 325 | 0.1090 |
75
- | No log | 26.0 | 338 | 0.1057 |
76
- | No log | 27.0 | 351 | 0.1055 |
77
- | No log | 28.0 | 364 | 0.1087 |
78
- | No log | 29.0 | 377 | 0.1112 |
79
- | No log | 30.0 | 390 | 0.1074 |
80
- | No log | 31.0 | 403 | 0.1108 |
81
- | No log | 32.0 | 416 | 0.1160 |
82
- | No log | 33.0 | 429 | 0.1172 |
83
- | No log | 34.0 | 442 | 0.1125 |
84
- | No log | 35.0 | 455 | 0.1157 |
85
- | No log | 36.0 | 468 | 0.1180 |
86
- | No log | 37.0 | 481 | 0.1157 |
87
- | No log | 38.0 | 494 | 0.1137 |
88
- | 0.1477 | 39.0 | 507 | 0.1139 |
89
- | 0.1477 | 40.0 | 520 | 0.1152 |
90
 
91
 
92
  ### Framework versions
 
14
 
15
  This model is a fine-tuned version of [uer/gpt2-chinese-cluecorpussmall](https://huggingface.co/uer/gpt2-chinese-cluecorpussmall) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 0.1071
18
 
19
  ## Model description
20
 
 
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | No log | 1.0 | 13 | 0.6871 |
51
+ | No log | 2.0 | 26 | 0.6221 |
52
+ | No log | 3.0 | 39 | 0.5198 |
53
+ | No log | 4.0 | 52 | 0.3920 |
54
+ | No log | 5.0 | 65 | 0.2557 |
55
+ | No log | 6.0 | 78 | 0.1539 |
56
+ | No log | 7.0 | 91 | 0.1292 |
57
+ | No log | 8.0 | 104 | 0.1262 |
58
+ | No log | 9.0 | 117 | 0.1223 |
59
+ | No log | 10.0 | 130 | 0.1229 |
60
+ | No log | 11.0 | 143 | 0.1222 |
61
+ | No log | 12.0 | 156 | 0.1201 |
62
+ | No log | 13.0 | 169 | 0.1208 |
63
+ | No log | 14.0 | 182 | 0.1196 |
64
+ | No log | 15.0 | 195 | 0.1153 |
65
+ | No log | 16.0 | 208 | 0.1145 |
66
+ | No log | 17.0 | 221 | 0.1107 |
67
+ | No log | 18.0 | 234 | 0.1181 |
68
+ | No log | 19.0 | 247 | 0.1049 |
69
+ | No log | 20.0 | 260 | 0.1058 |
70
+ | No log | 21.0 | 273 | 0.1050 |
71
+ | No log | 22.0 | 286 | 0.1043 |
72
+ | No log | 23.0 | 299 | 0.1011 |
73
+ | No log | 24.0 | 312 | 0.1020 |
74
+ | No log | 25.0 | 325 | 0.1011 |
75
+ | No log | 26.0 | 338 | 0.1024 |
76
+ | No log | 27.0 | 351 | 0.1005 |
77
+ | No log | 28.0 | 364 | 0.0998 |
78
+ | No log | 29.0 | 377 | 0.1002 |
79
+ | No log | 30.0 | 390 | 0.0986 |
80
+ | No log | 31.0 | 403 | 0.1000 |
81
+ | No log | 32.0 | 416 | 0.1027 |
82
+ | No log | 33.0 | 429 | 0.1035 |
83
+ | No log | 34.0 | 442 | 0.1053 |
84
+ | No log | 35.0 | 455 | 0.1083 |
85
+ | No log | 36.0 | 468 | 0.1068 |
86
+ | No log | 37.0 | 481 | 0.1071 |
87
+ | No log | 38.0 | 494 | 0.1052 |
88
+ | 0.1393 | 39.0 | 507 | 0.1115 |
89
+ | 0.1393 | 40.0 | 520 | 0.1071 |
90
 
91
 
92
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:db073445aad30537b1b2c297d8d94fbee10098c5858deeea338f2292f53ee852
3
  size 408366800
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b6b3db0f9dc4702adf35f8336f69e7481e07b146b1aba998de9083ed1bfb04bd
3
  size 408366800