baohuynhbk14 commited on
Commit
2432df7
·
verified ·
1 Parent(s): 97a60a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +291 -0
README.md CHANGED
@@ -24,6 +24,297 @@ The model will give softmax outputs for three labels.
24
  1 -> Positive
25
  2 -> Neutral
26
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
  ## Usage (HuggingFace Transformers)
28
 
29
  Install `transformers` package:
 
24
  1 -> Positive
25
  2 -> Neutral
26
  ```
27
+
28
+ ## Dataset
29
+
30
+ <table border="2">
31
+ <tr align="center">
32
+ <th rowspan="2">Dataset</th>
33
+ <th colspan="3">Train</th>
34
+ <th colspan="3">Test</th>
35
+ <th colspan="3">Val</th>
36
+ </tr>
37
+ <tr align="center">
38
+ <th>Neg</th>
39
+ <th>Pos</th>
40
+ <th>Neu</th>
41
+ <th>Neg</th>
42
+ <th>Pos</th>
43
+ <th>Neu</th>
44
+ <th>Neg</th>
45
+ <th>Pos</th>
46
+ <th>Neu</th>
47
+ </tr>
48
+ <tr align="center">
49
+ <td align="left">All</td>
50
+ <td>1000</td>
51
+ <td>2000</td>
52
+ <td>3000</td>
53
+ <td>1000</td>
54
+ <td>2000</td>
55
+ <td>3000</td>
56
+ <td>1000</td>
57
+ <td>2000</td>
58
+ <td>3000</td>
59
+ </tr>
60
+ <tr align="center">
61
+ <td align="left">SA-VLSP2016</td>
62
+ <td>1000</td>
63
+ <td>2000</td>
64
+ <td>3000</td>
65
+ <td>1000</td>
66
+ <td>2000</td>
67
+ <td>3000</td>
68
+ <td>1000</td>
69
+ <td>2000</td>
70
+ <td>3000</td>
71
+ </tr>
72
+ <tr align="center">
73
+ <td align="left">UIT-VSFC </td>
74
+ <td>1000</td>
75
+ <td>2000</td>
76
+ <td>3000</td>
77
+ <td>1000</td>
78
+ <td>2000</td>
79
+ <td>3000</td>
80
+ <td>1000</td>
81
+ <td>2000</td>
82
+ <td>3000</td>
83
+ </tr>
84
+ <tr align="center">
85
+ <td align="left">UIT-VSMEC</td>
86
+ <td>1000</td>
87
+ <td>2000</td>
88
+ <td>3000</td>
89
+ <td>1000</td>
90
+ <td>2000</td>
91
+ <td>3000</td>
92
+ <td>1000</td>
93
+ <td>2000</td>
94
+ <td>3000</td>
95
+ </tr>
96
+ <tr align="center">
97
+ <td align="left">AIVIVN-2019</td>
98
+ <td>1000</td>
99
+ <td>2000</td>
100
+ <td>3000</td>
101
+ <td>1000</td>
102
+ <td>2000</td>
103
+ <td>3000</td>
104
+ <td>1000</td>
105
+ <td>2000</td>
106
+ <td>3000</td>
107
+ </tr>
108
+ <tr align="center">
109
+ <td align="left">UIT-ViCTSD</td>
110
+ <td>1000</td>
111
+ <td>2000</td>
112
+ <td>3000</td>
113
+ <td>1000</td>
114
+ <td>2000</td>
115
+ <td>3000</td>
116
+ <td>1000</td>
117
+ <td>2000</td>
118
+ <td>3000</td>
119
+ </tr>
120
+ <tr align="center">
121
+ <td align="left">UIT-ViHSD</td>
122
+ <td>1000</td>
123
+ <td>2000</td>
124
+ <td>3000</td>
125
+ <td>1000</td>
126
+ <td>2000</td>
127
+ <td>3000</td>
128
+ <td>1000</td>
129
+ <td>2000</td>
130
+ <td>3000</td>
131
+ </tr>
132
+ <tr align="center">
133
+ <td align="left">UIT-ViSFD</td>
134
+ <td>1000</td>
135
+ <td>2000</td>
136
+ <td>3000</td>
137
+ <td>1000</td>
138
+ <td>2000</td>
139
+ <td>3000</td>
140
+ <td>1000</td>
141
+ <td>2000</td>
142
+ <td>3000</td>
143
+ </tr>
144
+ <tr align="center">
145
+ <td align="left">UIT-ViOCD</td>
146
+ <td>1000</td>
147
+ <td>2000</td>
148
+ <td>3000</td>
149
+ <td>1000</td>
150
+ <td>2000</td>
151
+ <td>3000</td>
152
+ <td>1000</td>
153
+ <td>2000</td>
154
+ <td>3000</td>
155
+ </tr>
156
+ <tr align="center">
157
+ <td align="left">Ecommerce-reviews</td>
158
+ <td>1000</td>
159
+ <td>2000</td>
160
+ <td>3000</td>
161
+ <td>1000</td>
162
+ <td>2000</td>
163
+ <td>3000</td>
164
+ <td>1000</td>
165
+ <td>2000</td>
166
+ <td>3000</td>
167
+ </tr>
168
+ <tr align="center">
169
+ <td align="left">VOZ-HSD</td>
170
+ <td>1000</td>
171
+ <td>2000</td>
172
+ <td>3000</td>
173
+ <td>1000</td>
174
+ <td>2000</td>
175
+ <td>3000</td>
176
+ <td>1000</td>
177
+ <td>2000</td>
178
+ <td>3000</td>
179
+ </tr>
180
+ <tr align="center">
181
+ <td align="left">Vietnamese-amazon-polarity</td>
182
+ <td>1000</td>
183
+ <td>2000</td>
184
+ <td>3000</td>
185
+ <td>1000</td>
186
+ <td>2000</td>
187
+ <td>3000</td>
188
+ <td>1000</td>
189
+ <td>2000</td>
190
+ <td>3000</td>
191
+ </tr>
192
+ </table>
193
+
194
+ ## Evaluation
195
+ <table>
196
+ <tr align="center">
197
+ <td rowspan=2><b>Model</td>
198
+ <td rowspan=2><b>Avg MF1</td>
199
+ <td colspan=3><b>Emotion Recognition</td>
200
+ <td colspan=3><b>Hate Speech Detection</td>
201
+ <td colspan=3><b>Spam Reviews Detection</td>
202
+ <td colspan=3><b>Hate Speech Spans Detection</td>
203
+ </tr>
204
+ <tr align="center">
205
+ <td><b>Acc</td>
206
+ <td><b>WF1</td>
207
+ <td><b>MF1</td>
208
+ <td><b>Acc</td>
209
+ <td><b>WF1</td>
210
+ <td><b>MF1</td>
211
+ <td><b>Acc</td>
212
+ <td><b>WF1</td>
213
+ <td><b>MF1</td>
214
+ <td><b>Acc</td>
215
+ <td><b>WF1</td>
216
+ <td><b>MF1</td>
217
+ </tr>
218
+ <tr align="center">
219
+ <td align="left">viBERT</td>
220
+ <td>78.16</td>
221
+ <td>61.91</td>
222
+ <td>61.98</td>
223
+ <td>59.7</td>
224
+ <td>85.34</td>
225
+ <td>85.01</td>
226
+ <td>62.07</td>
227
+ <td>89.93</td>
228
+ <td>89.79</td>
229
+ <td>76.8</td>
230
+ <td>90.42</td>
231
+ <td>90.45</td>
232
+ <td>84.55</td>
233
+ </tr>
234
+ <tr align="center">
235
+ <td align="left">vELECTRA</td>
236
+ <td>79.23</td>
237
+ <td>64.79</td>
238
+ <td>64.71</td>
239
+ <td>61.95</td>
240
+ <td>86.96</td>
241
+ <td>86.37</td>
242
+ <td>63.95</td>
243
+ <td>89.83</td>
244
+ <td>89.68</td>
245
+ <td>76.23</td>
246
+ <td>90.59</td>
247
+ <td>90.58</td>
248
+ <td>85.12</td>
249
+ </tr>
250
+ <tr align="center">
251
+ <td align="left">PhoBERT-Base </td>
252
+ <td>79.3</td>
253
+ <td>63.49</td>
254
+ <td>63.36</td>
255
+ <td>61.41</td>
256
+ <td>87.12</td>
257
+ <td>86.81</td>
258
+ <td>65.01</td>
259
+ <td>89.83</td>
260
+ <td>89.75</td>
261
+ <td>76.18</td>
262
+ <td>91.32</td>
263
+ <td>91.38</td>
264
+ <td>85.92</td>
265
+ </tr>
266
+ <tr align="center">
267
+ <td align="left">PhoBERT-Large</td>
268
+ <td>79.82</td>
269
+ <td>64.71</td>
270
+ <td>64.66</td>
271
+ <td>62.55</td>
272
+ <td>87.32</td>
273
+ <td>86.98</td>
274
+ <td>65.14</td>
275
+ <td>90.12</td>
276
+ <td>90.03</td>
277
+ <td>76.88</td>
278
+ <td>91.44</td>
279
+ <td>91.46</td>
280
+ <td>86.56</td>
281
+ </tr>
282
+ <tr align="center">
283
+ <td align="left">ViSoBERT</td>
284
+ <td>81.58</td>
285
+ <td>68.1</td>
286
+ <td>68.37</td>
287
+ <td>65.88</td>
288
+ <td>88.51</td>
289
+ <td>88.31</td>
290
+ <td>68.77</td>
291
+ <td>90.99</td>
292
+ <td><b>90.92</td>
293
+ <td><b>79.06</td>
294
+ <td>91.62</td>
295
+ <td>91.57</td>
296
+ <td>86.8</td>
297
+ </tr>
298
+ <tr align="center">
299
+ <td align="left">visobert-14gb-corpus</td>
300
+ <td><b>82.2</td>
301
+ <td><b>68.69</td>
302
+ <td><b>68.75</td>
303
+ <td><b>66.03</td>
304
+ <td><b>88.79</td>
305
+ <td><b>88.6</td>
306
+ <td><b>69.57</td>
307
+ <td><b>91.02</td>
308
+ <td>90.88</td>
309
+ <td>77.13</td>
310
+ <td><b>93.69</td>
311
+ <td><b>93.63</td>
312
+ <td><b>89.66</td>
313
+ </tr>
314
+ </div>
315
+ </table>
316
+
317
+
318
  ## Usage (HuggingFace Transformers)
319
 
320
  Install `transformers` package: