ggml files of bge-base-en
You can use this ggml for https://github.com/skeskinen/bert.cpp
bge-base-en
Data Type |
STSBenchmark |
eval time |
EmotionClassification |
eval time |
f32 |
0.8630 |
39.56 |
0.5533 |
69.55 |
f16 |
0.8630 |
32.95 |
0.5533 |
55.75 |
q4_0 |
0.8627 |
27.23 |
0.5540 |
73.29 |
q4_1 |
0.8654 |
29.78 |
0.5508 |
69.81 |
all-MiniLM-L12-v2
Data Type |
STSBenchmark |
eval time |
EmotionClassification |
eval time |
f32 |
0.8306 |
13.36 |
0.4117 |
21.23 |
f16 |
0.8306 |
11.51 |
0.4119 |
20.08 |
q4_0 |
0.8310 |
11.27 |
0.4183 |
20.81 |
q4_1 |
0.8325 |
12.37 |
0.4093 |
19.38 |
all-MiniLM-L6-v2
Data Type |
STSBenchmark |
eval time |
EmotionClassification |
eval time |
f32 |
0.8201 |
6.83 |
0.4082 |
11.34 |
f16 |
0.8201 |
6.17 |
0.4085 |
10.28 |
q4_0 |
0.8175 |
5.45 |
0.3911 |
10.63 |
q4_1 |
0.8223 |
6.79 |
0.4027 |
11.41 |
bert-base-uncased
Data Type |
STSBenchmark |
eval time |
EmotionClassification |
eval time |
f32 |
0.4738 |
52.38 |
0.3361 |
88.56 |
f16 |
0.4739 |
33.24 |
0.3361 |
55.86 |
q4_0 |
0.4940 |
33.93 |
0.3375 |
57.82 |
q4_1 |
0.4612 |
36.86 |
0.3318 |
59.63 |