Watt-Tool-8B-GGUF / scores /Watt-Tool-8B-iq3_m.mmlu
eaddario's picture
Generate Perplexity, KLD, ARC, HellaSwag, MMLU, Truthful QA and WinoGrande scores
6583e65 verified
common_init_from_params: setting dry_penalty_last_n to ctx_size = 768
common_init_from_params: warming up the model with an empty run - please wait ... (--no-warmup to disable)
system_info: n_threads = 6 (n_threads_batch = 6) / 12 | Metal : EMBED_LIBRARY = 1 | CPU : NEON = 1 | ARM_FMA = 1 | FP16_VA = 1 | DOTPROD = 1 | LLAMAFILE = 1 | ACCELERATE = 1 | AARCH64_REPACK = 1 |
multiple_choice_score: there are 1548 tasks in prompt
multiple_choice_score: selecting 750 random tasks from 1548 tasks available
multiple_choice_score: preparing task data...done
multiple_choice_score : calculating TruthfulQA score over 750 tasks.
task acc_norm
1 100.00000000
2 50.00000000
3 33.33333333
4 50.00000000
5 40.00000000
6 33.33333333
7 42.85714286
8 50.00000000
9 44.44444444
10 50.00000000
11 45.45454545
12 41.66666667
13 38.46153846
14 35.71428571
15 40.00000000
16 43.75000000
17 47.05882353
18 44.44444444
19 47.36842105
20 50.00000000
21 52.38095238
22 50.00000000
23 47.82608696
24 50.00000000
25 52.00000000
26 50.00000000
27 51.85185185
28 53.57142857
29 55.17241379
30 56.66666667
31 58.06451613
32 59.37500000
33 57.57575758
34 58.82352941
35 60.00000000
36 58.33333333
37 56.75675676
38 55.26315789
39 53.84615385
40 52.50000000
41 51.21951220
42 50.00000000
43 51.16279070
44 50.00000000
45 51.11111111
46 50.00000000
47 48.93617021
48 50.00000000
49 48.97959184
50 48.00000000
51 47.05882353
52 46.15384615
53 45.28301887
54 46.29629630
55 47.27272727
56 48.21428571
57 49.12280702
58 50.00000000
59 49.15254237
60 48.33333333
61 49.18032787
62 48.38709677
63 47.61904762
64 46.87500000
65 46.15384615
66 45.45454545
67 44.77611940
68 44.11764706
69 43.47826087
70 44.28571429
71 45.07042254
72 44.44444444
73 43.83561644
74 44.59459459
75 45.33333333
76 46.05263158
77 46.75324675
78 47.43589744
79 46.83544304
80 47.50000000
81 48.14814815
82 47.56097561
83 48.19277108
84 48.80952381
85 48.23529412
86 47.67441860
87 48.27586207
88 48.86363636
89 48.31460674
90 47.77777778
91 47.25274725
92 46.73913043
93 46.23655914
94 45.74468085
95 45.26315789
96 45.83333333
97 45.36082474
98 44.89795918
99 44.44444444
100 44.00000000
101 44.55445545
102 44.11764706
103 43.68932039
104 44.23076923
105 44.76190476
106 45.28301887
107 44.85981308
108 44.44444444
109 44.03669725
110 43.63636364
111 44.14414414
112 44.64285714
113 44.24778761
114 44.73684211
115 45.21739130
116 44.82758621
117 44.44444444
118 44.06779661
119 44.53781513
120 44.16666667
121 44.62809917
122 44.26229508
123 43.90243902
124 43.54838710
125 44.00000000
126 43.65079365
127 43.30708661
128 42.96875000
129 42.63565891
130 42.30769231
131 42.74809160
132 43.18181818
133 42.85714286
134 42.53731343
135 42.22222222
136 42.64705882
137 42.33576642
138 42.02898551
139 42.44604317
140 42.85714286
141 43.26241135
142 43.66197183
143 44.05594406
144 43.75000000
145 44.13793103
146 44.52054795
147 44.21768707
148 44.59459459
149 44.29530201
150 44.00000000
151 43.70860927
152 44.07894737
153 44.44444444
154 44.15584416
155 43.87096774
156 43.58974359
157 43.94904459
158 44.30379747
159 44.02515723
160 44.37500000
161 44.09937888
162 43.82716049
163 43.55828221
164 43.29268293
165 43.63636364
166 43.37349398
167 43.71257485
168 43.45238095
169 43.19526627
170 42.94117647
171 42.69005848
172 42.44186047
173 42.19653179
174 41.95402299
175 41.71428571
176 42.04545455
177 41.80790960
178 41.57303371
179 41.89944134
180 41.66666667
181 41.43646409
182 41.75824176
183 42.07650273
184 41.84782609
185 42.16216216
186 42.47311828
187 42.24598930
188 42.02127660
189 42.32804233
190 42.10526316
191 42.40837696
192 42.70833333
193 42.48704663
194 42.26804124
195 42.56410256
196 42.85714286
197 42.63959391
198 42.92929293
199 42.71356784
200 42.50000000
201 42.28855721
202 42.07920792
203 41.87192118
204 41.66666667
205 41.95121951
206 41.74757282
207 41.54589372
208 41.34615385
209 41.14832536
210 40.95238095
211 41.23222749
212 41.03773585
213 40.84507042
214 40.65420561
215 40.93023256
216 41.20370370
217 41.01382488
218 41.28440367
219 41.55251142
220 41.81818182
221 41.62895928
222 41.44144144
223 41.70403587
224 41.51785714
225 41.77777778
226 41.59292035
227 41.40969163
228 41.66666667
229 41.48471616
230 41.30434783
231 41.12554113
232 41.37931034
233 41.20171674
234 41.45299145
235 41.27659574
236 41.10169492
237 40.92827004
238 40.75630252
239 40.58577406
240 40.83333333
241 40.66390041
242 40.90909091
243 41.15226337
244 40.98360656
245 41.22448980
246 41.05691057
247 41.29554656
248 41.12903226
249 40.96385542
250 40.80000000
251 41.03585657
252 40.87301587
253 40.71146245
254 40.55118110
255 40.39215686
256 40.62500000
257 40.85603113
258 40.69767442
259 40.54054054
260 40.76923077
261 40.61302682
262 40.45801527
263 40.30418251
264 40.15151515
265 40.37735849
266 40.60150376
267 40.44943820
268 40.29850746
269 40.52044610
270 40.74074074
271 40.59040590
272 40.44117647
273 40.29304029
274 40.14598540
275 40.00000000
276 40.21739130
277 40.43321300
278 40.28776978
279 40.14336918
280 40.00000000
281 40.21352313
282 40.07092199
283 39.92932862
284 39.78873239
285 40.00000000
286 39.86013986
287 40.06968641
288 39.93055556
289 39.79238754
290 40.00000000
291 40.20618557
292 40.41095890
293 40.61433447
294 40.47619048
295 40.33898305
296 40.20270270
297 40.06734007
298 40.26845638
299 40.46822742
300 40.66666667
301 40.53156146
302 40.72847682
303 40.59405941
304 40.46052632
305 40.65573770
306 40.52287582
307 40.71661238
308 40.58441558
309 40.45307443
310 40.32258065
311 40.51446945
312 40.70512821
313 40.89456869
314 40.76433121
315 40.63492063
316 40.50632911
317 40.69400631
318 40.56603774
319 40.75235110
320 40.62500000
321 40.49844237
322 40.68322981
323 40.55727554
324 40.43209877
325 40.61538462
326 40.49079755
327 40.67278287
328 40.54878049
329 40.42553191
330 40.30303030
331 40.18126888
332 40.06024096
333 40.24024024
334 40.11976048
335 40.29850746
336 40.17857143
337 40.05934718
338 39.94082840
339 39.82300885
340 40.00000000
341 40.17595308
342 40.35087719
343 40.23323615
344 40.11627907
345 40.00000000
346 40.17341040
347 40.34582133
348 40.22988506
349 40.11461318
350 40.28571429
351 40.17094017
352 40.34090909
353 40.22662890
354 40.11299435
355 40.00000000
356 40.16853933
357 40.33613445
358 40.22346369
359 40.11142061
360 40.27777778
361 40.44321330
362 40.60773481
363 40.49586777
364 40.38461538
365 40.27397260
366 40.43715847
367 40.32697548
368 40.21739130
369 40.10840108
370 40.27027027
371 40.16172507
372 40.05376344
373 39.94638070
374 39.83957219
375 40.00000000
376 39.89361702
377 40.05305040
378 40.21164021
379 40.10554090
380 40.00000000
381 40.15748031
382 40.31413613
383 40.46997389
384 40.36458333
385 40.51948052
386 40.41450777
387 40.31007752
388 40.20618557
389 40.35989717
390 40.51282051
391 40.66496164
392 40.81632653
393 40.71246819
394 40.60913706
395 40.75949367
396 40.90909091
397 41.05793451
398 40.95477387
399 41.10275689
400 41.00000000
401 41.14713217
402 41.04477612
403 40.94292804
404 40.84158416
405 40.98765432
406 40.88669951
407 40.78624079
408 40.68627451
409 40.83129584
410 40.73170732
411 40.63260341
412 40.53398058
413 40.43583535
414 40.57971014
415 40.48192771
416 40.62500000
417 40.76738609
418 40.66985646
419 40.81145585
420 40.71428571
421 40.85510689
422 40.75829384
423 40.66193853
424 40.80188679
425 40.94117647
426 41.07981221
427 40.98360656
428 40.88785047
429 41.02564103
430 40.93023256
431 41.06728538
432 41.20370370
433 41.10854503
434 41.24423963
435 41.14942529
436 41.05504587
437 40.96109840
438 41.09589041
439 41.23006834
440 41.13636364
441 41.04308390
442 40.95022624
443 40.85778781
444 40.76576577
445 40.67415730
446 40.80717489
447 40.71588367
448 40.62500000
449 40.53452116
450 40.66666667
451 40.79822616
452 40.70796460
453 40.83885210
454 40.74889868
455 40.65934066
456 40.78947368
457 40.91903720
458 40.82969432
459 40.74074074
460 40.86956522
461 40.78091106
462 40.69264069
463 40.82073434
464 40.94827586
465 40.86021505
466 40.77253219
467 40.89935760
468 40.81196581
469 40.93816631
470 40.85106383
471 40.76433121
472 40.67796610
473 40.80338266
474 40.71729958
475 40.63157895
476 40.75630252
477 40.67085954
478 40.79497908
479 40.91858038
480 41.04166667
481 40.95634096
482 40.87136929
483 40.78674948
484 40.70247934
485 40.61855670
486 40.53497942
487 40.65708419
488 40.57377049
489 40.49079755
490 40.61224490
491 40.52953157
492 40.44715447
493 40.36511156
494 40.48582996
495 40.60606061
496 40.72580645
497 40.64386318
498 40.56224900
499 40.68136273
500 40.80000000
501 40.71856287
502 40.63745020
503 40.55666004
504 40.47619048
505 40.39603960
506 40.51383399
507 40.43392505
508 40.55118110
509 40.47151277
510 40.58823529
511 40.50880626
512 40.42968750
513 40.35087719
514 40.27237354
515 40.38834951
516 40.50387597
517 40.61895551
518 40.73359073
519 40.65510597
520 40.76923077
521 40.69097889
522 40.80459770
523 40.72657744
524 40.64885496
525 40.57142857
526 40.49429658
527 40.41745731
528 40.53030303
529 40.64272212
530 40.75471698
531 40.67796610
532 40.60150376
533 40.52532833
534 40.63670412
535 40.74766355
536 40.67164179
537 40.78212291
538 40.89219331
539 40.81632653
540 40.74074074
541 40.66543438
542 40.77490775
543 40.69981584
544 40.80882353
545 40.73394495
546 40.65934066
547 40.58500914
548 40.69343066
549 40.61930783
550 40.54545455
551 40.65335753
552 40.57971014
553 40.50632911
554 40.43321300
555 40.36036036
556 40.46762590
557 40.57450628
558 40.68100358
559 40.60822898
560 40.53571429
561 40.46345811
562 40.56939502
563 40.67495560
564 40.60283688
565 40.70796460
566 40.63604240
567 40.56437390
568 40.66901408
569 40.59753954
570 40.52631579
571 40.45534151
572 40.55944056
573 40.48865620
574 40.41811847
575 40.34782609
576 40.45138889
577 40.55459272
578 40.48442907
579 40.41450777
580 40.51724138
581 40.44750430
582 40.37800687
583 40.48027444
584 40.58219178
585 40.68376068
586 40.78498294
587 40.71550256
588 40.64625850
589 40.57724958
590 40.50847458
591 40.43993232
592 40.37162162
593 40.47217538
594 40.40404040
595 40.33613445
596 40.43624161
597 40.36850921
598 40.30100334
599 40.40066778
600 40.33333333
601 40.26622296
602 40.19933555
603 40.29850746
604 40.23178808
605 40.16528926
606 40.09900990
607 40.19769357
608 40.13157895
609 40.06568144
610 40.00000000
611 40.09819967
612 40.03267974
613 40.13050571
614 40.06514658
615 40.00000000
616 40.09740260
617 40.19448947
618 40.12944984
619 40.06462036
620 40.00000000
621 39.93558776
622 39.87138264
623 39.80738363
624 39.74358974
625 39.68000000
626 39.77635783
627 39.71291866
628 39.64968153
629 39.74562798
630 39.84126984
631 39.93660856
632 40.03164557
633 39.96840442
634 39.90536278
635 39.84251969
636 39.77987421
637 39.71742543
638 39.65517241
639 39.74960876
640 39.84375000
641 39.78159126
642 39.71962617
643 39.65785381
644 39.75155280
645 39.68992248
646 39.62848297
647 39.56723338
648 39.50617284
649 39.44530046
650 39.38461538
651 39.32411674
652 39.26380368
653 39.20367534
654 39.29663609
655 39.23664122
656 39.17682927
657 39.11719939
658 39.20972644
659 39.15022762
660 39.09090909
661 39.03177005
662 38.97280967
663 38.91402715
664 38.85542169
665 38.79699248
666 38.73873874
667 38.83058471
668 38.92215569
669 39.01345291
670 39.10447761
671 39.04619970
672 38.98809524
673 38.93016345
674 38.87240356
675 38.81481481
676 38.75739645
677 38.70014771
678 38.79056047
679 38.73343152
680 38.82352941
681 38.91336270
682 38.85630499
683 38.79941435
684 38.88888889
685 38.83211679
686 38.92128280
687 38.86462882
688 38.80813953
689 38.75181422
690 38.69565217
691 38.78437048
692 38.72832370
693 38.67243867
694 38.76080692
695 38.70503597
696 38.79310345
697 38.73744620
698 38.82521490
699 38.91273247
700 38.85714286
701 38.94436519
702 39.03133903
703 39.11806543
704 39.20454545
705 39.14893617
706 39.09348442
707 39.03818953
708 38.98305085
709 38.92806770
710 38.87323944
711 38.95921238
712 39.04494382
713 39.13043478
714 39.07563025
715 39.02097902
716 38.96648045
717 38.91213389
718 38.85793872
719 38.94297636
720 38.88888889
721 38.83495146
722 38.78116343
723 38.86583679
724 38.81215470
725 38.89655172
726 38.98071625
727 38.92709766
728 38.87362637
729 38.82030178
730 38.76712329
731 38.71409029
732 38.66120219
733 38.74488404
734 38.82833787
735 38.77551020
736 38.72282609
737 38.80597015
738 38.88888889
739 38.97158322
740 38.91891892
741 38.86639676
742 38.94878706
743 39.03095559
744 39.11290323
745 39.06040268
746 39.00804290
747 38.95582329
748 38.90374332
749 38.85180240
750 38.93333333
Final result: 38.9333 +/- 1.7816
Random chance: 25.0000 +/- 1.5822