yonatanbitton commited on
Commit
549fe0b
1 Parent(s): aaa843d

Upload visitbench_leaderboard_Single~Image_Oct282023.tsv

Browse files
visitbench_leaderboard_Single~Image_Oct282023.tsv ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Category Model Elo # Matches Win vs. Reference (w/ # ratings)
2
+ Single Image human_verified_reference 1361 6030 ---
3
+ Single Image llava-a1-predictions 1206 724 30.15% (n=136)
4
+ Single Image llava13b_output 1091 5474 18.53% (n=475)
5
+ Single Image lynx(7B)_v2 prediction 1078 708 15.15% (n=132)
6
+ Single Image mPLUG-Owl prediction 1076 5465 16.04% (n=480)
7
+ Single Image LlamaAdapter-v2 prediction 1055 5485 14.14% (n=488)
8
+ Single Image idefics9b_prediction 1030 842 9.72% (n=144)
9
+ Single Image Lynx(8B) predictions 1012 827 11.43% (n=140)
10
+ Single Image instruct_blip_output 995 5505 14.12% (n=503)
11
+ Single Image otter 970 5495 7.01% (n=499)
12
+ Single Image visual_gpt_davinci003_output 937 5486 1.57% (n=510)
13
+ Single Image Octopus V2 prediction 936 820 8.90% (n=146)
14
+ Single Image MiniGPT-4 prediction 899 5473 3.36% (n=506)
15
+ Single Image openflamingo 831 5490 2.95% (n=509)
16
+ Single Image panda_gpt_13b_output 767 5480 2.70% (n=519)
17
+ Single Image mmgpt_output 757 5504 0.19% (n=527)