File size: 2,336 Bytes
ef0afd2
 
 
 
 
 
 
54b78b0
ef0afd2
f3299e2
 
 
29546d2
ef0afd2
475a193
f1f440b
475a193
abe6e5a
955867a
abe6e5a
29546d2
 
53b5225
a40a23b
d4abd07
7c302d5
 
 
 
 
a89e0a7
 
 
163a300
a89e0a7
7c302d5
de666be
 
9d07f2a
 
 
 
 
 
 
09ee56a
9d07f2a
 
7c302d5
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
---
title: README
emoji: 🌍
colorFrom: blue
colorTo: gray
sdk: static
pinned: false
short_description: NeurIPS'24 Datasets&Benchmarks
---
# WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences

**NeurIPS'24 Datasets&Benchmarks** [**πŸ€— Demo**](https://huggingface.co/spaces/WildVision/vision-arena) | [**πŸ€— Paper**](https://huggingface.co/papers/2406.11069) | [**πŸ“– arXiv**](https://arxiv.org/abs/2406.11069) | [**GitHub**](https://github.com/orgs/WildVision-AI/repositories)


We are a team from AI2, UCSB, UWaterloo, UPenn, NTU, UWM, and UCSC, and we are working on benchmarking vision language models. 

Team Member: Yujie Lu, Dongfu Jiang, Xingyu Fu, Hui Chen, Yingzi Ma, Jing Gu, Michael Saxon

Advisor: Bill Yuchen Lin, Wenhu Chen, Chaowei Xiao, Yejin Choi, Miguel Eckstein, William Yang Wang

Compare VLMs at [WildVision-Arena](https://huggingface.co/spaces/WildVision/vision-arena) and [WildVision-Bench](https://huggingface.co/datasets/WildVision/wildvision-arena-data).

**More chat and vote data will be updated reguarly. Eval script is released here [WildVision-Bench](https://github.com/WildVision-AI/WildVision-Bench)**

Contact: Bill Yuchen Lin ([email protected]) and Yujie Lu ([email protected]) 


Citation: If you found this huggingface space useful, please consider cite us:

```
@article{lu2024wildvision,
  title={WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences},
  author={Lu, Yujie and Jiang, Dongfu and Chen, Wenhu and Wang, William Yang and Choi, Yejin and Lin, Bill Yuchen},
  publisher={NeurIPS},
  year={2024}
}
@misc{yujie2024wildvisionarena,
    title={WildVision Arena: Benchmarking Multimodal LLMs in the Wild},
    url={https://huggingface.co/spaces/WildVision/vision-arena/},
    author={Lu, Yujie and Jiang, Dongfu and Chen, Hui and Ma, Yingzi and Gu, Jing and Xiao, Chaowei and Chen, Wenhu and Wang, William and Choi, Yejin and Lin, Bill Yuchen},
    year={2024}
}
@misc{yujie2024wildvisionv2,
    title={WildVision Data and Model},
    url={https://huggingface.co/WildVision},
    author={Lu, Yujie* and Jiang, Dongfu* and Chen, Hui* and Fu, Xingyu and Ma, Yingzi and Gu, Jing and Saxon, Michael and Xiao, Chaowei and Chen, Wenhu and Choi, Yejin and Lin, Bill Yuchen and Eckstein, Miguel and Wang, William},
    year={2024}
}
```