Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MixEval
community
https://mixeval.github.io/
NiJinjie
Psycoy
Activity Feed
Follow
12
AI & ML interests
LLM & LMM evaluation
Recent Activity
Solaris99
authored
a paper
about 1 month ago
VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?
Solaris99
authored
a paper
about 1 month ago
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
Solaris99
authored
a paper
about 1 month ago
AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
View all activity
Team members
7
MixEval
's models
None public yet