Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

MixEval

community
https://mixeval.github.io/
NiJinjie
Psycoy
Activity Feed

AI & ML interests

LLM & LMM evaluation

Recent Activity

Solaris99  authored a paper about 1 month ago
VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?
Solaris99  authored a paper about 1 month ago
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
Solaris99  authored a paper about 1 month ago
AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
View all activity

Jinjie Ni's profile picture Fuzhao Xue's profile picture Xiang Yue's profile picture Deepanway's profile picture Bo Li's profile picture David Junhao ZHANG's profile picture Yifan Song's profile picture

MixEval 's models

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs