Open-LLM-Leaderboard

community

https://vila-lab.github.io/Open-LLM-Leaderboard-Website/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Organization Card

Community About org cards

Open-LLM-Leaderboard: Open-Style Question Evaluation

We introduce the Open-LLM-Leaderboard to track various LLMs’ performance on open-style questions and reflect their true capability. You can use OSQ-bench questions and prompts to evaluate your models automatically with an LLM-based evaluator.

spaces 1

Running

OSQ Leaderboard

🐨

Display leaderboard data for LLMs

models 0

None public yet

datasets 1

Open-Style/Open-LLM-Benchmark

Viewer • Updated Jul 31, 2024 • 402k • 271