OpenCompass

community

https://opencompass.org.cn/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Mor-Li authored a paper 4 days ago

Sculptor: Empowering LLMs with Cognitive Agency via Active Context Management

Mor-Li authored a paper 4 days ago

MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

Sudanl updated a collection 4 days ago

CompassVerifier

View all activity

Organization Card

Community About org cards

OpenCompass Website ^HOT OpenCompass Toolkit ^{TRY IT OUT}

👋 join us on Discord and WeChat

follow us on Github

OpenCompass is a platform focused on evaluation of AGI, include Large Language Model and Multi-modality Model. We aim to:

develop high-quality libraries to reduce the difficulties in evaluation
provide convincing leaderboards for improving the understanding of the large models
create powerful toolchains targeting a variety of abilities and tasks
build solid benchmarks to support the large model research

Collections 3

View 3 collections

spaces 17

RISEBench Gallery

A Gallery of Generation Results on RISEBench

Open LMM Spatial Leaderboard

A Leaderboard for LMM spatial understanding capabilities

Open LMM Subjective Leaderboard

VLMEvalKit Subjectivce Benchmark Results

CompassAcademic Leaderboard Full Version

Compass Academic Leaderboard Full Version

Open LMM Reasoning Leaderboard

A Leaderboard that demonstrates LMM reasoning capabilities

Compass Academic Leaderboard

Compass Academic Leaderboard

models 13

opencompass/CompassJudger-2-7B-Instruct

Text Ranking • 8B • Updated 20 days ago • 264 • 2

opencompass/CompassJudger-2-32B-Instruct

Text Ranking • 33B • Updated 20 days ago • 117 • 2

opencompass/CompassVerifier-32B

33B • Updated Jul 11 • 14 • 5

opencompass/CompassVerifier-7B

8B • Updated Jul 11 • 77 • 4

opencompass/CompassVerifier-3B

3B • Updated Jul 11 • 68 • 2

opencompass/anah-7b

Text Classification • 8B • Updated Mar 8 • 3

opencompass/anah-20b

Text Classification • 20B • Updated Mar 8 • 4

opencompass/anah-v2

Text Classification • 8B • Updated Mar 8 • 5 • 4

opencompass/CompassJudger-1-14B-Instruct

Text Generation • 15B • Updated Oct 30, 2024 • 6 • 2

opencompass/CompassJudger-1-32B-Instruct

Text Generation • 33B • Updated Oct 30, 2024 • 15 • 17

datasets 14

opencompass/LiveMathBench

Viewer • Updated 6 days ago • 483 • 1.25k • 9

opencompass/CodeForce_SAGA

Viewer • Updated 10 days ago • 5.57k • 238 • 1

opencompass/CodeCompass

Updated 10 days ago • 290 • 1

opencompass/VerifierBench

Viewer • Updated about 1 month ago • 2.82k • 243 • 1

opencompass/NeedleBench

Viewer • Updated May 12 • 6.8k • 8.36k • 5

opencompass/compass_academic_predictions

Viewer • Updated Apr 7 • 4.42M • 65

opencompass/Creation-MMBench

Viewer • Updated Mar 19 • 765 • 106 • 2

opencompass/anah

Viewer • Updated Mar 13 • 783 • 82 • 3

opencompass/AIME2025

Viewer • Updated Feb 25 • 30 • 7.33k • 26

opencompass/mmmlu_lite

Viewer • Updated Nov 1, 2024 • 20k • 47 • 2

View 14 datasets