Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
LLM360
/
k2-eval-gallery
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
c33c8c6
k2-eval-gallery
/
eval-results
Ctrl+K
Ctrl+K
3 contributors
History:
4 commits
mylibrar
Upload other metrics
c33c8c6
12 months ago
arc_challenge
Upload other metrics
12 months ago
arc_easy
Upload other metrics
12 months ago
bbh_cot_fewshot
Upload other metrics
12 months ago
crowspairs
Upload 3 metrics
12 months ago
gsm8k
Upload other metrics
12 months ago
hellaswag
Upload other metrics
12 months ago
humaneval
Upload results for 3 metrics
12 months ago
logiqa2
Upload other metrics
12 months ago
mathqa
Upload other metrics
12 months ago
mbpp
Upload results for 3 metrics
12 months ago
medmcqa
Upload other metrics
12 months ago
medqa
Upload more metrics and fix some issues in app.py
12 months ago
mmlu
Upload other metrics
12 months ago
openbookqa5
Upload 3 metrics
12 months ago
piqa5
Upload more metrics and fix some issues in app.py
12 months ago
pubmedqa
Upload more metrics and fix some issues in app.py
12 months ago
race
Upload other metrics
12 months ago
toxigen
Upload results for 3 metrics
12 months ago
toxigen2
Upload more metrics and fix some issues in app.py
12 months ago
truthfulqa_mc2
Upload other metrics
12 months ago
winogrande5
Upload 3 metrics
12 months ago