OpenBuddy
/

openbuddy-falcon-180b-v13-preview0

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

openbuddy-falcon-180b-v13-preview0 / README.md

lihongze8's picture

Adding Evaluation Results

6d75a80 about 1 year ago

|

634 Bytes

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	65.85
ARC (25-shot)	65.1
HellaSwag (10-shot)	86.19
MMLU (5-shot)	64.6
TruthfulQA (0-shot)	54.97
Winogrande (5-shot)	82.64
GSM8K (5-shot)	41.62