Leaderboard is of very limited use without more 0-shot, instruction prompted datasets

#27

by JulesGM - opened May 25, 2023

May 25, 2023

Most of the use of LLM nowadays is with zero shot & prompting, yet there is just one fairly specific dataset evaluating this.

I think it would be important to add more zero-shotted, instruction prompted datasets as this is how the models will be used a large fraction of the time.

clefourrier

Open LLM Leaderboard org Jul 13, 2023

•

edited Jul 13, 2023

Hi! We tried to select a good range of evaluation tasks based on what is used in the litterature to compare models :)
We might add more 0-shot evaluations in the future!

clefourrier changed discussion status to closed Jul 21, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment