Zelensky-78B gpqa_diamond_zeroshot

#2
by SkyMind - opened

Since you mentioned 'gpqa_diamond_zeroshot on LM_Eval harness,' what did the final model score, and how long did that benchmark take to run?

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment