Zelensky-78B gpqa_diamond_zeroshot

by SkyMind - opened Jan 26

Jan 26

Since you mentioned 'gpqa_diamond_zeroshot on LM_Eval harness,' what did the final model score, and how long did that benchmark take to run?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment