Independent evaluation results

#9
by yaronr - opened

Dear Qwen team,

I'm pleased to share our independent evaluation of the model using our implementation of the MMLU-Pro benchmark.
The results demonstrate impressive performance for the model across multiple categories compared with other models.
I hope you find this useful.

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment