Evaluation

by owao - opened 17 days ago

owao

17 days ago

Hey! Really great initiative :)

I find the idea of having a handy small model for simple QA really attractable but I was surprised no evaluation results are presented.
While I obviously get the point of not sharing your personal benchmark dataset, why not evaluate the models on SimpleQA for example?

McaTech

Owner 17 days ago

Hi! I am still working on it. Maybe next week I will present the benchmark results with new model updates. I still upgrading the model for QA.

owao

14 days ago

Great to hear ;) thanks for your reply

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment