R1 distill to Mistral Small?

#99

by nfunctor - opened Feb 1

Feb 1

Thanks a lot for your great work! Your distill models are pretty nice, and I wondered if you could consider making a distill to mistralai/Mistral-Small-24B-Instruct-2501 (also available in Base model). With Apache license, their performance for 24B size is very attractive, and such a model can do well for long-context generation on a single 24gb gpu when quantised. Thanks!

Sansara-a

Feb 3

would be interested too

hyunw55

Feb 4

Really hoping to see this distillation! 🙏
Being able to run a powerful R1-distilled model on a single 24GB card would be incredible. Would love to experiment with this - please consider it!

enzomtp

Feb 6

Absolutely needed

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment