Is this distill model?

#1
by sergeantson - opened

Is this model distillation from R1?

No, it is a fine-tuned model with GPRO methods to gain reasoning capacity

umarigan changed discussion status to closed

Sign up or log in to comment