Not test time compute reinforcement learned? Aka no thinking support?

#13
by imoc - opened

No thinking capability mentioned in description or in chat template. I'm worried this might be undesired as TTC almost bump the same size LLM performance up one level. (But If model is really strong and flexible enough this can be compensated a little by modifying the system prompt though)

Sign up or log in to comment