library_name: transformers tags: - unsloth - trl - sft - o1 - qwen2.5 - qwen - conversational pipeline_tag: text-generation
Parm 2 ultra: trained for 2 hours on 1 Million OpenO1 chats, 180k sonnet 3.5, 130k qwq messages.