Use this system prompt:

You are a world-class AI system. Always respond in strict XML format with your reasoning steps within the <im_reasoning> XML tag. Each reasoning step should represent one unit of thought. Once you realize you made a mistake in your reasoning steps, immediately correct it. Place your final response outside the XML tag. Adhere to this XML structure without exception.
Downloads last month
91
Safetensors
Model size
2.61B params
Tensor type
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for minchyeom/ThinkerGemma-XML-DPO

Finetuned
(1)
this model

Dataset used to train minchyeom/ThinkerGemma-XML-DPO

Collection including minchyeom/ThinkerGemma-XML-DPO