migtissera
/

Tess-R1-Limerick-Llama-3.1-70B

Model card Files Files and versions Community

migtissera commited on 11 days ago

Commit

fddc2ac

•

1 Parent(s): b19d1fb

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -41,7 +41,7 @@ The system message *must* be the following:
 The model was trained mostly with Chain-of-Thought reasoning data, including the XML tags. However, to generalize model generations, some single-turn and multi-turn data without XML tags were also included. Due to this, in some instances the model does not produce XML tags and does not fully utilize test-time compute capabilities. There is two ways to get around this:
 - Include a try/catch statement in your inference script, and only pass on the contents between the `<output>` `</output>` tags if it's available.
-- Use the `<thinking>` tag as the seed in the generation. i.e: `f"{conversation}{user_input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n<thinking>"`
 I have included a sample Python script below.

 The model was trained mostly with Chain-of-Thought reasoning data, including the XML tags. However, to generalize model generations, some single-turn and multi-turn data without XML tags were also included. Due to this, in some instances the model does not produce XML tags and does not fully utilize test-time compute capabilities. There is two ways to get around this:
 - Include a try/catch statement in your inference script, and only pass on the contents between the `<output>` `</output>` tags if it's available.
+- Use the `<thinking>` tag as the seed in the generation, and force the model to produce outputs with XML tags. i.e: `f"{conversation}{user_input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n<thinking>"`
 I have included a sample Python script below.