migtissera
commited on
Commit
•
fddc2ac
1
Parent(s):
b19d1fb
Update README.md
Browse files
README.md
CHANGED
@@ -41,7 +41,7 @@ The system message *must* be the following:
|
|
41 |
The model was trained mostly with Chain-of-Thought reasoning data, including the XML tags. However, to generalize model generations, some single-turn and multi-turn data without XML tags were also included. Due to this, in some instances the model does not produce XML tags and does not fully utilize test-time compute capabilities. There is two ways to get around this:
|
42 |
|
43 |
- Include a try/catch statement in your inference script, and only pass on the contents between the `<output>` `</output>` tags if it's available.
|
44 |
-
- Use the `<thinking>` tag as the seed in the generation. i.e: `f"{conversation}{user_input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n<thinking>"`
|
45 |
|
46 |
I have included a sample Python script below.
|
47 |
|
|
|
41 |
The model was trained mostly with Chain-of-Thought reasoning data, including the XML tags. However, to generalize model generations, some single-turn and multi-turn data without XML tags were also included. Due to this, in some instances the model does not produce XML tags and does not fully utilize test-time compute capabilities. There is two ways to get around this:
|
42 |
|
43 |
- Include a try/catch statement in your inference script, and only pass on the contents between the `<output>` `</output>` tags if it's available.
|
44 |
+
- Use the `<thinking>` tag as the seed in the generation, and force the model to produce outputs with XML tags. i.e: `f"{conversation}{user_input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n<thinking>"`
|
45 |
|
46 |
I have included a sample Python script below.
|
47 |
|