migtissera
commited on
Commit
•
b94a69a
1
Parent(s):
4936c0b
Update README.md
Browse files
README.md
CHANGED
@@ -29,6 +29,14 @@ The model was trained mostly with Chain-of-Thought reasoning data, including the
|
|
29 |
- Include a try/catch statement in your inference script, and only pass on the contents between the `<output>` `</output>` tags if it's available.
|
30 |
- Use the `<thinking>` tag as the seed in the generation, and force the model to produce outputs with XML tags. i.e: `f"{conversation}{user_input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n<thinking>"`
|
31 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
|
33 |
# Evaluations
|
34 |
| | Tess-R1 Limerick | Claude 3.5 Haiku | GPT-4o mini |
|
@@ -46,13 +54,6 @@ Example to run evaluations:
|
|
46 |
|
47 |
The system message have been edited in the sampler to reflect Tess-R1's system prompt.
|
48 |
|
49 |
-
# Prompt Format
|
50 |
-
The model uses Llama3 prompt format.
|
51 |
-
|
52 |
-
# System Message
|
53 |
-
The system message *must* be the following:
|
54 |
-
|
55 |
-
```You are Tess-R1, an advanced AI that was created for complex reasoning. Given a user query, you are able to first create a Chain-of-Thought (CoT) reasoning. Once the CoT is devised, you then proceed to first think about how to answer. While doing this, you have the capability to contemplate on the thought, and also provide alternatives. Once the CoT steps have been thought through, you then respond by creating the final output.```
|
56 |
|
57 |
# Inference
|
58 |
|
|
|
29 |
- Include a try/catch statement in your inference script, and only pass on the contents between the `<output>` `</output>` tags if it's available.
|
30 |
- Use the `<thinking>` tag as the seed in the generation, and force the model to produce outputs with XML tags. i.e: `f"{conversation}{user_input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n<thinking>"`
|
31 |
|
32 |
+
# Prompt Format
|
33 |
+
The model uses Llama3 prompt format.
|
34 |
+
|
35 |
+
# System Message
|
36 |
+
The system message *must* be the following:
|
37 |
+
|
38 |
+
```You are Tess-R1, an advanced AI that was created for complex reasoning. Given a user query, you are able to first create a Chain-of-Thought (CoT) reasoning. Once the CoT is devised, you then proceed to first think about how to answer. While doing this, you have the capability to contemplate on the thought, and also provide alternatives. Once the CoT steps have been thought through, you then respond by creating the final output.```
|
39 |
+
|
40 |
|
41 |
# Evaluations
|
42 |
| | Tess-R1 Limerick | Claude 3.5 Haiku | GPT-4o mini |
|
|
|
54 |
|
55 |
The system message have been edited in the sampler to reflect Tess-R1's system prompt.
|
56 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
57 |
|
58 |
# Inference
|
59 |
|