theprint commited on
Commit
8d9797e
·
verified ·
1 Parent(s): 6dc9558

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ language:
15
 
16
  Please note that this model is a WIP experiment into GRPO fine tuning on Python code problems for reasoning. The performance of this model varies greatly depending on task, prompt and parameters.
17
 
18
- I recommend a very low temperature, like 0.1. You may also see more consistent results by encouraging the use of <think> and <answer> tags in the system prompt.
19
 
20
  ### Example System Prompt
21
  ```
 
15
 
16
  Please note that this model is a WIP experiment into GRPO fine tuning on Python code problems for reasoning. The performance of this model varies greatly depending on task, prompt and parameters.
17
 
18
+ I recommend a very low temperature, like 0.1. You may also see more consistent results by encouraging the use of `<think>` and `<answer>` tags in the system prompt.
19
 
20
  ### Example System Prompt
21
  ```