theprint commited on
Commit
6dc9558
·
verified ·
1 Parent(s): 9a0a0b7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -13,7 +13,7 @@ language:
13
  ---
14
  # PyRe is Experimental
15
 
16
- Please note that this model is a WIP experiment into GPRO fine tuning on Python code problems for reasoning. The performance of this model varies greatly depending on task, prompt and parameters.
17
 
18
  I recommend a very low temperature, like 0.1. You may also see more consistent results by encouraging the use of <think> and <answer> tags in the system prompt.
19
 
 
13
  ---
14
  # PyRe is Experimental
15
 
16
+ Please note that this model is a WIP experiment into GRPO fine tuning on Python code problems for reasoning. The performance of this model varies greatly depending on task, prompt and parameters.
17
 
18
  I recommend a very low temperature, like 0.1. You may also see more consistent results by encouraging the use of <think> and <answer> tags in the system prompt.
19