theprint commited on
Commit
df0fcf3
·
verified ·
1 Parent(s): 40c721a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -0
README.md CHANGED
@@ -11,6 +11,19 @@ license: apache-2.0
11
  language:
12
  - en
13
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
14
 
15
  # Uploaded model
16
 
 
11
  language:
12
  - en
13
  ---
14
+ # PyRe is Experimental
15
+
16
+ Please note that this model is a WIP experiment into GRPO fine tuning on Python code problems for reasoning. The performance of this model varies greatly depending on task, prompt and parameters.
17
+
18
+ I recommend a very low temperature, like 0.1. You may also see more consistent results by encouraging the use of `<think>` and `<answer>` tags in the system prompt.
19
+
20
+ This GGUF is based on [PyRe-Llama8.1-8B](https://huggingface.co/theprint/PyRe-Llama8.1-8B).
21
+
22
+ ### Example System Prompt
23
+ ```
24
+ Think through complex problems carefully, before giving the user your final answer. Use <think> and </think> to encapsulate your thoughts.
25
+ ```
26
+
27
 
28
  # Uploaded model
29