theprint
/

PyRe-Llama8.1-8B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

theprint commited on 8 days ago

Commit

8d9797e

·

verified ·

1 Parent(s): 6dc9558

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ language:
 Please note that this model is a WIP experiment into GRPO fine tuning on Python code problems for reasoning. The performance of this model varies greatly depending on task, prompt and parameters.
-I recommend a very low temperature, like 0.1. You may also see more consistent results by encouraging the use of <think> and <answer> tags in the system prompt.
 ### Example System Prompt
 ```

 Please note that this model is a WIP experiment into GRPO fine tuning on Python code problems for reasoning. The performance of this model varies greatly depending on task, prompt and parameters.
+I recommend a very low temperature, like 0.1. You may also see more consistent results by encouraging the use of `<think>` and `<answer>` tags in the system prompt.
 ### Example System Prompt
 ```