Update README.md
Browse files
README.md
CHANGED
@@ -10,14 +10,42 @@ tags:
|
|
10 |
license: apache-2.0
|
11 |
language:
|
12 |
- en
|
|
|
|
|
13 |
---
|
14 |
|
15 |
-
#
|
16 |
|
17 |
-
|
18 |
-
- **License:** apache-2.0
|
19 |
-
- **Finetuned from model :** unsloth/mistral-small-24b-instruct-2501-unsloth-bnb-4bit
|
20 |
|
21 |
-
|
22 |
|
23 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
license: apache-2.0
|
11 |
language:
|
12 |
- en
|
13 |
+
datasets:
|
14 |
+
- qingy2024/NuminaMath-282k-GRPO
|
15 |
---
|
16 |
|
17 |
+
# Step Recap 24B Instruct
|
18 |
|
19 |
+
This model is created with GRPO (QLoRA) from Mistral Small 24B Instruct to follow the thinking pattern of step, recap, answer.
|
|
|
|
|
20 |
|
21 |
+
**Example: Which is bigger? 9.9 or 9.11?**
|
22 |
|
23 |
+
```xml
|
24 |
+
<step>
|
25 |
+
To determine which number is bigger between 9.11 and 9.9, we can compare them directly.
|
26 |
+
|
27 |
+
</step>
|
28 |
+
|
29 |
+
<recap>
|
30 |
+
I've identified the numbers to compare: 9.11 and 9.9. The next step is to compare these numbers to determine which is larger.
|
31 |
+
|
32 |
+
</recap>
|
33 |
+
|
34 |
+
<step>
|
35 |
+
Comparing the two numbers:
|
36 |
+
- 9.11 has a tenths digit of 1 and a hundredths digit of 1.
|
37 |
+
- 9.9 has a tenths digit of 9 and no hundredths digit (which is equivalent to 0 hundredths).
|
38 |
+
|
39 |
+
Since 9 (tenths digit of 9.9) is greater than 1 (tenths digit of 9.11), 9.9 is larger than 9.11.
|
40 |
+
|
41 |
+
</step>
|
42 |
+
|
43 |
+
<recap>
|
44 |
+
I have compared the numbers 9.11 and 9.9 and determined that 9.9 is the larger number. There are no further steps needed to solve this problem.
|
45 |
+
|
46 |
+
</recap>
|
47 |
+
|
48 |
+
<answer>9.9</answer>
|
49 |
+
```
|
50 |
+
|
51 |
+
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|