qingy2024 commited on
Commit
e10582c
·
verified ·
1 Parent(s): 45cc78a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -6
README.md CHANGED
@@ -10,14 +10,42 @@ tags:
10
  license: apache-2.0
11
  language:
12
  - en
 
 
13
  ---
14
 
15
- # Uploaded model
16
 
17
- - **Developed by:** qingy2024
18
- - **License:** apache-2.0
19
- - **Finetuned from model :** unsloth/mistral-small-24b-instruct-2501-unsloth-bnb-4bit
20
 
21
- This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
22
 
23
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  license: apache-2.0
11
  language:
12
  - en
13
+ datasets:
14
+ - qingy2024/NuminaMath-282k-GRPO
15
  ---
16
 
17
+ # Step Recap 24B Instruct
18
 
19
+ This model is created with GRPO (QLoRA) from Mistral Small 24B Instruct to follow the thinking pattern of step, recap, answer.
 
 
20
 
21
+ **Example: Which is bigger? 9.9 or 9.11?**
22
 
23
+ ```xml
24
+ <step>
25
+ To determine which number is bigger between 9.11 and 9.9, we can compare them directly.
26
+
27
+ </step>
28
+
29
+ <recap>
30
+ I've identified the numbers to compare: 9.11 and 9.9. The next step is to compare these numbers to determine which is larger.
31
+
32
+ </recap>
33
+
34
+ <step>
35
+ Comparing the two numbers:
36
+ - 9.11 has a tenths digit of 1 and a hundredths digit of 1.
37
+ - 9.9 has a tenths digit of 9 and no hundredths digit (which is equivalent to 0 hundredths).
38
+
39
+ Since 9 (tenths digit of 9.9) is greater than 1 (tenths digit of 9.11), 9.9 is larger than 9.11.
40
+
41
+ </step>
42
+
43
+ <recap>
44
+ I have compared the numbers 9.11 and 9.9 and determined that 9.9 is the larger number. There are no further steps needed to solve this problem.
45
+
46
+ </recap>
47
+
48
+ <answer>9.9</answer>
49
+ ```
50
+
51
+ [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)