winglian commited on
Commit
63c038a
·
verified ·
1 Parent(s): 3fcee37

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -1
README.md CHANGED
@@ -2,6 +2,10 @@
2
  base_model: Qwen/Qwen2.5-Math-7B-Instruct
3
  library_name: transformers
4
  model_name: qwen-prm-7b-soft-labels
 
 
 
 
5
  tags:
6
  - generated_from_trainer
7
  - axolotl
@@ -10,10 +14,12 @@ tags:
10
  licence: license
11
  ---
12
 
 
 
13
  # Model Card for qwen-prm-7b-soft-labels
14
 
15
  This model is a fine-tuned version of [Qwen/Qwen2.5-Math-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Math-7B-Instruct).
16
- It has been trained using [TRL](https://github.com/huggingface/trl).
17
 
18
  ## Quick start
19
 
 
2
  base_model: Qwen/Qwen2.5-Math-7B-Instruct
3
  library_name: transformers
4
  model_name: qwen-prm-7b-soft-labels
5
+ datasets:
6
+ - axolotl-ai-co/Math-Shepherd
7
+ - axolotl-ai-co/prm800k_phase_1
8
+ - axolotl-ai-co/prm800k_phase_2
9
  tags:
10
  - generated_from_trainer
11
  - axolotl
 
14
  licence: license
15
  ---
16
 
17
+ [<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
18
+
19
  # Model Card for qwen-prm-7b-soft-labels
20
 
21
  This model is a fine-tuned version of [Qwen/Qwen2.5-Math-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Math-7B-Instruct).
22
+ It has been trained using [Axolotl](https://github.com/axolotl-ai-cloud/axolotl) with [TRL](https://github.com/huggingface/trl).
23
 
24
  ## Quick start
25