Update README.md
Browse files
README.md
CHANGED
@@ -2,6 +2,10 @@
|
|
2 |
base_model: Qwen/Qwen2.5-Math-7B-Instruct
|
3 |
library_name: transformers
|
4 |
model_name: qwen-prm-7b-soft-labels
|
|
|
|
|
|
|
|
|
5 |
tags:
|
6 |
- generated_from_trainer
|
7 |
- axolotl
|
@@ -10,10 +14,12 @@ tags:
|
|
10 |
licence: license
|
11 |
---
|
12 |
|
|
|
|
|
13 |
# Model Card for qwen-prm-7b-soft-labels
|
14 |
|
15 |
This model is a fine-tuned version of [Qwen/Qwen2.5-Math-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Math-7B-Instruct).
|
16 |
-
It has been trained using [TRL](https://github.com/huggingface/trl).
|
17 |
|
18 |
## Quick start
|
19 |
|
|
|
2 |
base_model: Qwen/Qwen2.5-Math-7B-Instruct
|
3 |
library_name: transformers
|
4 |
model_name: qwen-prm-7b-soft-labels
|
5 |
+
datasets:
|
6 |
+
- axolotl-ai-co/Math-Shepherd
|
7 |
+
- axolotl-ai-co/prm800k_phase_1
|
8 |
+
- axolotl-ai-co/prm800k_phase_2
|
9 |
tags:
|
10 |
- generated_from_trainer
|
11 |
- axolotl
|
|
|
14 |
licence: license
|
15 |
---
|
16 |
|
17 |
+
[<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
|
18 |
+
|
19 |
# Model Card for qwen-prm-7b-soft-labels
|
20 |
|
21 |
This model is a fine-tuned version of [Qwen/Qwen2.5-Math-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Math-7B-Instruct).
|
22 |
+
It has been trained using [Axolotl](https://github.com/axolotl-ai-cloud/axolotl) with [TRL](https://github.com/huggingface/trl).
|
23 |
|
24 |
## Quick start
|
25 |
|