axolotl-ai-co
/

Qwen2.5-Math-PRM-7B

Token Classification

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

winglian commited on 8 days ago

Commit

63c038a

·

verified ·

1 Parent(s): 3fcee37

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -2,6 +2,10 @@
 base_model: Qwen/Qwen2.5-Math-7B-Instruct
 library_name: transformers
 model_name: qwen-prm-7b-soft-labels
 tags:
 - generated_from_trainer
 - axolotl
@@ -10,10 +14,12 @@ tags:
 licence: license
 ---
 # Model Card for qwen-prm-7b-soft-labels
 This model is a fine-tuned version of [Qwen/Qwen2.5-Math-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Math-7B-Instruct).
-It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start

 base_model: Qwen/Qwen2.5-Math-7B-Instruct
 library_name: transformers
 model_name: qwen-prm-7b-soft-labels
+datasets:
+- axolotl-ai-co/Math-Shepherd
+- axolotl-ai-co/prm800k_phase_1
+- axolotl-ai-co/prm800k_phase_2
 tags:
 - generated_from_trainer
 - axolotl
 licence: license
 ---
+[<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
 # Model Card for qwen-prm-7b-soft-labels
 This model is a fine-tuned version of [Qwen/Qwen2.5-Math-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Math-7B-Instruct).
+It has been trained using [Axolotl](https://github.com/axolotl-ai-cloud/axolotl) with [TRL](https://github.com/huggingface/trl).
 ## Quick start