AlejandroOlmedo
/

DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-4bit-mlx

Text Generation

Generated from Trainer

text-generation-inference

4-bit precision

Model card Files Files and versions Community

DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-4bit-mlx

1 contributor

History: 2 commits

AlejandroOlmedo's picture

AlejandroOlmedo

Upload model.safetensors with huggingface_hub

6253db5 verified about 1 month ago