AlejandroOlmedo
/

DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-8bit-mlx

Text Generation

Generated from Trainer

text-generation-inference

8-bit precision

Model card Files Files and versions Community

DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-8bit-mlx

Ctrl+K

Ctrl+K

1 contributor

History: 5 commits

AlejandroOlmedo's picture

AlejandroOlmedo

Upload tokenizer_config.json with huggingface_hub

33c4e2a verified 6 months ago

.gitattributes

1.52 kB

initial commit 6 months ago
model-00001-of-00002.safetensors

5.32 GB
LFS

Upload model-00001-of-00002.safetensors with huggingface_hub 6 months ago
model-00002-of-00002.safetensors

2.78 GB
LFS

Upload model-00002-of-00002.safetensors with huggingface_hub 6 months ago
model.safetensors.index.json

62.7 kB

Upload model.safetensors.index.json with huggingface_hub 6 months ago
tokenizer_config.json

6.86 kB

Upload tokenizer_config.json with huggingface_hub 6 months ago