sarthak247
/

qwen2.5-grpo-gsm8k-250steps-lora-adapters

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

qwen2.5-grpo-gsm8k-250steps-lora-adapters / merges.txt

sarthak247's picture

Trained with Unsloth

cd55218 verified 4 days ago

history contribute delete

1.67 MB

File too large to display, you can check the raw version instead.