Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ubermenchh
/
llama3.1-8B-gsm8k-grpo
like
0
PyTorch
Safetensors
GGUF
llama
unsloth
trl
grpo
Inference Endpoints
conversational
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
llama3.1-8B-gsm8k-grpo
File size: 57 Bytes
99dce61
b42217f
99dce61
1
2
3
4
5
6
7
8
---
license:
mit
tags:
-
unsloth
-
trl
-
grpo
---