Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aidando73
/
simplerl-v4-checkpoints
like
0
Safetensors
Model card
Files
Files and versions
Community
main
simplerl-v4-checkpoints
/
path
aidando73
Upload path with huggingface_hub
3673263
verified
about 2 months ago
raw
Copy download link
history
blame
contribute
delete
124 Bytes
/workspace/
simpleRL-reason
/checkpoints/
3
_Qwen_Qwen2.
5
-Math-
7
B_batch1024_rollout8_klcoef0.
0001
_entcoef0.
001
_simplelr_math_35