ScaleML-RLHF/Qwen2.5-Math-1.5B-raftpp-numina_math_15_all-GVMonce-n8-8-step120 2B • Updated May 12 • 1
ScaleML-RLHF/Qwen2.5-Math-1.5B-raftpp-numina_math_15_all-GVMonce-n8-8-step110 2B • Updated May 12 • 1
ScaleML-RLHF/Qwen2.5-Math-7B-raft-plusplus-numina_math_em-cliphigher0.35-n8-8-iter1 8B • Updated Apr 25 • 1