The collection for the Paper "Pitfalls of Rule- and Model-based Verifiers: A Case Study on Mathematical Reasoning."
-
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning • Updated • 14 -
hkust-nlp/R1-Distill-Verifier-1.5B
Updated • 13 -
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning • Updated • 11 -
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning • Updated • 11