R-PRM: Reasoning-Driven Process Reward Modeling
Shuaijie She
kevinpro
AI & ML interests
Reasoning, Chain of Thoughts, Alignment, Factual Consistency, Summarization
Recent Activity
new activity
11 days ago
SynthLabsAI/Big-Math-RL-Verified:wrong answer
liked
a dataset
12 days ago
JINIAC-competition/OpenMathInstruct-2-augmented-math
liked
a dataset
12 days ago
nvidia/OpenMathReasoning