Questions about data scale

#9
by masterLan - opened

How much data was used to train the final version of Qwen-2.5-MATH-PRM?
image.png
While reading the paper, I noticed that in Figure 8, the score for ProcessBench is 66.5. However, in the final results presented in Table 7, the score for Qwen-2.5-MATH-PRM is 73.5. Additionally, the data in Figure 8 does not match the data in Table 4, which raises some concerns.

image.png

image.png

Sign up or log in to comment