Questions about data scale

by masterLan - opened 6 days ago

6 days ago

•

How much data was used to train the final version of Qwen-2.5-MATH-PRM?

While reading the paper, I noticed that in Figure 8, the score for ProcessBench is 66.5. However, in the final results presented in Table 7, the score for Qwen-2.5-MATH-PRM is 73.5. Additionally, the data in Figure 8 does not match the data in Table 4, which raises some concerns.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment