Skywork
/

Skywork-o1-Open-PRM-Qwen-2.5-1.5B

Text Classification

Model card Files Files and versions Community

chrisliu298 commited on Nov 26, 2024

Commit

ff174cb

·

verified ·

1 Parent(s): ef7fa22

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -121,7 +121,7 @@ Since the compared PRMs have not been trained on code-related tasks, this sectio
 | Reward Model             | Method                  | MBPP  | MBPP+ | HumanEval | HumanEval+ | LiveCodeBench-2024.01-2024-11 |
 |--------------------------|-------------------------|-------|-------|-----------|------------|-------------------------------|
 | N/A                      | Greedy Sampling Pass@1 | 81.7  | 69.3  | **84.8**  | **78.0**   | 25.3                          |
-| Skywork-o1-Open-PRM-7B   | Best-of-N@64           | **84.9** | **72.5** | 83.5      | **78.0**       |                              |
 #### Llama3.1-8B-Instruct

 | Reward Model             | Method                  | MBPP  | MBPP+ | HumanEval | HumanEval+ | LiveCodeBench-2024.01-2024-11 |
 |--------------------------|-------------------------|-------|-------|-----------|------------|-------------------------------|
 | N/A                      | Greedy Sampling Pass@1 | 81.7  | 69.3  | **84.8**  | **78.0**   | 25.3                          |
+| Skywork-o1-Open-PRM-7B   | Best-of-N@64           | **84.9** | **72.5** | 83.5      | **78.0**       | **30.7**                             |
 #### Llama3.1-8B-Instruct