Update README.md
Browse files
README.md
CHANGED
@@ -112,7 +112,7 @@ Since the compared PRMs have not been trained on code-related tasks, this sectio
|
|
112 |
|
113 |
| Reward Model | Method | MBPP | MBPP+ | HumanEval | HumanEval+ | LiveCodeBench-2024.01-2024-11 |
|
114 |
|--------------------------|-------------------------|-------|-------|-----------|------------|-------------------------------|
|
115 |
-
| N/A | Greedy Sampling Pass@1 |
|
116 |
| Skywork-o1-Open-PRM-7B | Best-of-N@64 | **81.2** | **68.5** | 81.1 | 74.4 | **31.3** |
|
117 |
|
118 |
|
|
|
112 |
|
113 |
| Reward Model | Method | MBPP | MBPP+ | HumanEval | HumanEval+ | LiveCodeBench-2024.01-2024-11 |
|
114 |
|--------------------------|-------------------------|-------|-------|-----------|------------|-------------------------------|
|
115 |
+
| N/A | Greedy Sampling Pass@1 | 79.9 | 65.9 | **82.9** | **78.7** | 26.0 |
|
116 |
| Skywork-o1-Open-PRM-7B | Best-of-N@64 | **81.2** | **68.5** | 81.1 | 74.4 | **31.3** |
|
117 |
|
118 |
|