Update README.md
Browse files
README.md
CHANGED
@@ -312,11 +312,7 @@ result = reward_model.judge(
|
|
312 |
|
313 |
## π Performance & Benchmarks
|
314 |
|
315 |
-
|
316 |
-
|
317 |
-
- **RM-Bench**: 92.3% accuracy (vs 87.1% for best baseline)
|
318 |
-
- **RABench**: 89.7% principle-following accuracy
|
319 |
-
- **HH-RLHF**: 94.2% alignment with human preferences
|
320 |
|
321 |
## π Documentation
|
322 |
|
|
|
312 |
|
313 |
## π Performance & Benchmarks
|
314 |
|
315 |
+
Please refer to our paper for performance metrics and comparison.
|
|
|
|
|
|
|
|
|
316 |
|
317 |
## π Documentation
|
318 |
|