WisdomShell
/

RewardAnything-8B-v1

Text Generation

principle-following

Model card Files Files and versions

zhuohaoyu commited on Jun 4

Commit

80a8a96

·

verified ·

1 Parent(s): a742dfb

Update README.md

Files changed (1) hide show

README.md +1 -5

README.md CHANGED Viewed

@@ -312,11 +312,7 @@ result = reward_model.judge(
 ## 📈 Performance & Benchmarks
-RewardAnything achieves state-of-the-art performance on multiple benchmarks:
-- **RM-Bench**: 92.3% accuracy (vs 87.1% for best baseline)
-- **RABench**: 89.7% principle-following accuracy
-- **HH-RLHF**: 94.2% alignment with human preferences
 ## 📚 Documentation

 ## 📈 Performance & Benchmarks
+Please refer to our paper for performance metrics and comparison.
 ## 📚 Documentation