metadata
license: apache-2.0
Introduction
This reward model is finetuned from the google/gemma-2b-it using the dataset Skywork/Skywork-Reward-Preference-80K-v0.1
Evaluation
This reward model is evaluated using evaluation code adapted from RewardBench. For detailed code information, please refer to general-preference-model.
Usage
Please refer to general-preference-model for detailed usage instructions.