Safetensors
gemma
GPM-Gemma-2B / README.md
kirigayahitsugi's picture
Update README.md
2631853 verified
|
raw
history blame
711 Bytes
metadata
license: apache-2.0

Introduction

This reward model is finetuned from the google/gemma-2b-it using the dataset Skywork/Skywork-Reward-Preference-80K-v0.1

Evaluation

This reward model is evaluated using evaluation code adapted from RewardBench. For detailed code information, please refer to general-preference-model.

Usage

Please refer to general-preference-model for detailed usage instructions.