kirigayahitsugi
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,9 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
---
|
4 |
+
# Introduction
|
5 |
+
This reward model is finetuned from the [google/gemma-2b-it](https://huggingface.co/google/gemma-2b-it) using the dataset [Skywork/Skywork-Reward-Preference-80K-v0.1](https://huggingface.co/datasets/Skywork/Skywork-Reward-Preference-80K-v0.1)
|
6 |
+
# Evaluation
|
7 |
+
This reward model is evaluated using evaluation code adapted from [RewardBench](https://github.com/allenai/reward-bench). For detailed code information, please refer to [general-preference-model](https://github.com/general-preference/general-preference-model).
|
8 |
+
# Usage
|
9 |
+
Please refer to [general-preference-model](https://github.com/general-preference/general-preference-model) for detailed usage instructions.
|