chuyi777 commited on
Commit
41d1ad8
·
verified ·
1 Parent(s): 79a5118

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -1,5 +1,5 @@
1
  Process Reward Model trained by OpenRLHF
2
- ```
3
- dataset Math-Shepherd
4
- Training accuracy 0.922
5
- ```
 
1
  Process Reward Model trained by OpenRLHF
2
+
3
+ - Dataset: Math-Shepherd (https://huggingface.co/datasets/peiyi9979/Math-Shepherd)
4
+ - Learning Rate: 1e-6
5
+ - Training Accuracy: 0.922