OpenRLHF
/

Mistral-7b-PRM-Math-Shepherd

Model card Files Files and versions

chuyi777 commited on Oct 30, 2024

Commit

79a5118

·

verified ·

1 Parent(s): 160b64d

Create README.md

Files changed (1) hide show

README.md +5 -0

README.md ADDED Viewed

	@@ -0,0 +1,5 @@

+Process Reward Model trained by OpenRLHF
+```
+dataset Math-Shepherd
+Training accuracy 0.922
+```