Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
OpenRLHF
/
Mistral-7b-PRM-Math-Shepherd
like
1
Follow
OpenRLHF
24
Safetensors
mistral
Model card
Files
Files and versions
Community
1
Train
chuyi777
commited on
Oct 30, 2024
Commit
79a5118
•
1 Parent(s):
160b64d
Create README.md
Browse files
Files changed (1)
hide
show
README.md
+5
-0
README.md
ADDED
Viewed
@@ -0,0 +1,5 @@
1
+
Process Reward Model trained by OpenRLHF
2
+
```
3
+
dataset Math-Shepherd
4
+
Training accuracy 0.922
5
+
```