Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RLHFlow
's Collections
Decision-Tree Reward Models
RLHFlow MATH Process Reward Model
Standard-format-preference-dataset
Mixture-of-preference-reward-modeling
RM-Bradley-Terry
PM-pair
Online RLHF
RLHFLow Reward Models
SFT Models
Decision-Tree Reward Models
updated
6 days ago
Upvote
1
RLHFlow/Decision-Tree-Reward-Gemma-2-27B
Text Classification
•
Updated
18 days ago
•
65
•
3
RLHFlow/Decision-Tree-Reward-Llama-3.1-8B
Text Classification
•
Updated
18 days ago
•
335
•
3
RLHFlow/LLM-Preferences-HelpSteer2
Viewer
•
Updated
6 days ago
•
9.13k
•
43
Upvote
1
Share collection
View history
Collection guide
Browse collections