Group Robust Preference Optimization in Reward-free RLHF Paper • 2405.20304 • Published May 30, 2024 • 1