RLHF Workflow: From Reward Modeling to Online RLHF Paper • 2405.07863 • Published May 13, 2024 • 67
Seminal AI Papers Collection A collection of top AI papers. • 17 items • Updated Feb 23, 2024 • 3
TrustLLM: Trustworthiness in Large Language Models Paper • 2401.05561 • Published Jan 10, 2024 • 69