Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
dtxw
's Collections
LLM
RLHF
RLHF
updated
Sep 8, 2023
Upvote
-
Efficient RLHF: Reducing the Memory Usage of PPO
Paper
•
2309.00754
•
Published
Sep 1, 2023
•
13
Upvote
-
Share collection
View history
Collection guide
Browse collections