Offline Actor-Critic Reinforcement Learning Scales to Large Models Paper • 2402.05546 • Published Feb 8 • 4