PKU-Alignment
/

beaver-7b-v1.0-reward

Reinforcement Learning

reinforcement-learning-from-human-feedback

Model card Files Files and versions Community

beaver-7b-v1.0-reward / README.md

Commit History

Update README.md

375cd6a

XuehaiPan commited on Apr 20, 2024

Update README.md

b352642

RuiyangSun commited on Jul 12, 2023

docs: update readme

6e7ed4d

RuiyangSun commited on Jul 10, 2023

docs: update readme

5ea6c15

RuiyangSun commited on Jul 10, 2023

docs: update readme

8def050

RuiyangSun commited on Jul 10, 2023

initial commit

7fae170

RuiyangSun commited on Jul 8, 2023