Xianyu
catqaq
AI & ML interests
Founder of OpenLLMAI, do something cool!
Efficient LLM,Alignment,data efficiency,Parameter efficiency,RLHF
Recent Activity
upvoted
a
paper
about 1 month ago
Inverse Reinforcement Learning Meets Large Language Model Post-Training:
Basics, Advances, and Opportunities
new activity
about 1 month ago
OpenRLHF/Llama-3-8b-rm-700k:Improve model card: add tags, paper/code links, and usage example
updated
a dataset
8 months ago
OpenRLHF/prompt-collection-v0.1-dev-100k