Penghui Qi
QPHutu
AI & ML interests
None yet
Recent Activity
updated
a collection
13 days ago
LLM Agent
upvoted
a
paper
about 1 month ago
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy
upvoted
a
paper
about 2 months ago
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via
Multi-Agent Multi-Turn Reinforcement Learning