Peng Wang
ZJUPeng
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
Group Sequence Policy Optimization
upvoted
a
paper
about 2 months ago
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective
Reinforcement Learning for LLM Reasoning
authored
a paper
2 months ago
Qwen3 Technical Report