xunzhou
xunzhou
·
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
Scale-Distribution Decoupling: Enabling Stable and Effective Training of
Large Language Models
authored
a paper
29 days ago
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
authored
a paper
5 months ago
Hyper-Connections
Organizations
xunzhou's activity
No public activity