Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
xunzhou
xunzhou
Follow
0 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models
authored
a paper
29 days ago
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
authored
a paper
5 months ago
Hyper-Connections
View all activity
Organizations
Papers
5
arxiv:
2502.15499
arxiv:
2501.16975
arxiv:
2409.19606
arxiv:
2406.08657
Expand 5 papers
models
None public yet
datasets
None public yet