Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
8
Haoze Wu
WaitHZ
Follow
0 followers
·
1 following
https://waithz.github.io/
HaozeWu7
WaitHZ
AI & ML interests
Modular DL, Complex Reasoning
Recent Activity
upvoted
an
article
3 days ago
How to generate text: using different decoding methods for language generation with Transformers
upvoted
an
article
5 days ago
You could have designed state of the art positional encoding
upvoted
a
paper
17 days ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
View all activity
Organizations
None yet
WaitHZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
commented
a paper
17 days ago
Autonomy-of-Experts Models
Paper
•
2501.13074
•
Published
19 days ago
•
40
•
5
New activity in
deepseek-ai/deepseek-moe-16b-base
11 months ago
A little question about aux_loss
2
#4 opened 11 months ago by
WaitHZ
A little question about aux_loss
2
#4 opened 11 months ago by
WaitHZ
A little question about aux_loss
2
#4 opened 11 months ago by
WaitHZ