daiwei chen
daiweichen
AI & ML interests
representation learning, foundation models, preference learning
Recent Activity
liked
a dataset
7 days ago
HannahRoseKirk/prism-alignment
liked
a model
9 days ago
google/gemma-2-2b-it
new activity
about 1 month ago
meta-llama/Llama-3.2-1B:Attention doesn't work for all layers except for the first layer
Organizations
daiweichen's activity
Attention doesn't work for all layers except for the first layer
#79 opened about 1 month ago
by
daiweichen