arxiv:2411.10958
Jianfei Chen
surfingtomchen
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
SageAttention2 Technical Report: Accurate 4 Bit Attention for
Plug-and-play Inference Acceleration
Organizations
None yet