AI & ML interests

LLM

Recent Activity

ngc7293  updated a collection 12 days ago
MOSS Embodied Planner
ngc7293  updated a collection 13 days ago
MOSS Embodied Planner
ngc7293  updated a collection 17 days ago
MOSS Embodied Planner
View all activity

fnlp 's collections 6

MHA2MLA-refactor
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
MHA2MLA
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"