AI & ML interests

LLM

Recent Activity

ngc7293  published a model 2 days ago
fnlp/Embodied_Planner-R1-Alfworld
ngc7293  published a model 2 days ago
fnlp/Embodied_R1-ScienceWorld
Cqy2019  updated a collection 2 days ago
MOSS
View all activity

fnlp 's collections 6

MHA2MLA-refactor
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
MHA2MLA
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"