mengfanxu
fxmeng
AI & ML interests
None yet
Recent Activity
commented on
a paper
27 days ago
TransMLA: Multi-head Latent Attention Is All You Need
commented on
a paper
27 days ago
TransMLA: Multi-head Latent Attention Is All You Need
updated
a collection
29 days ago
CLOVER-Commonsense-148k
Organizations
None yet
Collections
8
models
55
fxmeng/PiSSA-llama-7b-commonsense-148k
Updated
•
20
fxmeng/PiSSA-Llama-3-8b-commonsense-148k
Updated
•
15
fxmeng/PiSSA-Llama-2-7b-commonsense-148k
Updated
•
16
fxmeng/PiSSA-llama-13b-commonsense-148k
Updated
•
22
fxmeng/CLOVER-llama-3-8b-commonsense-148k
Updated
•
11
fxmeng/CLOVER-llama-2-7b-commonsense-148k
Updated
•
17
fxmeng/CLOVER-llama-13b-commonsense-148k
Updated
•
16
fxmeng/CLOVER-llama-7b-commonsense-148k
Updated
•
13
fxmeng/TransMLA_qwen2.5_0.5b_instruct
Updated
fxmeng/TransMLA_llama3.2_1b_instruct
Updated
datasets
9
fxmeng/pissa-dataset
Viewer
•
Updated
•
844k
•
788
•
2
fxmeng/big-bench-hard-continue-finetuning
Viewer
•
Updated
•
10.3k
•
195
fxmeng/commonsense_filtered
Viewer
•
Updated
•
170k
•
248
•
1
fxmeng/MetaMath-GSM240K
Viewer
•
Updated
•
240k
•
71
•
1
fxmeng/MetaMath-MATH155K
Viewer
•
Updated
•
155k
•
56
fxmeng/CodeFeedback-Python105K
Viewer
•
Updated
•
105k
•
619
•
5
fxmeng/llava_finetune_336x336
Preview
•
Updated
•
62
fxmeng/llava_pretrain_336x336
Preview
•
Updated
•
54
fxmeng/WizardLM_evol_instruct_V2_143k
Viewer
•
Updated
•
143k
•
84
•
2