arxiv:2410.23743
Ming Li
MingLiiii
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A
Gradient Perspective
Organizations
models
9
MingLiiii/cherry-alpaca-pre-experienced-7B
Text Generation
•
Updated
•
12
MingLiiii/cherry-wizardlm-filtered-7B
Text Generation
•
Updated
•
12
MingLiiii/cherry-wizardlm-40-percent-7B
Text Generation
•
Updated
•
13
MingLiiii/cherry-wizardlm-30-percent-7B
Text Generation
•
Updated
•
15
MingLiiii/cherry-wizardlm-20-percent-7B
Text Generation
•
Updated
•
12
MingLiiii/cherry-wizardlm-10-percent-7B
Text Generation
•
Updated
•
12
MingLiiii/cherry-alpaca-15-percent-7B
Text Generation
•
Updated
•
13
MingLiiii/cherry-alpaca-10-percent-7B
Text Generation
•
Updated
•
13
MingLiiii/cherry-alpaca-5-percent-7B
Text Generation
•
Updated
•
14
datasets
5
MingLiiii/Wiz70_Analysis_llama2_13b
Viewer
•
Updated
•
280k
•
36
MingLiiii/Wiz70_Analysis_llama2_7b
Viewer
•
Updated
•
280k
•
38
MingLiiii/Alpaca_Analysis_llama2_13b
Viewer
•
Updated
•
208k
•
37
MingLiiii/Alpaca_Analysis_llama2_7b
Viewer
•
Updated
•
208k
•
49
MingLiiii/cherry_wizardlm_filtered
Viewer
•
Updated
•
63.7k
•
36