-
How to Train Data-Efficient LLMs
Paper • 2402.09668 • Published • 43 -
Adapting Large Language Models via Reading Comprehension
Paper • 2309.09530 • Published • 80 -
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper • 2403.03507 • Published • 189 -
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper • 2403.02884 • Published • 17
peng
superpeng
·
AI & ML interests
None yet
Recent Activity
liked
a model
17 days ago
baichuan-inc/Baichuan-M2-32B
liked
a dataset
22 days ago
Intelligent-Internet/II-Medical-Reasoning-SFT
upvoted
a
collection
22 days ago
II-Medical
Organizations
None yet