-
Sparse Finetuning for Inference Acceleration of Large Language Models
Paper • 2310.06927 • Published • 14 -
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot
Paper • 2301.00774 • Published • 3 -
The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models
Paper • 2203.07259 • Published • 3 -
How Well Do Sparse Imagenet Models Transfer?
Paper • 2111.13445 • Published • 1
Collections
Discover the best community collections!
Collections trending this week
-
Teach LLMs to Personalize -- An Approach inspired by Writing Education
Paper • 2308.07968 • Published • 26 -
YaRN: Efficient Context Window Extension of Large Language Models
Paper • 2309.00071 • Published • 66 -
Enable Language Models to Implicitly Learn Self-Improvement From Data
Paper • 2310.00898 • Published • 23