microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 1 day ago • 472k • 1.13k
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • Updated 18 days ago • 1.6M • • 1.03k
SliceGPT: Compress Large Language Models by Deleting Rows and Columns Paper • 2401.15024 • Published Jan 26, 2024 • 72
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning Paper • 2303.02861 • Published Mar 6, 2023 • 2