EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models Paper • 2310.03270 • Published Oct 5, 2023
Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM Paper • 2310.04836 • Published Oct 7, 2023 • 1
Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization Paper • 2204.04215 • Published Apr 8, 2022 • 1
MiniCache: KV Cache Compression in Depth Dimension for Large Language Models Paper • 2405.14366 • Published May 23 • 1
ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression Paper • 2410.08584 • Published Oct 11 • 12
ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality Paper • 2412.04062 • Published 22 days ago • 7
GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation Paper • 2411.18499 • Published 30 days ago • 18
GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation Paper • 2411.18499 • Published 30 days ago • 18
ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression Paper • 2410.08584 • Published Oct 11 • 12
DragAnything: Motion Control for Anything using Entity Representation Paper • 2403.07420 • Published Mar 12 • 13