ziplab (ZIP Lab)

yefly

authored 5 papers 3 months ago

EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models

Paper • 2310.03270 • Published Oct 5, 2023

Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM

Paper • 2310.04836 • Published Oct 7, 2023 • 1

Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization

Paper • 2204.04215 • Published Apr 8, 2022 • 1

BiViT: Extremely Compressed Binary Vision Transformer

Paper • 2211.07091 • Published Nov 14, 2022

MiniCache: KV Cache Compression in Depth Dimension for Large Language Models

Paper • 2405.14366 • Published May 23, 2024 • 2

chenfeng1271

authored a paper 3 months ago

ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression

Paper • 2410.08584 • Published Oct 11, 2024 • 12

yefly

authored a paper 3 months ago

ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression

Paper • 2410.08584 • Published Oct 11, 2024 • 12

chenfeng1271

authored 3 papers 3 months ago

InfiniMotion: Mamba Boosts Memory in Transformer for Arbitrary Long Motion Generation

Paper • 2407.10061 • Published Jul 14, 2024

KMM: Key Frame Mask Mamba for Extended Motion Generation

Paper • 2411.06481 • Published Nov 10, 2024 • 4

ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality

Paper • 2412.04062 • Published Dec 5, 2024 • 9

yefly

authored 2 papers 3 months ago

ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality

Paper • 2412.04062 • Published Dec 5, 2024 • 9

GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

Paper • 2411.18499 • Published Nov 27, 2024 • 18

zizhpan

authored a paper 4 months ago

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published Nov 12, 2024 • 30

zizhpan

authored a paper 5 months ago

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published Oct 17, 2024 • 34

BohanZ

authored 2 papers 6 months ago

LongVLM: Efficient Long Video Understanding via Large Language Models

Paper • 2404.03384 • Published Apr 4, 2024

MiniCache: KV Cache Compression in Depth Dimension for Large Language Models

Paper • 2405.14366 • Published May 23, 2024 • 2

BohanZ

authored a paper 7 months ago

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI

Paper • 2408.03361 • Published Aug 6, 2024 • 86

liujingcs

authored 3 papers 11 months ago

QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models

Paper • 2310.08041 • Published Oct 12, 2023 • 1

Mesa: A Memory-saving Training Framework for Transformers

Paper • 2111.11124 • Published Nov 22, 2021 • 1

TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models

Paper • 2311.16503 • Published Nov 27, 2023

ZIP Lab

AI & ML interests

ziplab's activity

EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models

Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM

Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization

BiViT: Extremely Compressed Binary Vision Transformer

MiniCache: KV Cache Compression in Depth Dimension for Large Language Models

ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression

ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression

InfiniMotion: Mamba Boosts Memory in Transformer for Arbitrary Long Motion Generation

KMM: Key Frame Mask Mamba for Extended Motion Generation

ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality

ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality

GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

LongVLM: Efficient Long Video Understanding via Large Language Models

MiniCache: KV Cache Compression in Depth Dimension for Large Language Models

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI

QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models

Mesa: A Memory-saving Training Framework for Transformers

TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models

AI & ML interests

Team members 7

ziplab's activity