ISTA-DASLab/Meta-Llama-3-8B-Instruct-AQLM-2Bit-1x16 Text Generation • Updated Nov 8, 2024 • 162 • 12
ISTA-DASLab/Meta-Llama-3.1-70B-AQLM-PV-2Bit-1x16 Text Generation • Updated Sep 14, 2024 • 18 • 17
AQLM+PV Collection Official AQLM quantizations for "PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression": https://arxiv.org/abs/2405.14852 • 25 items • Updated 27 days ago • 20
ISTA-DASLab/Meta-Llama-3-70B-AQLM-PV-1Bit-1x16 Text Generation • Updated Sep 14, 2024 • 143 • 1