llama-hqq-1-bit / README.md
chrispie's picture
Update README.md
49b5be2 verified

A newer version of the Gradio SDK is available: 5.13.1

Upgrade
metadata
title: Llama Hqq 1 Bit
emoji: πŸ“Š
colorFrom: green
colorTo: pink
sdk: gradio
sdk_version: 4.24.0
app_file: app.py
license: llama2
train: false
inference: false
pipeline_tag: text-generation

Demo for HQQ 1-bit quantized (binary weights) Llama2-7B-chat model using a low-rank adapter to improve the performance (referred to as HQQ+). You will need a GPU for this.

https://huggingface.co/mobiuslabsgmbh/Llama-2-7b-chat-hf_1bitgs8_hqq