Original model: https://huggingface.co/Doctor-Shotgun/Nous-Capybara-limarpv3-34B Using erotiquant-xl.parquet as calibration dataset. 5bpw - Can run 10k context in ~23.3GB VRAM with 8bit-cache option. Prompt format - extended alpaca with length modifier: ``` Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request. ### Instruction: {instruction} ### Input: {input} ### Response: (length = long) ```