TheBloke
/

llama-2-70b-Guanaco-QLoRA-fp16

Text Classification

text-generation

text-generation-inference

Model card Files Files and versions Community

TheBloke commited on Jul 21, 2023

Commit

d944dd0

•

1 Parent(s): 6434247

Update README.md

Files changed (1) hide show

README.md +0 -1

README.md CHANGED Viewed

@@ -32,7 +32,6 @@ It is the result of merging and/or converting the source repository to float16.
 ## Repositories available
 * [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/llama-2-70b-Guanaco-QLoRA-GPTQ)
-* [GGML experimental 4, 5, 6 and 8-bit models for CPU only inference](https://huggingface.co/TheBloke/llama-2-70b-Guanaco-QLoRA-GGML)
 * [Merged fp16 model in pytorch model format for GPU inference and further conversions](https://huggingface.co/TheBloke/llama-2-70b-Guanaco-QLoRA-fp16)
 * [Original QLoRA model](https://huggingface.co/Mikael110/llama-2-70b-guanaco-qlora)

 ## Repositories available
 * [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/llama-2-70b-Guanaco-QLoRA-GPTQ)
 * [Merged fp16 model in pytorch model format for GPU inference and further conversions](https://huggingface.co/TheBloke/llama-2-70b-Guanaco-QLoRA-fp16)
 * [Original QLoRA model](https://huggingface.co/Mikael110/llama-2-70b-guanaco-qlora)