GGUF version please

by HR1777 - opened Dec 20, 2023

Discussion

HR1777

Dec 20, 2023

@TheBloke Please release the GGUF version of this great model. I am sure many users would like to use it.

avideci

Dec 20, 2023

•

edited Dec 20, 2023

@HR1777 https://huggingface.co/Deci/DeciLM-7B-instruct-GGUF

HR1777

Dec 20, 2023

@HR1777 https://huggingface.co/Deci/DeciLM-7B-instruct-GGUF

Thank you for that, but is there any chance we could get some lighter versions like Q6_K too? I know Q8_0 is more accurate, but it runs slow on average computers especially those without GPU.

HR1777

Dec 20, 2023

No need for Q6_K as i am sure you are experts and Q8_0 is better. I close this thread.

HR1777 changed discussion status to closed Dec 20, 2023

avideci

Dec 20, 2023

@HR1777 Lower quantization than Q8_0 did not preserve accuracy in a usable manner.
Q8_0 was the lower checkpoint that generated relevant outputs. Anything below could only answer 1+1=2, etc - but not generate real-world grade text.

HR1777

Dec 20, 2023

Yes, i understand it. Thank you so much.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment