apple/OpenELM-450M-Instruct · need gguf support

May 4, 2024

anyone from apple team are seeing this. please add a gguf format for this model .
thank you

sdyy

Jul 27, 2024

I want that too

Aug 1, 2024

Hey hey @huntz47 & @sdyy - sorry for the delay in response. OpenELM is supported in llama.cpp!

Note: I found quite a bit of degradation below Q8, but if you want to create other quants then feel free to use GGUF-my-repo space: https://huggingface.co/spaces/ggml-org/gguf-my-repo for it.

The inference instructions are in the model cards. Enjoy! and do let me know if you have any questions!