7B-Q5_K_M.gguf Extremely coherent first chat

#3
by bharatcoder - opened

I ran a quantized gguf, Q5_K_M using llama.cpp on Mac M2 24 GB. Extremely coherent chat and quality! It recognizes Devanagri script as well as Roman transliterated text and correctly responds. Will dig in more. Some pictures, from my custom chat client.

Screenshot 2024-05-02 at 7.18.18 PM.png

Screenshot 2024-05-02 at 7.10.47 PM.png

Sign up or log in to comment