7B-Q5_K_M.gguf Extremely coherent first chat

by bharatcoder - opened May 2, 2024

May 2, 2024

I ran a quantized gguf, Q5_K_M using llama.cpp on Mac M2 24 GB. Extremely coherent chat and quality! It recognizes Devanagri script as well as Roman transliterated text and correctly responds. Will dig in more. Some pictures, from my custom chat client.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment