metadata

license: llama3
base_model:
  - gmonsoon/SahabatAI-MediChatIndo-8B-v1

SkylarWhite/SahabatAI-MediChatIndo-8B-v1-gguf

This is hosts various GGUF quantized versions of the gmonsoon/SahabatAI-MediChatIndo-8B-v1 model. This model is designed for medical and general-purpose conversational AI in Indonesian and is based on the LLaMA3 architecture. The GGUF format is optimized for efficient inference on low-resource devices and fast deployment.

Model Overview

SahabatAI-MediChatIndo-8B-v1 is a fine-tuned model created by merging:

It has been optimized for understanding and responding in medical and general Indonesian conversations.

GGUF Quantized Versions

The following GGUF quantized versions are available in this repository:

16-bit (F16): High-precision quantization for use cases requiring maximal accuracy.
Q4_K_M: Balanced between speed and performance, ideal for most use cases.
Q5_K_M: Improved precision over Q4 while maintaining efficient performance.
Q8_0: Full precision for demanding tasks where accuracy is critical.

Feedback and Contributions

Feedback and contributions are welcome! Please open an issue or contact the model's author for further discussions.