valenradovich
/

gemma-2-9b-it-Q4_0-gguf

Inference Endpoints

Model card Files Files and versions Community

gemma-2-9b-it-Q4_0-gguf / README.md

valenradovich's picture

Create README.md

34b2922 verified 7 months ago

|

324 Bytes

	# gemma-2-9B-it-q4_0

	This is a quantized version of the Gemma2 9B instruct model using the Q4_0 quantization method.

	## Model Details

	- Original Model: [Gemma2-9B-it](https://huggingface.co/google/gemma-2-9b-it)
	- Quantization Method: Q4_0
	- Precision: 4-bit

	## Usage

	You can use it directly with llama.cpp