Edit model card

LG AI EXAONE3 -> llamafy + 8K Context length -> 4bit awq quantized model

tested in vllm, just for use

''' vllm serve lee5j/EXAONE3-IT8k-llamafy-awq --enforce-eager --gpu-memory-utilization 0.99 '''

Lincense : https://huggingface.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct/blob/main/LICENSE

origin model : LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct

Downloads last month
33
Safetensors
Model size
1.77B params
Tensor type
I32
·
FP16
·
Inference Examples
Unable to determine this model's library. Check the docs .

Model tree for lee5j/EXAONE3-IT8k-llamafy-awq

Quantized
(1)
this model