compressa-ai
/

Llama-3-8B-Instruct-OmniQuant

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

Llama-3-8B-Instruct-OmniQuant

Ctrl+K

Ctrl+K

2 contributors

History: 7 commits

Vasily Alexeev

add two stop toks in gen config

5413035 over 1 year ago

.gitattributes

1.52 kB

initial commit over 1 year ago
README.md

6.96 kB

add asymm quantized model, add two eos in code sample over 1 year ago
compressa-config.json

732 Bytes

add asymm quantized model, add two eos in code sample over 1 year ago
config.json

898 Bytes

add asymm quantized model, add two eos in code sample over 1 year ago
generation_config.json

131 Bytes

add two stop toks in gen config over 1 year ago
model-00001-of-00002.safetensors

4.68 GB
LFS

add asymm quantized model, add two eos in code sample over 1 year ago
model-00002-of-00002.safetensors

1.05 GB
LFS

add model weights and stuff over 1 year ago
model.safetensors.index.json

78.5 kB

add model weights and stuff over 1 year ago
quant_config.json

64 Bytes

add asymm quantized model, add two eos in code sample over 1 year ago
special_tokens_map.json

301 Bytes

add model weights and stuff over 1 year ago
tokenizer.json

9.08 MB

add model weights and stuff over 1 year ago
tokenizer_config.json

51.4 kB

kinda fix eos token to stop model from chatting with itself over 1 year ago