Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

compressa-ai
/
Llama-3-8B-Instruct-OmniQuant

Text Generation
Transformers
Safetensors
llama
llama3
omniquant
gptq
triton
conversational
text-generation-inference
4-bit precision
Model card Files Files and versions Community
Llama-3-8B-Instruct-OmniQuant
Ctrl+K
Ctrl+K
  • 2 contributors
History: 7 commits
Vasily Alexeev
add two stop toks in gen config
5413035 over 1 year ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • README.md
    6.96 kB
    add asymm quantized model, add two eos in code sample over 1 year ago
  • compressa-config.json
    732 Bytes
    add asymm quantized model, add two eos in code sample over 1 year ago
  • config.json
    898 Bytes
    add asymm quantized model, add two eos in code sample over 1 year ago
  • generation_config.json
    131 Bytes
    add two stop toks in gen config over 1 year ago
  • model-00001-of-00002.safetensors
    4.68 GB
    LFS
    add asymm quantized model, add two eos in code sample over 1 year ago
  • model-00002-of-00002.safetensors
    1.05 GB
    LFS
    add model weights and stuff over 1 year ago
  • model.safetensors.index.json
    78.5 kB
    add model weights and stuff over 1 year ago
  • quant_config.json
    64 Bytes
    add asymm quantized model, add two eos in code sample over 1 year ago
  • special_tokens_map.json
    301 Bytes
    add model weights and stuff over 1 year ago
  • tokenizer.json
    9.08 MB
    add model weights and stuff over 1 year ago
  • tokenizer_config.json
    51.4 kB
    kinda fix eos token to stop model from chatting with itself over 1 year ago