YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
LLama2 Model with FP8 KV Cache checkpoint for TRTLM
Generated using https://github.com/vllm-project/vllm/blob/main/examples/fp8/quantizer/quantize.py
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
HF Inference deployability: The model has no library tag.