Text Generation
Transformers
Safetensors
English
llama
awq
tinyllama
text-generation-inference
Inference Endpoints
TinyLlama-1.1B-Chat-v0.3-AWQ / quant_config.json

Commit History

Upload quant_config.json with huggingface_hub
df57b5e

RonanMcGovern commited on