Rename inference-cache-config/llama.json to inference-cache-config/llama2.json f06a55a verified dacorvo HF staff commited on Apr 19, 2024
Create stable-diffusion.json (#43) 32561fe verified philschmid HF staff Jingya HF staff commited on Apr 4, 2024
Added Llama-70b batch_size 4 to inference cache 593822e verified dacorvo HF staff commited on Mar 8, 2024
Create inference-cache-config/llama.json 1960ccb verified philschmid HF staff commited on Mar 5, 2024