[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-budget
#59
by
Gerald001
- opened
Please add the following model to the neuron cache
Llama 7b is already present in the cache: please go to the model card, select deploy and look at the Inferentia code snippet.
dacorvo
changed discussion status to
closed