hf inference endpoint

by tintin12 - opened Nov 6, 2023

Nov 6, 2023

has anyone tried deploying this through hf inference endpoint? i get errors. I know the inference engine command line has the option to pass in parameter to tell that it's an AWQ model, but the deployment interface does not provide such thing, i get errors and can't run

TheBloke

Owner Nov 6, 2023

No, I've never tried it on the hosted HF endpoints. Only with a local TGI deployment via a Docker container.

If the hosted endpoint provides no way to specify the model type then my guess would be that it's not supported, but other than that I'm afraid I don't know. Contact HF support maybe?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment