Could you please tell how to inference this model?

#4
by carlosbdw - opened

thank you very much , I used to use vllm ,but it doesn't work with it.

Any specific ideas on how to infer with vllm?

you can't. it's tensorRT

Sign up or log in to comment