thank you very much , I used to use vllm ,but it doesn't work with it.
yes please
Any specific ideas on how to infer with vllm?
you can't. it's tensorRT
Β· Sign up or log in to comment