Can Model Batch Infer By vLLM

#30

by BITDDD - opened Nov 22

BITDDD

Nov 22

Does vLLM support batch inference of models?

narai

21 days ago

Yes, just give a list of messages instead of one

narai

19 days ago

AFAIK batch works in vlllm python object "offline mode" but online mode will return an error if you try to submit more than one message list at once

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment