view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference 12 days ago β’ 60