langchain fastapi uvicorn gptcache transformers accelerate bitsandbytes sentence-transformers scikit-learn langchain_community openai vllm==0.2.2 vllm[all]==0.2.2