transformers torch safetensors gradio llama-cpp-python