qwen2.5-vl-7b / qwen2.5-vl-3b / abliterated-caption-it
Qwen2.5-VL-3B-Instruct infer in T4 small GPU
Interact with a multimodal chatbot using text and images
Generate text by uploading images and asking questions