Qwen2.5-VL-3B-Instruct
Extract text from images
Generate responses using a chat model
Generate math answers from images