|
see help: |
|
|
|
or |
|
|
|
Follow steps: |
|
|
|
1 Download `turingevo/Qwen2-VL-2B-Instruct-gguf` |
|
|
|
2 `git clone https://github.com/ggerganov/llama.cpp.git` |
|
|
|
then build and get target `llama-qwen2vl-cli` |
|
|
|
3 Get pictures: |
|
|
|
It's recommended to resize the image to a resolution below 640x640, so it won't take forever to run on CPU backend: |
|
|
|
`ffmpeg -i input.jpeg -vf "scale=512:512" 1.png` |
|
|
|
|
|
4 cmd: `llama-qwen2vl-cli -m Qwen2-VL-2B-Instruct-F16.gguf --mmproj qwen2-vl-2b-instruct-vision.gguf -p "Describe this image" --image "1.png"` |
|
|
|
|
|
|
|
|