File size: 524 Bytes
c76260e
6bda253
c76260e
6bda253
 
 
 
d4ad43e
c76260e
0a591a1
 
d4ad43e
6bda253
0a591a1
6bda253
0a591a1
6bda253
0a591a1
6bda253
c76260e
6bda253
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
see help:

or 

Follow steps:

1 Download `turingevo/Qwen2-VL-2B-Instruct-gguf`

2 `git clone https://github.com/ggerganov/llama.cpp.git`  

then build and get target `llama-qwen2vl-cli`

3 Get pictures: 

 It's recommended to resize the image to a resolution below 640x640, so it won't take forever to run on CPU backend:
 
 `ffmpeg -i input.jpeg -vf "scale=512:512" 1.png`
 

4 cmd: `llama-qwen2vl-cli -m Qwen2-VL-2B-Instruct-F16.gguf --mmproj qwen2-vl-2b-instruct-vision.gguf -p "Describe this image" --image "1.png"`