EXL2 quants of Qwen2-VL-72B-Instruct
4.00 bits per weight
4.50 bits per weight
5.00 bits per weight
6.00 bits per weight
(2.3bpw to 3.5bpw revisions are in also this repo, but they are unstable. Working on it.)
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
HF Inference deployability: The model has no library tag.