-
-
-
-
-
-
Inference Providers
Active filters:
awq
Qwen/Qwen3-32B-AWQ
Text Generation
•
6B
•
Updated
•
665k
•
99
QuantTrio/Qwen3-235B-A22B-Instruct-2507-AWQ
Text Generation
•
33B
•
Updated
•
856
•
7
Qwen/Qwen2.5-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
13B
•
Updated
•
29.9k
•
62
Qwen/Qwen3-8B-AWQ
Text Generation
•
2B
•
Updated
•
36.1k
•
19
openbmb/MiniCPM-V-4-AWQ
Image-Text-to-Text
•
1B
•
Updated
•
240
•
9
QuantTrio/Qwen3-235B-A22B-Thinking-2507-AWQ
Text Generation
•
33B
•
Updated
•
1.99k
•
2
TheBloke/wizard-vicuna-13B-AWQ
Text Generation
•
2B
•
Updated
•
13
•
2
amazon/MistralLite-AWQ
Text Generation
•
1B
•
Updated
•
14
•
5
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
2B
•
Updated
•
246k
•
74
Qwen/Qwen2.5-Coder-7B-Instruct-AWQ
Text Generation
•
2B
•
Updated
•
83.8k
•
14
AMead10/Llama-3.2-1B-Instruct-AWQ
Text Generation
•
0.7B
•
Updated
•
1.11k
•
6
Qwen/Qwen2.5-VL-3B-Instruct-AWQ
Image-Text-to-Text
•
1B
•
Updated
•
343k
•
47
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
3B
•
Updated
•
372k
•
83
OPEA/Mistral-Small-3.1-24B-Instruct-2503-int4-AutoRound-awq-sym
5B
•
Updated
•
186k
•
16
Orion-zhen/Qwen3-0.6B-AWQ
0.2B
•
Updated
•
1.36k
•
2
Qwen/Qwen3-14B-AWQ
Text Generation
•
3B
•
Updated
•
76k
•
29
Qwen/Qwen2.5-Omni-7B-AWQ
Any-to-Any
•
5B
•
Updated
•
14.1k
•
11
gghfez/Mistral-Small-3.2-24B-Instruct-hf-AWQ
Text Generation
•
4B
•
Updated
•
424
•
3
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-AWQ
Text Generation
•
66B
•
Updated
•
2.19k
•
3
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ-BF16Mix
Text Generation
•
9B
•
Updated
•
750
•
2
twhitworth/gpt-oss-120b-awq-w4a16
117B
•
Updated
•
1
casperhansen/mpt-7b-8k-chat-awq
Text Generation
•
Updated
•
13
•
3
casperhansen/falcon-7b-awq
Text Generation
•
Updated
•
10
•
1
casperhansen/vicuna-7b-v1.5-awq
Text Generation
•
Updated
•
6
•
3
casperhansen/vicuna-7b-v1.5-awq-gemv
Text Generation
•
Updated
•
7
•
1
casperhansen/mpt-7b-8k-chat-awq-gemv
Text Generation
•
Updated
•
6
casperhansen/opt-125m-awq
Text Generation
•
0.1B
•
Updated
•
1.1k
•
3
casperhansen/tinyllama-1b-awq
Text Generation
•
Updated
•
3.98k
Bomml/Llama-2-70B-chat-w4-g128-awq
Text Generation
•
Updated
TheBloke/Llama-2-7B-Chat-AWQ
Text Generation
•
1B
•
Updated
•
7.35k
•
23