-
-
-
-
-
-
Inference Providers
Active filters:
awq
QuantTrio/DeepSeek-V3.2-AWQ
Text Generation
•
685B
•
Updated
•
2.8k
•
6
gaunernst/gemma-3-12b-it-int4-awq
Image-Text-to-Text
•
12B
•
Updated
•
4.27k
•
22
Qwen/Qwen3-4B-AWQ
Text Generation
•
4B
•
Updated
•
123k
•
21
nn-tech/MetalGPT-1-AWQ
Text Generation
•
33B
•
Updated
•
98
•
2
TheBloke/mixtral-8x7b-v0.1-AWQ
Text Generation
•
47B
•
Updated
•
231
•
11
Qwen/Qwen2.5-32B-Instruct-AWQ
Text Generation
•
33B
•
Updated
•
843k
•
90
Qwen/Qwen2.5-Coder-1.5B-Instruct-AWQ
Text Generation
•
2B
•
Updated
•
109k
•
4
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
8B
•
Updated
•
152k
•
95
gaunernst/gemma-3-4b-it-int4-awq
Image-Text-to-Text
•
Updated
•
8.06k
•
2
Qwen/Qwen3-32B-AWQ
Text Generation
•
33B
•
Updated
•
154k
•
117
Qwen/Qwen3-14B-AWQ
Text Generation
•
15B
•
Updated
•
197k
•
47
Qwen/Qwen3-8B-AWQ
Text Generation
•
8B
•
Updated
•
99.9k
•
30
Qwen/Qwen2.5-Omni-7B-AWQ
Any-to-Any
•
11B
•
Updated
•
33.8k
•
15
cpatonn/Devstral-Small-2507-AWQ-4bit
Text Generation
•
24B
•
Updated
•
14.7k
•
9
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ-BF16Mix
Text Generation
•
14B
•
Updated
•
169
•
4
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ
Text Generation
•
31B
•
Updated
•
13k
•
2
QuantTrio/DeepSeek-V3.2-Exp-AWQ-Lite
Text Generation
•
685B
•
Updated
•
54
•
4
QuantTrio/MiniMax-M2-AWQ
Text Generation
•
229B
•
Updated
•
43.7k
•
7
IDEA-Research/Rex-Omni-AWQ
4B
•
Updated
•
1.81k
•
3
MidnightPhreaker/GLM-4.5-Air-REAP-82B-A12B-AWQ-4bit
Text Generation
•
13B
•
Updated
•
1.21k
•
2
QuantTrio/DeepSeek-V3.2-Speciale-AWQ
Text Generation
•
685B
•
Updated
•
180
•
4
casperhansen/mpt-7b-8k-chat-awq
Text Generation
•
Updated
•
30
•
3
casperhansen/falcon-7b-awq
Text Generation
•
Updated
•
20
•
1
casperhansen/vicuna-7b-v1.5-awq
Text Generation
•
Updated
•
21
•
3
casperhansen/vicuna-7b-v1.5-awq-gemv
Text Generation
•
Updated
•
17
•
1
casperhansen/mpt-7b-8k-chat-awq-gemv
Text Generation
•
Updated
•
15
casperhansen/opt-125m-awq
Text Generation
•
0.2B
•
Updated
•
815
•
3
casperhansen/tinyllama-1b-awq
Text Generation
•
Updated
•
69
Bomml/Llama-2-70B-chat-w4-g128-awq
Text Generation
•
Updated
TheBloke/Llama-2-7B-Chat-AWQ
Text Generation
•
7B
•
Updated
•
2k
•
24