-
-
-
-
-
-
Inference Providers
Active filters:
vLLM
Text Generation
•
358B
•
Updated
•
20.1k
•
18
QuantTrio/MiniMax-M2.1-AWQ
Text Generation
•
229B
•
Updated
•
6.45k
•
8
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
1.7k
•
1
Text Generation
•
229B
•
Updated
•
380k
•
9
QuantTrio/MiniMax-M2-REAP-162B-A10B-AWQ
Text Generation
•
162B
•
Updated
•
551
•
3
QuantTrio/DeepSeek-V3.2-AWQ
Text Generation
•
685B
•
Updated
•
2.59k
•
9
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
•
9B
•
Updated
•
53
•
6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
•
9B
•
Updated
•
19
•
2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
•
73B
•
Updated
•
55
•
2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
•
69B
•
Updated
•
56
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
•
15B
•
Updated
•
3
•
2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B
•
Updated
•
80
•
1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B
•
Updated
•
60
•
1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
•
0.6B
•
Updated
•
378
•
1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
•
0.6B
•
Updated
•
19
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
461
•
1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
•
2B
•
Updated
•
15
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
•
33B
•
Updated
•
707
•
3
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
•
33B
•
Updated
•
274
•
3
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
•
5B
•
Updated
•
24
•
1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
•
15B
•
Updated
•
87
•
1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
•
15B
•
Updated
•
691
•
4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
96
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
2.18k
•
4
JunHowie/Qwen3-4B-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
288
•
1
JunHowie/Qwen3-4B-GPTQ-Int8
Text Generation
•
4B
•
Updated
•
6
JunHowie/Qwen3-30B-A3B-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
7.28k
QuantTrio/Qwen3-235B-A22B-GPTQ-Int8
Text Generation
•
235B
•
Updated
•
46
BeastyZ/Qwen2.5-3B-ConvSearch-R1-TopiOCQA
3B
•
Updated
•
4
QuantTrio/DeepSeek-R1-0528-Qwen3-8B-GPTQ-Int4-Int8Mix
Text Generation
•
11B
•
Updated
•
62
•
3