Inference Providers
Active filters: vLLM
mistralai/Mistral-Medium-3.5-128B
128B • Updated • 323k
• 386
unsloth/Mistral-Small-4-119B-2603-GGUF
119B • Updated • 8.94k
• 79
QuantTrio/GLM-5.2-Int4-Int8Mix
Text Generation
• 785B • Updated • 56.9k
• 7
mistralai/Mistral-Small-4-119B-2603
119B • Updated • 140k
• 403
QuantTrio/Qwen3.6-27B-AWQ-6Bit
Image-Text-to-Text
• 28B • Updated • 41.8k
• 14
QuantTrio/gemma-4-31B-it-AWQ
Image-Text-to-Text
• 31B • Updated • 461k
• 12
QuantTrio/Qwen3.6-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 1.35M
• 18
cyankiwi/Mistral-Medium-3.5-128B-AWQ-INT4
25B • Updated • 3.14k
• 4
Text Generation
• 426B • Updated • 46
• 1
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
• 9B • Updated • 98
• 6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
• 9B • Updated • 14
• 2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 102
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 161
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
• 15B • Updated • 8
• 2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B • Updated • 72
• 1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B • Updated • 211
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
• 0.6B • Updated • 113
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 9
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 1.76k
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 9
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 1.59k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 186
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 75
• 1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
• 15B • Updated • 84
• 1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
• 15B • Updated • 155k
• 4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
• 8B • Updated • 2.16k
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
• 8B • Updated • 368
• 4
JunHowie/Qwen3-4B-GPTQ-Int4
Text Generation
• 4B • Updated • 877
• 1
JunHowie/Qwen3-4B-GPTQ-Int8
Text Generation
• 4B • Updated • 19
JunHowie/Qwen3-30B-A3B-GPTQ-Int8
Text Generation
• 8B • Updated • 32