-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
Text Generation
•
120B
•
Updated
•
3.1M
•
•
4.45k
Text Generation
•
22B
•
Updated
•
5.98M
•
•
4.31k
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
Text Generation
•
18B
•
Updated
•
91k
•
76
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
•
6.44k
•
1.29k
GadflyII/GLM-4.7-Flash-NVFP4
Text Generation
•
18B
•
Updated
•
266k
•
58
mlx-community/Qwen3-Coder-Next-8bit
Text Generation
•
80B
•
Updated
•
725
•
6
openai/gpt-oss-safeguard-20b
Text Generation
•
22B
•
Updated
•
33.7k
•
•
190
mlx-community/Qwen3-ASR-1.7B-8bit
0.8B
•
Updated
•
503
•
7
MultiverseComputingCAI/HyperNova-60B
Text Generation
•
60B
•
Updated
•
1.5k
•
51
unsloth/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
Text Generation
•
18B
•
Updated
•
241
•
6
RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4
Text Generation
•
133B
•
Updated
•
14k
•
10
Text Generation
•
177B
•
Updated
•
3.88k
•
14
GadflyII/GLM-4.7-Flash-MXFP4
Text Generation
•
18B
•
Updated
•
9.2k
•
8
inferencerlabs/Qwen3-Coder-Next-MLX-9bit
Text Generation
•
80B
•
Updated
•
764
•
3
MaziyarPanahi/Qwen3-14B-GGUF
Text Generation
•
15B
•
Updated
•
262k
•
7
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
•
16B
•
Updated
•
23.6k
•
23
Text Generation
•
22B
•
Updated
•
31.2k
•
41
openai/gpt-oss-safeguard-120b
Text Generation
•
120B
•
Updated
•
25.9k
•
84
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-NVFP4-QAD
Image-Text-to-Text
•
8B
•
Updated
•
36.2k
•
17
kldzj/gpt-oss-120b-heretic-v2
Text Generation
•
117B
•
Updated
•
378
•
19
lukealonso/MiniMax-M2.1-NVFP4
115B
•
Updated
•
27.3k
•
21
Image-Text-to-Text
•
62B
•
Updated
•
5.82k
•
4
lmstudio-community/GLM-4.7-Flash-MLX-8bit
Text Generation
•
30B
•
Updated
•
778k
•
7
mlx-community/Qwen3-TTS-12Hz-1.7B-CustomVoice-8bit
Text-to-Speech
•
0.8B
•
Updated
•
935
•
4
CalamitousFelicitousness/HunyuanImage-3.0-Instruct-Distil-SDNQ-4bit-dynamic
Image-to-Image
•
45B
•
Updated
•
72
•
2
DeathGodlike/SicariusSicariiStuff_Assistant-Pepe-8B_EXL3
Text Generation
•
Updated
•
2
•
2
mlx-community/GLM-OCR-8bit
Image-to-Text
•
0.6B
•
Updated
•
520
•
2
EpistemeAI/rsi-gpt-oss-120bv2-8bit
Text Generation
•
120B
•
Updated
•
126
•
2
StefanKrsteski/Phi-3-mini-4k-instruct-GPTQ-8bit
Text Generation
•
4B
•
Updated
•
27
•
2
MaziyarPanahi/Mistral-Nemo-Instruct-2407-GGUF
Text Generation
•
12B
•
Updated
•
181k
•
51