Inference Providers
Active filters: 4bit
mlx-community/Qwen3.5-9B-MLX-4bit
Image-Text-to-Text
• 2B • Updated • 108k
• 64
0xSero/GLM-4.7-REAP-218B-A32B-W4A16
Text Generation
• 2B • Updated • 314
• 24
Sepolian/Huihui-Qwen3.5-27B-Claude-4.6-Opus-abliterated-Q4_K_M
Text Generation
• 27B • Updated • 2.16k
• 12
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated • 839k
• 20
0xSero/GLM-4.7-REAP-50-W4A16
Text Generation
• 2B • Updated • 123
• 69
Text Generation
• Updated • 190
• 2
empero-ai/openNemo-Cascade-2-30B-A3B
Text Generation
• 32B • Updated • 172
• 2
0xSero/GLM-4.6-REAP-218B-A32B-W4A16-AutoRound
Text Generation
• 2B • Updated • 138
• 8
0xSero/GLM-4.7-REAP-40-W4A16
Text Generation
• 2B • Updated • 52
• 7
RunPod/FLUX.2-klein-4B-mflux-4bit
Text-to-Image
• Updated • 2
mlx-community/Qwen3.5-4B-MLX-4bit
1.0B • Updated • 32.2k
• 11
mlx-community/Qwen3.5-0.8B-OptiQ-4bit
Text Generation
• 0.2B • Updated • 3.1k
• 5
mlx-community/Qwen3.5-2B-OptiQ-4bit
Text Generation
• 0.4B • Updated • 4.39k
• 2
mlx-community/Qwen3.5-4B-OptiQ-4bit
Text Generation
• 0.8B • Updated • 5.44k
• 8
mlx-community/Qwen3.5-9B-OptiQ-4bit
Text Generation
• 9B • Updated • 7.16k
• 13
groxaxo/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-heretic-v2-AutoRound-W4A16
Image-Text-to-Text
• 3B • Updated • 92
• 1
mayaeary/pygmalion-6b-4bit-128g
Text Generation
• Updated • 23
• 40
mayaeary/pygmalion-6b_dev-4bit-128g
Text Generation
• Updated • 20
• 121
mayaeary/PPO_Pygway-V8p4_Dev-6b-4bit-128g
Text Generation
• Updated • 11
• 2
mayaeary/PPO_Pygway-6b-Mix-4bit-128g
Text Generation
• Updated • 3
• 2
Ancestral/Dolly_Shygmalion-6b-4bit-128g
Text Generation
• Updated • 10
• 5
Ancestral/PPO_Shygmalion-6b-4bit-128g
Text Generation
• Updated • 33
Ancestral/Dolly_Malion-6b-4bit-128g
Text Generation
• Updated • 6
• 1
4bit/pygmalion-6b-4bit-128g
Text Generation
• Updated • 4
• 3
Text Generation
• Updated • 7
• 1
seonglae/opt-125m-4bit-gptq
Text Generation
• Updated • 25
seonglae/wizardlm-7b-uncensored-gptq
Text Generation
• Updated • 1
seonglae/llama-2-7b-chat-hf-gptq
Text Generation
• Updated • 6
seonglae/llama-2-13b-chat-hf-gptq
Text Generation
• Updated • 93
CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA
Text Generation
• Updated • 6