-
-
-
-
-
-
Inference Providers
Active filters: 4-bit
Text Generation
• 73B • Updated
• 54
• 2
Bob-the-Koala/LocalCodeViber
Text Generation
• 8B • Updated
• 39
• 2
RepublicOfKorokke/Qwen3.5-35B-A3B-mlx-vlm-mxfp4
Text Generation
• 7B • Updated
• 1.02k
• 2
WaveCut/Qwen3.5-35B-A3B-MLX_4bit_SWAN
Text Generation
• 35B • Updated
• 870
• 2
Intel/GLM-5-int4-mixed-AutoRound
117B • Updated
• 49
• 2
Text Generation
• 586B • Updated
• 50
• 2
TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ
Text Generation
• 33B • Updated
• 99.3k
• 604
TheBloke/Vicuna-13B-1-3-SuperHOT-8K-GPTQ
Text Generation
• 13B • Updated
• 1
• 8
Text Generation
• 10B • Updated
• 2.15k
• 94
unsloth/mistral-7b-instruct-v0.2-bnb-4bit
Text Generation
• 7B • Updated
• 12.7k
• 36
Tann-dev/sex-chat-dirty-girlfriend
Text Generation
• 7B • Updated
• 142
• 44
MaziyarPanahi/Mixtral-8x22B-Instruct-v0.1-AWQ
Text Generation
• 141B • Updated
• 40.2k
• 13
unsloth/llama-3-8b-bnb-4bit
Text Generation
• 8B • Updated
• 76.6k
• 204
MaziyarPanahi/Llama-3-16B-Instruct-v0.1-GGUF
Text Generation
• 17B • Updated
• 651
• 9
unsloth/Phi-3-mini-4k-instruct-bnb-4bit
Text Generation
• 4B • Updated
• 22k
• 43
solidrust/Mistral-7B-Instruct-v0.3-AWQ
Text Generation
• 7B • Updated
• 4.84k
• 8
MaziyarPanahi/gemma-2-2b-it-GGUF
Text Generation
• 3B • Updated
• 108k
• 14
openbmb/MiniCPM-V-2_6-int4
Image-Text-to-Text
• 8B • Updated
• 10.5k
• 88
thesven/Phi-3.5-mini-instruct-GPTQ-4bit
Text Generation
• 4B • Updated
• 19
• 1
MaziyarPanahi/solar-pro-preview-instruct-GGUF
Text Generation
• 22B • Updated
• 106k
• 28
Qwen/Qwen2.5-14B-Instruct-GPTQ-Int4
Text Generation
• 15B • Updated
• 93.6k
• 26
MaziyarPanahi/Qwen2.5-3B-Instruct-GGUF
Text Generation
• 3B • Updated
• 75
• 2
unsloth/Qwen2.5-3B-Instruct-bnb-4bit
Text Generation
• 3B • Updated
• 17.1k
• 14
Qwen/Qwen2.5-Coder-7B-Instruct-GPTQ-Int4
Text Generation
• 8B • Updated
• 940k
• 13
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation
• 3B • Updated
• 108k
• 14
4B • Updated
• 209
• 21
stelterlab/SauerkrautLM-v2-14b-SFT-AWQ
15B • Updated
• 2
• 1
Qwen/Qwen2.5-Coder-14B-Instruct-AWQ
Text Generation
• 15B • Updated
• 165k
• 16
mlx-community/Qwen2.5-Coder-0.5B-Instruct-4bit
Text Generation
• Updated
• 216
• 6
lmstudio-community/Qwen2.5-32B-Instruct-MLX-4bit
Text Generation
• 5B • Updated
• 1.12k
• 3