Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

42,367

Full-text search

Active filters: 4-bit

Madras1/JadeBr72b

Text Generation • 73B • Updated 8 days ago • 54 • 2

Bob-the-Koala/LocalCodeViber

Text Generation • 8B • Updated 6 days ago • 39 • 2

RepublicOfKorokke/Qwen3.5-35B-A3B-mlx-vlm-mxfp4

Text Generation • 7B • Updated 5 days ago • 1.02k • 2

WaveCut/Qwen3.5-35B-A3B-MLX_4bit_SWAN

Text Generation • 35B • Updated 4 days ago • 870 • 2

Intel/GLM-5-int4-mixed-AutoRound

117B • Updated about 18 hours ago • 49 • 2

QuantTrio/GLM-5-AWQ

Text Generation • 586B • Updated 1 day ago • 50 • 2

TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ

Text Generation • 33B • Updated Sep 27, 2023 • 99.3k • 604

TheBloke/Vicuna-13B-1-3-SuperHOT-8K-GPTQ

Text Generation • 13B • Updated Aug 21, 2023 • 1 • 8

Qwen/Qwen-VL-Chat-Int4

Text Generation • 10B • Updated Jan 25, 2024 • 2.15k • 94

unsloth/mistral-7b-instruct-v0.2-bnb-4bit

Text Generation • 7B • Updated Nov 22, 2024 • 12.7k • 36

Tann-dev/sex-chat-dirty-girlfriend

Text Generation • 7B • Updated Feb 17, 2024 • 142 • 44

MaziyarPanahi/Mixtral-8x22B-Instruct-v0.1-AWQ

Text Generation • 141B • Updated Apr 18, 2024 • 40.2k • 13

unsloth/llama-3-8b-bnb-4bit

Text Generation • 8B • Updated Jan 7, 2025 • 76.6k • 204

MaziyarPanahi/Llama-3-16B-Instruct-v0.1-GGUF

Text Generation • 17B • Updated Apr 20, 2024 • 651 • 9

unsloth/Phi-3-mini-4k-instruct-bnb-4bit

Text Generation • 4B • Updated Sep 3, 2024 • 22k • 43

solidrust/Mistral-7B-Instruct-v0.3-AWQ

Text Generation • 7B • Updated Sep 3, 2024 • 4.84k • 8

MaziyarPanahi/gemma-2-2b-it-GGUF

Text Generation • 3B • Updated Aug 1, 2024 • 108k • 14

openbmb/MiniCPM-V-2_6-int4

Image-Text-to-Text • 8B • Updated Jun 16, 2025 • 10.5k • 88

thesven/Phi-3.5-mini-instruct-GPTQ-4bit

Text Generation • 4B • Updated Aug 29, 2024 • 19 • 1

MaziyarPanahi/solar-pro-preview-instruct-GGUF

Text Generation • 22B • Updated Sep 13, 2024 • 106k • 28

Qwen/Qwen2.5-14B-Instruct-GPTQ-Int4

Text Generation • 15B • Updated Oct 9, 2024 • 93.6k • 26

MaziyarPanahi/Qwen2.5-3B-Instruct-GGUF

Text Generation • 3B • Updated Sep 18, 2024 • 75 • 2

unsloth/Qwen2.5-3B-Instruct-bnb-4bit

Text Generation • 3B • Updated Feb 6, 2025 • 17.1k • 14

Qwen/Qwen2.5-Coder-7B-Instruct-GPTQ-Int4

Text Generation • 8B • Updated Nov 18, 2024 • 940k • 13

MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF

Text Generation • 3B • Updated Sep 25, 2024 • 108k • 14

NetoAISolutions/TSLAM-4B

4B • Updated Dec 4, 2025 • 209 • 21

stelterlab/SauerkrautLM-v2-14b-SFT-AWQ

15B • Updated Nov 5, 2024 • 2 • 1

Qwen/Qwen2.5-Coder-14B-Instruct-AWQ

Text Generation • 15B • Updated Jan 12, 2025 • 165k • 16

mlx-community/Qwen2.5-Coder-0.5B-Instruct-4bit

Text Generation • Updated Nov 11, 2024 • 216 • 6

lmstudio-community/Qwen2.5-32B-Instruct-MLX-4bit

Text Generation • 5B • Updated Nov 13, 2024 • 1.12k • 3