Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

19,485

Full-text search

Active filters: gguf-my-repo

Theta-Lev/DeepSeek-V2-Lite-Q8_0-GGUF

16B • Updated Aug 4, 2024 • 8

Theta-Lev/DeepSeek-Coder-V2-Lite-Instruct-Q8_0-GGUF

16B • Updated Aug 4, 2024 • 10

Theta-Lev/Mistral-Nemo-Instruct-2407-Q8_0-GGUF

12B • Updated Aug 4, 2024 • 6

SteelQuants/DA-MG-NeMoist-21b-v1.4-Q5_K_M-GGUF

20B • Updated Aug 4, 2024 • 2

lightfall/llama-3-8b-instruct-262k-chinese-Q8_0-GGUF

Text Generation • 8B • Updated Aug 4, 2024

lightfall/Atom-7B-Chat-Q8_0-GGUF

Question Answering • 7B • Updated Aug 4, 2024 • 6

sk7n4k3d/Mistral-Nemo-Instruct-2407-Q8_0-GGUF

12B • Updated Aug 4, 2024 • 2

SteelQuants/DA-MG-NeMoist-21b-v1.5-Q5_K_M-GGUF

20B • Updated Aug 4, 2024 • 6

darwin2025/starcoder2-3b-instruct-Q4_0-GGUF

Text Generation • 3B • Updated Aug 4, 2024 • 7

sh2nd/codegemma-1.1-2b-fix-Q4_K_M-GGUF

3B • Updated Aug 4, 2024 • 2

double-em/DeepSeek-Coder-V2-Lite-Instruct-Q4_K_M-GGUF

16B • Updated Aug 4, 2024 • 14

double-em/DeepSeek-Coder-V2-Lite-Instruct-Q2_K-GGUF

16B • Updated Aug 4, 2024 • 37

Nehal07/Mistral-Nemo-Instruct-2407-Q4_K_M-GGUF

12B • Updated Aug 4, 2024 • 101 • 1

inflatebot/L3-8B-Helium3-Q8_0-GGUF

8B • Updated Aug 4, 2024 • 1

inflatebot/L3-8B-Helium3-Q4_K_S-GGUF

8B • Updated Aug 4, 2024

Pterko/llama-3-12b-instruct-Q8_0-GGUF

Text Generation • 12B • Updated Aug 4, 2024 • 27

Ransss/MN-12B-Lyra-v1-Q8_0-GGUF

12B • Updated Aug 4, 2024

Pterko/saiga_gemma2_9b-Q8_0-GGUF

9B • Updated Aug 4, 2024

DataSoul/Gemma-2-2b-Chinese-it-Q8_0-GGUF

Text Generation • 3B • Updated Aug 4, 2024 • 3 • 1

DataSoul/Gemma-2-2b-Chinese-it-Q5_K_M-GGUF

Text Generation • 3B • Updated Aug 4, 2024 • 6 • 1

yaneivan/dolphin-2.9.4-llama3.1-8b-Q4_K_M-GGUF

8B • Updated Aug 4, 2024 • 768

Ransss/L3-8B-Helium3-Q8_0-GGUF

8B • Updated Aug 4, 2024

wolfygeek/MN-Lyra-12b-EXL2-8bpw-Q4_K_M-GGUF

Updated Aug 5, 2024 • 1

Fischerboot/4b-Tinyllama-Q8_0-GGUF

4B • Updated Aug 5, 2024 • 2

Fischerboot/Llama-3-4B-Instruct-HalfWit-Q4_K_S-GGUF

5B • Updated Aug 5, 2024 • 3

brandonchen/DeepSeek-Coder-V2-Lite-Instruct-Q8_0-GGUF

16B • Updated Aug 5, 2024 • 2

blogcncom/Qwen2-7B-Instruct-Q4_0-GGUF

Text Generation • 8B • Updated Aug 5, 2024 • 3

sh1njuku/Mahou-1.3-mistral-nemo-12B-Q5_K_M-GGUF

12B • Updated Aug 5, 2024 • 1 • 1

mrmage/gemma-2-2b-it-Q2_K-GGUF

Text Generation • 3B • Updated Aug 5, 2024 • 5

mrmage/gemma-2-2b-it-IQ3_XXS-GGUF

Text Generation • 3B • Updated Aug 5, 2024 • 4