Inference Providers
Active filters: llamacpp
Vikhrmodels/Vikhr-Llama-3.2-1B-instruct-GGUF
Text Generation
• 1B • Updated • 1.45k
• 14
Vikhrmodels/Vikhr-Qwen-2.5-0.5B-instruct-GGUF
Text Generation
• 0.5B • Updated • 328
• 9
DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters
Updated • 171
rob-x-ai/neural-chat-7b-v3-1-GGUF
7B • Updated • 34
Druvith/mistralmed-7b-v1.5.gguf
7B • Updated • 7
rxavier/Taurus-7B-1.0-GGUF
7B • Updated • 26
BramVanroy/GEITje-7B-ultra-GGUF
7B • Updated • 179
• 9
Vikhrmodels/it-5.3-fp16-32k-GGUF
8B • Updated • 95
• 2
rubra-ai/Meta-Llama-3-8B-Instruct-GGUF
9B • Updated • 44
• 4
Vikhrmodels/it-5.4-fp16-orpo-v2-GGUF
8B • Updated • 120
• 4
Dracones/gemma-2-9b-it-GGUF
Text Generation
• 9B • Updated • 33
Dracones/gemma-2-27b-it-GGUF
Text Generation
• 27B • Updated • 17
Vikhrmodels/Vikhr-Gemma-2B-instruct-GGUF
Text Generation
• 3B • Updated • 674
• 19
flowaicom/Flow-Judge-v0.1-GGUF
Text Generation
• 4B • Updated • 71
• 10
Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO_GGUF
2B • Updated • 128
• 10
Vikhrmodels/QVikhr-2.5-1.5B-Instruct-r_GGUF
2B • Updated • 230
• 4
vicharai/ViCoder-html-32B-preview-GGUF
Text Generation
• 33B • Updated • 74
• 4
Gardeviance/MS-Gardventure-MW-V1-22B-IQ4_NL-GGUF
Text Generation
• 22B • Updated • 6
Dca3271144691983/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters
Updated
lucky087/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters
Updated