Inference Providers
Active filters: llamacpp
DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters
Updated • 174
rob-x-ai/neural-chat-7b-v3-1-GGUF
7B • Updated • 40
Druvith/mistralmed-7b-v1.5.gguf
7B • Updated • 9
rxavier/Taurus-7B-1.0-GGUF
7B • Updated • 25
BramVanroy/GEITje-7B-ultra-GGUF
7B • Updated • 200
• 9
Vikhrmodels/it-5.3-fp16-32k-GGUF
8B • Updated • 116
• 2
rubra-ai/Meta-Llama-3-8B-Instruct-GGUF
9B • Updated • 42
• 4
Vikhrmodels/it-5.4-fp16-orpo-v2-GGUF
8B • Updated • 109
• 4
Dracones/gemma-2-9b-it-GGUF
Text Generation
• 9B • Updated • 27
Dracones/gemma-2-27b-it-GGUF
Text Generation
• 27B • Updated • 15
Vikhrmodels/Vikhr-Gemma-2B-instruct-GGUF
Text Generation
• 3B • Updated • 561
• 19
flowaicom/Flow-Judge-v0.1-GGUF
Text Generation
• 4B • Updated • 68
• 10
Vikhrmodels/Vikhr-Llama-3.2-1B-instruct-GGUF
Text Generation
• 1B • Updated • 1.42k
• 14
Vikhrmodels/Vikhr-Qwen-2.5-0.5B-instruct-GGUF
Text Generation
• 0.5B • Updated • 384
• 9
Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO_GGUF
2B • Updated • 133
• 10
Vikhrmodels/QVikhr-2.5-1.5B-Instruct-r_GGUF
2B • Updated • 229
• 4
vicharai/ViCoder-html-32B-preview-GGUF
Text Generation
• 33B • Updated • 92
• 4
Gardeviance/MS-Gardventure-MW-V1-22B-IQ4_NL-GGUF
Text Generation
• 22B • Updated • 6
Dca3271144691983/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters
Updated
lucky087/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters
Updated