Inference Providers
Active filters: chat
microsoft/Phi-4-reasoning-plus
Text Generation
• 15B • Updated • 26.9k
• 343
DavidAU/OpenAi-GPT-oss-20b-abliterated-uncensored-NEO-Imatrix-gguf
Text Generation
• 21B • Updated • 28.5k
• 502
NousResearch/Hermes-4-14B
Text Generation
• 425k • Updated • 37.5k
• • 153
DavidAU/OpenAi-GPT-oss-20b-HERETIC-uncensored-NEO-Imatrix-gguf
Text Generation
• 21B • Updated • 25.4k
• 150
Text Generation
• 0.1B • Updated • 558
• 6
NousResearch/Hermes-3-Llama-3.1-8B
Text Generation
• 8B • Updated • 239k
• • 450
Qwen/Qwen2-Audio-7B-Instruct
Audio-Text-to-Text
• 8B • Updated • 736k
• 536
bartowski/Qwen2.5-3B-Instruct-GGUF
Text Generation
• 3B • Updated • 39.7k
• 27
Qwen/Qwen2.5-7B-Instruct-GPTQ-Int4
Text Generation
• 8B • Updated • 77k
• 33
Qwen/Qwen2.5-7B-Instruct-GGUF
Text Generation
• 8B • Updated • 221k
• 149
bartowski/Qwen2.5-Coder-32B-Instruct-GGUF
Text Generation
• 33B • Updated • 37.4k
• 108
microsoft/bitnet-b1.58-2B-4T-gguf
Text Generation
• 2B • Updated • 16.8k
• 276
NousResearch/Hermes-4-70B
Text Generation
• 71B • Updated • 1.15k
• • 194
NousResearch/Hermes-4-70B-FP8
Text Generation
• 71B • Updated • 3.12k
• 34
mlx-community/Hermes-4-14B-8bit
Text Generation
• 15B • Updated • 361
• 3
neuralcrew/neutrino-instruct
Text Generation
• 7B • Updated • 13.3k
• 7
Lazarus-Ai/ReAligned-Qwen3.5-35B-A3B
Text Generation
• 35B • Updated • 80
• 5
AEON-7/Qwen3.6-35B-A3B-heretic-NVFP4
Image-Text-to-Text
• 21B • Updated • 141k
• 47
Text Generation
• 0.1B • Updated • 887
• 18
CohereLabs/command-a-plus-05-2026-bf16
Image-Text-to-Text
• 219B • Updated • 21.3k
• • 131
CohereLabs/command-a-plus-05-2026-fp8
Image-Text-to-Text
• 219B • Updated • 4.86k
• • 34
Text Generation
• 0.6B • Updated • 521
• 4
Surpem/Supertron2.1-0.6B-GGUF
Text Generation
• 0.6B • Updated • 354
• 3
Text Generation
• 1B • Updated • 105
• 3
ThingAI/Quark-270m-Instruct
Text Generation
• 0.3B • Updated • 137
• 2
Text Generation
• 0.6B • Updated • 83.6k
• • 96
bartowski/Hermes-3-Llama-3.1-8B-GGUF
Text Generation
• Updated • 10.4k
• 16
bartowski/Qwen2.5-7B-Instruct-GGUF
Text Generation
• 8B • Updated • 115k
• 62
Qwen/Qwen2.5-7B-Instruct-AWQ
Text Generation
• 8B • Updated • 2.94M
• 44
Qwen/Qwen2.5-32B-Instruct-AWQ
Text Generation
• 33B • Updated • 814k
• 101