Inference Providers
Active filters: tool use
NousResearch/Hermes-4.3-36B
Text Generation
• 36B • Updated • 7.33k
• 224
NousResearch/Hermes-4-14B
Text Generation
• 425k • Updated • 34.9k
• • 151
NousResearch/Hermes-4-70B
Text Generation
• 71B • Updated • 1.08k
• • 193
NousResearch/Hermes-4-70B-FP8
Text Generation
• 71B • Updated • 3.35k
• 33
NousResearch/Hermes-4-14B-FP8
Text Generation
• 15B • Updated • 11.2k
• 25
NousResearch/Hermes-4-405B
Text Generation
• 406B • Updated • 450
• • 91
NousResearch/Hermes-4-405B-FP8
Text Generation
• 406B • Updated • 410
• 31
NousResearch/Hermes-4.3-36B-GGUF
Text Generation
• 36B • Updated • 4.95k
• 54
cyankiwi/Hermes-4-70B-AWQ-4bit
Text Generation
• 13B • Updated • 65.1k
• 7
mlx-community/Hermes-4-14B-8bit
Text Generation
• 15B • Updated • 260
• 2
alexcovo/Hermes-4.3-36B-mlx-8Bit
Text Generation
• 36B • Updated • 217
• 1
unclecode/llama3-function-call-lora-adapter-240424
unclecode/llama3-function-call-Q4_K_M_GGFU-240424
8B • Updated • 99
• 3
unclecode/tinyllama-function-call-lora-adapter-250424
Updated
unclecode/tinyllama-function-call-Q4_K_M_GGFU-250424
1B • Updated • 347
• 4
mims-harvard/TxAgent-T1-Llama-3.1-8B
Text Generation
• 8B • Updated • 325
• • 31
mradermacher/TxAgent-T1-Llama-3.1-8B-GGUF
8B • Updated • 263
• 2
DavidAU/Llama3.1-MOE-4X8B-Gated-IQ-Multi-Tier-Deep-Reasoning-32B-GGUF
Text Generation
• 25B • Updated • 443
• 10
DavidAU/Llama3.1-MOE-4X8B-Gated-IQ-Multi-Tier-COGITO-Deep-Reasoning-32B-GGUF
Text Generation
• 25B • Updated • 380
• 6
tensorblock/mims-harvard_TxAgent-T1-Llama-3.1-8B-GGUF
Text Generation
• 8B • Updated • 49
DavidAU/Llama3.1-MOE-4X8B-Gated-IQ-Multi-Tier-Deep-Reasoning-32B
Text Generation
• 25B • Updated • 9
• 4
DavidAU/Qwen3-128k-30B-A3B-NEO-MAX-Imatrix-gguf
Text Generation
• 31B • Updated • 3.74k
• 37
DavidAU/Qwen3-30B-A1.5B-64K-High-Speed-NEO-Imatrix-MAX-gguf
Text Generation
• 31B • Updated • 743
• 26
mradermacher/Llama3.1-MOE-4X8B-Gated-IQ-Multi-Tier-Deep-Reasoning-32B-GGUF
25B • Updated • 163
mradermacher/Llama3.1-MOE-4X8B-Gated-IQ-Multi-Tier-Deep-Reasoning-32B-i1-GGUF
25B • Updated • 340
Prince-1/Hermes-4-14B-Onnx
Text Generation
• Updated makaveli10/tinyllama-function-call-lora-adapter-250424-F16-GGUF
25.2M • Updated • 21
rogue-security/mcp-tool-use-quality-ranger-0.6b
Text Classification
• 0.6B • Updated • 14
lmstudio-community/Hermes-4-405B-MLX-5bit
Text Generation
• 406B • Updated • 312
lmstudio-community/Hermes-4-405B-MLX-4bit
Text Generation
• 406B • Updated • 791
• 1