Inference Providers
Active filters: nvidia
NikolayKozloff/NVIDIA-Nemotron-Nano-12B-v2-Q5_K_M-GGUF
Text Generation
• 12B • Updated • 10
• 1
NikolayKozloff/NVIDIA-Nemotron-Nano-12B-v2-Q5_K_S-GGUF
Text Generation
• 12B • Updated • 19
• 1
NikolayKozloff/NVIDIA-Nemotron-Nano-12B-v2-Q4_K_M-GGUF
Text Generation
• 12B • Updated • 11
• 1
Neural-Hacker/Qwen3-Math-Reasoning-LoRA
Text Generation
• Updated • 2
alexcovo/NVIDIA-Nemotron-Nano-12B-v2-Q4_K_M-GGUF
Text Generation
• 12B • Updated • 4
cyankiwi/NVIDIA-Nemotron-Nano-9B-v2-AWQ-4bit
Text Generation
• 2B • Updated • 75
• 3
cyankiwi/NVIDIA-Nemotron-Nano-12B-v2-AWQ-4bit
Text Generation
• 3B • Updated • 1.11k
• 4
cyankiwi/NVIDIA-Nemotron-Nano-12B-v2-AWQ-8bit
Text Generation
• 4B • Updated • 1.03k
• 1
cyankiwi/NVIDIA-Nemotron-Nano-9B-v2-AWQ-8bit
Text Generation
• 3B • Updated • 33
QuantFactory/OpenReasoning-Nemotron-7B-GGUF
Text Generation
• 8B • Updated • 57
• 2
nvidia/Qwen3-235B-A22B-Thinking-2507-Eagle3
Text Generation
• 0.3B • Updated • 108
• 1
nvidia/Qwen3-30B-A3B-Thinking-2507-Eagle3
Text Generation
• 0.1B • Updated • 188
• 3
Lumia101/NVIDIA-Nemotron-Nano-9B-v2-Q4_K_M-GGUF
Text Generation
• 9B • Updated • 106
mlx-community/NVIDIA-Nemotron-Nano-9B-v2-6bit
Text Generation
• Updated • 401
• 1
Mungert/NVIDIA-Nemotron-Nano-12B-v2-GGUF
Text Generation
• 12B • Updated • 193
• 2
nvidia/Phi-4-multimodal-instruct-NVFP4
4B • Updated • 3.04k
• 11
nvidia/Phi-4-multimodal-instruct-FP8
6B • Updated • 631
• 7
nvidia/Phi-4-reasoning-plus-FP8
15B • Updated • 221
• 6
nvidia/Phi-4-reasoning-plus-NVFP4
8B • Updated • 912
• 9
nvidia/Llama-3.1-8B-Instruct-NVFP4
5B • Updated • 240k
• 10
nvidia/Cosmos-Predict2.5-14B
Updated • 4.08k
• 25
DBMe/Llama-3_1-Nemotron-Ultra-253B-v1-exl3-2.7bpw
Text Generation
• 45B • Updated • 1
maxrubin629/Nemotron-H-8B-Reasoning-128K-6bit
Text Generation
• 8B • Updated • 6
straino/NVIDIA-Nemotron-Nano-9B-v2-Base-Q4_K_M-GGUF
Text Generation
• 9B • Updated • 29
• 1
NexVeridian/NVIDIA-Nemotron-Nano-9B-v2-3bit
Text Generation
• 1B • Updated • 1.05k
NexVeridian/NVIDIA-Nemotron-Nano-12B-v2-3bit
Text Generation
• 12B • Updated • 1.01k
Text Generation
• 5B • Updated • 74.8k
• 17
Text Generation
• 8B • Updated • 22.8k
• 5
Text Generation
• 8B • Updated • 10.7k
• 11
Text Generation
• 15B • Updated • 2.13k
• 5