Inference Providers
Active filters: nvidia
etsien/Llama-3_3-Nemotron-Super-49B-v1_5-GPTQ-w4a8
Text Generation
• 50B • Updated • 554
dominguesm/NVIDIA-Nemotron-Nano-9B-v2-GGUF
Text Generation
• 9B • Updated • 791
• 1
weathermanj/NVIDIA-Nemotron-Nano-9B-v2-gguf
Text Generation
• 9B • Updated • 322
• 1
mlx-community/NVIDIA-Nemotron-Nano-9B-v2-4bits
Text Generation
• Updated • 497
• 2
mradermacher/STELLA-VLM-FineBio-7B-GGUF
8B • Updated • 73
• 1
Triago/NVIDIA-Nemotron-Nano-12B-v2-Q8_0-GGUF
Text Generation
• 12B • Updated • 8
• 1
gabriellarson/NVIDIA-Nemotron-Nano-12B-v2-GGUF
Text Generation
• 12B • Updated • 196
QuantFactory/NVIDIA-Nemotron-Nano-9B-v2-GGUF
Text Generation
• Updated • 572
• 4
jesusoctavioas/Llama-3.1-Nemotron-Nano-8B-v1-mlx-4Bit
Text Generation
• 1B • Updated • 28
NikolayKozloff/NVIDIA-Nemotron-Nano-12B-v2-Q6_K-GGUF
Text Generation
• 12B • Updated • 16
• 2
QuantFactory/NVIDIA-Nemotron-Nano-12B-v2-GGUF
Text Generation
• Updated • 178
• 2
NikolayKozloff/NVIDIA-Nemotron-Nano-12B-v2-Q5_K_M-GGUF
Text Generation
• 12B • Updated • 7
• 1
NikolayKozloff/NVIDIA-Nemotron-Nano-12B-v2-Q5_K_S-GGUF
Text Generation
• 12B • Updated • 4
• 1
NikolayKozloff/NVIDIA-Nemotron-Nano-12B-v2-Q4_K_M-GGUF
Text Generation
• 12B • Updated • 13
• 1
Neural-Hacker/Qwen3-Math-Reasoning-LoRA
Text Generation
• Updated • 2
alexcovo/NVIDIA-Nemotron-Nano-12B-v2-Q4_K_M-GGUF
Text Generation
• 12B • Updated • 22
cyankiwi/NVIDIA-Nemotron-Nano-12B-v2-AWQ-4bit
Text Generation
• 3B • Updated • 930
• 4
cyankiwi/NVIDIA-Nemotron-Nano-12B-v2-AWQ-8bit
Text Generation
• 4B • Updated • 353
• 1
cyankiwi/NVIDIA-Nemotron-Nano-9B-v2-AWQ-8bit
Text Generation
• 3B • Updated • 20
QuantFactory/OpenReasoning-Nemotron-7B-GGUF
Text Generation
• 8B • Updated • 87
• 2
nvidia/Qwen3-235B-A22B-Thinking-2507-Eagle3
Text Generation
• 0.3B • Updated • 25
nvidia/Qwen3-30B-A3B-Thinking-2507-Eagle3
Text Generation
• 0.1B • Updated • 32
Lumia101/NVIDIA-Nemotron-Nano-9B-v2-Q4_K_M-GGUF
Text Generation
• 9B • Updated • 85
mlx-community/NVIDIA-Nemotron-Nano-9B-v2-6bit
Text Generation
• Updated • 212
Mungert/NVIDIA-Nemotron-Nano-12B-v2-GGUF
Text Generation
• 12B • Updated • 59
• 2
nvidia/Llama-3.1-8B-Instruct-NVFP4
5B • Updated • 100k
• 7
nvidia/Cosmos-Predict2.5-14B
Updated • 2.42k
• 24
DBMe/Llama-3_1-Nemotron-Ultra-253B-v1-exl3-2.7bpw
Text Generation
• 45B • Updated • 2
maxrubin629/Nemotron-H-8B-Reasoning-128K-6bit
Text Generation
• 8B • Updated • 13
straino/NVIDIA-Nemotron-Nano-9B-v2-Base-Q4_K_M-GGUF
Text Generation
• 9B • Updated • 50
• 1