Inference Providers
Active filters: nvidia
DBMe/Llama-3_1-Nemotron-Ultra-253B-v1-exl3-2.7bpw
Text Generation
• 45B • Updated • 2
maxrubin629/Nemotron-H-8B-Reasoning-128K-6bit
Text Generation
• 8B • Updated • 13
straino/NVIDIA-Nemotron-Nano-9B-v2-Base-Q4_K_M-GGUF
Text Generation
• 9B • Updated • 50
• 1
NexVeridian/NVIDIA-Nemotron-Nano-9B-v2-3bit
Text Generation
• 1B • Updated • 407
NexVeridian/NVIDIA-Nemotron-Nano-12B-v2-3bit
Text Generation
• 12B • Updated • 382
Text Generation
• 5B • Updated • 32.1k
• 15
Text Generation
• 8B • Updated • 30.7k
• 4
Text Generation
• 8B • Updated • 368k
• 5
Text Generation
• 15B • Updated • 15.1k
• 4
Text Generation
• 17B • Updated • 84.8k
• 13
nvidia/Qwen2.5-VL-7B-Instruct-FP8
Text Generation
• 8B • Updated • 560
• 7
nvidia/Qwen2.5-VL-7B-Instruct-NVFP4
Text Generation
• 5B • Updated • 23.9k
• 13
SandLogicTechnologies/OpenReasoning-Nemotron-1.5B-GGUF
Text Generation
• 2B • Updated • 13
SandLogicTechnologies/OpenReasoning-Nemotron-7B-GGUF
Text Generation
• 8B • Updated • 27
unsloth/NVIDIA-Nemotron-Nano-9B-v2
Text Generation
• 9B • Updated • 1.21k
• 3
MorsiKK/Llama-3.1-Nemotron-Nano-8B-v1-Q4_K_M-GGUF
Text Generation
• 8B • Updated • 2
mlx-community/Llama-3_3-Nemotron-Super-49B-v1_5-mlx-4Bit
Text Generation
• 50B • Updated • 1.42k
• 3
nvidia/NVIDIA-Nemotron-Nano-9B-v2-FP8
Text Generation
• 9B • Updated • 81.6k
• 7
jc2375/Llama-3_3-Nemotron-Super-49B-v1_5-mlx-8Bit
Text Generation
• 50B • Updated • 75
Wwayu/Llama-3_3-Nemotron-Super-49B-v1_5-mlx-4Bit
Text Generation
• Updated • 109
valuat/Llama-3_3-Nemotron-Super-49B-v1_5-mlx-4Bit
Text Generation
• 50B • Updated • 62
cagataydev/gr00t-wholettheducksout
Robotics
• 3B • Updated • 1
• 2
RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-FP8-dynamic
Text Generation
• 9B • Updated • 2.72k
• 3
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-FP4-QAD
Image-Text-to-Text
• 6B • Updated • 212
• 14
danijo13/OpenCodeReasoning-Nemotron-1.1-32B-Q4_K_M-GGUF
Text Generation
• 33B • Updated • 2
gg000/NVIDIA-Nemotron-Nano-12B-v2-MLX-4bit
Text Generation
• 12B • Updated • 27
Wwayu/Llama-3_1-Nemotron-Ultra-253B-v1-mlx-2Bit
Text Generation
• Updated • 21
Wwayu/Llama-3_1-Nemotron-Ultra-253B-CPT-v1-mlx-3Bit
Text Generation
• Updated • 36
nvidia/gpt-oss-120b-Eagle3-short-context
Text Generation
• Updated • 2.53k
• 16
nvidia/NVIDIA-Nemotron-Nano-9B-v2-NVFP4
Text Generation
• Updated • 8.22k
• 21