Inference Providers
Active filters: nvidia
straino/NVIDIA-Nemotron-Nano-9B-v2-Base-Q4_K_M-GGUF
Text Generation
• 9B • Updated • 50
• 1
NexVeridian/NVIDIA-Nemotron-Nano-9B-v2-3bit
Text Generation
• 1B • Updated • 265
NexVeridian/NVIDIA-Nemotron-Nano-9B-v2-4bit
Text Generation
• 1B • Updated • 20
NexVeridian/NVIDIA-Nemotron-Nano-9B-v2-6bit
Text Generation
• 9B • Updated • 25
NexVeridian/NVIDIA-Nemotron-Nano-9B-v2-8bit
Text Generation
• 9B • Updated • 43
NexVeridian/NVIDIA-Nemotron-Nano-12B-v2-3bit
Text Generation
• 12B • Updated • 239
NexVeridian/NVIDIA-Nemotron-Nano-12B-v2-4bit
Text Generation
• 12B • Updated • 22
NexVeridian/NVIDIA-Nemotron-Nano-12B-v2-6bit
Text Generation
• 12B • Updated • 23
NexVeridian/NVIDIA-Nemotron-Nano-12B-v2-8bit
Text Generation
• 12B • Updated • 22
Text Generation
• 5B • Updated • 34.1k
• 15
Text Generation
• 8B • Updated • 18.1k
• 4
Text Generation
• 8B • Updated • 367k
• 5
Text Generation
• 15B • Updated • 14.3k
• 4
Text Generation
• 17B • Updated • 85.4k
• 13
nvidia/Qwen2.5-VL-7B-Instruct-FP8
Text Generation
• 8B • Updated • 553
• 7
nvidia/Qwen2.5-VL-7B-Instruct-NVFP4
Text Generation
• 5B • Updated • 23.9k
• 13
SandLogicTechnologies/OpenReasoning-Nemotron-1.5B-GGUF
Text Generation
• 2B • Updated • 14
SandLogicTechnologies/OpenReasoning-Nemotron-7B-GGUF
Text Generation
• 8B • Updated • 29
MorsiKK/Llama-3.1-Nemotron-Nano-8B-v1-Q4_K_M-GGUF
Text Generation
• 8B • Updated • 2
mlx-community/Llama-3_3-Nemotron-Super-49B-v1_5-mlx-4Bit
Text Generation
• 50B • Updated • 1.39k
• 3
nvidia/NVIDIA-Nemotron-Nano-9B-v2-FP8
Text Generation
• 9B • Updated • 84.1k
• 7
jc2375/Llama-3_3-Nemotron-Super-49B-v1_5-mlx-8Bit
Text Generation
• 50B • Updated • 71
Wwayu/Llama-3_3-Nemotron-Super-49B-v1_5-mlx-4Bit
Text Generation
• Updated • 105
valuat/Llama-3_3-Nemotron-Super-49B-v1_5-mlx-4Bit
Text Generation
• 50B • Updated • 58
cagataydev/gr00t-wholettheducksout
Robotics
• 3B • Updated • 1
• 2
RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-FP8-dynamic
Text Generation
• 9B • Updated • 2.31k
• 3
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-FP4-QAD
Image-Text-to-Text
• 6B • Updated • 215
• 14
danijo13/OpenCodeReasoning-Nemotron-1.1-32B-Q4_K_M-GGUF
Text Generation
• 33B • Updated • 2
gg000/NVIDIA-Nemotron-Nano-12B-v2-MLX-4bit
Text Generation
• 12B • Updated • 27
Wwayu/Llama-3_1-Nemotron-Ultra-253B-v1-mlx-2Bit
Text Generation
• Updated • 19