Inference Providers
Active filters: nvidia
nvidia/Nemotron-Cascade-14B-Thinking
Text Generation
• Updated • 1.96k
• • 79
unsloth/Nemotron-3-Nano-30B-A3B
Text Generation
• 32B • Updated • 90.5k
• 14
nvidia/gpt-oss-120b-Eagle3-throughput
Text Generation
• Updated • 1.04k
• 34
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
• Updated • 12.7k
• 39
nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
• Updated • 3.36k
• 59
introvoyz041/OpenMath-Nemotron-14B-Kaggle-mlx-4Bit
Text Generation
• 15B • Updated introvoyz041/OpenMath-Mistral-7B-v0.1-hf-mlx-4Bit
1B • Updated • 4
introvoyz041/OpenMath2-Llama3.1-8B-mlx-4Bit
Text Generation
• 1B • Updated • 10
introvoyz041/OpenMath-Nemotron-7B-mlx-4Bit
Text Generation
• 1B • Updated • 1
introvoyz041/OpenMath-Nemotron-32B-mlx-4Bit
Text Generation
• 33B • Updated • 8
introvoyz041/OpenMath-Nemotron-14B-mlx-4Bit
Text Generation
• 15B • Updated • 1
unsloth/Nemotron-3-Nano-30B-A3B-GGUF
Text Generation
• 32B • Updated • 88.7k
• 305
unsloth/Nemotron-3-Nano-30B-A3B-Base
Text Generation
• 32B • Updated • 1.49k
• 4
unsloth/Nemotron-3-Nano-30B-A3B-FP8
Text Generation
• 32B • Updated • 78
• 7
FriendliAI/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
Text Generation
• 32B • Updated • 934
FriendliAI/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Text Generation
• 32B • Updated • 2
nvidia/Qwen3-235B-A22B-Thinking-2507-FP4-Eagle3
Text Generation
• Updated • 60
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-4bit
Text Generation
• Updated • 873
• 4
ExaltedSlayer/nvidia-nemotron-3-nano-30b-a3b-mlx-mxfp4
Text Generation
• 32B • Updated • 1.01k
• 1
nvidia/Nemotron-Cascade-8B
Text Generation
• Updated • 2.07k
• • 67
bartowski/nvidia_Nemotron-3-Nano-30B-A3B-GGUF
Text Generation
• 32B • Updated • 1.62k
• 9
bartowski/nvidia_Nemotron-Cascade-8B-Thinking-GGUF
Text Generation
• 8B • Updated • 1.22k
• 2
bartowski/nvidia_Nemotron-Cascade-14B-Thinking-GGUF
Text Generation
• 15B • Updated • 703
• 8
lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-4bit
Text Generation
• 32B • Updated • 91.9k
• 2
lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-5bit
Text Generation
• 32B • Updated • 82.6k
lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-6bit
Text Generation
• 32B • Updated • 82.7k
lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-8bit
Text Generation
• 32B • Updated • 84k
• 3
moxin-org/Nemotron-3-Nano-30B-A3B-GGUF
Text Generation
• 32B • Updated • 88
• 3
mradermacher/Qwen3-Nemotron-235B-A22B-GenRM-GGUF
235B • Updated • 87
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-MXFP4
Text Generation
• 32B • Updated • 1.62k
• 3