Inference Providers
Active filters: nvidia
nvidia/Llama-2-13B-DMC-4x
nvidia/Llama-2-13B-DMC-8x
backyardai/Llama-3.1-Nemotron-70B-Instruct-GGUF
Text Generation
• 71B • Updated • 63
second-state/Llama-3_1-Nemotron-51B-Instruct-GGUF
Text Generation
• 52B • Updated • 296
gaianet/Llama-3_1-Nemotron-51B-Instruct-GGUF
Text Generation
• 52B • Updated • 159
tensorblock/Llama-3_1-Nemotron-51B-Instruct-GGUF
Text Generation
• 52B • Updated • 14
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16
Text Generation
• 11B • Updated • 12
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w8a8
Text Generation
• 71B • Updated • 2
mradermacher/llama-3-nvidia-ChatQA-1.5-8B-GGUF
8B • Updated • 88
mradermacher/llama-3-nvidia-ChatQA-1.5-8B-i1-GGUF
8B • Updated • 15
matatonic/Llama-3.1-Nemotron-70B-Instruct-HF-abliterated-6.5bpw-h8-exl2
Text Generation
• Updated • 2
nvidia/Cosmos-1.0-Tokenizer-CV8x8x8
Updated • 204
• 23
nvidia/Cosmos-1.0-Tokenizer-DV8x16x16
Updated • 33
• 17
nvidia/Cosmos-1.0-Prompt-Upsampler-12B-Text2World
Updated • 31
• 13
nvidia/Cosmos-1.0-Diffusion-7B-Video2World
Updated • 6.84k
• 39
nvidia/Cosmos-1.0-Diffusion-14B-Text2World
Updated • 9
• 60
nvidia/Cosmos-1.0-Diffusion-14B-Video2World
Updated • 11.2k
• 57
nvidia/Cosmos-1.0-Autoregressive-13B-Video2World
Updated • 12
• 32
nvidia/Cosmos-1.0-Autoregressive-12B
Updated • 7
• 30
nvidia/Cosmos-1.0-Autoregressive-5B-Video2World
Updated • 26
• 30
nvidia/Cosmos-1.0-Diffusion-7B-Decoder-DV8x16x16ToCV8x8x8
Updated • 12
• 9
nvidia/Cosmos-1.0-Autoregressive-4B
Updated • 15
• 56
nvidia/Cosmos-1.0-Diffusion-7B-Text2World
Text-to-Video
• Updated • 7.86k
• 233
mradermacher/Llama-3_1-Nemotron-51B-Instruct-GGUF
52B • Updated • 41
• 1
mradermacher/Llama-3_1-Nemotron-51B-Instruct-i1-GGUF
52B • Updated • 36
• 1
mradermacher/Llama-3.2-Nemotron-3B-Instruct-GGUF
3B • Updated • 17
mradermacher/Llama-3.2-Nemotron-3B-Instruct-i1-GGUF
3B • Updated • 69
nvidia/AceMath-1.5B-Instruct
Text Generation
• 2B • Updated • 2.19k
• 15