Inference Providers
Active filters: nvidia
nvidia/Llama-2-13B-DMC-4x
nvidia/Llama-2-13B-DMC-8x
backyardai/Llama-3.1-Nemotron-70B-Instruct-GGUF
Text Generation
• 71B • Updated • 122
second-state/Llama-3_1-Nemotron-51B-Instruct-GGUF
Text Generation
• 52B • Updated • 959
gaianet/Llama-3_1-Nemotron-51B-Instruct-GGUF
Text Generation
• 52B • Updated • 884
tensorblock/Llama-3_1-Nemotron-51B-Instruct-GGUF
Text Generation
• 52B • Updated • 285
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16
Text Generation
• 11B • Updated • 12
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w8a8
Text Generation
• 71B • Updated • 10
mradermacher/llama-3-nvidia-ChatQA-1.5-8B-GGUF
8B • Updated • 65
mradermacher/llama-3-nvidia-ChatQA-1.5-8B-i1-GGUF
8B • Updated • 112
matatonic/Llama-3.1-Nemotron-70B-Instruct-HF-abliterated-6.5bpw-h8-exl2
Text Generation
• Updated • 1
nvidia/Cosmos-1.0-Tokenizer-CV8x8x8
Updated • 169
• 24
nvidia/Cosmos-1.0-Tokenizer-DV8x16x16
Updated • 52
• 18
nvidia/Cosmos-1.0-Prompt-Upsampler-12B-Text2World
Updated • 20
• 13
nvidia/Cosmos-1.0-Diffusion-7B-Video2World
Updated • 2.31k
• 39
nvidia/Cosmos-1.0-Diffusion-14B-Text2World
Updated • 650
• 60
nvidia/Cosmos-1.0-Diffusion-14B-Video2World
Updated • 323
• 57
nvidia/Cosmos-1.0-Autoregressive-13B-Video2World
Updated • 5
• 32
nvidia/Cosmos-1.0-Autoregressive-12B
Updated • 30
nvidia/Cosmos-1.0-Autoregressive-5B-Video2World
Updated • 15
• 30
nvidia/Cosmos-1.0-Guardrail
Updated • 483
• 59
nvidia/Cosmos-1.0-Diffusion-7B-Decoder-DV8x16x16ToCV8x8x8
nvidia/Cosmos-1.0-Autoregressive-4B
Updated • 20
• 56
nvidia/Cosmos-1.0-Diffusion-7B-Text2World
Text-to-Video
• Updated • 5.07k
• 233
mradermacher/Llama-3_1-Nemotron-51B-Instruct-GGUF
52B • Updated • 92
• 1
mradermacher/Llama-3_1-Nemotron-51B-Instruct-i1-GGUF
52B • Updated • 205
• 1
mradermacher/Llama-3.2-Nemotron-3B-Instruct-GGUF
3B • Updated • 51
mradermacher/Llama-3.2-Nemotron-3B-Instruct-i1-GGUF
3B • Updated • 80