Inference Providers
Active filters: nvidia
Dracones/Llama-3.1-Nemotron-70B-Instruct_exl2_4.0bpw
Text Generation
• Updated • 1
Dracones/Llama-3.1-Nemotron-70B-Instruct_exl2_3.5bpw
Text Generation
• Updated • 3
Dracones/Llama-3.1-Nemotron-70B-Instruct_exl2_3.0bpw
Text Generation
• Updated • 1
tensorblock/Llama-3.1-Nemotron-70B-Instruct-HF-bf16-GGUF
Text Generation
• 71B • Updated • 45
ymcki/Llama-3_1-Nemotron-51B-Instruct-GGUF
Text Generation
• Updated • 305
• 14
tensorblock/Llama-3.1-Nemotron-70B-Instruct-GGUF
Text Generation
• 71B • Updated • 35
KnutJaegersberg/Llama3-ChatQA-2-70B-4.65bpw-exl2
Text Generation
• Updated tensorblock/Llama3-ChatQA-1.5-70B-GGUF
Text Generation
• 71B • Updated • 32
sandbox-ai/Llama-3.1-Tango-70b-bnb_4b
Text Generation
• 73B • Updated • 5
• 4
Text Generation
• Updated jeorjesami/NividiaLatestModel
Text Generation
• Updated • 1
jacobcarajo/OpenMath2-Llama3.1-8B-Q5_K_M-GGUF
8B • Updated • 9
cnfusion/Llama-3.1-Nemotron-70B-Instruct-HF-Q2-mlx
Text Generation
• 7B • Updated • 15
Image-Text-to-Text
• Updated • 6
nvidia/Llama-2-13B-DMC-4x
nvidia/Llama-2-13B-DMC-8x
backyardai/Llama-3.1-Nemotron-70B-Instruct-GGUF
Text Generation
• 71B • Updated • 111
second-state/Llama-3_1-Nemotron-51B-Instruct-GGUF
Text Generation
• 52B • Updated • 960
gaianet/Llama-3_1-Nemotron-51B-Instruct-GGUF
Text Generation
• 52B • Updated • 918
tensorblock/Llama-3_1-Nemotron-51B-Instruct-GGUF
Text Generation
• 52B • Updated • 284
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16
Text Generation
• 11B • Updated • 14
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w8a8
Text Generation
• 71B • Updated • 10
mradermacher/llama-3-nvidia-ChatQA-1.5-8B-GGUF
8B • Updated • 66
mradermacher/llama-3-nvidia-ChatQA-1.5-8B-i1-GGUF
8B • Updated • 103
matatonic/Llama-3.1-Nemotron-70B-Instruct-HF-abliterated-6.5bpw-h8-exl2
Text Generation
• Updated • 1
nvidia/Cosmos-1.0-Tokenizer-CV8x8x8
Updated • 188
• 24
nvidia/Cosmos-1.0-Tokenizer-DV8x16x16
Updated • 52
• 18
nvidia/Cosmos-1.0-Prompt-Upsampler-12B-Text2World
Updated • 21
• 13