nvidia
/

Nemotron-4-340B-Instruct

Model card Files Files and versions

Gguf

#5

by iHaag - opened Jun 19, 2024

Can you run this with llama.cpp?

NVIDIA org Jun 19, 2024

Probably not at this time -- I did a quick search and it doesn't seem that llama.cpp supports NeMo models.

laugh
out
loud

•

edited Jul 14, 2024

Yes you can, at least with my branch. Check this out for details: https://github.com/ggerganov/llama.cpp/issues/7966#issuecomment-2227104693

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment