Request for guff conversation or version for amd gpus

by Swastikjatt7 - opened Feb 22

Feb 22

Bro can you convert the model from fp8 to guff cause I am using amd gpu clusters which don't have native support for fp8 quantisation so can you convert fp8 to guff using llama cpp i tried but failed bcz of system specs

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment