NeoDim
/

starchat-alpha-GGML

Model card Files Files and versions

Resources

View closed (0)

demo space

#4 opened almost 3 years ago by

Looks like the starchat-alpha-ggml-q4_1.bin is broken

#3 opened almost 3 years ago by

Which inference repo is this quantized for?

#2 opened almost 3 years ago by

Can the quantized model be loaded in gpu to have faster inference ?

#1 opened almost 3 years ago by