Spaces:

prism-ml
/

demo

Running on L40S

App Files Files

Commit History

Update CUDA binaries with mmq fix (b8190 -> b8191)

10697bd

Running

pashak commited on 21 days ago

Docker Space: llama.cpp CUDA inference with multi-GPU load balancing

f156592

pashak commited on 25 days ago

initial commit

6f1db0b
verified

pashak commited on 27 days ago