Spaces:

prism-ml
/

demo

Running on L40S

App Files Files

demo / bin /llama-cli

Commit History

Update CUDA binaries with mmq fix (b8190 -> b8191)

10697bd

Running

pashak commited on 21 days ago

Docker Space: llama.cpp CUDA inference with multi-GPU load balancing

f156592

pashak commited on 25 days ago