Docker Space: llama.cpp CUDA inference with multi-GPU load balancing f156592 pashak commited on 29 days ago