Docker Space: llama.cpp CUDA inference with multi-GPU load balancing f156592 pashak commited on 25 days ago