fix: use pre-compiled llama-cpp-python wheel + model in image 0408500 verified hugh007 commited on 21 days ago
fix: use pre-compiled llama-server binary (zero compilation) df05d22 verified hugh007 commited on 21 days ago