Fix ZeroGPU: preload CUDA runtime libs for the llama.cpp wheel 76e2a61 marcodsn Claude Opus 4.8 commited on 16 days ago
Fix Python 3.10 startup: tomllib fallback + pin python_version 3.12 83b2601 marcodsn Claude Opus 4.8 commited on 16 days ago
Run a local small model (MiniCPM3-4B via llama.cpp) instead of a cloud API d000651 marcodsn Claude Opus 4.8 commited on 16 days ago