fix: remove --voxcpm2-base-lm/acoustic args (not valid for llama-server CLI) 6230c98 J.B-Lin commited on 23 days ago
fix: rewrite test_omni to use llama-server built-in omni endpoints d67ae6b J.B-Lin commited on 23 days ago
refactor: deprecate llama-omni-server, use llama-server built-in omni endpoints 6451051 J.B-Lin commited on 23 days ago
fix: set n_parallel=1, n_ctx=4096 before omni_init to prevent n_seq_max > 256 e0c2f3a J.B-Lin commited on 23 days ago
test_omni: add stderr capture for omni-server when init fails 4429886 J.B-Lin commited on 23 days ago
tmp: health check / omni test scripts for debugging omni_init 500 c86f455 J.B-Lin commited on 23 days ago
fix: set params.model/vpm/apm/tts in server-omni.cpp omni_init handler so model files are found a850bb4 J.B-Lin commited on 23 days ago
modal/omni: text inference verified (Chinese OK, English needs fix) 2400b46 J.B-Lin commited on 24 days ago
chore: add build_llama_server.sh (local cross-compile script, unused) 12a8a38 J.B-Lin commited on 24 days ago
deploy_omni: llFile llama.cpp-omni compile succeeds on Modal T4 1a07461 J.B-Lin commited on 24 days ago
deploy: 成功部署 MiniCPM-o-4_5 到 Modal (预编译llama-cpp-python A100) c0f7bc8 J.B-Lin commited on 25 days ago
[modal] Fix: mmproj single-file, health endpoint, client API consistency f10673a J.B-Lin commited on 26 days ago
fix(deploy.py): 修复4个问题 - MODEL_DIR shadowing, find_mmproj_files硬编码, MainModel传参, 移除--multimodal flag c8e1c13 J.B-Lin commited on 26 days ago
fix: CUDA runtime LD_LIBRARY_PATH + CUDA_HOME in Modal deploy; add inference test script ea75111 J.B-Lin commited on 26 days ago
feat: llama.cpp deploys MiniCPM-o 4.5 on Modal - technical report and client API ff5d4ed J.B-Lin commited on 26 days ago