refactor: deprecate llama-omni-server, use llama-server built-in omni endpoints 6451051 J.B-Lin commited on 23 days ago
modal/omni: PRODUCTION DEPLOY - text inference fully verified, llama-server running on T4 d7f81b5 J.B-Lin commited on 24 days ago
modal/omni: text inference verified (Chinese OK, English needs fix) 2400b46 J.B-Lin commited on 24 days ago
docs: comprehensive Modal deploy guide (both solutions + pitfalls) 03e9a7c J.B-Lin commited on 24 days ago
deploy_omni: llFile llama.cpp-omni compile succeeds on Modal T4 1a07461 J.B-Lin commited on 24 days ago
feat: llama.cpp deploys MiniCPM-o 4.5 on Modal - technical report and client API ff5d4ed J.B-Lin commited on 26 days ago
chore: save current version before UI improvements (i18n + font fix) ec90eae J.B-Lin commited on 27 days ago