refactor: deprecate llama-omni-server, use llama-server built-in omni endpoints 6451051 J.B-Lin commited on 23 days ago
modal/omni: PRODUCTION DEPLOY - text inference fully verified, llama-server running on T4 d7f81b5 J.B-Lin commited on 24 days ago
modal/omni: text inference verified (Chinese OK, English needs fix) 2400b46 J.B-Lin commited on 24 days ago
docs: comprehensive Modal deploy guide (both solutions + pitfalls) 03e9a7c J.B-Lin commited on 24 days ago