PregoPal / docs /README_modal_deploy.md

Commit History

refactor: deprecate llama-omni-server, use llama-server built-in omni endpoints
6451051

J.B-Lin commited on

modal/omni: PRODUCTION DEPLOY - text inference fully verified, llama-server running on T4
d7f81b5

J.B-Lin commited on

modal/omni: text inference verified (Chinese OK, English needs fix)
2400b46

J.B-Lin commited on

modal/omni: llama-server inference verified on T4
0e9e62e

J.B-Lin commited on

docs: add production script protect warnings to deploy guide
10796de

J.B-Lin commited on

docs: comprehensive Modal deploy guide (both solutions + pitfalls)
03e9a7c

J.B-Lin commited on