Commit History

v5.1: revert INT8 — too lossy for 0.6B, keep LoRA merge + float32
ef8b629

arthu1 commited on

v5: merge LoRA at startup + INT8 quantization + faster params
600fce7

arthu1 commited on

North Air 1 API — Instance 2 (load-balanced replica)
2b2f18c

arthu1 commited on

initial commit
77b73a9
verified

arthu1 commited on