Commit History

Bisect AOTI compile duration upward: 600 → 1000
1e7f8dd

moose Claude Opus 4.7 (1M context) commited on

Lower AOTI compile duration 1200 → 600 (cap probe)
b7783cd

moose Claude Opus 4.7 (1M context) commited on

Lower AOTI compile duration 1500 → 1200 (cap probe)
e8d12ec

moose Claude Opus 4.7 (1M context) commited on

Fix device mismatch during AOT compilation warmup
c88fe83

moose Claude Opus 4.5 commited on

Revert "Fix OOM by deferring model loading and AOT compilation to runtime"
31c7abd

moose commited on

Fix OOM by deferring model loading and AOT compilation to runtime
bc7c2dd

moose Claude Opus 4.5 commited on

add AoTI + FA3 (#1)
6571814
verified

linoyts HF Staff commited on