distill-m-6a3lnzvb-code / configs /sweep /M_phase2_lr2e8_largebatch.toml

Commit History

add phase-2 ultra-conservative sweep (J,K,L,M) + waiter that auto-launches after phase 1 from the best ckpt
729546e
verified

Delta-Vector commited on