Add distilled checkpoint (warm start, val_loss=3.687) a2de593 verified LisaMegaWatts commited on 6 days ago