feat: training speed optimizations — mixed precision, vectorized augmentation, cached eval predictions 1fe1a19
lemousehunter commited on