--- license: mit --- [ESC](arxiv.org/abs/2512.11831) trained with SiT-XL/2 with 600k iterations (240 epochs), with FID-50k ~ 5.77.