--- license: bigscience-openrail-m datasets: - ILSVRC/imagenet-1k --- # SmallDiT 复现经典的DiT工作([Scalable Diffusion Models with Transformers](https://arxiv.org/abs/2212.09748)),训练数据为ImageNet. 代码仓库: `https://github.com/lixiang90/ClassicalModels` ## vae `vae.pt`是用于图像压缩的vae模型,把(256,256,3)的图像压缩为(32,32,4)的latents.