File size: 378 Bytes
a7d3247 c025faa | 1 2 3 4 5 6 7 8 9 10 | ---
license: bigscience-openrail-m
datasets:
- ILSVRC/imagenet-1k
---
# SmallDiT
复现经典的DiT工作([Scalable Diffusion Models with Transformers](https://arxiv.org/abs/2212.09748)),训练数据为ImageNet.
代码仓库: `https://github.com/lixiang90/ClassicalModels`
## vae
`vae.pt`是用于图像压缩的vae模型,把(256,256,3)的图像压缩为(32,32,4)的latents. |