Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models
Paper • 2510.18457 • Published • 3
Pretrained checkpoints, features, and samples for VFM-VAE,
introduced in the paper:
Tianci Bi et al., “Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models”, arXiv:2510.18457