VFM-VAE / README.md

tiancibi

Update README.md

f9f1fdc verified 3 days ago

preview code

raw

history blame contribute delete

2.4 kB

metadata

license: cc-by-nc-4.0
tags:
  - latent-diffusion
  - vae
  - imagenet
  - CVPR2026
paper: https://huggingface.co/papers/2510.18457

🧩 VFM-VAE

Pretrained checkpoints, features, and samples for VFM-VAE, introduced in the paper:

Tianci Bi et al., "VFM-VAE: Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models", CVPR 2026 · arXiv:2510.18457

🎉 Accepted to CVPR 2026.

💻 Code: github.com/tianciB/VFM-VAE
📦 Includes: alignment data, ImageNet-256 & ImageNet-512 checkpoints, and diffusion samples

📝 Citation

@inproceedings{bi2026vfmvae,
  title     = {Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models},
  author    = {Bi, Tianci and Zhang, Xiaoyi and Lu, Yan and Zheng, Nanning},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},                                                                                   
  year      = {2026}                                                                                                                                                                        
}