| license: mit | |
| ### (NeurIPS 2023) Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models | |
| ### Official Model Repo | |
| #### Model Include: | |
| - Stage1-CAVP Pretrained Model. | |
| - Stage2-LDM Pretrained Model. | |
| - Double Guidance Classifier. | |
| <p align="center"> | |
| <img src="teaser.png"> | |
| </p> | |
| ## BibTeX | |
| ```bibtex | |
| @misc{luo2023difffoley, | |
| title={Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models}, | |
| author={Simian Luo and Chuanhao Yan and Chenxu Hu and Hang Zhao}, | |
| year={2023}, | |
| eprint={2306.17203}, | |
| archivePrefix={arXiv}, | |
| primaryClass={cs.SD} | |
| } | |
| ``` | |