zghhui
/

JavisBench_model

Model card Files Files and versions

JavisBench_model / Diff-Foley /README.md

zghhui's picture

Add files using upload-large-folder tool

5c26320 verified 19 days ago

|

history blame contribute delete

604 Bytes

	---
	license: mit
	---

	### (NeurIPS 2023) Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
	### Official Model Repo

	#### Model Include:
	- Stage1-CAVP Pretrained Model.
	- Stage2-LDM Pretrained Model.
	- Double Guidance Classifier.

	<p align="center">
	<img src="teaser.png">
	</p>

	## BibTeX

	```bibtex
	@misc{luo2023difffoley,
	title={Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models},
	author={Simian Luo and Chuanhao Yan and Chenxu Hu and Hang Zhao},
	year={2023},
	eprint={2306.17203},
	archivePrefix={arXiv},
	primaryClass={cs.SD}
	}
	```