backups
/

ThinkSound

Model card Files Files and versions

ThinkSound / README.md

mrfakename's picture

Duplicate from liuhuadai/ThinkSound

7c524e6 verified 7 months ago

|

history blame contribute delete

774 Bytes

	---
	license: apache-2.0
	---

	This repository contains the weights of [ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing](https://arxiv.org/abs/2506.21448).

	Project Page: https://thinksound-project.github.io/.

	If you find our work useful, please cite our paper:

	```bibtex
	@misc{liu2025thinksoundchainofthoughtreasoningmultimodal,
	title={ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing},
	author={Huadai Liu and Jialei Wang and Kaicheng Luo and Wen Wang and Qian Chen and Zhou Zhao and Wei Xue},
	year={2025},
	eprint={2506.21448},
	archivePrefix={arXiv},
	primaryClass={eess.AS},
	url={https://arxiv.org/abs/2506.21448},
	}
	```