--- license: apache-2.0 --- This repository contains the weights of [ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing](https://arxiv.org/abs/2506.21448). Project Paper: https://thinksound-project.github.io/. If you find our work useful, please cite our paper: ```bibtex @misc{liu2025thinksoundchainofthoughtreasoningmultimodal, title={ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing}, author={Huadai Liu and Jialei Wang and Kaicheng Luo and Wen Wang and Qian Chen and Zhou Zhao and Wei Xue}, year={2025}, eprint={2506.21448}, archivePrefix={arXiv}, primaryClass={eess.AS}, url={https://arxiv.org/abs/2506.21448}, } ```