| license: apache-2.0 | |
| This repository contains the weights of [ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing](https://arxiv.org/abs/2506.21448). | |
| Project Page: https://thinksound-project.github.io/. | |
| If you find our work useful, please cite our paper: | |
| ```bibtex | |
| @misc{liu2025thinksoundchainofthoughtreasoningmultimodal, | |
| title={ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing}, | |
| author={Huadai Liu and Jialei Wang and Kaicheng Luo and Wen Wang and Qian Chen and Zhou Zhao and Wei Xue}, | |
| year={2025}, | |
| eprint={2506.21448}, | |
| archivePrefix={arXiv}, | |
| primaryClass={eess.AS}, | |
| url={https://arxiv.org/abs/2506.21448}, | |
| } | |
| ``` |