Commit
·
a75ebf2
1
Parent(s):
b3aa8aa
first commit
Browse files
README.md
CHANGED
|
@@ -90,4 +90,17 @@ IPython.display.Audio(data=audio, rate=16000)
|
|
| 90 |
|
| 91 |
The auffusion model will be automatically downloaded from huggingface and saved in cache. Subsequent runs will load the model directly from cache.
|
| 92 |
|
| 93 |
-
Other audio manipulation examples can be seen in [https://github.com/happylittlecat2333/Auffusion/notebooks](https://github.com/happylittlecat2333/Auffusion/notebooks). We only show the default text-to-audio example here.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 90 |
|
| 91 |
The auffusion model will be automatically downloaded from huggingface and saved in cache. Subsequent runs will load the model directly from cache.
|
| 92 |
|
| 93 |
+
Other audio manipulation examples can be seen in [https://github.com/happylittlecat2333/Auffusion/notebooks](https://github.com/happylittlecat2333/Auffusion/notebooks). We only show the default text-to-audio example here.
|
| 94 |
+
|
| 95 |
+
## Citation
|
| 96 |
+
|
| 97 |
+
Please consider citing the following article if you found our work useful:
|
| 98 |
+
|
| 99 |
+
```bibtex
|
| 100 |
+
@article{xue2024auffusion,
|
| 101 |
+
title={Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation},
|
| 102 |
+
author={Jinlong Xue and Yayue Deng and Yingming Gao and Ya Li},
|
| 103 |
+
journal={arXiv preprint arXiv:2401.01044},
|
| 104 |
+
year={2024}
|
| 105 |
+
}
|
| 106 |
+
```
|