Zeyue7 commited on
Commit
a90b58b
·
1 Parent(s): 8e48b17
Files changed (1) hide show
  1. README.md +16 -2
README.md CHANGED
@@ -4,9 +4,9 @@ license: cc-by-nc-4.0
4
 
5
  # AudioX
6
 
7
- ## AudioX: A Simple Audio-to-Audio Generation Framework with Long-Short-Term Modeling
8
 
9
- [TL;DR]: AudioX is a framework for generating high-fidelity audio aligned with audio content, utilizing Long-Short-Term modeling, and has been accepted to CVPR 2025.
10
 
11
  ### Links
12
  - **[Paper](https://arxiv.org/abs/2503.10522)**: Explore the research behind VidMuse.
@@ -18,3 +18,17 @@ license: cc-by-nc-4.0
18
  GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/Zeyue7/AudioX
19
  cd AudioX
20
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
 
5
  # AudioX
6
 
7
+ ## 🎧 AudioX: Diffusion Transformer for Anything-to-Audio Generation
8
 
9
+ [TL;DR]: AudioX is a unified Diffusion Transformer model for Anything-to-Audio and Music Generation, capable of generating high-quality general audio and music, offering flexible natural language control, and seamlessly processing various modalities including text, video, image, music, and audio.
10
 
11
  ### Links
12
  - **[Paper](https://arxiv.org/abs/2503.10522)**: Explore the research behind VidMuse.
 
18
  GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/Zeyue7/AudioX
19
  cd AudioX
20
  ```
21
+
22
+
23
+
24
+ ## Citation
25
+ If you find our work useful, please consider citing:
26
+
27
+ ```
28
+ @article{tian2025audiox,
29
+ title={AudioX: Diffusion Transformer for Anything-to-Audio Generation},
30
+ author={Tian, Zeyue and Jin, Yizhu and Liu, Zhaoyang and Yuan, Ruibin and Tan, Xu and Chen, Qifeng and Xue, Wei and Guo, Yike},
31
+ journal={arXiv preprint arXiv:2503.10522},
32
+ year={2025}
33
+ }
34
+ ```