Metacebertrunk commited on
Commit
a262e84
·
verified ·
1 Parent(s): 27e4315

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -10
README.md CHANGED
@@ -1,10 +1,25 @@
1
- ---
2
- title: README
3
- emoji: 🔥
4
- colorFrom: red
5
- colorTo: gray
6
- sdk: static
7
- pinned: false
8
- ---
9
-
10
- Edit this `README.md` markdown file to author your organization card.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Unified-Audio: An Open-Source Project to Unify Audio Processing and Generation
2
+
3
+ This project contains a series of works developed for audio (including speech, music, and general audio events) processing and generation, which helps reproducible research in the field of audio. The target of **Unified-Audio** is to explore a unified framework to handle **different audio processing and generation tasks**, including:
4
+
5
+ - **SR**: Speech Restoration (⛳ supported)
6
+ - **TSE**: Target Speaker Extraction (⛳ supported)
7
+ - **SS**: Speech Separation (⛳ supported)
8
+ - **VC**: Voice Conversion (⛳ supported)
9
+ - **LASS**: Language-Queried Audio Source Separation (⛳ supported)
10
+ - **CODEC**: Audio Tokenization (⛳ supported)
11
+ - **AE**: Audio Editing (⛳ developing)
12
+ - **TTA**: Text to Audio (⛳ developing)
13
+ - more...
14
+
15
+ In addition to the frameworks for specific audio tasks, **Unified-Audio** also provides works involving **neural audio codec (NAC)**, which is the fundamental module to combine audio modality with language models.
16
+
17
+
18
+ ## 🚀 News
19
+ - **2025/09/22**: We release [***UniSE***](https://github.com/hyyan2k/UniSE), a foundation model for unified speech generation. The system supports target speaker extraction, universal speech enhancement.[***demo***](https://hyyan2k.github.io/UniSE/), [![arXiv](https://img.shields.io/badge/arXiv-Paper-COLOR.svg)](https://arxiv.org/abs/2510.20441),Code will comming soon.
20
+ - **2025/10/26**: We release [***UniTok-Audio***](https://github.com/alibaba/unified-audio), The system supports target speaker extraction, universal speech enhancement, Speech Restoration, Voice Conversion, Language-Queried Audio Source Separation, Audio Tokenization,[***demo***](https://alibaba.github.io/unified-audio/), [![arXiv](https://img.shields.io/badge/arXiv-Paper-COLOR.svg)](https://arxiv.org/abs/2510.26372)
21
+ ## key Works
22
+ ### UniSE
23
+ [UniSE](https://github.com/alibaba/unified-audio/tree/main/UniSE): A Unified Framework for Decoder-Only Autoregressive LM-Based Speech Enhancement[![arXiv](https://img.shields.io/badge/arXiv-Paper-COLOR.svg)](https://arxiv.org/abs/2510.20441)
24
+ supported tasks: **SR**, **TSE**, **SS**
25
+