Spaces:
Configuration error
Configuration error
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,10 +1,25 @@
|
|
| 1 |
-
|
| 2 |
-
|
| 3 |
-
|
| 4 |
-
|
| 5 |
-
|
| 6 |
-
|
| 7 |
-
|
| 8 |
-
|
| 9 |
-
|
| 10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Unified-Audio: An Open-Source Project to Unify Audio Processing and Generation
|
| 2 |
+
|
| 3 |
+
This project contains a series of works developed for audio (including speech, music, and general audio events) processing and generation, which helps reproducible research in the field of audio. The target of **Unified-Audio** is to explore a unified framework to handle **different audio processing and generation tasks**, including:
|
| 4 |
+
|
| 5 |
+
- **SR**: Speech Restoration (⛳ supported)
|
| 6 |
+
- **TSE**: Target Speaker Extraction (⛳ supported)
|
| 7 |
+
- **SS**: Speech Separation (⛳ supported)
|
| 8 |
+
- **VC**: Voice Conversion (⛳ supported)
|
| 9 |
+
- **LASS**: Language-Queried Audio Source Separation (⛳ supported)
|
| 10 |
+
- **CODEC**: Audio Tokenization (⛳ supported)
|
| 11 |
+
- **AE**: Audio Editing (⛳ developing)
|
| 12 |
+
- **TTA**: Text to Audio (⛳ developing)
|
| 13 |
+
- more...
|
| 14 |
+
|
| 15 |
+
In addition to the frameworks for specific audio tasks, **Unified-Audio** also provides works involving **neural audio codec (NAC)**, which is the fundamental module to combine audio modality with language models.
|
| 16 |
+
|
| 17 |
+
|
| 18 |
+
## 🚀 News
|
| 19 |
+
- **2025/09/22**: We release [***UniSE***](https://github.com/hyyan2k/UniSE), a foundation model for unified speech generation. The system supports target speaker extraction, universal speech enhancement.[***demo***](https://hyyan2k.github.io/UniSE/), [](https://arxiv.org/abs/2510.20441),Code will comming soon.
|
| 20 |
+
- **2025/10/26**: We release [***UniTok-Audio***](https://github.com/alibaba/unified-audio), The system supports target speaker extraction, universal speech enhancement, Speech Restoration, Voice Conversion, Language-Queried Audio Source Separation, Audio Tokenization,[***demo***](https://alibaba.github.io/unified-audio/), [](https://arxiv.org/abs/2510.26372)
|
| 21 |
+
## key Works
|
| 22 |
+
### UniSE
|
| 23 |
+
[UniSE](https://github.com/alibaba/unified-audio/tree/main/UniSE): A Unified Framework for Decoder-Only Autoregressive LM-Based Speech Enhancement[](https://arxiv.org/abs/2510.20441)
|
| 24 |
+
supported tasks: **SR**, **TSE**, **SS**
|
| 25 |
+
|