Update README.md
Browse files
README.md
CHANGED
|
@@ -22,6 +22,14 @@ Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conv
|
|
| 22 |
<img src="https://img.shields.io/badge/GitHub-Repository-blue?logo=github" alt="GitHub Repository"></a>
|
| 23 |
</div>
|
| 24 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 25 |
## 🗞 News
|
| 26 |
|
| 27 |
- **[2025-11-08]** 🎉 Paper accepted by AAAI 2026
|
|
@@ -40,14 +48,6 @@ Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conv
|
|
| 40 |
- [ ] Singing style control
|
| 41 |
- [ ] Improved quality
|
| 42 |
|
| 43 |
-
HQ-SVC is an efficient framework for high-quality zero-shot singing voice conversion (SVC) in low-resource scenarios. It achieves disentanglement of content and speaker features via a unified decoupled codec, and enhances synthesis quality through multi-feature fusion and progressive optimization.
|
| 44 |
-
|
| 45 |
-
Unlike existing methods that demand large datasets or heavy computational resources, **HQ-SVC** unifies:
|
| 46 |
-
- 🚀 Zero-shot conversion for unseen speakers without fine-tuning
|
| 47 |
-
- ⚡ Low-resource training (single consumer-grade GPU, <80h data)
|
| 48 |
-
- 🎧 Dual capabilities: high-quality singing voice conversion + voice super-resolution
|
| 49 |
-
- 🎯 Superior naturalness and speaker similarity compared to SOTA methods
|
| 50 |
-
|
| 51 |
## 🎸 Try Inference
|
| 52 |
### 1. Download Codes and Environment(下载代码和环境)
|
| 53 |
|
|
|
|
| 22 |
<img src="https://img.shields.io/badge/GitHub-Repository-blue?logo=github" alt="GitHub Repository"></a>
|
| 23 |
</div>
|
| 24 |
|
| 25 |
+
HQ-SVC is an efficient framework for high-quality zero-shot singing voice conversion (SVC) in low-resource scenarios. It achieves disentanglement of content and speaker features via a unified decoupled codec, and enhances synthesis quality through multi-feature fusion and progressive optimization.
|
| 26 |
+
|
| 27 |
+
Unlike existing methods that demand large datasets or heavy computational resources, **HQ-SVC** unifies:
|
| 28 |
+
- 🚀 Zero-shot conversion for unseen speakers without fine-tuning
|
| 29 |
+
- ⚡ Low-resource training (single consumer-grade GPU, <80h data)
|
| 30 |
+
- 🎧 Dual capabilities: high-quality singing voice conversion + voice super-resolution
|
| 31 |
+
- 🎯 Superior naturalness and speaker similarity compared to SOTA methods
|
| 32 |
+
|
| 33 |
## 🗞 News
|
| 34 |
|
| 35 |
- **[2025-11-08]** 🎉 Paper accepted by AAAI 2026
|
|
|
|
| 48 |
- [ ] Singing style control
|
| 49 |
- [ ] Improved quality
|
| 50 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 51 |
## 🎸 Try Inference
|
| 52 |
### 1. Download Codes and Environment(下载代码和环境)
|
| 53 |
|