drbaph commited on
Commit
d87f0fc
·
verified ·
1 Parent(s): dfbaa6a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +71 -6
README.md CHANGED
@@ -15,27 +15,92 @@ tags:
15
 
16
  ---
17
 
18
- # converted main .pt model to .safetensors
19
- # bf16 + fp32
20
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
21
 
22
  ---
23
 
24
  <div align="center">
25
- <b><em> Towards High-Quality Zero-Shot Singing Voice Synthesis</em></b>
26
- </p>
27
  <p>
28
  <img src="assets/soulx-logo.png" alt="SoulX-Singer_Logo" style="height: 80px;">
29
  </p>
30
  <p>
31
- </p>
32
  <a href="https://soul-ailab.github.io/soulx-singer/"><img src="https://img.shields.io/badge/Demo-Page-lightgrey" alt="version"></a>
33
  <a href="https://github.com/Soul-AILab/SoulX-Singer"><img src='https://img.shields.io/badge/Github-Page-green' alt="Github"></a>
34
  <a href="https://arxiv.org/abs/2602.07803"><img src="https://img.shields.io/badge/arXiv-2602.07803-b31b1b" alt="arXiv"></a>
35
  <a href="https://github.com/Soul-AILab/SoulX-Singer/blob/main/assets/technical-report.pdf"><img src='https://img.shields.io/badge/Report-Github?label=Technical&color=red' alt="technical report"></a>
36
  <a href="https://github.com/Soul-AILab/SoulX-Singer"><img src="https://img.shields.io/badge/License-Apache%202.0-blue.svg" alt="Apache-2.0"></a>
 
37
  </div>
38
 
 
 
 
 
39
  **SoulX-Singer** is a high-fidelity, zero-shot singing voice synthesis model that enables users to generate realistic singing voices for unseen singers. It supports melody-conditioned (F0 contour) and score-conditioned (MIDI notes) control for precise pitch, rhythm, and expression.
40
 
41
- For more details, please refer to the paper: [SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis](https://arxiv.org/abs/2602.07803).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
 
16
  ---
17
 
18
+ ## ComfyUI Custom Node
 
19
 
20
+ This repository includes a custom node for ComfyUI integration:
21
+
22
+ 🔗 **[ComfyUI-SoulX-Singer](https://github.com/Saganaki22/ComfyUI-SoulX-Singer)**
23
+
24
+
25
+ ![Screenshot 2026-02-11 160905](https://cdn-uploads.huggingface.co/production/uploads/63473b59e5c0717e6737b872/FqxVnkFDrVt287ppwQj90.png)
26
+
27
+ Use this custom node to integrate SoulX-Singer into your ComfyUI workflows for seamless singing voice synthesis.
28
+
29
+ # SoulX-Singer: Converted .pt model to .safetensors
30
+ **bf16 + fp32**
31
+
32
+ ## Audio Samples
33
+
34
+ ### Original Audio
35
+ <audio controls>
36
+ <source src="samples/song.mp3" type="audio/mpeg">
37
+ Your browser does not support the audio element.
38
+ </audio>
39
+
40
+ ### SpongeBob Voice
41
+ <audio controls>
42
+ <source src="samples/generated/sample-1.mp3" type="audio/mpeg">
43
+ Your browser does not support the audio element.
44
+ </audio>
45
+
46
+ ### Male Voice
47
+ <audio controls>
48
+ <source src="samples/generated/sample-2.mp3" type="audio/mpeg">
49
+ Your browser does not support the audio element.
50
+ </audio>
51
 
52
  ---
53
 
54
  <div align="center">
55
+ <b><em>Towards High-Quality Zero-Shot Singing Voice Synthesis</em></b>
 
56
  <p>
57
  <img src="assets/soulx-logo.png" alt="SoulX-Singer_Logo" style="height: 80px;">
58
  </p>
59
  <p>
 
60
  <a href="https://soul-ailab.github.io/soulx-singer/"><img src="https://img.shields.io/badge/Demo-Page-lightgrey" alt="version"></a>
61
  <a href="https://github.com/Soul-AILab/SoulX-Singer"><img src='https://img.shields.io/badge/Github-Page-green' alt="Github"></a>
62
  <a href="https://arxiv.org/abs/2602.07803"><img src="https://img.shields.io/badge/arXiv-2602.07803-b31b1b" alt="arXiv"></a>
63
  <a href="https://github.com/Soul-AILab/SoulX-Singer/blob/main/assets/technical-report.pdf"><img src='https://img.shields.io/badge/Report-Github?label=Technical&color=red' alt="technical report"></a>
64
  <a href="https://github.com/Soul-AILab/SoulX-Singer"><img src="https://img.shields.io/badge/License-Apache%202.0-blue.svg" alt="Apache-2.0"></a>
65
+ </p>
66
  </div>
67
 
68
+ ---
69
+
70
+ ## Overview
71
+
72
  **SoulX-Singer** is a high-fidelity, zero-shot singing voice synthesis model that enables users to generate realistic singing voices for unseen singers. It supports melody-conditioned (F0 contour) and score-conditioned (MIDI notes) control for precise pitch, rhythm, and expression.
73
 
74
+ For more details, please refer to the paper: [SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis](https://arxiv.org/abs/2602.07803).
75
+
76
+
77
+ ---
78
+
79
+ ## Features
80
+
81
+ - **Zero-shot synthesis**: Generate singing voices for unseen singers without fine-tuning
82
+ - **Melody-conditioned control**: Use F0 contour for pitch guidance
83
+ - **Score-conditioned control**: Use MIDI notes for precise musical notation
84
+ - **High-fidelity output**: Realistic vocal synthesis with natural expression
85
+ - **Safetensors format**: Optimized model weights in bf16 + fp32 precision
86
+
87
+ ---
88
+
89
+ ## Citation
90
+
91
+ If you use SoulX-Singer in your research, please cite:
92
+
93
+ ```bibtex
94
+ @article{soulxsinger2025,
95
+ title={SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis},
96
+ author={Soul-AILab},
97
+ journal={arXiv preprint arXiv:2602.07803},
98
+ year={2025}
99
+ }
100
+ ```
101
+
102
+ ---
103
+
104
+ ## License
105
+
106
+ This project is licensed under the Apache License 2.0.