Spaces:
Running on Zero
Running on Zero
File size: 2,456 Bytes
dae4a7c 3a0b618 dae4a7c 3a0b618 dae4a7c 3f33afc dae4a7c | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 | ---
title: YingMusic-Singer
emoji: 🎤
colorFrom: pink
colorTo: blue
sdk: gradio
python_version: "3.10"
app_file: app.py
tags:
- singing-voice-synthesis
- lyric-editing
- diffusion-model
- reinforcement-learning
short_description: Edit lyrics, keep the melody
fullWidth: true
---
# YingMusic-Singer
YingMusic-Singer: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance
## Environment Setup
### 1. Install from Scratch
```bash
conda create -n YingMusic-Singer python=3.10
conda activate YingMusic-Singer
# uv is much quicker
pip install uv
uv pip install -r requirements.txt
```
### 2. Pre-built Conda Environment for One-Click Deployment (Nvidia / AMD CPU Only)
Coming soon
## 推理
### 使用huggingface Space(线上体验)
访问https://huggingface.co/spaces/ASLP-lab/YingMusic-Singer之后,就可以快速体验
### 使用Docker运行
docker build -t yingmusic-singer .
### 使用python运行
git clone
cd
python initialization.py --task infer
# for Gradio
python app.py
# 多进程 Inference
# 1. 你需要确保所有输入模型的均为分离之后的纯人声,如果没有分离,可以参考/src/third_party/MusicSourceSeparationTraining/inference_api.py 进行分离
# 2. jsonl 文件的格式为,每行一个json,{}
python batch_infer.py \
--input_type jsonl \
--input_path /path/to/input.jsonl \
--output_dir /path/to/output \
--ckpt_path /path/to/ckpts \
--num_gpus 4
# 多进程 Inference(LyricEditBench melody control)
python inference_mp.py \
--input_type lyric_edit_bench_melody_control \
--output_dir path/to/ \
LyricEditBench_melody_control \
--ckpt_path ASLP-lab/YingMusic-Singer \
--num_gpus 8
# 多进程 Inference(LyricEditBench sing edit)
python inference_mp.py \
--input_type lyric_edit_bench_sing_edit \
--output_dir path/to/ \
LyricEditBench_melody_control \
--ckpt_path ASLP-lab/YingMusic-Singer \
--num_gpus 8
## License
The code and model weights in this project are licensed under [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/), except for the following components:
The VAE model weights and inference code (in `src/YingMusic-Singer/utils/stable-audio-tools`) are derived from [Stable Audio Open](https://huggingface.co/stabilityai/stable-audio-open-1.0) by Stability AI, and are licensed under the [Stability AI Community License](./LICENSE-STABILITY). |