YingMusic-SingerGPU

Running on Zero

App Files Files Community

YingMusic-SingerGPU / README.md

xjsc0

111

3f33afc 6 days ago

preview code

raw

history blame contribute delete

2.46 kB

	---
	title: YingMusic-Singer
	emoji: 🎤
	colorFrom: pink
	colorTo: blue
	sdk: gradio
	python_version: "3.10"
	app_file: app.py
	tags:
	- singing-voice-synthesis
	- lyric-editing
	- diffusion-model
	- reinforcement-learning
	short_description: Edit lyrics, keep the melody
	fullWidth: true
	---

	# YingMusic-Singer
	YingMusic-Singer: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance

	## Environment Setup

	### 1. Install from Scratch
	```bash
	conda create -n YingMusic-Singer python=3.10
	conda activate YingMusic-Singer

	# uv is much quicker
	pip install uv
	uv pip install -r requirements.txt
	```

	### 2. Pre-built Conda Environment for One-Click Deployment (Nvidia / AMD CPU Only)

	Coming soon

	## 推理

	### 使用huggingface Space（线上体验）

	访问https://huggingface.co/spaces/ASLP-lab/YingMusic-Singer之后，就可以快速体验

	### 使用Docker运行

	docker build -t yingmusic-singer .

	### 使用python运行

	git clone
	cd
	python initialization.py --task infer

	# for Gradio

	python app.py

	# 多进程 Inference
	# 1. 你需要确保所有输入模型的均为分离之后的纯人声，如果没有分离，可以参考/src/third_party/MusicSourceSeparationTraining/inference_api.py 进行分离
	# 2. jsonl 文件的格式为，每行一个json，{}
	python batch_infer.py \
	--input_type jsonl \
	--input_path /path/to/input.jsonl \
	--output_dir /path/to/output \
	--ckpt_path /path/to/ckpts \
	--num_gpus 4

	# 多进程 Inference(LyricEditBench melody control)
	python inference_mp.py \
	--input_type lyric_edit_bench_melody_control \
	--output_dir path/to/ \
	LyricEditBench_melody_control \
	--ckpt_path ASLP-lab/YingMusic-Singer \
	--num_gpus 8

	# 多进程 Inference(LyricEditBench sing edit)
	python inference_mp.py \
	--input_type lyric_edit_bench_sing_edit \
	--output_dir path/to/ \
	LyricEditBench_melody_control \
	--ckpt_path ASLP-lab/YingMusic-Singer \
	--num_gpus 8



	## License

	The code and model weights in this project are licensed under [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/), except for the following components:

	The VAE model weights and inference code (in `src/YingMusic-Singer/utils/stable-audio-tools`) are derived from [Stable Audio Open](https://huggingface.co/stabilityai/stable-audio-open-1.0) by Stability AI, and are licensed under the [Stability AI Community License](./LICENSE-STABILITY).