niobures
/

AudioSep

Model card Files Files and versions

AudioSep / models /ailia-models /code /README.md

niobures's picture

AudioSep (code, models, paper)

b508693 verified 2 months ago

|

history blame contribute delete

2.04 kB

	# AudioSep: Separate Anything You Describe

	## Input

	* Mixed audio file

	Audio file in wav format with mixed sources. [input.wav](./input.wav)

	https://github.com/axinc-ai/ailia-models/assets/53651931/4b761212-a1c7-46dc-b598-a08e4c5ab7ff

	This audio file was adapted from the [official audiosep implementation](https://github.com/Audio-AGI/AudioSep)

	https://audio-agi.github.io/Separate-Anything-You-Describe/demos/exp31_water/drops_mixture.wav

	* Text condition

	Text description of the sound source you want to separate.

	## Output

	* Audio file

	Separated audio source according to the text query.

	Saves to ```./output.wav``` by default but it can be specified with the ```--path``` option

	## Usage
	Internet connection is required when running the script for the first time, as the model files will be automatically downloaded.

	Running this script will separate sound sources from the original input audio file, according to the language query.

	#### Example1: Extract sound of thunder
	```bash
	$ python3 audiosep.py -p "thunder" -i input.wav -s output_thunder.wav
	```
	https://github.com/axinc-ai/ailia-models/assets/53651931/d0d016dd-a808-4eb6-a4b5-9791f8f1bd2f

	#### Example2: Extract sound of waterdrops
	```bash
	$ python3 audiosep.py -p "water drops" -i input.wav -s output_waterdrops.wav
	```
	https://github.com/axinc-ai/ailia-models/assets/53651931/7710b6c9-49dc-4d2a-8489-ccbf7fb45591

	```.wav``` file containing the sound source separated from the original mixture will be created in both cases.

	## Reference

	* [AudioSep](https://github.com/Audio-AGI/AudioSep)
	* [Separate Anything You Describe](https://audio-agi.github.io/Separate-Anything-You-Describe/)

	## Framework

	Pytorch

	## Model Format

	ONNX opset=11

	## Netron

	* [audiosep_text.onnx.prototxt](https://netron.app/?url=https://storage.googleapis.com/ailia-models/audiosep/audiosep_text.onnx.prototxt)
	* [audiosep_resunet.onnx.prototxt](https://netron.app/?url=https://storage.googleapis.com/ailia-models/audiosep/audiosep_resunet.onnx.prototxt)