niobures's picture
DTLN (models: ailia-models)
e6d49fe verified
# Dual-signal Transformation LSTM Network
## input
audio file(16kHz)
```
LibriSpeech ASR corpus
http://www.openslr.org/12
1221-135766-0000.wav
```
## Output
Audio file with noise removed
## Usage
Automatically downloads the onnx and prototxt files on the first run.
It is necessary to be connected to the Internet while downloading.
For the sample wav,
```bash
$ python3 dtln.py
```
If you want to specify the audio, put the file path after the `--input` option.
```bash
$ python3 dtln.py --input AUDIO_FILE
```
If you run by onnxruntime instead of ailia, you use --onnx option.
## Reference
- [Dual-signal Transformation LSTM Network](https://github.com/breizhn/DTLN)
## Framework
Pytorch
## Model Format
ONNX opset=11
## Netron
- [dtln1.onnx.prototxt](https://netron.app/?url=https://storage.googleapis.com/ailia-models/dtln/dtln1.onnx.prototxt)
- [dtln2.onnx.prototxt](https://netron.app/?url=https://storage.googleapis.com/ailia-models/dtln2/dtln2.onnx.prototxt)