LavaSR is a novel 50MB BWE(bandwidth extension) model along with the UL-UNAS denoiser.
Details
- Model Size: 50mb for pytorch version.
- Input Rate: Any from 8-48khz.
- Output Rate: 48kHz
- Inference Speed: 10-50x realtime on CPU and 400-4000x realtime on GPU
Use cases
- Restore low quality audio datasets
- Enhance TTS or ASR model quality.
- Upscale poor quality voice calls.
Benchmark Comparison
| Model | Speed | Size |
|---|---|---|
| LavaSR | 400-4000x | 50MB |
| AudioSR | < 1x realtime | ~3gb+ |
| AP-BWE(previous fastest) | < 400x realtime | ~200MB+ |
Usage
Usage instructions can be found here: https://github.com/ysharma3501/LavaSR
Final notes
The model and code are licensed under the Apache-2.0 license. See LICENSE for details.
Stars/Likes would be appreciated, thank you.
Email: yatharthsharma3501@gmail.com
- Downloads last month
- 16