Update README.md
Browse files
README.md
CHANGED
|
@@ -3,13 +3,13 @@ license: apache-2.0
|
|
| 3 |
pipeline_tag: audio-to-audio
|
| 4 |
---
|
| 5 |
|
| 6 |
-
LavaSR is a novel 50MB BWE(bandwidth extension) model along with the UL-UNAS denoiser.
|
| 7 |
|
| 8 |
### Details
|
| 9 |
* **Model Size:** 50mb for pytorch version.
|
| 10 |
* **Input Rate:** Any from 8-48khz.
|
| 11 |
* **Output Rate:** 48kHz
|
| 12 |
-
* **Inference Speed:**
|
| 13 |
|
| 14 |
### Use cases
|
| 15 |
- Restore low quality audio datasets
|
|
@@ -18,11 +18,13 @@ LavaSR is a novel 50MB BWE(bandwidth extension) model along with the UL-UNAS den
|
|
| 18 |
|
| 19 |
### Benchmark Comparison
|
| 20 |
|
|
|
|
|
|
|
| 21 |
| Model | Speed on GPU(bs=1) | Size | Input range| Quality |
|
| 22 |
| :--- | :--- | :--- | :--- | :--- |
|
| 23 |
-
| **
|
| 24 |
-
| AudioSR | < 1x realtime | ~3gb+ | ~2-16khz |
|
| 25 |
-
| AP-BWE(previous formal fastest) | < 400x realtime | ~200MB+ | 8khz/12khz/16khz |
|
| 26 |
| NovaSR(previous informal fastest) | <3600x realtime | ~50KB+ | 16khz | Low |
|
| 27 |
|
| 28 |
### Usage
|
|
|
|
| 3 |
pipeline_tag: audio-to-audio
|
| 4 |
---
|
| 5 |
|
| 6 |
+
LavaSR(v2) is a novel 50MB BWE(bandwidth extension) model along with the UL-UNAS denoiser. It can enhance nearly 5000 seconds of audio in just 1 second while exceeding the quality of 6gb large diffusion models.
|
| 7 |
|
| 8 |
### Details
|
| 9 |
* **Model Size:** 50mb for pytorch version.
|
| 10 |
* **Input Rate:** Any from 8-48khz.
|
| 11 |
* **Output Rate:** 48kHz
|
| 12 |
+
* **Inference Speed:** 20-80x realtime on CPU and 800-5000x realtime depending on GPU.
|
| 13 |
|
| 14 |
### Use cases
|
| 15 |
- Restore low quality audio datasets
|
|
|
|
| 18 |
|
| 19 |
### Benchmark Comparison
|
| 20 |
|
| 21 |
+
Please check out the repo for objective benchmarks: https://github.com/ysharma3501/LavaSR
|
| 22 |
+
|
| 23 |
| Model | Speed on GPU(bs=1) | Size | Input range| Quality |
|
| 24 |
| :--- | :--- | :--- | :--- | :--- |
|
| 25 |
+
| **LavaSR_v2** | **5000x** | **50MB** | **Any from 8-48khz** | **High** |
|
| 26 |
+
| AudioSR | < 1x realtime | ~3gb+ | ~2-16khz | Medium |
|
| 27 |
+
| AP-BWE(previous formal fastest) | < 400x realtime | ~200MB+ | 8khz/12khz/16khz | High |
|
| 28 |
| NovaSR(previous informal fastest) | <3600x realtime | ~50KB+ | 16khz | Low |
|
| 29 |
|
| 30 |
### Usage
|