YatharthS
/

LavaSR

YatharthS commited on Feb 27

Commit

78c7577

verified ·

1 Parent(s): 3c5f84a

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -3,13 +3,13 @@ license: apache-2.0
 pipeline_tag: audio-to-audio
 ---
-LavaSR is a novel 50MB BWE(bandwidth extension) model along with the UL-UNAS denoiser.
 ### Details
 * **Model Size:** 50mb for pytorch version.
 * **Input Rate:** Any from 8-48khz.
 * **Output Rate:** 48kHz
-* **Inference Speed:** 10-50x realtime on CPU and 400-4000x realtime depending on GPU.
 ### Use cases
 - Restore low quality audio datasets
@@ -18,11 +18,13 @@ LavaSR is a novel 50MB BWE(bandwidth extension) model along with the UL-UNAS den
 ### Benchmark Comparison
 | Model | Speed on GPU(bs=1) | Size | Input range| Quality |
 | :--- | :--- | :--- | :--- | :--- |
-| **LavaSR** | **4000x** | **50MB** | **Any from 8-48khz** | **High** |
-| AudioSR | < 1x realtime | ~3gb+ | ~2-16khz | High |
-| AP-BWE(previous formal fastest) | < 400x realtime | ~200MB+ | 8khz/12khz/16khz | Medium |
 | NovaSR(previous informal fastest) | <3600x realtime | ~50KB+ | 16khz | Low |
 ### Usage

 pipeline_tag: audio-to-audio
 ---
+LavaSR(v2) is a novel 50MB BWE(bandwidth extension) model along with the UL-UNAS denoiser. It can enhance nearly 5000 seconds of audio in just 1 second while exceeding the quality of 6gb large diffusion models.
 ### Details
 * **Model Size:** 50mb for pytorch version.
 * **Input Rate:** Any from 8-48khz.
 * **Output Rate:** 48kHz
+* **Inference Speed:** 20-80x realtime on CPU and 800-5000x realtime depending on GPU.
 ### Use cases
 - Restore low quality audio datasets
 ### Benchmark Comparison
+Please check out the repo for objective benchmarks: https://github.com/ysharma3501/LavaSR
 | Model | Speed on GPU(bs=1) | Size | Input range| Quality |
 | :--- | :--- | :--- | :--- | :--- |
+| **LavaSR_v2** | **5000x** | **50MB** | **Any from 8-48khz** | **High** |
+| AudioSR | < 1x realtime | ~3gb+ | ~2-16khz | Medium |
+| AP-BWE(previous formal fastest) | < 400x realtime | ~200MB+ | 8khz/12khz/16khz | High |
 | NovaSR(previous informal fastest) | <3600x realtime | ~50KB+ | 16khz | Low |
 ### Usage