YatharthS commited on
Commit
78c7577
·
verified ·
1 Parent(s): 3c5f84a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -5
README.md CHANGED
@@ -3,13 +3,13 @@ license: apache-2.0
3
  pipeline_tag: audio-to-audio
4
  ---
5
 
6
- LavaSR is a novel 50MB BWE(bandwidth extension) model along with the UL-UNAS denoiser.
7
 
8
  ### Details
9
  * **Model Size:** 50mb for pytorch version.
10
  * **Input Rate:** Any from 8-48khz.
11
  * **Output Rate:** 48kHz
12
- * **Inference Speed:** 10-50x realtime on CPU and 400-4000x realtime depending on GPU.
13
 
14
  ### Use cases
15
  - Restore low quality audio datasets
@@ -18,11 +18,13 @@ LavaSR is a novel 50MB BWE(bandwidth extension) model along with the UL-UNAS den
18
 
19
  ### Benchmark Comparison
20
 
 
 
21
  | Model | Speed on GPU(bs=1) | Size | Input range| Quality |
22
  | :--- | :--- | :--- | :--- | :--- |
23
- | **LavaSR** | **4000x** | **50MB** | **Any from 8-48khz** | **High** |
24
- | AudioSR | < 1x realtime | ~3gb+ | ~2-16khz | High |
25
- | AP-BWE(previous formal fastest) | < 400x realtime | ~200MB+ | 8khz/12khz/16khz | Medium |
26
  | NovaSR(previous informal fastest) | <3600x realtime | ~50KB+ | 16khz | Low |
27
 
28
  ### Usage
 
3
  pipeline_tag: audio-to-audio
4
  ---
5
 
6
+ LavaSR(v2) is a novel 50MB BWE(bandwidth extension) model along with the UL-UNAS denoiser. It can enhance nearly 5000 seconds of audio in just 1 second while exceeding the quality of 6gb large diffusion models.
7
 
8
  ### Details
9
  * **Model Size:** 50mb for pytorch version.
10
  * **Input Rate:** Any from 8-48khz.
11
  * **Output Rate:** 48kHz
12
+ * **Inference Speed:** 20-80x realtime on CPU and 800-5000x realtime depending on GPU.
13
 
14
  ### Use cases
15
  - Restore low quality audio datasets
 
18
 
19
  ### Benchmark Comparison
20
 
21
+ Please check out the repo for objective benchmarks: https://github.com/ysharma3501/LavaSR
22
+
23
  | Model | Speed on GPU(bs=1) | Size | Input range| Quality |
24
  | :--- | :--- | :--- | :--- | :--- |
25
+ | **LavaSR_v2** | **5000x** | **50MB** | **Any from 8-48khz** | **High** |
26
+ | AudioSR | < 1x realtime | ~3gb+ | ~2-16khz | Medium |
27
+ | AP-BWE(previous formal fastest) | < 400x realtime | ~200MB+ | 8khz/12khz/16khz | High |
28
  | NovaSR(previous informal fastest) | <3600x realtime | ~50KB+ | 16khz | Low |
29
 
30
  ### Usage