YatharthS commited on
Commit
1732986
·
verified ·
1 Parent(s): ee7c892

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -2
README.md CHANGED
@@ -1,11 +1,37 @@
1
  ---
2
- license: cc-by-4.0
3
  pipeline_tag: audio-to-audio
4
  tags:
5
  - pytorch
6
  - audio
7
  - upsampling
8
  ---
 
9
 
 
10
 
11
- This is a high quality and incredibly fast audio upsampler. It upscales audio from 16khz to 48khz rapidly, far exceeding speeds of alternatives such as resemble-enhance/clearervoice and reaching above 120x realtime on consumer gpus.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: apache-2.0
3
  pipeline_tag: audio-to-audio
4
  tags:
5
  - pytorch
6
  - audio
7
  - upsampling
8
  ---
9
+ # FlashSR
10
 
11
+ FlashSR is a 2MB audio super-resolution model based on the HierSpeech++ architecture. It upscales 16kHz audio to 48kHz at speeds ranging from 200x to 400x real-time.
12
 
13
+ ### Details
14
+ * **Model Size:** 2MB
15
+ * **Input Rate:** 16kHz
16
+ * **Output Rate:** 48kHz
17
+ * **Inference Speed:** 200x - 400x real-time depending on gpu and dtype
18
+
19
+ ### Performance Summary
20
+ FlashSR is designed for high-speed frequency reconstruction. It offers a significantly lower computational footprint compared to alternatives such as Resemble-Enhance and ClearerVoice, while maintaining similar output quality.
21
+
22
+
23
+
24
+ ### Benchmark Comparison
25
+
26
+ | Model | Speed | Size |
27
+ | :--- | :--- | :--- |
28
+ | **FlashSR** | **200x - 400x realtime** | **2MB** |
29
+ | Resemble-Enhance | < 20x realtime | ~700MB+ |
30
+ | ClearerVoice | < 20x realtime | ~200MB+ |
31
+
32
+ ### Usage
33
+ Usage instructions and source code are available on GitHub:
34
+ https://github.com/ysharma3501/FlashSR
35
+
36
+ ### Credits
37
+ Thanks to the authors of **HierSpeech++** as this was based on it's 48khz upsampler.