File size: 2,057 Bytes
1a00032
 
 
 
dfea210
1a00032
 
dfea210
 
10cc928
1a00032
dfea210
 
 
1a00032
bcf7e57
 
1a00032
 
 
 
36c4ad2
1a00032
36c4ad2
 
1f1e1db
36c4ad2
 
 
 
 
1f1e1db
36c4ad2
 
e78bb03
bcf7e57
e78bb03
1a00032
36c4ad2
1a00032
 
 
 
 
 
 
36c4ad2
1a00032
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
36c4ad2
1a00032
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
---
license: apache-2.0
pipeline_tag: audio-to-audio
tags:
- comfy
- comfyui
- audio2audio
- audio-to-audio
- audio-upscale
- audiosuperresolution
---
# Custom Node for ComfyUI

https://github.com/Saganaki22/ComfyUI-NovaSR

![image](https://cdn-uploads.huggingface.co/production/uploads/63473b59e5c0717e6737b872/43tQTMGNSgD3ABnabat44.png)

## NovaSR: Pushing the Limits of Extreme Efficiency in Audio Super-Resolution

This is the model for NovaSR, a tiny 50kb audio upsampling model that upscales muffled 16khz audio into clear and crisp 48khz audio at speeds from 100-3500x realtime.

### Audio Samples

**Before Processing (16kHz):**
<audio controls>
  <source src="https://huggingface.co/drbaph/NovaSR/resolve/main/samples/speech_sample_before.mp3" type="audio/mpeg">
  Your browser does not support the audio element.
</audio>

**After Processing (48kHz):**
<audio controls>
  <source src="https://huggingface.co/drbaph/NovaSR/resolve/main/samples/speech_sample_after.mp3" type="audio/mpeg">
  Your browser does not support the audio element.
</audio>

![ComfyUI_temp_tepqs_00001_](https://cdn-uploads.huggingface.co/production/uploads/63473b59e5c0717e6737b872/TK76Ku0FPo_n9gm87IGGt.png)

### Details

* **Model Size:** 52kb for pytorch version
* **Input Rate:** 16kHz
* **Output Rate:** 48kHz
* **Inference Speed:** 300-3500x realtime depending on gpu
* **Mono**

## Comparisons

Comparisons were done on A100 gpu. Higher realtime means faster processing speeds.
Comparison on CPU are coming soon.

| Model         | Speed (Real-Time) | Model Size |
| :------------ | :---------------- | :--------- |
| **NovaSR** | **3600x realtime** | **~52 KB** |
| FlowHigh      | 20x realtime        | ~450 MB     |
| FlashSR       | 14x realtime        | ~1000 MB     |
| AudioSR       | 0.6x realtime    | ~6000 MB     |

## Usage

Please check out the github repo for usage: https://github.com/Saganaki22/ComfyUI-NovaSR 

Original Repo: https://github.com/ysharma3501/NovaSR

If you find the model/code helpful, stars or likes would be appreciated.

Thank you.