Audio-to-Audio
Diffusers
audio
super-resolution
audio-upscaling
comfyui
audio-sr
audiosr
versatle-audio-super-resolution
Instructions to use drbaph/AudioSR with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use drbaph/AudioSR with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("drbaph/AudioSR", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -15,6 +15,12 @@ library_name: diffusers
|
|
| 15 |
|
| 16 |
Pre-trained AudioSR (Versatile Audio Super Resolution) models for use with [ComfyUI-AudioSR](https://github.com/Saganaki22/ComfyUI-VASR) custom node.
|
| 17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
## Models
|
| 19 |
|
| 20 |
### audiosr_basic_fp32.safetensors
|
|
@@ -58,9 +64,6 @@ AudioSR upscales low-quality audio to high-quality 48kHz output using latent dif
|
|
| 58 |
- Reduces compression artifacts
|
| 59 |
- Adds clarity and detail
|
| 60 |
|
| 61 |
-
|
| 62 |
-

|
| 63 |
-
|
| 64 |
## Model Info
|
| 65 |
|
| 66 |
Based on [AudioSR: Versatile Audio Super-Resolution](https://arxiv.org/abs/2309.07314) by Haohe Liu et al.
|
|
|
|
| 15 |
|
| 16 |
Pre-trained AudioSR (Versatile Audio Super Resolution) models for use with [ComfyUI-AudioSR](https://github.com/Saganaki22/ComfyUI-VASR) custom node.
|
| 17 |
|
| 18 |
+
<audio controls src="https://huggingface.co/drbaph/AudioSR/resolve/main/samples/speech_up_4.wav"></audio>
|
| 19 |
+
<audio controls src="https://huggingface.co/drbaph/AudioSR/resolve/main/samples/speech_audiosr_4.wav"></audio>
|
| 20 |
+
|
| 21 |
+

|
| 22 |
+
|
| 23 |
+
|
| 24 |
## Models
|
| 25 |
|
| 26 |
### audiosr_basic_fp32.safetensors
|
|
|
|
| 64 |
- Reduces compression artifacts
|
| 65 |
- Adds clarity and detail
|
| 66 |
|
|
|
|
|
|
|
|
|
|
| 67 |
## Model Info
|
| 68 |
|
| 69 |
Based on [AudioSR: Versatile Audio Super-Resolution](https://arxiv.org/abs/2309.07314) by Haohe Liu et al.
|