metadata
library_name: transformers
base_model:
- Plachta/Seed-VC
pipeline_tag: audio-to-audio
tags:
- voice-conversion
- seed-vc
- audio
Seed-VC seed-uvit-whisper-base Finetune
Introduction
This model is a fine-tuned version of Plachta/Seed-VC's seed-uvit-whisper-base with 168 hours of clean singing audios in korean.
It demonstrates significant improvements in naturalness and voice quality.
🎯 Reference Audio
🎧 Audio Comparison
| Model | ===================Converted Singing Audio (25 steps)=================== |
|---|---|
| Original | |
| Base | |
| Finetune |