Spaces:

WSYBYT
/

ybtts

Running

App Files Files Community

Fix: Voice cloning working + Custom WAV encoder

#5

by masbudjj - opened Oct 22, 2025

base: refs/heads/main

←

from: refs/pr/5

Discussion Files changed

WS YB YT org Oct 22, 2025

FINAL FIX - Voice Cloning Working!

Fixed Issues:

✅ WAV encoding: Implemented custom encodeWAV function
✅ Speaker encoder: Use Web Audio API (no WavLM dependency)
✅ Voice extraction: Spectral analysis (RMS, ZCR, centroid)
✅ Default voice: Working perfectly
✅ Cloned voice: Working with uploaded audio

Voice Cloning Algorithm:

Extract spectral features from uploaded audio
RMS energy (loudness)
Zero-crossing rate (pitch)
Spectral centroid (timbre)
Project to 512-dim embedding space
Blend 60% custom + 40% default for stability

Improvements:

No external model dependencies (faster loading)
Simplified but effective voice extraction
Better error handling
More stable voice cloning

Fix: Voice cloning working + Custom WAV encoderb39d19df

masbudjj changed pull request status to merged Oct 22, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment