FluffyVoices β ONNX models
ONNX model pack for FluffyVoices, a real-time AI voice changer for Windows (zero-shot voice cloning).
Derived from the X-VC checkpoint by Jerrister Zheng, fine-tuned on Polish speech (Common Voice PL). These weights inherit the licenses of those upstream projects; the FluffyVoices app code itself is MIT.
Usage
Download this whole repository and place it as a models/ folder next to
FluffyVoices.exe (see the app Releases page).
Contents
sac_encoder.onnx/sac_decoder.onnxβ acoustic codec (dynamic time axis)semantic_tokenizer*.onnxβ GLM-4-Voice-based semantic tokenizer (fixed windows: 480/1200/1760/2400 ms)converter*.onnxβ X-VC voice converter (fixed windows, dynamic reference axis)speaker_encoder.onnxβ ERes2Net speaker embeddingpipeline_config.json+assets/β DSP contract (mel filterbanks, windows)
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support