VoiceIsolate-Models

Quantized ONNX models used by VoiceIsolate Pro for on-device, GPU-accelerated voice isolation and audio enhancement.

All inference runs 100% client-side in the browser via ONNX Runtime Web (WebGPU / WASM fallback). No server required.

Models in this Repository

File	Description	Size	Source
`demucs_v4_quantized.onnx`	Demucs v4 HTDemucs int8-quantized — stem-level voice isolation	~83 MB	facebookresearch/demucs
`bsrnn_vocals.onnx`	BSRNN Band-Split RNN vocals separator	~45 MB	crlandsc/bsrnn

Usage

These models are fetched automatically by ml-worker-fetch-cache.js in VoiceIsolate Pro. They are cached in IndexedDB after the first download and never re-fetched.

// MODEL_REGISTRY entry in ml-worker-fetch-cache.js
demucs_v4: {
  path: 'models/demucs_v4_quantized.onnx',
  sizeBytes: 87_031_808,
  cdnUrls: ['https://huggingface.co/Joker5514/VoiceIsolate-Models/resolve/main/demucs_v4_quantized.onnx']
},
bsrnn_vocals: {
  path: 'models/bsrnn_vocals.onnx',
  sizeBytes: 3_870_554,
  cdnUrls: ['https://huggingface.co/Joker5514/VoiceIsolate-Models/resolve/main/bsrnn_vocals.onnx']
}

License

MIT. Model weights inherit the licenses of their respective upstream projects:

Demucs: MIT
BSRNN: MIT

Downloads last month: -; Downloads are not tracked for this model. How to track