nubeaudio-mdx — MDX23C InstVoc HQ (Int8 CoreML)

CoreML-converted, Int8-quantized version of MDX23C-8KFFT-InstVoc_HQ.ckpt for on-device vocal/instrumental separation in NubeAudio — an iOS music player.

The .mlpackage runs the mask-prediction core only. STFT/iSTFT are computed in Swift with Accelerate/vDSP so the model stays portable and the spectrogram pipeline is debuggable end-to-end.

File

File Size Format
mdx23c_instvoc_hq.mlpackage.zip ~40 MB zipped CoreML mlprogram, weights Int8

Unzipped: mdx23c_instvoc_hq.mlpackage ≈ 107 MB.

I/O contract

Input spectrogramMLMultiArray<Float32>, shape [1, 4, 4096, T]

  • 4 = 2 audio channels × [real, imag] interleaved
  • 4096 = dim_f (truncated from n_fft/2 + 1 = 4097)
  • T = time frames; flexible (RangeDim), nominal 256 for a 5.92 s chunk

Output stemsMLMultiArray<Float32>, shape [1, 2, 4, 4096, T]

  • Outer 2 = [Vocals, Instrumental]
  • Inner 4, 4096, T = same layout as input — the model returns time-domain-equivalent complex spectrograms ready for iSTFT

Spectrogram parameters (must match exactly)

Encoded in user_defined_metadata:

Param Value
sample_rate 44100
n_fft 8192
hop_length 1024
dim_f 4096
chunk_size 261120 samples (~5.92 s)
window Hann periodic
instruments Vocals,Instrumental

Inputs at other sample rates must be resampled to 44.1 kHz before STFT.

Conversion source

  • Architecture: KUIELab TFC-TDF v3 / MDX23C — code from ZFTurbo/Music-Source-Separation-Training.
  • Original checkpoint: MDX23C-8KFFT-InstVoc_HQ.ckpt (trained by ZFTurbo, distributed via UVR).
  • Conversion: torch.jit.trace on a wrapper that exposes encoder→decoder→final_conv (skipping STFT/iSTFT), then coremltools.convert(convert_to="mlprogram") + linear_quantize_weights(mode="linear_symmetric", dtype="int8").
  • Verified max_abs_diff(traced, eager) == 0.0 before quantization.

Credits

  • Model: ZFTurbo (architecture and weights), KUIELab (TFC-TDF v3).
  • UVR project: Anjok07.
  • CoreML packaging: NubeAudio.
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support