Cocktail-Fork-MRX — paper variant (MLX)

Apple MLX port of MERL's MRX (Multi-Resolution CrossNet) — separates a soundtrack mixture into music, speech, and sound effects (sfx).

This variant uses the paper_ checkpoint: trained with the scale-invariant SNR loss, reproducing the results from the ICASSP 2022 paper. Use this for benchmark comparability with the original publication.

Other variants: Cocktail-Fork-MRX (default, SNR loss) · -adapted-loudness · -adapted-eq (cinematic-tuned).

Usage

pip install git+https://github.com/xocialize/cocktail-fork-mlx
cocktail-fork-mlx --audio-path soundtrack.wav --out-dir ./out \
    --weights mlx-community/Cocktail-Fork-MRX-paper

~30.6M params, fp32 (122 MB), 44.1 kHz. MIT, © MERL for the original model/weights.

Downloads last month
16
Safetensors
Model size
30.6M params
Tensor type
F32
·
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including mlx-community/Cocktail-Fork-MRX-paper