Cocktail-Fork MRX (MLX)
Collection
MERL MRX ported to Apple MLX — 3-stem music/speech/sfx soundtrack separation. Numerically exact vs PyTorch. 4 variants. • 4 items • Updated
How to use mlx-community/Cocktail-Fork-MRX-paper with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir Cocktail-Fork-MRX-paper mlx-community/Cocktail-Fork-MRX-paper
paper variant (MLX)
Apple MLX port of MERL's MRX (Multi-Resolution CrossNet) — separates a soundtrack mixture into music, speech, and sound effects (sfx).
This variant uses the paper_ checkpoint: trained with the scale-invariant
SNR loss, reproducing the results from the ICASSP 2022 paper. Use this for
benchmark comparability with the original publication.
Other variants: Cocktail-Fork-MRX (default, SNR loss) ·
-adapted-loudness ·
-adapted-eq (cinematic-tuned).
2e-7).pip install git+https://github.com/xocialize/cocktail-fork-mlx
cocktail-fork-mlx --audio-path soundtrack.wav --out-dir ./out \
--weights mlx-community/Cocktail-Fork-MRX-paper
~30.6M params, fp32 (122 MB), 44.1 kHz. MIT, © MERL for the original model/weights.
Quantized