Cocktail-Fork-MRX — adapted-loudness variant (MLX)

Apple MLX port of MERL's MRX (Multi-Resolution CrossNet) — separates a soundtrack mixture into music, speech, and sound effects (sfx).

This variant uses the adapted_loudness_ checkpoint: adapted with loudness normalization for better alignment with real cinematic/movie stems. Try this (and -adapted-eq) on real soundtrack content.

Other variants: Cocktail-Fork-MRX (default) · -paper (ICASSP reproduction) · -adapted-eq.

Usage

pip install git+https://github.com/xocialize/cocktail-fork-mlx
cocktail-fork-mlx --audio-path soundtrack.wav --out-dir ./out \
    --weights mlx-community/Cocktail-Fork-MRX-adapted-loudness

~30.6M params, fp32 (122 MB), 44.1 kHz. MIT, © MERL for the original model/weights.

Downloads last month
9
Safetensors
Model size
30.6M params
Tensor type
F32
·
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including mlx-community/Cocktail-Fork-MRX-adapted-loudness