MossFormer2-SE-fp16 / README.md
starkdmi's picture
Upload folder using huggingface_hub
dd04b1b verified
metadata
license: apache-2.0
library_name: mlx
tags:
  - speech-enhancement
  - audio
  - mlx

MossFormer2 SE fp16 (MLX)

48kHz speech enhancement model converted to MLX format (fp16).

Original Model

alibabasglab/MossFormer2_SE_48K

Usage

from mlx_audio.sts.models.mossformer2_se import MossFormer2SEModel

model = MossFormer2SEModel.from_pretrained("starkdmi/MossFormer2-SE-fp16")
enhanced = model.enhance("noisy.wav")

Precision Variants

Performance

~30x real-time on Apple M4