| license: apache-2.0 | |
| library_name: mlx | |
| tags: | |
| - speech-enhancement | |
| - audio | |
| - mlx | |
| # MossFormer2 SE (MLX) | |
| 48kHz speech enhancement model converted to MLX format. | |
| ## Original Model | |
| [alibabasglab/MossFormer2_SE_48K](https://huggingface.co/alibabasglab/MossFormer2_SE_48K) | |
| ## Usage | |
| ```python | |
| from mlx_audio.sts.models.mossformer2_se import MossFormer2SEModel | |
| model = MossFormer2SEModel.from_pretrained("starkdmi/MossFormer2-SE") | |
| enhanced = model.enhance("noisy.wav") | |
| ``` | |
| ## Precision Variants | |
| - [MossFormer2-SE](https://huggingface.co/starkdmi/MossFormer2-SE) (fp32, 211MB) | |
| - [MossFormer2-SE-fp16](https://huggingface.co/starkdmi/MossFormer2-SE-fp16) (fp16, 106MB) | |
| - [MossFormer2-SE-8bit](https://huggingface.co/starkdmi/MossFormer2-SE-8bit) (int8, 86MB) | |
| - [MossFormer2-SE-6bit](https://huggingface.co/starkdmi/MossFormer2-SE-6bit) (int6, 75MB) | |
| - [MossFormer2-SE-4bit](https://huggingface.co/starkdmi/MossFormer2-SE-4bit) (int4, 64MB) | |
| ## Performance | |
| ~30x real-time on Apple M4 | |