File size: 992 Bytes
8744c59
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
---
license: apache-2.0
library_name: mlx
tags:
  - speech-enhancement
  - audio
  - mlx
---

# MossFormer2 SE (MLX)

48kHz speech enhancement model converted to MLX format.

## Original Model
[alibabasglab/MossFormer2_SE_48K](https://huggingface.co/alibabasglab/MossFormer2_SE_48K)

## Usage

```python
from mlx_audio.sts.models.mossformer2_se import MossFormer2SEModel

model = MossFormer2SEModel.from_pretrained("starkdmi/MossFormer2-SE")
enhanced = model.enhance("noisy.wav")
```

## Precision Variants
- [MossFormer2-SE](https://huggingface.co/starkdmi/MossFormer2-SE) (fp32, 211MB)
- [MossFormer2-SE-fp16](https://huggingface.co/starkdmi/MossFormer2-SE-fp16) (fp16, 106MB)
- [MossFormer2-SE-8bit](https://huggingface.co/starkdmi/MossFormer2-SE-8bit) (int8, 86MB)
- [MossFormer2-SE-6bit](https://huggingface.co/starkdmi/MossFormer2-SE-6bit) (int6, 75MB)
- [MossFormer2-SE-4bit](https://huggingface.co/starkdmi/MossFormer2-SE-4bit) (int4, 64MB)

## Performance
~30x real-time on Apple M4