File size: 2,021 Bytes
9320738
 
 
 
 
16d60e8
8b690ef
 
 
9320738
 
 
 
16d60e8
9320738
 
51a2eb1
 
9320738
 
8b690ef
9320738
8b690ef
9320738
8b690ef
9320738
 
 
 
 
16d60e8
9320738
8b690ef
 
 
 
 
 
 
 
16d60e8
 
8b690ef
9320738
 
 
 
 
 
 
8b690ef
9320738
 
5d00204
9320738
 
 
 
8b690ef
 
 
 
 
16d60e8
8b690ef
9320738
8b690ef
 
9320738
 
 
8b690ef
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
---
language:
- zh
- en
license: apache-2.0
library_name: mlx
pipeline_tag: feature-extraction
base_model: OpenMOSS-Team/MOSS-Audio-Tokenizer
base_model_relation: quantized
tags:
- mlx
- audio
- speech
- codec
- tokenizer
- apple-silicon
- quantized
- 8bit
---

# OpenMOSS Audio Tokenizer — MLX 8-bit

This repository contains an MLX-native int8 conversion of the OpenMOSS audio tokenizer for Apple Silicon.

It is a supporting model that encodes and decodes audio tokens for the OpenMOSS TTS family. It is not a standalone speech generation model.

## Variants

| Path | Precision |
| --- | --- |
| `mlx-int8/` | int8 quantized weights |

## Model Details

- Developed by: AppAutomaton
- Shared by: AppAutomaton on Hugging Face
- Upstream model: [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer)
- Task: audio tokenization and codec decoding
- Runtime: MLX on Apple Silicon

## How to Get Started

Load it directly with [`mlx-speech`](https://github.com/appautomaton/mlx-speech):

```python
from mlx_speech.models.moss_audio_tokenizer import MossAudioTokenizerModel

model = MossAudioTokenizerModel.from_path("mlx-int8")
```

The tokenizer is loaded automatically when you run OpenMOSS generation scripts. You usually do not need to instantiate it directly.

```bash
python scripts/generate/moss_local.py \
  --text "Hello from mlx-speech." \
  --output outputs/out.wav
```

## Notes

- This repo contains the quantized MLX runtime artifact only.
- The conversion remaps the original OpenMOSS audio tokenizer weights explicitly for MLX inference.
- The artifact is shared by the OpenMOSS local TTS, TTSD, and SoundEffect runtime paths in this repo.

## Links

- Source code: [mlx-speech](https://github.com/appautomaton/mlx-speech)
- More examples: [AppAutomaton](https://github.com/appautomaton)

## License

Apache 2.0 — following the upstream license published with [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer).