File size: 2,021 Bytes
9320738 16d60e8 8b690ef 9320738 16d60e8 9320738 51a2eb1 9320738 8b690ef 9320738 8b690ef 9320738 8b690ef 9320738 16d60e8 9320738 8b690ef 16d60e8 8b690ef 9320738 8b690ef 9320738 5d00204 9320738 8b690ef 16d60e8 8b690ef 9320738 8b690ef 9320738 8b690ef | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 | ---
language:
- zh
- en
license: apache-2.0
library_name: mlx
pipeline_tag: feature-extraction
base_model: OpenMOSS-Team/MOSS-Audio-Tokenizer
base_model_relation: quantized
tags:
- mlx
- audio
- speech
- codec
- tokenizer
- apple-silicon
- quantized
- 8bit
---
# OpenMOSS Audio Tokenizer — MLX 8-bit
This repository contains an MLX-native int8 conversion of the OpenMOSS audio tokenizer for Apple Silicon.
It is a supporting model that encodes and decodes audio tokens for the OpenMOSS TTS family. It is not a standalone speech generation model.
## Variants
| Path | Precision |
| --- | --- |
| `mlx-int8/` | int8 quantized weights |
## Model Details
- Developed by: AppAutomaton
- Shared by: AppAutomaton on Hugging Face
- Upstream model: [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer)
- Task: audio tokenization and codec decoding
- Runtime: MLX on Apple Silicon
## How to Get Started
Load it directly with [`mlx-speech`](https://github.com/appautomaton/mlx-speech):
```python
from mlx_speech.models.moss_audio_tokenizer import MossAudioTokenizerModel
model = MossAudioTokenizerModel.from_path("mlx-int8")
```
The tokenizer is loaded automatically when you run OpenMOSS generation scripts. You usually do not need to instantiate it directly.
```bash
python scripts/generate/moss_local.py \
--text "Hello from mlx-speech." \
--output outputs/out.wav
```
## Notes
- This repo contains the quantized MLX runtime artifact only.
- The conversion remaps the original OpenMOSS audio tokenizer weights explicitly for MLX inference.
- The artifact is shared by the OpenMOSS local TTS, TTSD, and SoundEffect runtime paths in this repo.
## Links
- Source code: [mlx-speech](https://github.com/appautomaton/mlx-speech)
- More examples: [AppAutomaton](https://github.com/appautomaton)
## License
Apache 2.0 — following the upstream license published with [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer).
|