tamarher's picture
Sync model card
5d00204 verified
---
language:
- zh
- en
license: apache-2.0
library_name: mlx
pipeline_tag: feature-extraction
base_model: OpenMOSS-Team/MOSS-Audio-Tokenizer
base_model_relation: quantized
tags:
- mlx
- audio
- speech
- codec
- tokenizer
- apple-silicon
- quantized
- 8bit
---
# OpenMOSS Audio Tokenizer — MLX 8-bit
This repository contains an MLX-native int8 conversion of the OpenMOSS audio tokenizer for Apple Silicon.
It is a supporting model that encodes and decodes audio tokens for the OpenMOSS TTS family. It is not a standalone speech generation model.
## Variants
| Path | Precision |
| --- | --- |
| `mlx-int8/` | int8 quantized weights |
## Model Details
- Developed by: AppAutomaton
- Shared by: AppAutomaton on Hugging Face
- Upstream model: [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer)
- Task: audio tokenization and codec decoding
- Runtime: MLX on Apple Silicon
## How to Get Started
Load it directly with [`mlx-speech`](https://github.com/appautomaton/mlx-speech):
```python
from mlx_speech.models.moss_audio_tokenizer import MossAudioTokenizerModel
model = MossAudioTokenizerModel.from_path("mlx-int8")
```
The tokenizer is loaded automatically when you run OpenMOSS generation scripts. You usually do not need to instantiate it directly.
```bash
python scripts/generate/moss_local.py \
--text "Hello from mlx-speech." \
--output outputs/out.wav
```
## Notes
- This repo contains the quantized MLX runtime artifact only.
- The conversion remaps the original OpenMOSS audio tokenizer weights explicitly for MLX inference.
- The artifact is shared by the OpenMOSS local TTS, TTSD, and SoundEffect runtime paths in this repo.
## Links
- Source code: [mlx-speech](https://github.com/appautomaton/mlx-speech)
- More examples: [AppAutomaton](https://github.com/appautomaton)
## License
Apache 2.0 — following the upstream license published with [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer).