| --- |
| language: |
| - zh |
| - en |
| license: apache-2.0 |
| library_name: mlx |
| pipeline_tag: feature-extraction |
| base_model: OpenMOSS-Team/MOSS-Audio-Tokenizer |
| base_model_relation: quantized |
| tags: |
| - mlx |
| - audio |
| - speech |
| - codec |
| - tokenizer |
| - apple-silicon |
| - quantized |
| - 8bit |
| --- |
| |
| # OpenMOSS Audio Tokenizer — MLX 8-bit |
|
|
| This repository contains an MLX-native int8 conversion of the OpenMOSS audio tokenizer for Apple Silicon. |
|
|
| It is a supporting model that encodes and decodes audio tokens for the OpenMOSS TTS family. It is not a standalone speech generation model. |
|
|
| ## Variants |
|
|
| | Path | Precision | |
| | --- | --- | |
| | `mlx-int8/` | int8 quantized weights | |
|
|
| ## Model Details |
|
|
| - Developed by: AppAutomaton |
| - Shared by: AppAutomaton on Hugging Face |
| - Upstream model: [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer) |
| - Task: audio tokenization and codec decoding |
| - Runtime: MLX on Apple Silicon |
|
|
| ## How to Get Started |
|
|
| Load it directly with [`mlx-speech`](https://github.com/appautomaton/mlx-speech): |
|
|
| ```python |
| from mlx_speech.models.moss_audio_tokenizer import MossAudioTokenizerModel |
| |
| model = MossAudioTokenizerModel.from_path("mlx-int8") |
| ``` |
|
|
| The tokenizer is loaded automatically when you run OpenMOSS generation scripts. You usually do not need to instantiate it directly. |
|
|
| ```bash |
| python scripts/generate/moss_local.py \ |
| --text "Hello from mlx-speech." \ |
| --output outputs/out.wav |
| ``` |
|
|
| ## Notes |
|
|
| - This repo contains the quantized MLX runtime artifact only. |
| - The conversion remaps the original OpenMOSS audio tokenizer weights explicitly for MLX inference. |
| - The artifact is shared by the OpenMOSS local TTS, TTSD, and SoundEffect runtime paths in this repo. |
|
|
| ## Links |
|
|
| - Source code: [mlx-speech](https://github.com/appautomaton/mlx-speech) |
| - More examples: [AppAutomaton](https://github.com/appautomaton) |
|
|
| ## License |
|
|
| Apache 2.0 — following the upstream license published with [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer). |
|
|