--- language: - zh - en license: apache-2.0 library_name: mlx pipeline_tag: feature-extraction base_model: OpenMOSS-Team/MOSS-Audio-Tokenizer base_model_relation: quantized tags: - mlx - audio - speech - codec - tokenizer - apple-silicon - quantized - 8bit --- # OpenMOSS Audio Tokenizer — MLX 8-bit This repository contains an MLX-native int8 conversion of the OpenMOSS audio tokenizer for Apple Silicon. It is a supporting model that encodes and decodes audio tokens for the OpenMOSS TTS family. It is not a standalone speech generation model. ## Variants | Path | Precision | | --- | --- | | `mlx-int8/` | int8 quantized weights | ## Model Details - Developed by: AppAutomaton - Shared by: AppAutomaton on Hugging Face - Upstream model: [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer) - Task: audio tokenization and codec decoding - Runtime: MLX on Apple Silicon ## How to Get Started Load it directly with [`mlx-speech`](https://github.com/appautomaton/mlx-speech): ```python from mlx_speech.models.moss_audio_tokenizer import MossAudioTokenizerModel model = MossAudioTokenizerModel.from_path("mlx-int8") ``` The tokenizer is loaded automatically when you run OpenMOSS generation scripts. You usually do not need to instantiate it directly. ```bash python scripts/generate/moss_local.py \ --text "Hello from mlx-speech." \ --output outputs/out.wav ``` ## Notes - This repo contains the quantized MLX runtime artifact only. - The conversion remaps the original OpenMOSS audio tokenizer weights explicitly for MLX inference. - The artifact is shared by the OpenMOSS local TTS, TTSD, and SoundEffect runtime paths in this repo. ## Links - Source code: [mlx-speech](https://github.com/appautomaton/mlx-speech) - More examples: [AppAutomaton](https://github.com/appautomaton) ## License Apache 2.0 — following the upstream license published with [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer).