appautomaton
/

openmoss-audio-tokenizer-mlx

Feature Extraction

Model card Files Files and versions

openmoss-audio-tokenizer-mlx / README.md

tamarher's picture

Sync model card

5d00204 verified 3 days ago

|

history blame contribute delete

2.02 kB

	---
	language:
	- zh
	- en
	license: apache-2.0
	library_name: mlx
	pipeline_tag: feature-extraction
	base_model: OpenMOSS-Team/MOSS-Audio-Tokenizer
	base_model_relation: quantized
	tags:
	- mlx
	- audio
	- speech
	- codec
	- tokenizer
	- apple-silicon
	- quantized
	- 8bit
	---

	# OpenMOSS Audio Tokenizer — MLX 8-bit

	This repository contains an MLX-native int8 conversion of the OpenMOSS audio tokenizer for Apple Silicon.

	It is a supporting model that encodes and decodes audio tokens for the OpenMOSS TTS family. It is not a standalone speech generation model.

	## Variants

	\| Path \| Precision \|
	\| --- \| --- \|
	\| `mlx-int8/` \| int8 quantized weights \|

	## Model Details

	- Developed by: AppAutomaton
	- Shared by: AppAutomaton on Hugging Face
	- Upstream model: [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer)
	- Task: audio tokenization and codec decoding
	- Runtime: MLX on Apple Silicon

	## How to Get Started

	Load it directly with [`mlx-speech`](https://github.com/appautomaton/mlx-speech):

	```python
	from mlx_speech.models.moss_audio_tokenizer import MossAudioTokenizerModel

	model = MossAudioTokenizerModel.from_path("mlx-int8")
	```

	The tokenizer is loaded automatically when you run OpenMOSS generation scripts. You usually do not need to instantiate it directly.

	```bash
	python scripts/generate/moss_local.py \
	--text "Hello from mlx-speech." \
	--output outputs/out.wav
	```

	## Notes

	- This repo contains the quantized MLX runtime artifact only.
	- The conversion remaps the original OpenMOSS audio tokenizer weights explicitly for MLX inference.
	- The artifact is shared by the OpenMOSS local TTS, TTSD, and SoundEffect runtime paths in this repo.

	## Links

	- Source code: [mlx-speech](https://github.com/appautomaton/mlx-speech)
	- More examples: [AppAutomaton](https://github.com/appautomaton)

	## License

	Apache 2.0 — following the upstream license published with [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer).