MOSS-TTS-Local-Transformer-MLX-8bit

Customize mlx-community model card

2d1fc12 verified 4 days ago

2.59 kB

	---
	language:
	- zh
	- en
	license: apache-2.0
	library_name: mlx
	pipeline_tag: text-to-speech
	base_model: OpenMOSS-Team/MOSS-TTS-Local-Transformer
	base_model_relation: quantized
	tags:
	- mlx
	- tts
	- speech
	- voice-cloning
	- apple-silicon
	- quantized
	- 8bit
	---

	# MOSS TTS Local Transformer — MLX 8-bit

	This repository contains an MLX-native int8 conversion of MOSS TTS Local Transformer for single-speaker text-to-speech on Apple Silicon.

	> Note
	> This repo is a community mirror of the canonical MLX conversion maintained by
	> [AppAutomaton](https://github.com/appautomaton) at
	> [`appautomaton/openmoss-tts-local-mlx`](https://huggingface.co/appautomaton/openmoss-tts-local-mlx).

	## Variants

	\| Path \| Precision \|
	\| --- \| --- \|
	\| `mlx-int8/` \| int8 quantized weights \|

	## Model Details

	- Developed by: AppAutomaton
	- Shared by: `mlx-community`
	- Original MLX repo: [`appautomaton/openmoss-tts-local-mlx`](https://huggingface.co/appautomaton/openmoss-tts-local-mlx)
	- Upstream model: [`OpenMOSS-Team/MOSS-TTS-Local-Transformer`](https://huggingface.co/OpenMOSS-Team/MOSS-TTS-Local-Transformer)
	- Task: single-speaker text-to-speech and voice cloning
	- Runtime: MLX on Apple Silicon

	## How to Get Started

	Command-line generation with [`mlx-speech`](https://github.com/appautomaton/mlx-speech):

	Generate speech:
	```bash
	python scripts/generate_moss_local.py \
	--text "Hello, this is a test." \
	--output outputs/out.wav
	```

	Clone a voice:
	```bash
	python scripts/generate_moss_local.py \
	--mode clone \
	--text "This is a cloned voice." \
	--reference-audio reference.wav \
	--output outputs/clone.wav
	```

	Minimal Python usage:

	```python
	from mlx_speech.generation import MossTTSLocalModel

	model = MossTTSLocalModel.from_path("mlx-int8")
	```

	## Notes

	- This repo contains the quantized MLX runtime artifact only.
	- The conversion keeps the original local TTS architecture and remaps weights explicitly for MLX inference.
	- The default runtime path uses `W8Abf16` mixed precision with global and local KV cache enabled.
	- This mirror is a duplicated repo, not an automatically synchronized namespace mirror.

	## Links

	- Canonical MLX repo: [`appautomaton/openmoss-tts-local-mlx`](https://huggingface.co/appautomaton/openmoss-tts-local-mlx)
	- Source code: [`mlx-speech`](https://github.com/appautomaton/mlx-speech)
	- More examples: [AppAutomaton](https://github.com/appautomaton)

	## License

	Apache 2.0 — following the upstream license published with [`OpenMOSS-Team/MOSS-TTS-Local-Transformer`](https://huggingface.co/OpenMOSS-Team/MOSS-TTS-Local-Transformer).