File size: 2,452 Bytes
626b70e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2d1fc12
626b70e
2d1fc12
626b70e
2d1fc12
 
 
 
626b70e
 
 
 
 
 
 
 
 
 
2d1fc12
 
626b70e
 
 
 
 
 
 
 
 
 
a32f63a
626b70e
 
 
 
 
 
a32f63a
626b70e
 
 
 
 
 
 
 
 
2d1fc12
626b70e
2d1fc12
626b70e
 
 
2d1fc12
 
626b70e
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
---
language:
- zh
- en
license: apache-2.0
library_name: mlx
pipeline_tag: text-to-speech
base_model: OpenMOSS-Team/MOSS-TTS-Local-Transformer
base_model_relation: quantized
tags:
- mlx
- tts
- speech
- voice-cloning
- apple-silicon
- quantized
- 8bit
---

# MOSS TTS Local Transformer — MLX 8-bit

This repository contains an MLX-native int8 conversion of MOSS TTS Local Transformer for single-speaker text-to-speech on Apple Silicon.

> Note
> This repo is a community mirror of the canonical MLX conversion maintained by
> [AppAutomaton](https://github.com/appautomaton) at
> [`appautomaton/openmoss-tts-local-mlx`](https://huggingface.co/appautomaton/openmoss-tts-local-mlx).

## Variants

| Path | Precision |
| --- | --- |
| `mlx-int8/` | int8 quantized weights |

## Model Details

- Developed by: AppAutomaton
- Shared by: `mlx-community`
- Original MLX repo: [`appautomaton/openmoss-tts-local-mlx`](https://huggingface.co/appautomaton/openmoss-tts-local-mlx)
- Upstream model: [`OpenMOSS-Team/MOSS-TTS-Local-Transformer`](https://huggingface.co/OpenMOSS-Team/MOSS-TTS-Local-Transformer)
- Task: single-speaker text-to-speech and voice cloning
- Runtime: MLX on Apple Silicon

## How to Get Started

Command-line generation with [`mlx-speech`](https://github.com/appautomaton/mlx-speech):

**Generate speech:**
```bash
python scripts/generate/moss_local.py \
  --text "Hello, this is a test." \
  --output outputs/out.wav
```

**Clone a voice:**
```bash
python scripts/generate/moss_local.py \
  --mode clone \
  --text "This is a cloned voice." \
  --reference-audio reference.wav \
  --output outputs/clone.wav
```

## Notes

- This repo contains the quantized MLX runtime artifact only.
- The conversion keeps the original local TTS architecture and remaps weights explicitly for MLX inference.
- The default runtime path uses `W8Abf16` mixed precision with global and local KV cache enabled.
- This mirror is a duplicated repo, not an automatically synchronized namespace mirror.

## Links

- Canonical MLX repo: [`appautomaton/openmoss-tts-local-mlx`](https://huggingface.co/appautomaton/openmoss-tts-local-mlx)
- Source code: [`mlx-speech`](https://github.com/appautomaton/mlx-speech)
- More examples: [AppAutomaton](https://github.com/appautomaton)

## License

Apache 2.0 — following the upstream license published with [`OpenMOSS-Team/MOSS-TTS-Local-Transformer`](https://huggingface.co/OpenMOSS-Team/MOSS-TTS-Local-Transformer).