CoreML Speech Models
Collection
Speech AI models for Apple Neural Engine via CoreML. iOS/macOS ready. ASR, TTS, VAD, diarization. โข 22 items โข Updated โข 3
CoreML conversion of Qwen/Qwen3-TTS-0.6B for Apple Neural Engine acceleration. Includes the codec LM, Mimi decoder, and code embedder as separate CoreML models.
| Model | Description |
|---|---|
CodeDecoder.mlmodelc |
Mimi audio codec decoder |
CodeEmbedder.mlmodelc |
Token embedding layer |
Additional .mlmodelc |
Transformer layers for the codec LM |
Used by speech-swift Qwen3TTSCoreML module:
let model = try await Qwen3TTSCoreMLModel.fromPretrained()
let audio = model.synthesize(text: "Hello world", language: "english")