File size: 1,399 Bytes
dd35153
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
---
license: mit
language:
  - en
  - multilingual
tags:
  - whisper
  - coreml
  - apple-silicon
  - neural-engine
  - speech-recognition
  - asr
library_name: whisper.cpp
---

# Whisper CoreML Models

Pre-converted CoreML encoder models for [whisper.cpp](https://github.com/ggerganov/whisper.cpp), optimized for Apple Neural Engine (ANE) acceleration.

## Models

| Model | Size | Description |
|-------|------|-------------|
| `ggml-large-v3-turbo-encoder.mlmodelc` | ~1.3 GB | Whisper large-v3-turbo encoder, ANE-optimized |

## Usage

These models are used by [whisper-coreml](https://github.com/sebastian-software/whisper-coreml) for Node.js speech recognition with Neural Engine acceleration.

```bash
npm install whisper-coreml
npx whisper-coreml download
```

## Conversion

Models were converted using whisper.cpp's conversion tools:

```bash
python3 models/convert-whisper-to-coreml.py --model large-v3-turbo --encoder-only True --optimize-ane True
xcrun coremlc compile models/coreml-encoder-large-v3-turbo.mlpackage models/
```

## Performance

On Apple M1 Ultra:
- **14x real-time** transcription speed
- 1 hour of audio in ~4.5 minutes

## License

MIT - Same as whisper.cpp

## Credits

- [OpenAI Whisper](https://github.com/openai/whisper)
- [whisper.cpp](https://github.com/ggerganov/whisper.cpp)
- [Apple Neural Engine Transformers](https://github.com/apple/ml-ane-transformers)