Commit ·
72749ba
1
Parent(s): f8cf637
Add conversion scripts link and reproduce instructions
Browse files
README.md
CHANGED
|
@@ -101,7 +101,18 @@ To use:
|
|
| 101 |
|
| 102 |
## Conversion
|
| 103 |
|
| 104 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 105 |
|
| 106 |
Conversion pipeline:
|
| 107 |
1. Audio encoder: traced with coremltools, FP16 precision
|
|
|
|
| 101 |
|
| 102 |
## Conversion
|
| 103 |
|
| 104 |
+
Conversion scripts are available at [alandotcom/caspi-hebrew-asr](https://github.com/alandotcom/caspi-hebrew-asr), forked from [FluidInference/mobius](https://github.com/FluidInference/mobius/tree/main/models/stt/qwen3-asr-0.6b/coreml) with dimensions updated for the 1.7B architecture.
|
| 105 |
+
|
| 106 |
+
To reproduce:
|
| 107 |
+
```bash
|
| 108 |
+
git clone https://github.com/alandotcom/caspi-hebrew-asr.git
|
| 109 |
+
cd caspi-hebrew-asr/conversion
|
| 110 |
+
uv sync
|
| 111 |
+
uv run python convert-qwen3-asr.py # full f32 conversion
|
| 112 |
+
uv run python convert_decoder_fused.py # fused stateful decoder
|
| 113 |
+
uv run python extract_embeddings.py # embeddings + vocab
|
| 114 |
+
uv run python quantize_model.py input.mlpackage output.mlpackage --dtype int8 # quantize
|
| 115 |
+
```
|
| 116 |
|
| 117 |
Conversion pipeline:
|
| 118 |
1. Audio encoder: traced with coremltools, FP16 precision
|