Automatic Speech Recognition
Core ML
hebrew
qwen3-asr
apple-silicon
on-device
alandotcom commited on
Commit
72749ba
·
1 Parent(s): f8cf637

Add conversion scripts link and reproduce instructions

Browse files
Files changed (1) hide show
  1. README.md +12 -1
README.md CHANGED
@@ -101,7 +101,18 @@ To use:
101
 
102
  ## Conversion
103
 
104
- Converted from PyTorch safetensors to CoreML using scripts forked from [FluidInference/mobius](https://github.com/FluidInference/mobius/tree/main/models/stt/qwen3-asr-0.6b/coreml), with dimensions updated for the 1.7B architecture.
 
 
 
 
 
 
 
 
 
 
 
105
 
106
  Conversion pipeline:
107
  1. Audio encoder: traced with coremltools, FP16 precision
 
101
 
102
  ## Conversion
103
 
104
+ Conversion scripts are available at [alandotcom/caspi-hebrew-asr](https://github.com/alandotcom/caspi-hebrew-asr), forked from [FluidInference/mobius](https://github.com/FluidInference/mobius/tree/main/models/stt/qwen3-asr-0.6b/coreml) with dimensions updated for the 1.7B architecture.
105
+
106
+ To reproduce:
107
+ ```bash
108
+ git clone https://github.com/alandotcom/caspi-hebrew-asr.git
109
+ cd caspi-hebrew-asr/conversion
110
+ uv sync
111
+ uv run python convert-qwen3-asr.py # full f32 conversion
112
+ uv run python convert_decoder_fused.py # fused stateful decoder
113
+ uv run python extract_embeddings.py # embeddings + vocab
114
+ uv run python quantize_model.py input.mlpackage output.mlpackage --dtype int8 # quantize
115
+ ```
116
 
117
  Conversion pipeline:
118
  1. Audio encoder: traced with coremltools, FP16 precision