File size: 923 Bytes
e2f9725
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
---
license: apache-2.0
language:
- en
- zh
- ja
- ko
- fr
- de
- es
tags:
- speech
- asr
- mlx
- qwen3
library_name: mlx
base_model: Qwen/Qwen3-ASR-1.7B
---

# Qwen3 ASR 1.7B — MLX 8-bit

MLX 8-bit quantized conversion of [Qwen/Qwen3-ASR-1.7B](https://huggingface.co/Qwen/Qwen3-ASR-1.7B) for Apple Silicon inference.

## Usage

Used by [qwen3-asr-swift](https://github.com/AufklarerStudios/qwen3-asr-swift) `Qwen3ASR` module:

```swift
let model = try await Qwen3ASRModel.fromPretrained(
    modelId: "aufklarer/Qwen3-ASR-1.7B-MLX-8bit"
)
let text = model.transcribe(audio: samples, sampleRate: 16000)
```

```bash
audio transcribe --model large audio.wav
```

## Model Details

- **Architecture**: Qwen3-ASR encoder-decoder (Whisper-style audio encoder + Qwen3 text decoder)
- **Parameters**: 1.7B
- **Quantization**: 8-bit (MLX)
- **Size**: ~2.3 GB
- **Languages**: Multilingual (EN, ZH, JA, KO, FR, DE, ES, and more)