Update README.md
Browse files
README.md
CHANGED
|
@@ -62,7 +62,7 @@ on-device speech recognition on Apple platforms (iOS/macOS).
|
|
| 62 |
|
| 63 |
## Usage with FluidAudio
|
| 64 |
|
| 65 |
-
```
|
| 66 |
import FluidAudio
|
| 67 |
|
| 68 |
let manager = Qwen3AsrManager()
|
|
@@ -75,6 +75,8 @@ let transcript = try await manager.transcribe(
|
|
| 75 |
maxNewTokens: 512
|
| 76 |
)
|
| 77 |
print(transcript)
|
|
|
|
|
|
|
| 78 |
|
| 79 |
Model Architecture
|
| 80 |
|
|
@@ -82,19 +84,6 @@ Model Architecture
|
|
| 82 |
- Decoder: 28-layer transformer decoder with 1024 hidden size
|
| 83 |
- Tokenizer: Qwen tokenizer with special ASR tokens
|
| 84 |
|
| 85 |
-
Files
|
| 86 |
-
|
| 87 |
-
f32/
|
| 88 |
-
βββ AudioEncoder.mlpackage
|
| 89 |
-
βββ TextDecoder.mlpackage
|
| 90 |
-
βββ config.json
|
| 91 |
-
βββ tokenizer.json
|
| 92 |
-
|
| 93 |
-
int8/
|
| 94 |
-
βββ AudioEncoder.mlpackage
|
| 95 |
-
βββ TextDecoder.mlpackage
|
| 96 |
-
βββ config.json
|
| 97 |
-
βββ tokenizer.json
|
| 98 |
|
| 99 |
License
|
| 100 |
|
|
@@ -114,7 +103,7 @@ Citation
|
|
| 114 |
journal={arXiv preprint arXiv:2601.21337},
|
| 115 |
year={2025}
|
| 116 |
}
|
| 117 |
-
|
| 118 |
For the HuggingFace metadata UI, fill in:
|
| 119 |
- **License**: Apache 2.0
|
| 120 |
- **Base model**: Qwen/Qwen3-ASR-0.6B
|
|
|
|
| 62 |
|
| 63 |
## Usage with FluidAudio
|
| 64 |
|
| 65 |
+
```
|
| 66 |
import FluidAudio
|
| 67 |
|
| 68 |
let manager = Qwen3AsrManager()
|
|
|
|
| 75 |
maxNewTokens: 512
|
| 76 |
)
|
| 77 |
print(transcript)
|
| 78 |
+
```
|
| 79 |
+
|
| 80 |
|
| 81 |
Model Architecture
|
| 82 |
|
|
|
|
| 84 |
- Decoder: 28-layer transformer decoder with 1024 hidden size
|
| 85 |
- Tokenizer: Qwen tokenizer with special ASR tokens
|
| 86 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 87 |
|
| 88 |
License
|
| 89 |
|
|
|
|
| 103 |
journal={arXiv preprint arXiv:2601.21337},
|
| 104 |
year={2025}
|
| 105 |
}
|
| 106 |
+
|
| 107 |
For the HuggingFace metadata UI, fill in:
|
| 108 |
- **License**: Apache 2.0
|
| 109 |
- **Base model**: Qwen/Qwen3-ASR-0.6B
|