alexwengg commited on
Commit
ee0604f
Β·
verified Β·
1 Parent(s): 529b2fa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -15
README.md CHANGED
@@ -62,7 +62,7 @@ on-device speech recognition on Apple platforms (iOS/macOS).
62
 
63
  ## Usage with FluidAudio
64
 
65
- ```swift
66
  import FluidAudio
67
 
68
  let manager = Qwen3AsrManager()
@@ -75,6 +75,8 @@ let transcript = try await manager.transcribe(
75
  maxNewTokens: 512
76
  )
77
  print(transcript)
 
 
78
 
79
  Model Architecture
80
 
@@ -82,19 +84,6 @@ Model Architecture
82
  - Decoder: 28-layer transformer decoder with 1024 hidden size
83
  - Tokenizer: Qwen tokenizer with special ASR tokens
84
 
85
- Files
86
-
87
- f32/
88
- β”œβ”€β”€ AudioEncoder.mlpackage
89
- β”œβ”€β”€ TextDecoder.mlpackage
90
- β”œβ”€β”€ config.json
91
- └── tokenizer.json
92
-
93
- int8/
94
- β”œβ”€β”€ AudioEncoder.mlpackage
95
- β”œβ”€β”€ TextDecoder.mlpackage
96
- β”œβ”€β”€ config.json
97
- └── tokenizer.json
98
 
99
  License
100
 
@@ -114,7 +103,7 @@ Citation
114
  journal={arXiv preprint arXiv:2601.21337},
115
  year={2025}
116
  }
117
-
118
  For the HuggingFace metadata UI, fill in:
119
  - **License**: Apache 2.0
120
  - **Base model**: Qwen/Qwen3-ASR-0.6B
 
62
 
63
  ## Usage with FluidAudio
64
 
65
+ ```
66
  import FluidAudio
67
 
68
  let manager = Qwen3AsrManager()
 
75
  maxNewTokens: 512
76
  )
77
  print(transcript)
78
+ ```
79
+
80
 
81
  Model Architecture
82
 
 
84
  - Decoder: 28-layer transformer decoder with 1024 hidden size
85
  - Tokenizer: Qwen tokenizer with special ASR tokens
86
 
 
 
 
 
 
 
 
 
 
 
 
 
 
87
 
88
  License
89
 
 
103
  journal={arXiv preprint arXiv:2601.21337},
104
  year={2025}
105
  }
106
+
107
  For the HuggingFace metadata UI, fill in:
108
  - **License**: Apache 2.0
109
  - **Base model**: Qwen/Qwen3-ASR-0.6B