mlx-community
/

Fun-ASR-MLT-Nano-2512-8bit

@@ -10,22 +10,10 @@ tags:
 - stt
 pipeline_tag: automatic-speech-recognition
 language:
-- en
-- zh
-- ja
-- ko
-- es
-- fr
-- de
-- it
-- pt
-- ru
-- ar
-- th
-- vi
 ---
-# mlx-community/FunASR-MLT-Nano-2512-8bit
 This model was converted to MLX format from [FunAudioLLM/Fun-ASR-MLT-Nano-2512](https://huggingface.co/FunAudioLLM/Fun-ASR-MLT-Nano-2512) using [mlx-audio-plus](https://github.com/DePasqualeOrg/mlx-audio-plus) version **0.1.4**.
@@ -33,7 +21,9 @@ This model was converted to MLX format from [FunAudioLLM/Fun-ASR-MLT-Nano-2512](
 | Feature | Description |
 |---------|-------------|
-| **Multilingual** | Supports 13+ languages: English, Chinese, Japanese, Korean, Spanish, French, German, Italian, Portuguese, Russian, Arabic, Thai, Vietnamese |
 | **Translation** | Translate speech directly to English text |
 | **Custom prompting** | Guide recognition with domain-specific context |
 | **Streaming** | Real-time token-by-token output |
@@ -52,7 +42,7 @@ pip install -U mlx-audio-plus
 from mlx_audio.stt.models.funasr import Model
 # Load the model
-model = Model.from_pretrained("mlx-community/FunASR-MLT-Nano-2512-8bit")
 # Transcribe audio
 result = model.generate("audio.wav")
@@ -107,24 +97,6 @@ for chunk in model.generate("audio.wav", stream=True):
     print(chunk, end="", flush=True)
 ```
-### Batch Processing
-```python
-audio_files = ["meeting1.wav", "meeting2.wav", "meeting3.wav"]
-for audio_path in audio_files:
-    result = model.generate(audio_path)
-    print(f"{audio_path}: {result.text}")
-```
 ## Supported Languages
-| Code | Language | Code | Language |
-|------|----------|------|----------|
-| `en` | English | `ru` | Russian |
-| `zh` | Chinese | `ar` | Arabic |
-| `ja` | Japanese | `th` | Thai |
-| `ko` | Korean | `vi` | Vietnamese |
-| `es` | Spanish | `de` | German |
-| `fr` | French | `it` | Italian |
-| `pt` | Portuguese | `auto` | Auto-detect |

 - stt
 pipeline_tag: automatic-speech-recognition
 language:
+- multilingual
 ---
+# mlx-community/Fun-ASR-MLT-Nano-2512-8bit
 This model was converted to MLX format from [FunAudioLLM/Fun-ASR-MLT-Nano-2512](https://huggingface.co/FunAudioLLM/Fun-ASR-MLT-Nano-2512) using [mlx-audio-plus](https://github.com/DePasqualeOrg/mlx-audio-plus) version **0.1.4**.
 | Feature | Description |
 |---------|-------------|
+| **Multilingual** | Supports 31 languages with focus on East and Southeast Asian languages |
+| **Chinese dialects** | Supports 7 major Chinese dialects |
+| **Code-switching** | Handles mixed-language speech within sentences |
 | **Translation** | Translate speech directly to English text |
 | **Custom prompting** | Guide recognition with domain-specific context |
 | **Streaming** | Real-time token-by-token output |
 from mlx_audio.stt.models.funasr import Model
 # Load the model
+model = Model.from_pretrained("mlx-community/Fun-ASR-MLT-Nano-2512-8bit")
 # Transcribe audio
 result = model.generate("audio.wav")
     print(chunk, end="", flush=True)
 ```
 ## Supported Languages
+See [original model](https://huggingface.co/FunAudioLLM/Fun-ASR-MLT-Nano-2512) for the full list of supported languages.