mlx-community
/

Fun-ASR-Nano-2512-4bit

Automatic Speech Recognition

speech-recognition

Model card Files Files and versions

depasquale commited on Dec 16, 2025

Commit

2e4e29b

·

verified ·

1 Parent(s): 16bd0f8

Upload folder using huggingface_hub

Files changed (1) hide show

README.md +3 -33

README.md CHANGED Viewed

@@ -10,19 +10,7 @@ tags:
 - stt
 pipeline_tag: automatic-speech-recognition
 language:
-- en
-- zh
-- ja
-- ko
-- es
-- fr
-- de
-- it
-- pt
-- ru
-- ar
-- th
-- vi
 ---
 # mlx-community/Fun-ASR-Nano-2512-4bit
@@ -33,7 +21,7 @@ This model was converted to MLX format from [FunAudioLLM/Fun-ASR-Nano-2512](http
 | Feature | Description |
 |---------|-------------|
-| **Multilingual** | Supports 13+ languages: English, Chinese, Japanese, Korean, Spanish, French, German, Italian, Portuguese, Russian, Arabic, Thai, Vietnamese |
 | **Translation** | Translate speech directly to English text |
 | **Custom prompting** | Guide recognition with domain-specific context |
 | **Streaming** | Real-time token-by-token output |
@@ -107,24 +95,6 @@ for chunk in model.generate("audio.wav", stream=True):
     print(chunk, end="", flush=True)
 ```
-### Batch Processing
-```python
-audio_files = ["meeting1.wav", "meeting2.wav", "meeting3.wav"]
-for audio_path in audio_files:
-    result = model.generate(audio_path)
-    print(f"{audio_path}: {result.text}")
-```
 ## Supported Languages
-| Code | Language | Code | Language |
-|------|----------|------|----------|
-| `en` | English | `ru` | Russian |
-| `zh` | Chinese | `ar` | Arabic |
-| `ja` | Japanese | `th` | Thai |
-| `ko` | Korean | `vi` | Vietnamese |
-| `es` | Spanish | `de` | German |
-| `fr` | French | `it` | Italian |
-| `pt` | Portuguese | `auto` | Auto-detect |

 - stt
 pipeline_tag: automatic-speech-recognition
 language:
+- multilingual
 ---
 # mlx-community/Fun-ASR-Nano-2512-4bit
 | Feature | Description |
 |---------|-------------|
+| **Multilingual** | Supports 13+ languages |
 | **Translation** | Translate speech directly to English text |
 | **Custom prompting** | Guide recognition with domain-specific context |
 | **Streaming** | Real-time token-by-token output |
     print(chunk, end="", flush=True)
 ```
 ## Supported Languages
+See [original model](https://huggingface.co/FunAudioLLM/Fun-ASR-Nano-2512) for the full list of supported languages.