depasquale commited on
Commit
2e4e29b
·
verified ·
1 Parent(s): 16bd0f8

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +3 -33
README.md CHANGED
@@ -10,19 +10,7 @@ tags:
10
  - stt
11
  pipeline_tag: automatic-speech-recognition
12
  language:
13
- - en
14
- - zh
15
- - ja
16
- - ko
17
- - es
18
- - fr
19
- - de
20
- - it
21
- - pt
22
- - ru
23
- - ar
24
- - th
25
- - vi
26
  ---
27
 
28
  # mlx-community/Fun-ASR-Nano-2512-4bit
@@ -33,7 +21,7 @@ This model was converted to MLX format from [FunAudioLLM/Fun-ASR-Nano-2512](http
33
 
34
  | Feature | Description |
35
  |---------|-------------|
36
- | **Multilingual** | Supports 13+ languages: English, Chinese, Japanese, Korean, Spanish, French, German, Italian, Portuguese, Russian, Arabic, Thai, Vietnamese |
37
  | **Translation** | Translate speech directly to English text |
38
  | **Custom prompting** | Guide recognition with domain-specific context |
39
  | **Streaming** | Real-time token-by-token output |
@@ -107,24 +95,6 @@ for chunk in model.generate("audio.wav", stream=True):
107
  print(chunk, end="", flush=True)
108
  ```
109
 
110
- ### Batch Processing
111
-
112
- ```python
113
- audio_files = ["meeting1.wav", "meeting2.wav", "meeting3.wav"]
114
-
115
- for audio_path in audio_files:
116
- result = model.generate(audio_path)
117
- print(f"{audio_path}: {result.text}")
118
- ```
119
-
120
  ## Supported Languages
121
 
122
- | Code | Language | Code | Language |
123
- |------|----------|------|----------|
124
- | `en` | English | `ru` | Russian |
125
- | `zh` | Chinese | `ar` | Arabic |
126
- | `ja` | Japanese | `th` | Thai |
127
- | `ko` | Korean | `vi` | Vietnamese |
128
- | `es` | Spanish | `de` | German |
129
- | `fr` | French | `it` | Italian |
130
- | `pt` | Portuguese | `auto` | Auto-detect |
 
10
  - stt
11
  pipeline_tag: automatic-speech-recognition
12
  language:
13
+ - multilingual
 
 
 
 
 
 
 
 
 
 
 
 
14
  ---
15
 
16
  # mlx-community/Fun-ASR-Nano-2512-4bit
 
21
 
22
  | Feature | Description |
23
  |---------|-------------|
24
+ | **Multilingual** | Supports 13+ languages |
25
  | **Translation** | Translate speech directly to English text |
26
  | **Custom prompting** | Guide recognition with domain-specific context |
27
  | **Streaming** | Real-time token-by-token output |
 
95
  print(chunk, end="", flush=True)
96
  ```
97
 
 
 
 
 
 
 
 
 
 
 
98
  ## Supported Languages
99
 
100
+ See [original model](https://huggingface.co/FunAudioLLM/Fun-ASR-Nano-2512) for the full list of supported languages.