Automatic Speech Recognition
Transformers
Safetensors
fun_asr_nano
text-generation
speech-recognition
asr
end-to-end
streaming
Instructions to use FunAudioLLM/Fun-ASR-Nano-2512-hf with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use FunAudioLLM/Fun-ASR-Nano-2512-hf with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="FunAudioLLM/Fun-ASR-Nano-2512-hf")# Load model directly from transformers import AutoModelForSeq2SeqLM model = AutoModelForSeq2SeqLM.from_pretrained("FunAudioLLM/Fun-ASR-Nano-2512-hf", dtype="auto") - Notebooks
- Google Colab
- Kaggle
| language: | |
| - zh | |
| - en | |
| - ja | |
| - ko | |
| - yue | |
| - vi | |
| - id | |
| - th | |
| - ms | |
| - tl | |
| - ar | |
| - hi | |
| - multilingual | |
| license: apache-2.0 | |
| library_name: transformers | |
| pipeline_tag: automatic-speech-recognition | |
| tags: | |
| - speech-recognition | |
| - asr | |
| - end-to-end | |
| - multilingual | |
| - streaming | |
| # Fun-ASR-Nano (HuggingFace Transformers) | |
| This is the HuggingFace Transformers-compatible version of [Fun-ASR-Nano-2512](https://huggingface.co/FunAudioLLM/Fun-ASR-Nano-2512). | |
| Fun-ASR-Nano is an end-to-end speech recognition model by [FunAudioLLM](https://github.com/FunAudioLLM), trained on tens of millions of hours of real speech data. It supports multilingual speech recognition covering Chinese (with dialects), English, Japanese, Korean, and many more languages. | |
| For full documentation, benchmarks, and usage instructions, please refer to the [main model card](https://huggingface.co/FunAudioLLM/Fun-ASR-Nano-2512). | |