Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition
Paper β’ 2206.08317 β’ Published
This model is part of the FunASR ecosystem β one industrial-grade open-source toolkit for ASR Β· VAD Β· punctuation Β· speaker diarization Β· emotion / event Β· LLM-ASR. A Star really helps the project (and keeps you updated):
π FunASR Β· π SenseVoice Β· π Fun-ASR Β· π FunClip
Non-autoregressive ASR with ONNX runtime β optimized for deployment without PyTorch dependency.
This repository contains the ONNX-exported version of Paraformer-large for efficient CPU/GPU inference via ONNX runtime.
from funasr import AutoModel
model = AutoModel(
model="funasr/paraformer-zh",
hub="hf",
vad_model="funasr/fsmn-vad",
punc_model="funasr/ct-punc",
device="cuda",
)
result = model.generate(input="audio.wav")
print(result[0]["text"])
from funasr_onnx import Paraformer
model = Paraformer("funasr/Paraformer-large", quantize=True)
result = model(audio_in="audio.wav")
print(result)
| Property | Value |
|---|---|
| Architecture | Paraformer (Non-autoregressive) |
| Parameters | 220M |
| Format | ONNX |
| Languages | Chinese, English |
| Training Data | 60,000+ hours |