File size: 1,320 Bytes
070b04e 60402fa d56de90 60402fa c215883 60402fa 5d4703e 60402fa d56de90 60402fa d56de90 60402fa d56de90 60402fa |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 |
---
license: apache-2.0
---
# FireRedASR-AED
小红书ASR AED-L版本在AX650N上的部署,原项目地址为:[https://github.com/FireRedTeam/FireRedASR](https://github.com/FireRedTeam/FireRedASR)
转换后的模型放置在axmodel目录,目前支持中文、英文,最长输入10秒的音频,超过10秒的音频会用VAD切割后推理。
## 模型转换
[参考Github](https://github.com/ml-inory/FireRedASR.axera/tree/main)
## 支持平台
- [x] AX650N
## 安装依赖
### Audio backend
```
sudo apt install libsnffile1
```
### Python
测试环境为Python 3.12,建议使用[Miniconda](https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-aarch64.sh
),安装方法[参考](https://www.anaconda.com/docs/getting-started/miniconda/install#aws-graviton2%2Farm64)
```
conda create -n fireredasr python=3.12
conda activate fireredasr
pip install -r requirements.txt
```
### 安装pyaxengine
```
wget https://github.com/AXERA-TECH/pyaxengine/releases/download/0.1.3.rc1/axengine-0.1.3-py3-none-any.whl
pip install axengine-0.1.3-py3-none-any.whl
```
## 使用
```
conda activate fireredasr
python test_ax_model.py
```
```hypo_axmodel.txt```包含识别结果
## 性能表现
RTF ~= 0.3
CER(on custom dataset): 3.45%
|