---
license: mit
language:
- en
- zh
base_model:
- VoxCPM
pipeline_tag: text-to-speech
library_name: transformers
tags:
- VoxCPM
- Speech
---

# VoxCPM  
This version of VoxCPM has been converted to run on the Axera NPU using **w8a16** quantization.
Compatible with Pulsar2 version: 4.2

## Convert tools links:
For those who are interested in model conversion, you can try to export axmodel through the original repo : 
[VoxCPM offical](https://github.com/OpenBMB/VoxCPM/)

[Pulsar2 Link, How to Convert LLM from Huggingface to axmodel](https://pulsar2-docs.readthedocs.io/en/latest/appendix/build_llm.html) 

[AXera NPU HOST LLM Runtime](https://github.com/AXERA-TECH/VoxCPM.Axera) 

## Support Platform

- AX650
  - AX650N DEMO Board
  - [M4N-Dock(爱芯派Pro)](https://wiki.sipeed.com/hardware/zh/maixIV/m4ndock/m4ndock.html)
  - [M.2 Accelerator card](https://axcl-docs.readthedocs.io/zh-cn/latest/doc_guide_hardware.html)


## How to use

Download all files from this repository to the device  
### 1. Run python demo  

#### 1. Install voxcpm axinfer package  
```
git clone -b 1.0.4-axmode_infer https://github.com/techshoww/VoxCPM.git 
cd VoxCPM
pip3 install .
```

#### 2. Download zipenhancer
```
pip3 install modelscope  
modelscope download --model iic/speech_zipenhancer_ans_multiloss_16k_base --local_dir iic/speech_zipenhancer_ans_multiloss_16k_base
```

#### 3. Run on Axera Device    
Go to the root directory of this project.  run:  
```
python3 run_ax650.py
```

### 2. Run c++ demo  

#### 1. Install transformers
```
pip3 install transformers>=4.56.2
```

#### 2. Start tokenizer server  
```
python3 tokenizer.py --port 9999
```

#### 3. Run c++ demo  
```
bash run_ax650.sh
```