AXERA-TECH
/

CosyVoice2

Model card Files Files and versions

lihongjie commited on Sep 26, 2025

Commit

39ad93d

·

1 Parent(s): 41b3743

update

Files changed (1) hide show

README.md +37 -8

README.md CHANGED Viewed

@@ -42,24 +42,28 @@ For those who are interested in model conversion, you can try to export axmodel
 Download all files from this repository to the device
-### 1. Text to Speech (Voice Cloning)
-#### (1) Copy this project to AX650 Board
-#### (2). Prepare Dependencies
 **Running HTTP Tokenizer Server** and **Processing Prompt Speech** require these Python packages. If you run these two step on a PC, install them on the PC.
 ```
 pip3 install -r scripts/requirements.txt
 ```
-#### 2. Start HTTP Tokenizer Server
 ```
 cd scripts
 python cosyvoice2_tokenizer.py --host {your host} --port {your port}
 ```
-#### 3. Run on AX650 Board
 1) Moidfy the HTTP host in `run_ax650.sh`.
 2) Run `run_ax650.sh`
@@ -127,17 +131,42 @@ Output Speech：
 [output.wav](asset/output.wav)
-#### Optional. Process Prompt Speech
 If you want to replicate a specific sound, do this step.
 You can use audio in asset/ .
-##### (1). Downlaod wetext
 ```
 pip3 install modelscope
 modelscope download --model pengzhendong/wetext --local_dir pengzhendong/wetext
 ```
-##### (2). Process Prompt Speech
 Example:
 ```
 python3 scripts/process_prompt.py --prompt_text  asset/zh_man1.txt --prompt_speech asset/zh_man1.wav --output zh_man1

 Download all files from this repository to the device
+### 1. PrePare
+#### 1.1 Copy this project to AX650 Board
+#### 1.2 Prepare Dependencies
 **Running HTTP Tokenizer Server** and **Processing Prompt Speech** require these Python packages. If you run these two step on a PC, install them on the PC.
 ```
 pip3 install -r scripts/requirements.txt
 ```
+### 2. Start HTTP Tokenizer Server
 ```
 cd scripts
 python cosyvoice2_tokenizer.py --host {your host} --port {your port}
 ```
+### 3. Run on Axera Device
+There are 2 kinds of device, AX650 Board and AXCL aarch64 Board.
+#### 3.1 Run on AX650 Board
 1) Moidfy the HTTP host in `run_ax650.sh`.
 2) Run `run_ax650.sh`
 [output.wav](asset/output.wav)
+####  Or run on AX650 Board with Gradio GUI
+1) Start server
+```
+bash run_api_ax650.sh
+```
+2) Start Gradio GUI
+```
+python scripts/gradio_demo.py
+```
+#### 3.2 Run on AXCL aarch64 Board
+```
+bash run_axcl_aarch64.sh
+```
+#### Or run on AXCL aarch64 Board with Gradio GUI
+1) Start server
+```
+bash run_api_axcl_aarch64.sh
+```
+2) Start Gradio GUI
+```
+python scripts/gradio_demo.py
+```
+### Optional. Process Prompt Speech
 If you want to replicate a specific sound, do this step.
 You can use audio in asset/ .
+#### (1). Downlaod wetext
 ```
 pip3 install modelscope
 modelscope download --model pengzhendong/wetext --local_dir pengzhendong/wetext
 ```
+#### (2). Process Prompt Speech
 Example:
 ```
 python3 scripts/process_prompt.py --prompt_text  asset/zh_man1.txt --prompt_speech asset/zh_man1.wav --output zh_man1