stepfun-ai
/

Step-Audio-EditX

Model card Files Files and versions

yangpeng08 commited on Nov 7, 2025

Commit

d8b6fa0

·

verified ·

1 Parent(s): afb35b6

Update README.md

Files changed (1) hide show

README.md +28 -1

README.md CHANGED Viewed

@@ -93,9 +93,36 @@ Assume you have one GPU with at least 32GB memory available  and have already do
 ```bash
 # Step-Audio-EditX demo
-python app.py --model-path where_you_download_dir --model-source local
 ```
 ## Citation
 ```

 ```bash
 # Step-Audio-EditX demo
+python app.py --model-path where_you_download_dir --model-source local
 ```
+#### Local Inference Demo
+> [!TIP]
+> For optimal performance, keep audio under 30 seconds per inference.
+```bash
+# zero-shot cloning
+python3 tts_infer.py \
+    --model-path where_you_download_dir \
+    --output-dir ./output \
+    --prompt-text "your prompt text"\
+    --prompt-audio your_prompt_audio_path \
+    --generated-text "your target text" \
+    --edit-type "clone"
+# edit
+python3 tts_infer.py \
+    --model-path where_you_download_dir \
+    --output-dir ./output \
+    --prompt-text "your promt text" \
+    --prompt-audio your_prompt_audio_path \
+    --generated-text "" \ # for para-linguistic editing, you need to specify the generatedd text
+    --edit-type "emotion" \
+    --edit-info "sad" \
+    --n-edit-iter 2
+```
 ## Citation
 ```