Update model card: fix script paths, add license section

Browse files

Files changed (1) hide show

README.md +8 -16

README.md CHANGED Viewed

@@ -61,16 +61,7 @@ This bundle is self-contained — all weights are packaged in one repository.
 ## How to Get Started
-Install [`mlx-speech`](https://github.com/appautomaton/mlx-speech) and clone
-the repo (the Step Audio CLI entry point is not yet in the unified public API).
-```bash
-pip install mlx-speech
-git clone https://github.com/appautomaton/mlx-speech.git
-cd mlx-speech
-```
-Download the bundle with `huggingface-cli`:
 ```bash
 hf download appautomaton/step-audio-editx-8bit-mlx \
@@ -80,9 +71,7 @@ hf download appautomaton/step-audio-editx-8bit-mlx \
 **Voice cloning:**
 ```bash
-python scripts/generate_step_audio_editx.py \
-  --model-dir models/stepfun/step_audio_editx/mlx-int8 \
-  --prefer-mlx-int8 \
   --prompt-audio reference.wav \
   --prompt-text "Transcript of reference audio." \
   -o cloned.wav \
@@ -92,9 +81,7 @@ python scripts/generate_step_audio_editx.py \
 **Audio editing (change emotion):**
 ```bash
-python scripts/generate_step_audio_editx.py \
-  --model-dir models/stepfun/step_audio_editx/mlx-int8 \
-  --prefer-mlx-int8 \
   --prompt-audio input.wav \
   --prompt-text "Transcript of input audio." \
   -o happy.wav \
@@ -136,3 +123,8 @@ On Apple Silicon with int8 weights and bf16 activations, real-time factor
 - Upstream model: [`stepfun-ai/Step-Audio-EditX`](https://huggingface.co/stepfun-ai/Step-Audio-EditX)
 - Technical report: [arXiv:2511.03601](https://arxiv.org/abs/2511.03601)
 - More examples: [AppAutomaton](https://github.com/appautomaton)

 ## How to Get Started
+Download the bundle:
 ```bash
 hf download appautomaton/step-audio-editx-8bit-mlx \
 **Voice cloning:**
 ```bash
+python scripts/generate/step_audio_editx.py \
   --prompt-audio reference.wav \
   --prompt-text "Transcript of reference audio." \
   -o cloned.wav \
 **Audio editing (change emotion):**
 ```bash
+python scripts/generate/step_audio_editx.py \
   --prompt-audio input.wav \
   --prompt-text "Transcript of input audio." \
   -o happy.wav \
 - Upstream model: [`stepfun-ai/Step-Audio-EditX`](https://huggingface.co/stepfun-ai/Step-Audio-EditX)
 - Technical report: [arXiv:2511.03601](https://arxiv.org/abs/2511.03601)
 - More examples: [AppAutomaton](https://github.com/appautomaton)
+## License
+Apache 2.0 — following the upstream license published with
+[`stepfun-ai/Step-Audio-EditX`](https://huggingface.co/stepfun-ai/Step-Audio-EditX).