ChuxiJ commited on
Commit
8f40447
·
verified ·
1 Parent(s): 05791af

Document 50-step default for XL Base

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -46,7 +46,7 @@ output = pipe(
46
  prompt="An upbeat synthwave track with driving drums and a catchy lead",
47
  lyrics="[Verse]\nNeon lights are calling me\n[Chorus]\nRide the wave tonight",
48
  audio_duration=30.0,
49
- num_inference_steps=8,
50
  guidance_scale=7.0,
51
  shift=3.0,
52
  generator=torch.Generator(device="cuda").manual_seed(42),
@@ -56,7 +56,7 @@ audio = output.audios[0] # (channels, samples), 48 kHz
56
  sf.write("acestep-xl-base.wav", audio.T.cpu().float().numpy(), pipe.sample_rate)
57
  ```
58
 
59
- Unlike the turbo checkpoint, XL Base is not guidance-distilled. The pipeline uses ACE-Step's APG guidance path when `guidance_scale > 1.0`; `guidance_scale=7.0` and `shift=3.0` are the recommended defaults. You can increase `num_inference_steps` for slower, higher-quality sampling.
60
 
61
  For batched prompts with padding and FlashAttention, use the variable-length backend:
62
 
 
46
  prompt="An upbeat synthwave track with driving drums and a catchy lead",
47
  lyrics="[Verse]\nNeon lights are calling me\n[Chorus]\nRide the wave tonight",
48
  audio_duration=30.0,
49
+ num_inference_steps=50,
50
  guidance_scale=7.0,
51
  shift=3.0,
52
  generator=torch.Generator(device="cuda").manual_seed(42),
 
56
  sf.write("acestep-xl-base.wav", audio.T.cpu().float().numpy(), pipe.sample_rate)
57
  ```
58
 
59
+ Unlike the turbo checkpoint, XL Base is not guidance-distilled. The pipeline uses ACE-Step's APG guidance path when `guidance_scale > 1.0`; `num_inference_steps=50`, `guidance_scale=7.0`, and `shift=3.0` are the recommended defaults. Pass `num_inference_steps=50` explicitly so generation does not use the lower-step turbo setting.
60
 
61
  For batched prompts with padding and FlashAttention, use the variable-length backend:
62