yoyolicoris commited on
Commit
5c5a99e
·
1 Parent(s): d9872f2

docs: update demo description and update overview image

Browse files
Files changed (2) hide show
  1. app.py +3 -2
  2. overview.png +3 -0
app.py CHANGED
@@ -39,12 +39,13 @@ def chain_functions(*functions):
39
 
40
  title_md = "# Vocal Effects Generator"
41
  description_md = """
42
- This is a demo of the paper [DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions](https://arxiv.org/abs/2504.14735), accepted at DAFx 2025.
43
  In this demo, you can upload a raw vocal audio file (in mono) and use our model to apply professional-quality vocal processing by tweaking generated effects settings to enhance your vocals!
44
 
45
  The effects consist of series of EQ, compressor, delay, and reverb.
46
  The generator is a PCA model derived from 365 vocal effects presets fitted with the same effects chain.
47
  This interface allows you to control the principal components (PCs) of the generator, randomise them, and render the audio.
 
48
 
49
  To give you some idea, we empirically found that the first PC controls the amount of reverb and the second PC controls the amount of brightness.
50
  Note that adding these PCs together does not necessarily mean that their effects are additive in the final audio.
@@ -396,7 +397,7 @@ with gr.Blocks() as demo:
396
  description_md,
397
  elem_id="description",
398
  )
399
- gr.Image("diffvox_diagram.png", elem_id="diagram")
400
 
401
  with gr.Row():
402
  with gr.Column():
 
39
 
40
  title_md = "# Vocal Effects Generator"
41
  description_md = """
42
+ This is the demo [PCA-DiffVox: Augmenting Vocal Effects Tweakability With a Bijective Latent Space](https://www.waspaa.com/waspaa25/proceedings/WASPAA2025-291.pdf), which was presented at WASPAA 2025 and was built upon our work [DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions](https://arxiv.org/abs/2504.14735).
43
  In this demo, you can upload a raw vocal audio file (in mono) and use our model to apply professional-quality vocal processing by tweaking generated effects settings to enhance your vocals!
44
 
45
  The effects consist of series of EQ, compressor, delay, and reverb.
46
  The generator is a PCA model derived from 365 vocal effects presets fitted with the same effects chain.
47
  This interface allows you to control the principal components (PCs) of the generator, randomise them, and render the audio.
48
+ A brief illustration of the system is shown on the right.
49
 
50
  To give you some idea, we empirically found that the first PC controls the amount of reverb and the second PC controls the amount of brightness.
51
  Note that adding these PCs together does not necessarily mean that their effects are additive in the final audio.
 
397
  description_md,
398
  elem_id="description",
399
  )
400
+ gr.Image("overview.png", elem_id="diagram", height=500)
401
 
402
  with gr.Row():
403
  with gr.Column():
overview.png ADDED

Git LFS Details

  • SHA256: 34fc7b1eb1fbdba11f10b96bf3e878fdf8163196893bac27ddad598ac80e5ac0
  • Pointer size: 131 Bytes
  • Size of remote file: 187 kB