Spaces:

yoyolicoris
/

diffvox

Running

yoyolicoris commited on Oct 28

Commit

5c5a99e

1 Parent(s): d9872f2

docs: update demo description and update overview image

Files changed (2) hide show

app.py CHANGED Viewed

@@ -39,12 +39,13 @@ def chain_functions(*functions):
 title_md = "# Vocal Effects Generator"
 description_md = """
-This is a demo of the paper [DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions](https://arxiv.org/abs/2504.14735), accepted at DAFx 2025.
 In this demo, you can upload a raw vocal audio file (in mono) and use our model to apply professional-quality vocal processing by tweaking generated effects settings to enhance your vocals!
 The effects consist of series of EQ, compressor, delay, and reverb.
 The generator is a PCA model derived from 365 vocal effects presets fitted with the same effects chain.
 This interface allows you to control the principal components (PCs) of the generator, randomise them, and render the audio.
 To give you some idea, we empirically found that the first PC controls the amount of reverb and the second PC controls the amount of brightness.
 Note that adding these PCs together does not necessarily mean that their effects are additive in the final audio.
@@ -396,7 +397,7 @@ with gr.Blocks() as demo:
             description_md,
             elem_id="description",
         )
-        gr.Image("diffvox_diagram.png", elem_id="diagram")
     with gr.Row():
         with gr.Column():

 title_md = "# Vocal Effects Generator"
 description_md = """
+This is the demo [PCA-DiffVox: Augmenting Vocal Effects Tweakability With a Bijective Latent Space](https://www.waspaa.com/waspaa25/proceedings/WASPAA2025-291.pdf), which was presented at WASPAA 2025 and was built upon our work [DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions](https://arxiv.org/abs/2504.14735).
 In this demo, you can upload a raw vocal audio file (in mono) and use our model to apply professional-quality vocal processing by tweaking generated effects settings to enhance your vocals!
 The effects consist of series of EQ, compressor, delay, and reverb.
 The generator is a PCA model derived from 365 vocal effects presets fitted with the same effects chain.
 This interface allows you to control the principal components (PCs) of the generator, randomise them, and render the audio.
+A brief illustration of the system is shown on the right.
 To give you some idea, we empirically found that the first PC controls the amount of reverb and the second PC controls the amount of brightness.
 Note that adding these PCs together does not necessarily mean that their effects are additive in the final audio.
             description_md,
             elem_id="description",
         )
+        gr.Image("overview.png", elem_id="diagram", height=500)
     with gr.Row():
         with gr.Column():

overview.png ADDED Viewed