Spaces:

sonicase
/

granular-synthesis

Sleeping

App Files Files Community

sonicase commited on Feb 14

Commit

90f6d15

1 Parent(s): 8e35f78

new rain simulation method

Browse files

Files changed (3) hide show

README.md +7 -3
app.py +412 -406
requirements.txt +1 -0

README.md CHANGED Viewed

@@ -10,15 +10,19 @@ pinned: false
 license: mit
 ---
-# Granular Synthesis — Interactive Demo
 Educational demo of granular synthesis with two modes.
 ## Rain Simulation
-Each raindrop is a procedurally generated grain: noise burst → biquad low-pass filter → exponential envelope. Thousands of grains, scattered randomly in time with per-drop variation in size, spectral content, amplitude, and stereo position, create realistic rain textures.
-Three layers: individual drops (granular scatter), continuous wash (filtered brownian noise), and optional thunder (low-frequency sine cluster).
 ## Tonal Granular

 license: mit
 ---
+# Granular Synthesis
 Educational demo of granular synthesis with two modes.
 ## Rain Simulation
+Spectral-domain rain synthesis. White noise is sculpted in the frequency domain
+to match the spectral profile of real rainfall (energy concentrated 2 to 12 kHz,
+with slope varying by intensity). Each STFT frame is a "grain" whose spectrum
+is shaped by the rain profile, then overlap-added into the output.
+Three layers: continuous spectral wash, sparse transient drops (scipy bandpass filtered),
+and optional thunder rumble.
 ## Tonal Granular

app.py CHANGED Viewed

@@ -1,287 +1,369 @@
 """
-Granular Synthesis Demo — Rain Simulation (v2)
-===============================================
-v2 fixes the "frying bacon" problem:
-  - Biquad low-pass filter instead of naive box averaging
-  - Resonant body model for each drop (drops ring, not just click)
-  - Longer grains with proper exponential-then-silence tail
-  - Stereo field with per-drop panning
-  - Layered background wash using filtered brownian noise
-The core granular principle is the same: thousands of tiny
-overlapping sound fragments → emergent texture.
 """
 import numpy as np
 import gradio as gr
 # ---------------------------------------------------------------------------
 # Constants
 # ---------------------------------------------------------------------------
 SR = 44100
-DURATION = 6.0  # output length in seconds
 # ---------------------------------------------------------------------------
-# DSP utilities
 # ---------------------------------------------------------------------------
-def biquad_lowpass(signal: np.ndarray, cutoff_hz: float, sr: int = SR, Q: float = 0.707) -> np.ndarray:
     """
-    Second-order IIR low-pass filter (biquad).
-    WHY a biquad instead of moving-average?
-    A moving average is a very weak filter — it barely attenuates
-    high frequencies and creates comb-filter artifacts.
-    A biquad gives a proper -12 dB/octave roll-off with a tunable
-    cutoff frequency, which is essential for shaping rain timbre.
-    The math comes from the Audio EQ Cookbook (Robert Bristow-Johnson).
     """
-    w0 = 2.0 * np.pi * cutoff_hz / sr
-    alpha = np.sin(w0) / (2.0 * Q)
-    b0 = (1.0 - np.cos(w0)) / 2.0
-    b1 = 1.0 - np.cos(w0)
-    b2 = b0
-    a0 = 1.0 + alpha
-    a1 = -2.0 * np.cos(w0)
-    a2 = 1.0 - alpha
     # Normalize
-    b = np.array([b0 / a0, b1 / a0, b2 / a0])
-    a = np.array([1.0, a1 / a0, a2 / a0])
-    # Apply filter (direct form II transposed)
-    out = np.zeros_like(signal)
-    z1, z2 = 0.0, 0.0
-    for i in range(len(signal)):
-        x = signal[i]
-        y = b[0] * x + z1
-        z1 = b[1] * x - a[1] * y + z2
-        z2 = b[2] * x - a[2] * y
-        out[i] = y
-    return out
-def brownian_noise(n_samples: int) -> np.ndarray:
-    """
-    Brownian (red) noise = integrated white noise.
-    WHY brownian? It has a -6 dB/octave spectral slope, which sounds
-    like a deep, smooth rumble — much closer to steady rain wash
-    than white noise (which sounds like static/frying).
-    """
-    white = np.random.randn(n_samples) * 0.02
-    brown = np.cumsum(white)
-    # Remove DC drift and normalize
-    brown -= np.mean(brown)
-    peak = np.max(np.abs(brown))
-    if peak > 0:
-        brown /= peak
-    return brown
 # ---------------------------------------------------------------------------
-# Raindrop grain synthesis (v2 — much better)
 # ---------------------------------------------------------------------------
-def make_raindrop(
-    size_ms: float,
-    cutoff_hz: float,
-    resonance: float = 1.0,
-) -> np.ndarray:
     """
-    Synthesize one raindrop as: noise burst → biquad filter → envelope.
-    A real raindrop sound has three phases:
-      1. IMPACT — very short broadband transient (< 1 ms)
-      2. BODY — the surface resonates briefly (metal rings, glass taps)
-      3. TAIL — fast exponential decay into silence
-    We model this with a noise burst shaped by:
-      - A two-stage envelope (sharp attack + tunable decay)
-      - A biquad low-pass at a cutoff that varies with drop "size"
-      - Resonance (Q factor) that models the surface material
     """
-    n = max(int((size_ms / 1000.0) * SR), 64)
-    t = np.linspace(0, 1, n, endpoint=False)
-    # --- Two-stage envelope: fast attack, variable decay ---
-    # The attack is near-instant (first 5% of grain).
-    # The decay rate determines how "ringy" vs "dead" the surface is.
-    attack = np.minimum(t / 0.02, 1.0)  # ramp up in first 2% of grain
-    decay = np.exp(-6.0 * t)            # smooth exponential tail
-    envelope = attack * decay
-    # --- Noise source ---
-    # WHY noise and not a sine? Water impact is chaotic — it excites
-    # all frequencies at once. The filter then shapes the spectrum.
-    noise = np.random.randn(n)
-    # --- Apply envelope BEFORE filtering ---
-    # This way the filter's transient response adds a natural "ring"
-    # to the attack, which sounds like a surface being excited.
-    shaped = noise * envelope
-    # --- Biquad low-pass with resonance ---
-    # cutoff_hz controls brightness (glass=high, soil=low)
-    # resonance (Q) controls how much the surface "rings"
-    Q = 0.707 + resonance * 2.0  # 0.707=flat, higher=resonant peak
-    filtered = biquad_lowpass(shaped, cutoff_hz, Q=Q)
-    # Normalize grain
-    peak = np.max(np.abs(filtered))
-    if peak > 0:
-        filtered /= peak
-    return filtered
 # ---------------------------------------------------------------------------
-# Rain engine (v2)
 # ---------------------------------------------------------------------------
 def synthesize_rain(
     rain_type: str,
-    drop_size_ms: float,
-    drops_per_sec: float,
-    intensity: float,
-    surface_brightness: float,
-    surface_resonance: float,
     stereo_width: float,
 ) -> np.ndarray:
     """
-    Full rain synthesis engine.
-    Architecture:
-      Layer 1 — Individual drops (granular scatter)
-      Layer 2 — Continuous wash (filtered brownian noise)
-      Layer 3 — Thunder rumble (optional, low sine cluster)
-    Each layer uses different granular/procedural techniques
-    but they combine into a cohesive, natural rain sound.
     """
     n_out = int(DURATION * SR)
-    # Stereo output: shape (n_out, 2)
-    output = np.zeros((n_out, 2), dtype=np.float64)
-    # --- Map surface_brightness (0–1) to filter cutoff ---
-    # 0 = very dark (soil, 400 Hz) → 1 = very bright (tin, 8000 Hz)
-    # WHY logarithmic mapping? Human pitch perception is logarithmic.
-    cutoff = 400.0 * (2.0 ** (surface_brightness * 4.3))  # 400 → ~8000 Hz
-    # --- Rain type presets modify the base parameters ---
-    type_config = {
-        "light":   {"density_mult": 0.4, "size_mult": 0.6, "wash": 0.0,  "thunder": False},
-        "medium":  {"density_mult": 1.0, "size_mult": 1.0, "wash": 0.08, "thunder": False},
-        "heavy":   {"density_mult": 2.5, "size_mult": 1.4, "wash": 0.25, "thunder": False},
-        "thunder": {"density_mult": 3.0, "size_mult": 1.6, "wash": 0.35, "thunder": True},
-    }
-    cfg = type_config.get(rain_type, type_config["medium"])
-    total_drops = int(drops_per_sec * cfg["density_mult"] * intensity * DURATION)
-    actual_size = drop_size_ms * cfg["size_mult"]
-    # --- LAYER 1: Individual raindrop grains ---
-    # Poisson-random placement in time (real rain is stochastic)
-    drop_times = np.random.randint(0, max(n_out - int(actual_size / 1000 * SR) - 1, 1), size=total_drops)
-    for pos in drop_times:
-        # Per-drop variation (no two drops identical)
-        this_size = actual_size * np.random.uniform(0.5, 1.8)
-        this_cutoff = cutoff * np.random.uniform(0.6, 1.5)
-        this_res = surface_resonance * np.random.uniform(0.3, 1.0)
-        grain = make_raindrop(this_size, this_cutoff, this_res)
-        g_len = len(grain)
-        end = min(pos + g_len, n_out)
-        actual = end - pos
-        # Amplitude: random distance simulation (far drops are quieter)
-        amp = np.random.uniform(0.15, 1.0) ** 1.5  # power curve = more quiet drops
-        # Stereo panning: random position in the stereo field
-        # pan=0 → full left, pan=1 → full right
-        pan = 0.5 + (np.random.uniform(-1, 1) * stereo_width * 0.5)
-        pan = np.clip(pan, 0, 1)
-        # Constant-power panning (preserves perceived loudness)
-        L = np.cos(pan * np.pi / 2) * amp
-        R = np.sin(pan * np.pi / 2) * amp
-        output[pos:end, 0] += grain[:actual] * L
-        output[pos:end, 1] += grain[:actual] * R
-    # --- LAYER 2: Continuous background wash ---
-    # WHY a separate layer? When thousands of tiny drops overlap,
-    # you stop hearing individuals — it becomes a continuous "shhh".
-    # We model this directly with filtered brownian noise.
-    if cfg["wash"] > 0:
-        wash_L = brownian_noise(n_out)
-        wash_R = brownian_noise(n_out)  # independent channels = spatial width
-        # Filter the wash to match the surface brightness
-        wash_cutoff = cutoff * 0.6  # wash is always darker than individual drops
-        wash_L = biquad_lowpass(wash_L, wash_cutoff)
-        wash_R = biquad_lowpass(wash_R, wash_cutoff)
-        # Slow amplitude modulation — rain wash isn't perfectly steady
-        t = np.linspace(0, DURATION, n_out)
-        mod = 0.7 + 0.3 * np.sin(2 * np.pi * 0.15 * t + np.random.uniform(0, 2 * np.pi))
-        output[:, 0] += wash_L * mod * cfg["wash"] * intensity
-        output[:, 1] += wash_R * mod * cfg["wash"] * intensity
-    # --- LAYER 3: Thunder ---
-    if cfg["thunder"]:
-        n_thunders = np.random.randint(1, 3)
-        for _ in range(n_thunders):
-            th_pos = np.random.randint(0, n_out // 2)
-            th_len = int(SR * np.random.uniform(1.5, 3.5))
-            t = np.linspace(0, 1, th_len)
-            rumble = np.zeros(th_len)
-            for f in [22, 35, 48, 65, 80]:
-                phase = np.random.uniform(0, 2 * np.pi)
-                rumble += np.sin(2 * np.pi * f * t + phase) * np.random.uniform(0.3, 1.0)
-            # Thunder envelope: slow build, long tail
-            env = np.exp(-1.2 * t) * (1 - np.exp(-8 * t))
-            rumble *= env * 0.35
-            end = min(th_pos + th_len, n_out)
-            seg = rumble[:end - th_pos]
-            # Thunder is roughly centered in stereo
-            output[th_pos:end, 0] += seg * 0.8
-            output[th_pos:end, 1] += seg * 0.8
-    # --- Final normalization ---
-    peak = np.max(np.abs(output))
     if peak > 0:
-        output *= 0.9 / peak
-    return output
 # ---------------------------------------------------------------------------
-# Tonal granular engine (unchanged logic, improved quality)
 # ---------------------------------------------------------------------------
 def make_tonal_source(freq: float = 220.0, duration: float = 2.0) -> np.ndarray:
     t = np.linspace(0, duration, int(SR * duration), endpoint=False)
-    signal = np.zeros_like(t)
     for k in range(1, 8):
-        signal += (1.0 / k) * np.sin(2 * np.pi * freq * k * t)
-    signal /= np.max(np.abs(signal))
-    return signal
 def granular_synthesize(source, grain_size_ms, density, randomness, pitch_shift):
     grain_samples = max(int((grain_size_ms / 1000.0) * SR), 64)
     window = np.hanning(grain_samples)
     hop = max(int(grain_samples / density), 1)
     n_out = int(DURATION * SR)
     output = np.zeros(n_out, dtype=np.float64)
@@ -302,7 +384,6 @@ def granular_synthesize(source, grain_size_ms, density, randomness, pitch_shift)
         rand_pos = np.random.randint(0, max(src_len - grain_samples, 1))
         start = int(seq_pos * (1 - randomness) + rand_pos * randomness)
         start = np.clip(start, 0, src_len - grain_samples)
         grain = pitched[start: start + grain_samples] * window
         out_pos = i * hop
         if out_pos + grain_samples > n_out:
@@ -317,11 +398,11 @@ def granular_synthesize(source, grain_size_ms, density, randomness, pitch_shift)
 # ---------------------------------------------------------------------------
-# Gradio callbacks
 # ---------------------------------------------------------------------------
-def cb_rain(rain_type, drop_size, drops_sec, intensity, brightness, resonance, stereo):
-    audio = synthesize_rain(rain_type, drop_size, drops_sec, intensity, brightness, resonance, stereo)
     return (SR, audio.astype(np.float32))
@@ -332,173 +413,98 @@ def cb_tonal(grain_size, density, randomness, pitch_shift, freq):
 # ---------------------------------------------------------------------------
-# Custom CSS — dark theme inspired by Material for MkDocs
 # ---------------------------------------------------------------------------
-CUSTOM_CSS = """
-@import url('https://fonts.googleapis.com/css2?family=Inter:wght@300;400;500;600&family=JetBrains+Mono:wght@400&display=swap');
-/* ── Global overrides ── */
-.gradio-container {
-    font-family: 'Inter', -apple-system, BlinkMacSystemFont, sans-serif !important;
-    background: #0d1117 !important;
-    color: #c9d1d9 !important;
-    max-width: 1100px !important;
-    margin: auto !important;
-}
-/* ── Remove default Gradio footer ── */
 footer { display: none !important; }
-/* ── Typography ── */
-h1, h2, h3 {
-    font-family: 'Inter', sans-serif !important;
-    font-weight: 600 !important;
-    color: #e6edf3 !important;
-    letter-spacing: -0.02em !important;
-}
-h1 { font-size: 1.75rem !important; }
-h3 { font-size: 1.05rem !important; color: #8b949e !important; }
-p, span, label, .gr-prose {
-    font-family: 'Inter', sans-serif !important;
-    color: #c9d1d9 !important;
-    font-size: 0.9rem !important;
-    line-height: 1.6 !important;
-}
-/* ── Panels and cards ── */
-.gr-panel, .gr-box, .gr-form, .gr-block {
-    background: #161b22 !important;
-    border: 1px solid #21262d !important;
-    border-radius: 8px !important;
-}
-/* ── Tabs ── */
 .tab-nav button {
-    font-family: 'Inter', sans-serif !important;
     font-weight: 500 !important;
-    font-size: 0.9rem !important;
-    color: #8b949e !important;
     border: none !important;
-    background: transparent !important;
-    padding: 10px 20px !important;
     border-bottom: 2px solid transparent !important;
 }
 .tab-nav button.selected {
     color: #58a6ff !important;
-    border-bottom: 2px solid #58a6ff !important;
-}
-/* ── Sliders ── */
-input[type="range"] {
-    accent-color: #58a6ff !important;
-}
-.gr-slider label {
-    font-weight: 500 !important;
-}
-/* ── Buttons ── */
-.gr-button-primary {
-    background: #238636 !important;
-    border: 1px solid #2ea043 !important;
-    color: #ffffff !important;
-    font-family: 'Inter', sans-serif !important;
-    font-weight: 500 !important;
-    border-radius: 6px !important;
-    padding: 8px 24px !important;
-    transition: background 0.15s ease !important;
-}
-.gr-button-primary:hover {
-    background: #2ea043 !important;
-}
-/* ── Radio buttons ── */
-.gr-radio label {
-    font-family: 'Inter', sans-serif !important;
-}
-/* ── Audio player ── */
-audio {
-    border-radius: 6px !important;
-}
-/* ── Tables in markdown ── */
-table {
-    font-family: 'Inter', sans-serif !important;
-    font-size: 0.82rem !important;
-    border-collapse: collapse !important;
-    width: 100% !important;
-}
-th {
-    background: #21262d !important;
-    color: #8b949e !important;
-    font-weight: 500 !important;
-    padding: 8px 12px !important;
-    text-align: left !important;
-}
-td {
-    padding: 6px 12px !important;
-    border-top: 1px solid #21262d !important;
-    color: #c9d1d9 !important;
-}
-/* ── Info text under sliders ── */
-.gr-info {
-    font-size: 0.78rem !important;
-    color: #6e7681 !important;
-    font-style: normal !important;
-}
-/* ── Code / mono ── */
-code, .mono {
-    font-family: 'JetBrains Mono', monospace !important;
-    font-size: 0.82rem !important;
-    background: #21262d !important;
-    padding: 2px 6px !important;
-    border-radius: 4px !important;
-}
-/* ── Accent color for links ── */
-a { color: #58a6ff !important; }
-/* ── Divider ── */
-hr {
-    border: none !important;
-    border-top: 1px solid #21262d !important;
-    margin: 1.5rem 0 !important;
 }
 """
 # ---------------------------------------------------------------------------
 # UI
 # ---------------------------------------------------------------------------
-with gr.Blocks(
-    title="Granular Synthesis · Rain",
-    css=CUSTOM_CSS,
-    theme=gr.themes.Base(),
-) as demo:
     gr.Markdown(
         """
-        # Granular Synthesis — Interactive Demo
-        ### Micro-sound, grain clouds, texture design
         """
     )
     with gr.Tabs():
-        # ======================== RAIN TAB ========================
         with gr.TabItem("Rain Simulation"):
             gr.Markdown(
                 """
-                Each raindrop is a **noise micro-burst** shaped by an exponential envelope
-                and a resonant low-pass filter. Thousands of them, scattered randomly in time
-                with per-drop variation in size, brightness, and stereo position, create rain.
-                This is granular synthesis — but instead of slicing an existing recording,
-                each grain is **procedurally generated**.
                 """
             )
@@ -508,37 +514,37 @@ with gr.Blocks(
                         choices=["light", "medium", "heavy", "thunder"],
                         value="medium",
                         label="Rain type",
-                        info="Affects density multiplier, spectral content, and optional layers.",
-                    )
-                    drop_size = gr.Slider(
-                        minimum=2, maximum=60, value=18, step=1,
-                        label="Drop size (ms)",
-                        info="Grain duration. Larger → splashier, more resonant.",
-                    )
-                    rain_density = gr.Slider(
-                        minimum=10, maximum=400, value=100, step=5,
-                        label="Drops / second",
-                        info="Base rate before type multiplier.",
-                    )
-                    rain_intensity = gr.Slider(
-                        minimum=0.5, maximum=5.0, value=1.5, step=0.1,
-                        label="Intensity",
-                        info="Global density and wash volume scaling.",
                     )
                     brightness = gr.Slider(
-                        minimum=0.0, maximum=1.0, value=0.45, step=0.01,
                         label="Surface brightness",
-                        info="Low-pass cutoff. 0 → earth / foliage (dark). 1 → glass / tin (bright).",
                     )
-                    resonance = gr.Slider(
-                        minimum=0.0, maximum=1.0, value=0.3, step=0.01,
-                        label="Surface resonance",
-                        info="Filter Q. Higher → surface rings more (metallic).",
                     )
                     stereo = gr.Slider(
-                        minimum=0.0, maximum=1.0, value=0.7, step=0.01,
                         label="Stereo width",
-                        info="0 → mono centre. 1 → full L/R scatter.",
                     )
                     rain_btn = gr.Button("Generate rain", variant="primary", size="lg")
@@ -547,26 +553,26 @@ with gr.Blocks(
                     gr.Markdown(
                         """
-                        **Presets to try**
-                        | Scene | Type | ms | d/s | Int | Bright | Res |
                         |---|---|---|---|---|---|---|
-                        | Drizzle on leaves | light | 10 | 30 | 1.0 | 0.2 | 0.1 |
-                        | Window at night | medium | 18 | 100 | 1.5 | 0.5 | 0.3 |
-                        | Tin roof | medium | 12 | 140 | 2.0 | 0.9 | 0.8 |
-                        | Downpour | heavy | 25 | 250 | 3.0 | 0.4 | 0.2 |
-                        | Thunderstorm | thunder | 30 | 300 | 4.0 | 0.35 | 0.25 |
-                        | Forest canopy | light | 22 | 50 | 1.0 | 0.15 | 0.5 |
                         """
                     )
             rain_btn.click(
                 fn=cb_rain,
-                inputs=[rain_type, drop_size, rain_density, rain_intensity, brightness, resonance, stereo],
                 outputs=rain_audio,
             )
-        # ======================== TONAL TAB ========================
         with gr.TabItem("Tonal Granular"):
             gr.Markdown(
                 """
@@ -578,10 +584,10 @@ with gr.Blocks(
                 with gr.Column(scale=1):
                     source_freq = gr.Slider(80, 880, 220, step=1, label="Source frequency (Hz)")
                     grain_size = gr.Slider(5, 200, 50, step=1, label="Grain size (ms)",
-                        info="Smaller → buzzy. Larger → smooth.")
                     tonal_density = gr.Slider(1, 8, 4, step=0.5, label="Density (overlap)")
                     randomness_sl = gr.Slider(0, 1, 0.3, step=0.01, label="Position randomness",
-                        info="0 → sequential. 1 → fully random (freeze/texture).")
                     pitch = gr.Slider(0.25, 4.0, 1.0, step=0.05, label="Pitch shift")
                     tonal_btn = gr.Button("Synthesize", variant="primary", size="lg")
@@ -591,8 +597,8 @@ with gr.Blocks(
                         """
                         **Signal chain**
-                        Source (additive harmonics) → grain extraction (sequential + random blend)
-                        → Hann window (click-free edges) → overlap-add → normalize
                         """
                     )
@@ -605,7 +611,7 @@ with gr.Blocks(
     gr.Markdown(
         """
         ---
-        Built with Python and Gradio — no audio samples, everything is synthesized from scratch.
         Part of [Generative Audio Soundscapes Lab](https://my-sonicase.github.io/genaudio-soundscapes/).
         """
     )

 """
+Granular Synthesis Demo // Rain Simulation (v3)
+================================================
+Why v1/v2 sounded like frying bacon:
+  The old approach generated individual noise-burst "drops" and summed them.
+  This creates a sparse, clicky texture because:
+    1. Short noise bursts have flat spectra (white noise = frying sound)
+    2. Box/naive filters barely shape the spectrum
+    3. Individual grains are too sparse to fuse into a continuous texture
+Real rain is NOT a sum of isolated clicks. Acoustically, rain is a
+CONTINUOUS stochastic process with a specific spectral shape:
+  - Energy concentrated between 1 kHz and 15 kHz
+  - Peak around 5-8 kHz (research: Nystuen et al., raindrop acoustics)
+  - Spectral slope that varies with rain intensity
+  - Slow amplitude modulation (gusts, intensity fluctuation)
+  - Small drops produce 13-25 kHz (drizzle shimmer)
+  - Large drops add energy below 2 kHz (heavy rain rumble)
+v3 approach: spectral domain synthesis.
+  1. Generate white noise in the frequency domain (FFT)
+  2. Sculpt the spectrum to match real rain profiles
+  3. Add temporal modulation (amplitude envelopes that breathe)
+  4. Layer: continuous wash + transient drops + optional thunder
+  5. Stereo decorrelation for spatial width
+This is still granular thinking: the "grains" are now overlapping
+FFT frames (STFT), each with a shaped spectrum. The overlap-add
+reconstruction is the same principle as classic granular synthesis.
 """
 import numpy as np
+from scipy import signal as sig
+from scipy.fft import rfft, irfft
 import gradio as gr
 # ---------------------------------------------------------------------------
 # Constants
 # ---------------------------------------------------------------------------
 SR = 44100
+DURATION = 7.0  # seconds
 # ---------------------------------------------------------------------------
+# Spectral rain profile
 # ---------------------------------------------------------------------------
+def rain_spectral_profile(
+    n_fft: int,
+    brightness: float,
+    rain_type: str,
+) -> np.ndarray:
     """
+    Build a frequency-domain magnitude envelope that matches
+    the spectral shape of real rainfall.
+    Based on underwater acoustic rainfall studies:
+    small drops peak at 13-25 kHz, large drops are broadband 1-50 kHz,
+    most rain energy sits in the 2-12 kHz band.
+    We model this as a bandpass profile (skewed Gaussian in log-frequency)
+    whose center frequency and bandwidth shift with brightness and rain type.
     """
+    n_bins = n_fft // 2 + 1
+    freqs = np.linspace(0, SR / 2, n_bins)
+    # Avoid log(0)
+    freqs_safe = np.maximum(freqs, 1.0)
+    log_freqs = np.log2(freqs_safe)
+    # Center frequency shifts with brightness
+    # Low brightness (dark/soil): center around 2 kHz
+    # High brightness (glass/metal): center around 8 kHz
+    center_hz = 1500 * (2.0 ** (brightness * 2.5))  # 1.5 kHz to ~8.5 kHz
+    center_log = np.log2(center_hz)
+    # Bandwidth in octaves (wider for heavy rain)
+    bw_map = {"light": 1.8, "medium": 2.2, "heavy": 3.0, "thunder": 3.5}
+    bw = bw_map.get(rain_type, 2.2)
+    # Skewed Gaussian in log-frequency space
+    profile = np.exp(-0.5 * ((log_freqs - center_log) / bw) ** 2)
+    # Add high-frequency shimmer for light rain (drizzle peak at 13-25 kHz)
+    if rain_type == "light":
+        shimmer_center = np.log2(16000)
+        shimmer = 0.4 * np.exp(-0.5 * ((log_freqs - shimmer_center) / 0.5) ** 2)
+        profile += shimmer
+    # Add sub-bass rumble for heavy/thunder
+    if rain_type in ("heavy", "thunder"):
+        bass_center = np.log2(300)
+        bass = 0.3 * np.exp(-0.5 * ((log_freqs - bass_center) / 1.0) ** 2)
+        profile += bass
+    # Roll off everything below 80 Hz (rumble is not rain)
+    highpass = 1.0 / (1.0 + (80.0 / freqs_safe) ** 4)
+    profile *= highpass
     # Normalize
+    profile /= np.max(profile) + 1e-12
+    return profile
 # ---------------------------------------------------------------------------
+# Temporal modulation (rain is not perfectly steady)
 # ---------------------------------------------------------------------------
+def make_modulation(n_samples: int, speed: float = 0.2) -> np.ndarray:
     """
+    Slow amplitude modulation to simulate natural intensity fluctuation.
+    Rain intensity varies over seconds (gusts, cloud cells passing).
+    We sum a few slow random sinusoids to create an organic envelope.
     """
+    t = np.linspace(0, DURATION, n_samples)
+    mod = np.ones(n_samples, dtype=np.float64)
+    # Sum of 4 slow sinusoids with random phases
+    for i in range(4):
+        freq = speed * (0.5 + i * 0.3) + np.random.uniform(-0.05, 0.05)
+        phase = np.random.uniform(0, 2 * np.pi)
+        depth = 0.08 + 0.07 * i  # increasing modulation depth
+        mod += depth * np.sin(2 * np.pi * freq * t + phase)
+    # Keep in a reasonable range
+    mod = np.clip(mod, 0.3, 1.5)
+    # Smooth with a gentle low-pass to avoid sudden jumps
+    window = np.hanning(int(SR * 0.3))
+    window /= window.sum()
+    mod = np.convolve(mod, window, mode="same")
+    return mod
 # ---------------------------------------------------------------------------
+# Core: spectral rain synthesis via overlap-add STFT
 # ---------------------------------------------------------------------------
 def synthesize_rain(
     rain_type: str,
+    brightness: float,
+    density: float,
+    modulation_speed: float,
     stereo_width: float,
+    highcut: float,
+    lowcut: float,
 ) -> np.ndarray:
     """
+    Synthesize rain using FFT-based spectral shaping.
+    This is granular synthesis at the frame level:
+    each STFT frame is a "grain" whose spectrum is sculpted,
+    and the overlap-add reconstruction creates the continuous texture.
     """
     n_out = int(DURATION * SR)
+    # FFT parameters
+    # 2048 samples at 44.1kHz = ~46ms frames. This is our "grain size"
+    # in the spectral domain. Overlap of 75% ensures smooth transitions.
+    n_fft = 2048
+    hop = n_fft // 4  # 75% overlap (standard for STFT)
+    n_frames = (n_out // hop) + 1
+    n_bins = n_fft // 2 + 1
+    # Build the target spectral profile
+    profile = rain_spectral_profile(n_fft, brightness, rain_type)
+    # Apply density scaling (affects overall energy)
+    profile *= (0.3 + density * 0.7)
+    # Apply frequency range limits from sliders
+    freqs = np.linspace(0, SR / 2, n_bins)
+    # Gentle roll-off at the edges (not a brick wall, which sounds unnatural)
+    low_rolloff = 1.0 / (1.0 + (lowcut / (freqs + 1e-6)) ** 6)
+    high_rolloff = 1.0 / (1.0 + (freqs / highcut) ** 6)
+    profile *= low_rolloff * high_rolloff
+    # Synthesis window (Hann for overlap-add, same as classic granular)
+    window = np.hanning(n_fft)
+    # Two independent channels for stereo
+    output_L = np.zeros(n_out + n_fft, dtype=np.float64)
+    output_R = np.zeros(n_out + n_fft, dtype=np.float64)
+    for frame in range(n_frames):
+        # Generate random phase noise in the frequency domain.
+        # This is the key insight: white noise = uniform random phase
+        # + flat magnitude. By keeping random phase but imposing our
+        # spectral profile as magnitude, we get colored noise that
+        # matches the rain spectrum exactly.
+        # Left channel
+        phase_L = np.random.uniform(0, 2 * np.pi, n_bins)
+        spectrum_L = profile * np.exp(1j * phase_L)
+        grain_L = irfft(spectrum_L, n=n_fft).real * window
+        # Right channel: independent phase for stereo decorrelation.
+        # stereo_width controls how different L and R are.
+        # width=0: identical (mono). width=1: fully independent.
+        if stereo_width > 0.01:
+            phase_R = phase_L * (1 - stereo_width) + np.random.uniform(0, 2 * np.pi, n_bins) * stereo_width
+            spectrum_R = profile * np.exp(1j * phase_R)
+            grain_R = irfft(spectrum_R, n=n_fft).real * window
+        else:
+            grain_R = grain_L.copy()
+        # Place grain in output (overlap-add)
+        pos = frame * hop
+        if pos + n_fft <= len(output_L):
+            output_L[pos:pos + n_fft] += grain_L
+            output_R[pos:pos + n_fft] += grain_R
+    # Trim to exact length
+    output_L = output_L[:n_out]
+    output_R = output_R[:n_out]
+    # Apply temporal modulation
+    mod = make_modulation(n_out, speed=modulation_speed)
+    output_L *= mod
+    output_R *= mod
+    # Add transient drop layer for texture (sparse individual drops on top)
+    drop_layer_L, drop_layer_R = make_drop_layer(n_out, rain_type, brightness, density)
+    # Drops are much quieter than the continuous layer
+    drop_mix = {"light": 0.5, "medium": 0.3, "heavy": 0.15, "thunder": 0.1}
+    dmix = drop_mix.get(rain_type, 0.3)
+    output_L += drop_layer_L * dmix
+    output_R += drop_layer_R * dmix
+    # Thunder
+    if rain_type == "thunder":
+        th_L, th_R = make_thunder(n_out)
+        output_L += th_L
+        output_R += th_R
+    # Final normalization
+    stereo = np.column_stack([output_L, output_R])
+    peak = np.max(np.abs(stereo))
     if peak > 0:
+        stereo *= 0.85 / peak
+    return stereo
+# ---------------------------------------------------------------------------
+# Transient drop layer (sparse individual drops for texture)
+# ---------------------------------------------------------------------------
+def make_drop_layer(
+    n_out: int,
+    rain_type: str,
+    brightness: float,
+    density: float,
+) -> tuple:
+    """
+    Sparse individual drops layered on top of the continuous wash.
+    These provide the "pointillistic" detail that makes rain sound alive.
+    Without them, the wash alone sounds like generic colored noise.
+    """
+    output_L = np.zeros(n_out, dtype=np.float64)
+    output_R = np.zeros(n_out, dtype=np.float64)
+    # Number of audible drops (not all rain drops are individually heard)
+    drops_per_sec = {"light": 8, "medium": 20, "heavy": 40, "thunder": 50}
+    n_drops = int(drops_per_sec.get(rain_type, 20) * density * DURATION)
+    # Drop duration in samples (10-40ms)
+    base_dur = int(SR * 0.02)
+    # Cutoff frequency for drops (matches surface brightness)
+    cutoff_base = 1000 * (2.0 ** (brightness * 3.0))  # 1kHz to 8kHz
+    for _ in range(n_drops):
+        pos = np.random.randint(0, max(n_out - base_dur * 3, 1))
+        # Each drop varies in duration and brightness
+        dur = int(base_dur * np.random.uniform(0.5, 2.0))
+        dur = max(dur, 64)
+        # Synthesize drop: filtered noise with sharp exponential decay
+        t = np.linspace(0, 1, dur)
+        envelope = np.exp(-np.random.uniform(8, 20) * t)
+        noise = np.random.randn(dur) * envelope
+        # Bandpass filter each drop using scipy
+        # Cutoff varies per drop for realism
+        this_cutoff = cutoff_base * np.random.uniform(0.5, 1.5)
+        this_cutoff = min(this_cutoff, SR * 0.45)
+        low = max(this_cutoff * 0.3, 100)
+        try:
+            sos = sig.butter(2, [low, this_cutoff], btype="bandpass", fs=SR, output="sos")
+            drop = sig.sosfilt(sos, noise)
+        except Exception:
+            drop = noise  # fallback if filter params are out of range
+        # Random amplitude (distance simulation)
+        amp = np.random.uniform(0.1, 1.0) ** 1.3
+        # Stereo position
+        pan = np.random.uniform(0, 1)
+        L_gain = np.cos(pan * np.pi / 2) * amp
+        R_gain = np.sin(pan * np.pi / 2) * amp
+        end = min(pos + dur, n_out)
+        seg = drop[:end - pos]
+        output_L[pos:end] += seg * L_gain
+        output_R[pos:end] += seg * R_gain
+    return output_L, output_R
+# ---------------------------------------------------------------------------
+# Thunder
+# ---------------------------------------------------------------------------
+def make_thunder(n_out: int) -> tuple:
+    """Low-frequency rumble events with slow attack and long tail."""
+    L = np.zeros(n_out, dtype=np.float64)
+    R = np.zeros(n_out, dtype=np.float64)
+    n_events = np.random.randint(1, 3)
+    for _ in range(n_events):
+        pos = np.random.randint(0, n_out // 2)
+        dur = int(SR * np.random.uniform(2.0, 4.0))
+        t = np.linspace(0, 1, dur)
+        # Sum of low frequencies with random phases
+        rumble = np.zeros(dur)
+        for f in [20, 30, 45, 60, 80, 100]:
+            phase = np.random.uniform(0, 2 * np.pi)
+            rumble += np.sin(2 * np.pi * f * t + phase) * np.random.uniform(0.3, 1.0)
+        # Envelope: slow build, long decay
+        env = np.exp(-1.0 * t) * (1 - np.exp(-6 * t))
+        rumble *= env * 0.4
+        end = min(pos + dur, n_out)
+        seg = rumble[:end - pos]
+        # Slightly different L/R for width
+        L[pos:end] += seg * np.random.uniform(0.7, 1.0)
+        R[pos:end] += seg * np.random.uniform(0.7, 1.0)
+    return L, R
 # ---------------------------------------------------------------------------
+# Tonal granular engine (unchanged)
 # ---------------------------------------------------------------------------
 def make_tonal_source(freq: float = 220.0, duration: float = 2.0) -> np.ndarray:
     t = np.linspace(0, duration, int(SR * duration), endpoint=False)
+    s = np.zeros_like(t)
     for k in range(1, 8):
+        s += (1.0 / k) * np.sin(2 * np.pi * freq * k * t)
+    s /= np.max(np.abs(s))
+    return s
 def granular_synthesize(source, grain_size_ms, density, randomness, pitch_shift):
     grain_samples = max(int((grain_size_ms / 1000.0) * SR), 64)
     window = np.hanning(grain_samples)
     hop = max(int(grain_samples / density), 1)
     n_out = int(DURATION * SR)
     output = np.zeros(n_out, dtype=np.float64)
         rand_pos = np.random.randint(0, max(src_len - grain_samples, 1))
         start = int(seq_pos * (1 - randomness) + rand_pos * randomness)
         start = np.clip(start, 0, src_len - grain_samples)
         grain = pitched[start: start + grain_samples] * window
         out_pos = i * hop
         if out_pos + grain_samples > n_out:
 # ---------------------------------------------------------------------------
+# Callbacks
 # ---------------------------------------------------------------------------
+def cb_rain(rain_type, brightness, density, mod_speed, stereo, highcut, lowcut):
+    audio = synthesize_rain(rain_type, brightness, density, mod_speed, stereo, highcut, lowcut)
     return (SR, audio.astype(np.float32))
 # ---------------------------------------------------------------------------
+# Theme + CSS
 # ---------------------------------------------------------------------------
+dark_theme = gr.themes.Base(
+    primary_hue=gr.themes.colors.blue,
+    secondary_hue=gr.themes.colors.slate,
+    neutral_hue=gr.themes.colors.slate,
+    font=gr.themes.GoogleFont("Inter"),
+    font_mono=gr.themes.GoogleFont("JetBrains Mono"),
+).set(
+    body_background_fill="#0d1117",
+    body_background_fill_dark="#0d1117",
+    body_text_color="#c9d1d9",
+    body_text_color_dark="#c9d1d9",
+    body_text_color_subdued="#8b949e",
+    body_text_color_subdued_dark="#8b949e",
+    background_fill_primary="#161b22",
+    background_fill_primary_dark="#161b22",
+    background_fill_secondary="#0d1117",
+    background_fill_secondary_dark="#0d1117",
+    block_background_fill="#161b22",
+    block_background_fill_dark="#161b22",
+    block_border_color="#21262d",
+    block_border_color_dark="#21262d",
+    block_label_text_color="#8b949e",
+    block_label_text_color_dark="#8b949e",
+    block_title_text_color="#e6edf3",
+    block_title_text_color_dark="#e6edf3",
+    border_color_primary="#21262d",
+    border_color_primary_dark="#21262d",
+    button_primary_background_fill="#238636",
+    button_primary_background_fill_dark="#238636",
+    button_primary_background_fill_hover="#2ea043",
+    button_primary_background_fill_hover_dark="#2ea043",
+    button_primary_text_color="#ffffff",
+    button_primary_text_color_dark="#ffffff",
+    button_secondary_background_fill="#21262d",
+    button_secondary_background_fill_dark="#21262d",
+    button_secondary_text_color="#c9d1d9",
+    button_secondary_text_color_dark="#c9d1d9",
+    input_background_fill="#0d1117",
+    input_background_fill_dark="#0d1117",
+    input_border_color="#30363d",
+    input_border_color_dark="#30363d",
+    slider_color="#58a6ff",
+    slider_color_dark="#58a6ff",
+    link_text_color="#58a6ff",
+    link_text_color_dark="#58a6ff",
+)
+CUSTOM_CSS = """
 footer { display: none !important; }
+h1 { letter-spacing: -0.03em !important; font-weight: 600 !important; }
+h3 { color: #8b949e !important; font-weight: 400 !important; }
 .tab-nav button {
     font-weight: 500 !important;
     border: none !important;
     border-bottom: 2px solid transparent !important;
 }
 .tab-nav button.selected {
     color: #58a6ff !important;
+    border-bottom-color: #58a6ff !important;
 }
+table { font-size: 0.82rem !important; }
+th { background: #21262d !important; color: #8b949e !important; font-weight: 500 !important; }
+td { border-top: 1px solid #21262d !important; }
 """
 # ---------------------------------------------------------------------------
 # UI
 # ---------------------------------------------------------------------------
+with gr.Blocks(title="Granular Synthesis", css=CUSTOM_CSS, theme=dark_theme) as demo:
     gr.Markdown(
         """
+        # Granular Synthesis
+        ### micro-sound, grain clouds, texture design
         """
     )
     with gr.Tabs():
+        # ======================== RAIN ========================
         with gr.TabItem("Rain Simulation"):
             gr.Markdown(
                 """
+                Spectral-domain rain synthesis. Instead of summing noise clicks,
+                we sculpt white noise in the frequency domain to match the spectral
+                profile of real rainfall (energy concentrated 2 to 12 kHz, slope varies
+                with intensity). Each STFT frame is a "grain" whose spectrum is shaped
+                by the rain profile, then overlap-added into the output.
                 """
             )
                         choices=["light", "medium", "heavy", "thunder"],
                         value="medium",
                         label="Rain type",
+                        info="Controls spectral shape, transient density, and optional layers.",
                     )
                     brightness = gr.Slider(
+                        0.0, 1.0, 0.45, step=0.01,
                         label="Surface brightness",
+                        info="0 = dark (earth, foliage). 1 = bright (glass, tin roof).",
+                    )
+                    rain_density = gr.Slider(
+                        0.2, 3.0, 1.0, step=0.1,
+                        label="Density",
+                        info="Overall thickness of the rain texture.",
                     )
+                    mod_speed = gr.Slider(
+                        0.05, 1.0, 0.2, step=0.05,
+                        label="Modulation speed",
+                        info="How fast the rain intensity fluctuates (gusts).",
                     )
                     stereo = gr.Slider(
+                        0.0, 1.0, 0.7, step=0.01,
                         label="Stereo width",
+                        info="0 = mono. 1 = fully decorrelated L/R.",
+                    )
+                    lowcut = gr.Slider(
+                        50, 2000, 150, step=10,
+                        label="Low cut (Hz)",
+                        info="Remove frequencies below this point.",
+                    )
+                    highcut = gr.Slider(
+                        2000, 20000, 14000, step=100,
+                        label="High cut (Hz)",
+                        info="Remove frequencies above this point.",
                     )
                     rain_btn = gr.Button("Generate rain", variant="primary", size="lg")
                     gr.Markdown(
                         """
+                        **Presets**
+                        | Scene | Type | Bright | Dens | Mod | LoCut | HiCut |
                         |---|---|---|---|---|---|---|
+                        | Drizzle on leaves | light | 0.2 | 0.6 | 0.1 | 200 | 18000 |
+                        | Window at night | medium | 0.5 | 1.0 | 0.2 | 150 | 14000 |
+                        | Tin roof | medium | 0.9 | 1.2 | 0.15 | 300 | 16000 |
+                        | Downpour | heavy | 0.4 | 2.0 | 0.3 | 100 | 12000 |
+                        | Thunderstorm | thunder | 0.35 | 2.5 | 0.4 | 80 | 10000 |
+                        | Forest canopy | light | 0.15 | 0.5 | 0.08 | 200 | 15000 |
                         """
                     )
             rain_btn.click(
                 fn=cb_rain,
+                inputs=[rain_type, brightness, rain_density, mod_speed, stereo, highcut, lowcut],
                 outputs=rain_audio,
             )
+        # ======================== TONAL ========================
         with gr.TabItem("Tonal Granular"):
             gr.Markdown(
                 """
                 with gr.Column(scale=1):
                     source_freq = gr.Slider(80, 880, 220, step=1, label="Source frequency (Hz)")
                     grain_size = gr.Slider(5, 200, 50, step=1, label="Grain size (ms)",
+                        info="Smaller = buzzy. Larger = smooth.")
                     tonal_density = gr.Slider(1, 8, 4, step=0.5, label="Density (overlap)")
                     randomness_sl = gr.Slider(0, 1, 0.3, step=0.01, label="Position randomness",
+                        info="0 = sequential. 1 = fully random (freeze/texture).")
                     pitch = gr.Slider(0.25, 4.0, 1.0, step=0.05, label="Pitch shift")
                     tonal_btn = gr.Button("Synthesize", variant="primary", size="lg")
                         """
                         **Signal chain**
+                        Source (additive harmonics) > grain extraction (sequential + random blend)
+                        > Hann window (click-free edges) > overlap-add > normalize
                         """
                     )
     gr.Markdown(
         """
         ---
+        Built with Python, NumPy, SciPy and Gradio. Everything is synthesized from scratch, no samples.
         Part of [Generative Audio Soundscapes Lab](https://my-sonicase.github.io/genaudio-soundscapes/).
         """
     )

requirements.txt CHANGED Viewed

@@ -1,2 +1,3 @@
 numpy
 gradio

 numpy
+scipy
 gradio