Spaces:

ResembleAI
/

Dramabox

Running on Zero

Manmay commited on Apr 28

Commit

5d085de

verified ·

1 Parent(s): 31ad2d7

DramaBox Space README

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,12 +1,42 @@
 ---
-title: Dramabox
-emoji: 📈
-colorFrom: purple
-colorTo: pink
 sdk: gradio
-sdk_version: 6.13.0
 app_file: app.py
-pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: DramaBox
+emoji: 🎭
+colorFrom: red
+colorTo: indigo
 sdk: gradio
+sdk_version: 4.44.1
 app_file: app.py
+pinned: true
+license: other
+license_name: ltx-2-community
+license_link: https://huggingface.co/ResembleAI/Dramabox/blob/main/LICENSE
+hardware: l40s
+short_description: Expressive TTS with voice cloning — DramaBox demo
 ---
+# DramaBox — Expressive TTS Demo
+Live demo of [`ResembleAI/Dramabox`](https://huggingface.co/ResembleAI/Dramabox). Write a scene prompt, optionally upload a 10-second voice reference, and generate. Audio is automatically watermarked with [Resemble Perth](https://github.com/resemble-ai/Perth).
+The model checkpoints download automatically on first launch.
+## Prompt format
+```
+<speaker description>, "<dialogue>" <action direction> "<more dialogue>"
+```
+- **Inside double quotes**: dialogue and phonetic sounds (`"Hahaha"`, `"Mmmmm"`, `"Ugh"`)
+- **Outside quotes**: stage directions (`She sighs.`, `He clears his throat.`)
+- **Avoid inside quotes**: `Ahem`, `Pfft`, `Sigh`, `Gasp`, `Cough` — the model will speak them literally.
+See the **Load an example prompt** dropdown for ready-made scene templates.
+## Files
+- `app.py` — Gradio UI
+- `src/inference_server.py` — warm `TTSServer` (single load, ~2.5s/request)
+- `src/inference.py` — CLI inference
+- `src/model_downloader.py` — auto-fetches model from HuggingFace
+- `ltx2/` — vendored LTX-2 pipelines
+- `requirements.txt` — Python deps (includes `resemble-perth`)