Duplicate from hi-paris/ssml-breaks2ssml-fr-lora

Browse files

Files changed (5) hide show

.gitattributes +37 -0
README.md +191 -0
adapter_config.json +39 -0
adapter_model.safetensors +3 -0
notebook.ipynb +378 -0

.gitattributes ADDED Viewed

	@@ -0,0 +1,37 @@

+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text
+checkpoint-180/tokenizer.json filter=lfs diff=lfs merge=lfs -text
+tokenizer.json filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,191 @@

+---
+license: apache-2.0
+base_model: Qwen/Qwen2.5-7B
+library_name: peft
+language:
+- fr
+tags:
+- lora
+- peft
+- ssml
+- text-to-speech
+- qwen2.5
+pipeline_tag: text-generation
+---
+# 🗣️ French Breaks-to-SSML LoRA Model
+**hi-paris/ssml-breaks2ssml-fr-lora** is a LoRA adapter fine-tuned on Qwen2.5-7B to convert text with symbolic `<break/>` markers into rich SSML markup with prosody control (pitch, rate, volume) and precise break timing.
+This is the **second stage** of a two-step SSML cascade pipeline for improving French text-to-speech prosody control.
+> 📄 **Paper**: *"Improving Synthetic Speech Quality via SSML Prosody Control"*
+> **Authors**: Nassima Ould-Ouali, Awais Sani, Ruben Bueno, Jonah Dauvet, Tim Luka Horstmann, Eric Moulines
+> **Conference**: ICNLSP 2025
+> 🔗 **Demo & Audio Samples**: https://hi-paris.github.io/DemoTTS/
+## 🧩 Pipeline Overview
+| Stage | Model | Purpose |
+|-------|-------|---------|
+| 1️⃣ | [hi-paris/ssml-text2breaks-fr-lora](https://huggingface.co/hi-paris/ssml-text2breaks-fr-lora) | Predicts natural pause locations |
+| 2️⃣ | **hi-paris/ssml-breaks2ssml-fr-lora** | Converts breaks to full SSML with prosody |
+## ✨ Example
+**Input:**
+```
+Bonjour comment allez-vous ?<break/>
+```
+**Output:**
+```
+<prosody pitch="+2.5%" rate="-1.2%" volume="-5.0%">Bonjour comment allez-vous ?</prosody><break time="300ms"/>
+```
+## 🚀 Quick Start
+### Installation
+```bash
+pip install torch transformers peft accelerate
+```
+### Basic Usage
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+from peft import PeftModel
+import torch
+# Load base model and tokenizer
+base_model = AutoModelForCausalLM.from_pretrained(
+    "Qwen/Qwen2.5-7B",
+    torch_dtype=torch.float16,
+    device_map="auto"
+)
+tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-7B")
+# Load LoRA adapter
+model = PeftModel.from_pretrained(base_model, "hi-paris/ssml-breaks2ssml-fr-lora")
+# Prepare input (text with <break/> markers)
+text_with_breaks = "Bonjour comment allez-vous ?<break/>"
+formatted_input = f"### Task:\nConvert text to SSML with pauses:\n\n### Text:\n{text_with_breaks}\n\n### SSML:\n"
+# Generate
+inputs = tokenizer(formatted_input, return_tensors="pt").to(model.device)
+with torch.no_grad():
+    outputs = model.generate(
+        **inputs,
+        max_new_tokens=128,
+        temperature=0.3,
+        do_sample=False,
+        pad_token_id=tokenizer.eos_token_id
+    )
+response = tokenizer.decode(outputs[0], skip_special_tokens=True)
+result = response.split("### SSML:\n")[-1].strip()
+print(result)
+```
+### Production Usage (Recommended)
+For production use with memory optimization, see our [inference repository](https://github.com/hi-paris/cascading_model):
+```python
+from breaks2ssml_inference import Breaks2SSMLInference
+# Memory-efficient shared model approach
+model = Breaks2SSMLInference()
+result = model.predict("Bonjour comment allez-vous ?<break/>")
+```
+## 🔧 Full Cascade Example
+```python
+from breaks2ssml_inference import CascadedInference
+# Initialize full pipeline (memory efficient - single base model)
+cascade = CascadedInference()
+# Convert plain text directly to full SSML
+text = "Bonjour comment allez-vous aujourd'hui ?"
+ssml_output = cascade.predict(text)
+print(ssml_output)
+# Output: '<prosody pitch="+2.5%" rate="-1.2%" volume="-5.0%">Bonjour comment allez-vous aujourd'hui ?</prosody><break time="300ms"/>'
+```
+## 🧠 Model Details
+- **Base Model**: [Qwen/Qwen2.5-7B](https://huggingface.co/Qwen/Qwen2.5-7B)
+- **Fine-tuning Method**: LoRA (Low-Rank Adaptation)
+- **LoRA Rank**: 8, Alpha: 16
+- **Target Modules**: `q_proj`, `k_proj`, `v_proj`, `o_proj`, `gate_proj`, `up_proj`, `down_proj`
+- **Training**: 5 epochs, batch size 1 with gradient accumulation
+- **Language**: French
+- **Model Size**: 7B parameters (LoRA adapter: ~81MB)
+- **License**: Apache 2.0
+## 📊 Performance
+| Metric | Score |
+|--------|-------|
+| Pause Insertion Accuracy | 87.3% |
+| RMSE (pause duration) | 98.5 ms |
+| MOS gain (vs. baseline) | +0.42 |
+*Evaluation performed on held-out French validation set with annotated SSML pauses. Mean Opinion Score (MOS) improvements assessed using TTS outputs with Azure Henri voice, rated by 30 native French speakers.*
+## 🎯 SSML Features Generated
+- **Prosody Control**: Dynamic pitch, rate, and volume adjustments
+- **Break Timing**: Precise pause durations (e.g., `<break time="300ms"/>`)
+- **Contextual Adaptation**: Prosody values adapted to semantic content
+## ⚠️ Limitations
+- Optimized primarily for Azure TTS voices (e.g., `fr-FR-HenriNeural`)
+- Requires input text with `<break/>` markers (use Stage 1 model for automatic prediction)
+- Currently supports break tags only (pitch/rate/volume via prosody wrapper)
+## 🔗 Resources
+- **Full Pipeline Code**: https://github.com/hi-paris/cascading_model
+- **Interactive Demo**: [Colab Notebook](https://colab.research.google.com/drive/1K3bcLHRfbSy9syWRZR6D0hyTb5lqivGi)
+- **Stage 1 Model**: [hi-paris/ssml-text2breaks-fr-lora](https://huggingface.co/hi-paris/ssml-text2breaks-fr-lora)
+## 📖 Paper
+This model is part of the work described in:
+[Improving French Synthetic Speech Quality via SSML Prosody Control](https://arxiv.org/abs/2508.17494)
+If you use this model, please cite the paper.
+```
+@inproceedings{ouali-etal-2025-improving,
+    title = "Improving {F}rench Synthetic Speech Quality via {SSML} Prosody Control",
+    author = "Ouali, Nassima Ould  and
+      Sani, Awais Hussain  and
+      Bueno, Ruben  and
+      Dauvet, Jonah  and
+      Horstmann, Tim Luka  and
+      Moulines, Eric",
+    editor = "Abbas, Mourad  and
+      Yousef, Tariq  and
+      Galke, Lukas",
+    booktitle = "Proceedings of the 8th International Conference on Natural Language and Speech Processing (ICNLSP-2025)",
+    month = aug,
+    year = "2025",
+    address = "Southern Denmark University, Odense, Denmark",
+    publisher = "Association for Computational Linguistics",
+    url = "https://aclanthology.org/2025.icnlsp-1.30/",
+    pages = "302--314"
+}
+```
+## 📜 License
+Apache 2.0 License (same as the base Qwen2.5-7B model)

adapter_config.json ADDED Viewed

	@@ -0,0 +1,39 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "Qwen/Qwen2.5-7B",
+  "bias": "none",
+  "corda_config": null,
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 16,
+  "lora_bias": false,
+  "lora_dropout": 0.1,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 8,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj",
+    "k_proj",
+    "up_proj",
+    "down_proj",
+    "o_proj",
+    "v_proj",
+    "gate_proj"
+  ],
+  "task_type": "CAUSAL_LM",
+  "trainable_token_indices": null,
+  "use_dora": false,
+  "use_rslora": false
+}

adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:093bd69bafa2916e41174d18d4c3c47103f89e06d93758af20b4d42e08848a08
+size 80792096

notebook.ipynb ADDED Viewed

	@@ -0,0 +1,378 @@

+{
+  "cells": [
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "# French SSML Cascade Models Demo\n",
+        "\n",
+        "<img src=\"https://www.hi-paris.fr/wp-content/uploads/2020/09/logo-hi-paris-retina.png\" alt=\"Hi! Paris\" width=\"200\"/>\n",
+        "\n",
+        "**Interactive demonstration of French SSML cascade models for improved text-to-speech prosody control.**\n",
+        "\n",
+        "This notebook demonstrates the complete pipeline from plain French text to rich SSML markup with prosody control.\n",
+        "\n",
+        "## 🧩 Pipeline Overview\n",
+        "\n",
+        "1. **Text-to-Breaks**: Predicts natural pause locations  \n",
+        "2. **Breaks-to-SSML**: Adds prosody control (pitch, rate, volume) and precise timing\n",
+        "\n",
+        "📄 **Paper**: *Improving Synthetic Speech Quality via SSML Prosody Control* (ICNLSP 2025)  \n",
+        "🔗 **Demo & Audio Samples**: https://horstmann.tech/ssml-prosody-control/  \n",
+        "📚 **Models**: [hi-paris/ssml-text2breaks-fr-lora](https://huggingface.co/hi-paris/ssml-text2breaks-fr-lora) • [hi-paris/ssml-breaks2ssml-fr-lora](https://huggingface.co/hi-paris/ssml-breaks2ssml-fr-lora)\n",
+        "\n",
+        "---"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "## 🚀 Setup\n",
+        "\n",
+        "### Step 1: Mount Google Drive"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 34,
+      "metadata": {
+        "colab": {
+          "base_uri": "https://localhost:8080/"
+        },
+        "id": "a1jNj9uK7EoL",
+        "outputId": "76624289-061f-4700-e397-50da9da9ee6d"
+      },
+      "outputs": [
+        {
+          "name": "stdout",
+          "output_type": "stream",
+          "text": [
+            "Mounted at /content/drive\n"
+          ]
+        }
+      ],
+      "source": [
+        "from google.colab import drive\n",
+        "drive.mount('/content/drive', force_remount=True)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "### Step 2: Clone Repository"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 35,
+      "metadata": {
+        "colab": {
+          "base_uri": "https://localhost:8080/"
+        },
+        "id": "eE3iUaX_7OLG",
+        "outputId": "d621b296-b12f-489a-bc1f-c7240c21646b"
+      },
+      "outputs": [
+        {
+          "name": "stderr",
+          "output_type": "stream",
+          "text": [
+            "shell-init: error retrieving current directory: getcwd: cannot access parent directories: No such file or directory\n",
+            "chdir: error retrieving current directory: getcwd: cannot access parent directories: No such file or directory\n",
+            "Cloning into 'cascading_model'...\n"
+          ]
+        }
+      ],
+      "source": [
+        "%%bash\n",
+        "cd /content/drive/MyDrive/\n",
+        "git clone https://github.com/TimLukaHorstmann/cascading_model.git"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 36,
+      "metadata": {
+        "colab": {
+          "base_uri": "https://localhost:8080/"
+        },
+        "id": "vItNbMvh7ZNL",
+        "outputId": "31a31144-1261-4427-9d2e-089ae17689b2"
+      },
+      "outputs": [
+        {
+          "name": "stdout",
+          "output_type": "stream",
+          "text": [
+            "/content/drive/MyDrive/cascading_model\n"
+          ]
+        }
+      ],
+      "source": [
+        "%cd /content/drive/MyDrive/cascading_model/\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 37,
+      "metadata": {
+        "colab": {
+          "base_uri": "https://localhost:8080/"
+        },
+        "id": "JdeuCOX_7kae",
+        "outputId": "f8bad5e1-92d0-4531-fbe0-ca2f29a8efd8"
+      },
+      "outputs": [
+        {
+          "name": "stdout",
+          "output_type": "stream",
+          "text": [
+            "breaks2ssml_inference.py\n",
+            "demo.py\n",
+            "empty_ssml_creation.py\n",
+            "__init__.py\n",
+            "pyproject.toml\n",
+            "README.md\n",
+            "requirements.txt\n",
+            "shared_models.py\n",
+            "test_models.py\n",
+            "text2breaks_inference.py\n"
+          ]
+        }
+      ],
+      "source": [
+        "%%bash\n",
+        "ls"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "## 🧪 Testing & Demo\n",
+        "\n",
+        "### Step 3: Verify Installation"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 38,
+      "metadata": {
+        "colab": {
+          "base_uri": "https://localhost:8080/"
+        },
+        "id": "eaBx_eh-819B",
+        "outputId": "2c55f4fa-f17e-49b8-b032-74d670dcd34a"
+      },
+      "outputs": [
+        {
+          "name": "stdout",
+          "output_type": "stream",
+          "text": [
+            "2025-08-06 12:36:48.453347: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:477] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered\n",
+            "WARNING: All log messages before absl::InitializeLog() is called are written to STDERR\n",
+            "E0000 00:00:1754483808.475278   35366 cuda_dnn.cc:8310] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered\n",
+            "E0000 00:00:1754483808.481612   35366 cuda_blas.cc:1418] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered\n",
+            "============================================================\n",
+            "🧪 French SSML Models - Test Suite\n",
+            "============================================================\n",
+            "🔍 Testing imports...\n",
+            "   ✅ PyTorch 2.5.1+cu121\n",
+            "   ✅ Transformers 4.54.0\n",
+            "   ✅ PEFT 0.16.0\n",
+            "   ✅ All imports successful!\n",
+            "\n",
+            "🔧 Testing model loading...\n",
+            "   Loading text2breaks model...\n",
+            "Loading checkpoint shards: 100% 4/4 [01:33<00:00, 23.46s/it]\n",
+            "   ✅ Text2breaks model loaded\n",
+            "   Loading breaks2ssml model...\n",
+            "   ✅ Breaks2ssml model loaded\n",
+            "   ✅ All models loaded successfully!\n",
+            "\n",
+            "🧪 Testing inference...\n",
+            "   Input: Bonjour comment allez-vous ?\n",
+            "   Testing text2breaks...\n",
+            "The following generation flags are not valid and may be ignored: ['temperature']. Set `TRANSFORMERS_VERBOSITY=info` for more details.\n",
+            "   Step 1 result: Bonjour comment allez-vous ?<break/>\n",
+            "   Testing breaks2ssml...\n",
+            "The following generation flags are not valid and may be ignored: ['temperature']. Set `TRANSFORMERS_VERBOSITY=info` for more details.\n",
+            "   Step 2 result: <prosody pitch=\"+0.64%\" rate=\"-1.92%\" volume=\"-10.00%\">\n",
+            "    Bonjour comment allez-vous ?\n",
+            "  </prosody>\n",
+            "  <break time=\"500ms\"/>\n",
+            "   ✅ Inference test successful!\n",
+            "\n",
+            "🔗 Testing full cascade...\n",
+            "   Input: Bonsoir comment ça va ?\n",
+            "   Cascade result: <prosody pitch=\"+0.64%\" rate=\"-1.92%\" volume=\"-10.00%\">\n",
+            "    Bonsoir comment ça va ?\n",
+            "  </prosody>\n",
+            "  <break time=\"500ms\"/>\n",
+            "   ✅ Cascade test successful!\n",
+            "\n",
+            "============================================================\n",
+            "🎉 All tests passed! The models are working correctly.\n",
+            "============================================================\n",
+            "\n",
+            "You can now use:\n",
+            "  - python demo.py (for examples)\n",
+            "  - python demo.py --interactive (for interactive mode)\n",
+            "  - python text2breaks_inference.py --interactive\n",
+            "  - python breaks2ssml_inference.py --interactive\n"
+          ]
+        }
+      ],
+      "source": [
+        "!python test_models.py"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "### Step 4: Interactive Demo\n",
+        "\n",
+        "Run the interactive demo to test the models with your own French text:"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 29,
+      "metadata": {
+        "colab": {
+          "base_uri": "https://localhost:8080/"
+        },
+        "id": "ZIeUY9atUhvV",
+        "outputId": "581f1395-fa70-424f-9c66-50b5e44547c3"
+      },
+      "outputs": [
+        {
+          "name": "stdout",
+          "output_type": "stream",
+          "text": [
+            "2025-08-06 12:21:35.541051: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:477] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered\n",
+            "WARNING: All log messages before absl::InitializeLog() is called are written to STDERR\n",
+            "E0000 00:00:1754482895.561958   31169 cuda_dnn.cc:8310] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered\n",
+            "E0000 00:00:1754482895.568312   31169 cuda_blas.cc:1418] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered\n",
+            "================================================================================\n",
+            "Interactive French SSML Cascade\n",
+            "================================================================================\n",
+            "\n",
+            "Choose mode:\n",
+            "1. Full cascade (text → breaks → SSML)\n",
+            "2. Text to breaks only\n",
+            "3. Breaks to SSML only\n",
+            "\n",
+            "Select mode (1-3): 1\n",
+            "\n",
+            "Initializing models...\n",
+            "Loading checkpoint shards: 100% 4/4 [01:30<00:00, 22.70s/it]\n",
+            "Models loaded successfully!\n",
+            "\n",
+            "Enter French text (empty line to exit):\n",
+            "\n",
+            "> Je suis Luka.\n",
+            "The following generation flags are not valid and may be ignored: ['temperature']. Set `TRANSFORMERS_VERBOSITY=info` for more details.\n",
+            "The following generation flags are not valid and may be ignored: ['temperature']. Set `TRANSFORMERS_VERBOSITY=info` for more details.\n",
+            "Output: <prosody pitch=\"+0.64%\" rate=\"-1.92%\" volume=\"-10.00%\">\n",
+            "    Je suis Luka.\n",
+            "  </prosody>\n",
+            "  <break time=\"500ms\"/>\n",
+            "Time: 6.55s\n",
+            "\n",
+            "> Trés bien.\n",
+            "The following generation flags are not valid and may be ignored: ['temperature']. Set `TRANSFORMERS_VERBOSITY=info` for more details.\n",
+            "The following generation flags are not valid and may be ignored: ['temperature']. Set `TRANSFORMERS_VERBOSITY=info` for more details.\n",
+            "Output: <prosody pitch=\"+0.64%\" rate=\"-1.92%\" volume=\"-10.00%\">\n",
+            "    Trés bien.\n",
+            "  </prosody>\n",
+            "  <break time=\"500ms\"/>\n",
+            "Time: 5.64s\n",
+            "\n",
+            "> Je suis Bertrand Perier. Je suis avocat et vous écoutez ma masterclass.\n",
+            "The following generation flags are not valid and may be ignored: ['temperature']. Set `TRANSFORMERS_VERBOSITY=info` for more details.\n",
+            "The following generation flags are not valid and may be ignored: ['temperature']. Set `TRANSFORMERS_VERBOSITY=info` for more details.\n",
+            "Output: <prosody pitch=\"+0.64%\" rate=\"-1.92%\" volume=\"-10.00%\">\n",
+            "    Je suis Bertrand Perier.\n",
+            "  </prosody>\n",
+            "  <break time=\"500ms\"/>\n",
+            "\n",
+            "  <prosody pitch=\"+3.78%\" rate=\"-1.29%\" volume=\"-10.00%\">\n",
+            "    Je suis avocat et vous écoutez ma masterclass.\n",
+            "  </prosody>\n",
+            "  <break time=\"500ms\"/>\n",
+            "Time: 12.11s\n",
+            "\n",
+            "> Exception ignored in: <module 'threading' from '/usr/lib/python3.11/threading.py'>\n",
+            "Traceback (most recent call last):\n",
+            "  File \"/usr/lib/python3.11/threading.py\", line 1541, in _shutdown\n",
+            "    def _shutdown():\n",
+            "    \n",
+            "KeyboardInterrupt: \n"
+          ]
+        }
+      ],
+      "source": [
+        "!python demo.py --interactive"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "## 🎯 Example Usage\n",
+        "\n",
+        "```python\n",
+        "from breaks2ssml_inference import CascadedInference\n",
+        "\n",
+        "# Initialize the full cascade\n",
+        "cascade = CascadedInference()\n",
+        "\n",
+        "# Convert plain French text to SSML\n",
+        "text = \"Bonjour comment allez-vous aujourd'hui ?\"\n",
+        "result = cascade.predict(text)\n",
+        "print(result)\n",
+        "```\n",
+        "\n",
+        "**Expected Output:**\n",
+        "```xml\n",
+        "<prosody pitch=\"+2.5%\" rate=\"-1.2%\" volume=\"-5.0%\">Bonjour comment allez-vous aujourd'hui ?</prosody><break time=\"300ms\"/>\n",
+        "```\n",
+        "\n",
+        "## 📚 Resources\n",
+        "\n",
+        "- **Audio Demos**: https://horstmann.tech/ssml-prosody-control/\n",
+        "- **GitHub Repository**: https://github.com/TimLukaHorstmann/cascading_model\n",
+        "- **Stage 1 Model**: https://huggingface.co/hi-paris/ssml-text2breaks-fr-lora\n",
+        "- **Stage 2 Model**: https://huggingface.co/hi-paris/ssml-breaks2ssml-fr-lora\n",
+        "\n",
+        "---\n",
+        "*Hi! Paris - Interdisciplinary Research Institute for Artificial Intelligence*"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": []
+    }
+  ],
+  "metadata": {
+    "accelerator": "GPU",
+    "colab": {
+      "gpuType": "T4",
+      "provenance": []
+    },
+    "kernelspec": {
+      "display_name": "Python 3",
+      "name": "python3"
+    },
+    "language_info": {
+      "name": "python"
+    }
+  },
+  "nbformat": 4,
+  "nbformat_minor": 0
+}