diff --git a/Paper2Video/LICENSE b/Paper2Video/LICENSE
deleted file mode 100644
index dd76e7908a4ba73fe1bd0cf725a597bed9352f2a..0000000000000000000000000000000000000000
--- a/Paper2Video/LICENSE
+++ /dev/null
@@ -1,21 +0,0 @@
-MIT License
-
-Copyright (c) 2025 Show Lab
-
-Permission is hereby granted, free of charge, to any person obtaining a copy
-of this software and associated documentation files (the "Software"), to deal
-in the Software without restriction, including without limitation the rights
-to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
-copies of the Software, and to permit persons to whom the Software is
-furnished to do so, subject to the following conditions:
-
-The above copyright notice and this permission notice shall be included in all
-copies or substantial portions of the Software.
-
-THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
-IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
-FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
-AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
-LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
-OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
-SOFTWARE.
diff --git a/Paper2Video/README-CN.md b/Paper2Video/README-CN.md
deleted file mode 100644
index 1a8106ac91d695af96e07bd92b2b3f98bd2206c3..0000000000000000000000000000000000000000
--- a/Paper2Video/README-CN.md
+++ /dev/null
@@ -1,248 +0,0 @@
-# Paper2Video
-
-<p align="right">
-  <a href="./README.md">English</a> | <b>简体中文</b>
-</p>
-
-
-<p align="center">
-  <b>Paper2Video: 从学术论文自动生成演讲视频</b>
-<br>
-
-
-<p align="center">
-  <a href="https://zeyu-zhu.github.io/webpage/">Zeyu Zhu*</a>,
-  <a href="https://qhlin.me/">Kevin Qinghong Lin*</a>,
-  <a href="https://scholar.google.com/citations?user=h1-3lSoAAAAJ&hl=en">Mike Zheng Shou</a> <br>
-  新加坡国立大学 Show Lab
-</p>
-
-
-<p align="center">
-  <a href="https://arxiv.org/abs/2510.05096">📄 论文</a> &nbsp; | &nbsp;
-  <a href="https://huggingface.co/papers/2510.05096">🤗 Daily Paper</a> &nbsp; | &nbsp;
-  <a href="https://huggingface.co/datasets/ZaynZhu/Paper2Video">📊 数据集</a> &nbsp; | &nbsp;
-  <a href="https://showlab.github.io/Paper2Video/">🌐 项目主页</a> &nbsp; | &nbsp;
-  <a href="https://x.com/KevinQHLin/status/1976105129146257542">💬 推特</a>
-</p>
-
-- **输入:** 一篇论文 ➕ 一张图像 ➕ 一段音频
-  
-| 论文 | 图像 | 音频 |
-|--------|--------|--------|
-| <img src="https://github.com/showlab/Paper2Video/blob/page/assets/hinton/paper.png" width="180"/><br>[🔗 论文链接](https://arxiv.org/pdf/1509.01626) | <img src="https://github.com/showlab/Paper2Video/blob/page/assets/hinton/hinton_head.jpeg" width="180"/> <br>Hinton的图像| <img src="assets/sound.png" width="180"/><br>[🔗 音频样本](https://github.com/showlab/Paper2Video/blob/page/assets/hinton/ref_audio_10.wav) |
-
-
-- **输出:** 演讲视频
-
-
-
-https://github.com/user-attachments/assets/39221a9a-48cb-4e20-9d1c-080a5d8379c4
-
-
-
-
-查看更多生成结果 [🌐 project page](https://showlab.github.io/Paper2Video/).
-
-## 🔥 Update
-- [x] [2025.10.11] 我们的工作在[YC Hacker News](https://news.ycombinator.com/item?id=45553701)上受到关注.
-- [x] [2025.10.9] 感谢AK在[Twitter](https://x.com/_akhaliq/status/1976099830004072849)上分享我们的工作!
-- [x] [2025.10.9] 我们的工作被 [Medium](https://medium.com/@dataism/how-ai-learned-to-make-scientific-videos-from-slides-to-a-talking-head-0d807e491b27)报道.
-- [x] [2025.10.8] 下方查看我们的demo视频!
-- [x] [2025.10.7] 我们发布了 [Arxiv 论文](https://arxiv.org/abs/2510.05096).
-- [x] [2025.10.6] 我们发布了 [代码](https://github.com/showlab/Paper2Video) and [数据集](https://huggingface.co/datasets/ZaynZhu/Paper2Video).
-- [x] [2025.9.28] Paper2Video 已经被 **Scaling Environments for Agents Workshop([SEA](https://sea-workshop.github.io/)) at NeurIPS 2025** 接受.
-
-
-https://github.com/user-attachments/assets/a655e3c7-9d76-4c48-b946-1068fdb6cdd9
-
-
-
-
----
-
-### Table of Contents
-- [🌟 项目总览](#-项目总览)
-- [🚀 快速上手: PaperTalker](#-快速上手-PaperTalker)
-  - [1. 环境配置](#1-环境配置)
-  - [2. 大语言模型配置](#2-大语言模型配置)
-  - [3. 推理](#3-推理)
-- [📊 评价指标: Paper2Video](#-评价指标-Paper2Video)
-- [😼 乐趣: Paper2Video 生成 Paper2Video 演讲视频](#-乐趣-Paper2Video生成Paper2Video演讲视频)
-- [🙏 致谢](#-致谢)
-- [📌 引用](#-引用)
----
-
-## 🌟 项目总览
-<p align="center">
-  <img src="assets/teaser.png" alt="Overview" width="100%">
-</p>
-
-这项工作解决了学术演讲的两个核心问题:
-
-- **左边: 如何根据论文制作学术演讲?**  
-  *PaperTalker* — 集成**幻灯片**、**字幕**、**光标**、**语音合成**和**演讲者视频渲染**的多智能体。
-
-- **右边: 如何评估学术演讲视频?**  
-  *Paper2Video* — 一个具有精心设计的指标来评估演示质量的基准。
-
-
----
-
-## 🚀 尝试 PaperTalker 为你的论文制作演讲视频 !
-<p align="center">
-  <img src="assets/method.png" alt="Approach" width="100%">
-</p>
-
-### 1. 环境配置
-准备Python环境:
-```bash
-cd src
-conda create -n p2v python=3.10
-conda activate p2v
-pip install -r requirements.txt
-conda install -c conda-forge tectonic
-````
-下载所依赖代码，并按照[Hallo2](https://github.com/fudan-generative-vision/hallo2)中的说明下载模型权重。
-```bash
-git clone https://github.com/fudan-generative-vision/hallo2.git
-```
-您需要**单独准备用于 talking-head generation 的环境**，以避免潜在的软件包冲突，请参考<a href="git clone https://github.com/fudan-generative-vision/hallo2.git">Hallo2</a>。安装完成后，使用 `which python` 命令获取 Python 环境路径。
-```bash
-cd hallo2
-conda create -n hallo python=3.10
-conda activate hallo
-pip install -r requirements.txt
-```
-
-### 2. 大语言模型配置
-在终端配置您的**API 凭证**:
-```bash
-export GEMINI_API_KEY="your_gemini_key_here"
-export OPENAI_API_KEY="your_openai_key_here"
-```
-最佳实践是针对 LLM 和 VLM 使用 **GPT4.1** 或 **Gemini2.5-Pro**。我们也支持本地部署开源模型（例如 Qwen），详情请参阅 <a href="https://github.com/Paper2Poster/Paper2Poster.git">Paper2Poster</a>。
-
-### 3. 推理
-脚本 `pipeline.py` 提供了一个自动化的学术演示视频生成流程。它以 **LaTeX 论文素材** 和 **参考图像/音频** 作为输入，并经过多个子模块（幻灯片 → 字幕 → 语音 → 光标 → 头部特写）生成完整的演示视频。⚡ 运行此流程的最低推荐 GPU 为 **NVIDIA A6000**，显存 48G。
-
-#### 示例用法
-
-运行以下命令来启动完整生成：
-
-```bash
-python pipeline.py \
-    --model_name_t gpt-4.1 \
-    --model_name_v gpt-4.1 \
-    --model_name_talking hallo2 \
-    --result_dir /path/to/output \
-    --paper_latex_root /path/to/latex_proj \
-    --ref_img /path/to/ref_img.png \
-    --ref_audio /path/to/ref_audio.wav \
-    --talking_head_env /path/to/hallo2_env \
-    --gpu_list [0,1,2,3,4,5,6,7]
-```
-
-| 参数名 | 类型 | 默认值 | 说明 |
-|----------|------|---------|-------------|
-| `--model_name_t` | `str` | `gpt-4.1` | 文本大语言模型（LLM） |
-| `--model_name_v` | `str` | `gpt-4.1` | 视觉语言模型（VLM） |
-| `--model_name_talking` | `str` | `hallo2` | Talking Head 模型。目前仅支持 **hallo2** |
-| `--result_dir` | `str` | `/path/to/output` | 输出目录（包括幻灯片、字幕、视频等） |
-| `--paper_latex_root` | `str` | `/path/to/latex_proj` | 论文 LaTeX 项目的根目录 |
-| `--ref_img` | `str` | `/path/to/ref_img.png` | 参考图像（必须为**正方形**人像） |
-| `--ref_audio` | `str` | `/path/to/ref_audio.wav` | 参考音频（建议时长约为 10 秒） |
-| `--ref_text` | `str` | `None` | 可选参考文本（用于字幕风格指导） |
-| `--beamer_templete_prompt` | `str` | `None` | 可选参考文本（用于幻灯片风格指导） |
-| `--gpu_list` | `list[int]` | `""` | GPU 列表，用于并行执行（适用于**光标生成**与 **Talking Head 渲染**） |
-| `--if_tree_search` | `bool` | `True` | 是否启用树搜索（用于幻灯片布局优化） |
-| `--stage` | `str` | `"[0]"` | 需要运行的阶段（例如 `[0]` 表示完整流程，`[1,2,3]` 表示部分阶段） |
-| `--talking_head_env` | `str` | `/path/to/hallo2_env` | Talking Head 生成的 Python 环境路径 |
----
-
-## 📊 评价指标: Paper2Video
-<p align="center">
-  <img src="assets/metrics.png" alt="Metrics" width="100%">
-</p>
-
-与自然视频生成不同，学术演示视频发挥着高度专业化的作用：它们不仅关乎视觉保真度，更关乎**学术交流**。这使得直接应用视频合成中的传统指标（例如 FVD、IS 或基于 CLIP 的相似度）变得困难。相反，它们的价值在于它们如何有效地**传播研究成果**并**提升学术知名度**。从这个角度来看，我们认为，评判高质量的学术演示视频应该从两个互补的维度进行评判：
-#### 对于观众
-- 视频应**忠实传达论文的核心思想**。
-- 视频应**易于不同受众观看**。
-
-#### 对于作者
-- 视频应**突出作者的智力贡献和身份**。
-- 视频应**提升作品的知名度和影响力**。
-
-为了实现这些目标，我们引入了专门为学术演示视频设计的评估指标：Meta Similarity, PresentArena, PresentQuiz, IP Memory.
-
-### 运行评价
-- 准备环境：
-```bash
-cd src/evaluation
-conda create -n p2v_e python=3.10
-conda activate p2v_e
-pip install -r requirements.txt
-```
-- 对于 Meta Similarity 和 PresentArena：
-```bash
-python MetaSim_audio.py --r /path/to/result_dir --g /path/to/gt_dir --s /path/to/save_dir
-python MetaSim_content.py --r /path/to/result_dir --g /path/to/gt_dir --s /path/to/save_dir
-```
-```bash
-python PresentArena.py --r /path/to/result_dir --g /path/to/gt_dir --s /path/to/save_dir
-```
-- 对于**PresentQuiz**，首先基于论文生成问题并使用 Gemini 进行评估：
-```bash
-cd PresentQuiz
-python create_paper_questions.py ----paper_folder /path/to/data
-python PresentQuiz.py --r /path/to/result_dir --g /path/to/gt_dir --s /path/to/save_dir
-```
-
-- 对于**IP Memory**，首先从生成的视频中生成问题对，然后使用 Gemini 进行评估：
-```bash
-cd IPMemory
-python construct.py
-python ip_qa.py
-```
-更多详情请查看代码！
-
-👉 Paper2Video 数据集可在以下网址获取：
-[HuggingFace](https://huggingface.co/datasets/ZaynZhu/Paper2Video)
-
----
-
-## 😼 乐趣: Paper2Video 生成 Paper2Video 演讲视频
-查看 **Paper2Video 生成 Paper2Video 演讲视频**:
-
-https://github.com/user-attachments/assets/ff58f4d8-8376-4e12-b967-711118adf3c4
-
-## 🙏 致谢
-
-* 数据集中演示视频的来源是 SlideLive 和 YouTube。
-* 感谢所有为制作演示视频付出辛勤努力的作者！
-* 感谢 [CAMEL](https://github.com/camel-ai/camel) 开源了组织良好的多智能体框架代码库。
-* 感谢 [Hallo2](https://github.com/fudan-generative-vision/hallo2.git) 和 [Paper2Poster](https://github.com/Paper2Poster/Paper2Poster.git) 作者开源代码。
-* 感谢 [Wei Jia](https://github.com/weeadd) 在数据收集和baselines实现方面所做的努力。我们也感谢所有参与用户调研的参与者。
-* 感谢所有 **Show Lab @ NUS** 成员的支持！
-
-
-
----
-
-## 📌 引用
-
-
-如果我们的工作对您有帮助，欢迎引用我们的工作：
-
-```bibtex
-@misc{paper2video,
-      title={Paper2Video: Automatic Video Generation from Scientific Papers}, 
-      author={Zeyu Zhu and Kevin Qinghong Lin and Mike Zheng Shou},
-      year={2025},
-      eprint={2510.05096},
-      archivePrefix={arXiv},
-      primaryClass={cs.CV},
-      url={https://arxiv.org/abs/2510.05096}, 
-}
-```
diff --git a/Paper2Video/README.md b/Paper2Video/README.md
deleted file mode 100644
index 258f23247b73bc980a3d0d67dcbcc8cd9d5bc45c..0000000000000000000000000000000000000000
--- a/Paper2Video/README.md
+++ /dev/null
@@ -1,251 +0,0 @@
-# Paper2Video
-
-<p align="right">
-  <b>English</b> | <a href="./README-CN.md">简体中文</a>
-</p>
-
-
-<p align="center">
-  <b>Paper2Video: Automatic Video Generation from Scientific Papers</b>
-<br>
-从学术论文自动生成演讲视频
-</p>
-
-<p align="center">
-  <a href="https://zeyu-zhu.github.io/webpage/">Zeyu Zhu*</a>,
-  <a href="https://qhlin.me/">Kevin Qinghong Lin*</a>,
-  <a href="https://scholar.google.com/citations?user=h1-3lSoAAAAJ&hl=en">Mike Zheng Shou</a> <br>
-  Show Lab, National University of Singapore
-</p>
-
-
-<p align="center">
-  <a href="https://arxiv.org/abs/2510.05096">📄 Paper</a> &nbsp; | &nbsp;
-  <a href="https://huggingface.co/papers/2510.05096">🤗 Daily Paper</a> &nbsp; | &nbsp;
-  <a href="https://huggingface.co/datasets/ZaynZhu/Paper2Video">📊 Dataset</a> &nbsp; | &nbsp;
-  <a href="https://showlab.github.io/Paper2Video/">🌐 Project Website</a> &nbsp; | &nbsp;
-  <a href="https://x.com/KevinQHLin/status/1976105129146257542">💬 X (Twitter)</a>
-</p>
-
-- **Input:** a paper ➕ an image ➕ an audio
-  
-| Paper | Image | Audio |
-|--------|--------|--------|
-| <img src="https://github.com/showlab/Paper2Video/blob/page/assets/hinton/paper.png" width="180"/><br>[🔗 Paper link](https://arxiv.org/pdf/1509.01626) | <img src="https://github.com/showlab/Paper2Video/blob/page/assets/hinton/hinton_head.jpeg" width="180"/> <br>Hinton's photo| <img src="assets/sound.png" width="180"/><br>[🔗 Audio sample](https://github.com/showlab/Paper2Video/blob/page/assets/hinton/ref_audio_10.wav) |
-
-
-- **Output:** a presentation video
-
-
-
-https://github.com/user-attachments/assets/39221a9a-48cb-4e20-9d1c-080a5d8379c4
-
-
-
-
-Check out more examples at [🌐 project page](https://showlab.github.io/Paper2Video/).
-
-## 🔥 Update
-- [x] [2025.10.11] Our work receives attention on [YC Hacker News](https://news.ycombinator.com/item?id=45553701).
-- [x] [2025.10.9] Thanks AK for sharing our work on [Twitter](https://x.com/_akhaliq/status/1976099830004072849)!
-- [x] [2025.10.9] Our work is reported by [Medium](https://medium.com/@dataism/how-ai-learned-to-make-scientific-videos-from-slides-to-a-talking-head-0d807e491b27).
-- [x] [2025.10.8] Check out our demo video below!
-- [x] [2025.10.7] We release the [arxiv paper](https://arxiv.org/abs/2510.05096).
-- [x] [2025.10.6] We release the [code](https://github.com/showlab/Paper2Video) and [dataset](https://huggingface.co/datasets/ZaynZhu/Paper2Video).
-- [x] [2025.9.28] Paper2Video has been accepted to the **Scaling Environments for Agents Workshop([SEA](https://sea-workshop.github.io/)) at NeurIPS 2025**.
-
-
-https://github.com/user-attachments/assets/a655e3c7-9d76-4c48-b946-1068fdb6cdd9
-
-
-
-
----
-
-### Table of Contents
-- [🌟 Overview](#-overview)
-- [🚀 Quick Start: PaperTalker](#-try-papertalker-for-your-paper-)
-  - [1. Requirements](#1-requirements)
-  - [2. Configure LLMs](#2-configure-llms)
-  - [3. Inference](#3-inference)
-- [📊 Evaluation: Paper2Video](#-evaluation-paper2video)
-- [😼 Fun: Paper2Video for Paper2Video](#-fun-paper2video-for-paper2video)
-- [🙏 Acknowledgements](#-acknowledgements)
-- [📌 Citation](#-citation)
-
----
-
-## 🌟 Overview
-<p align="center">
-  <img src="assets/teaser.png" alt="Overview" width="100%">
-</p>
-
-This work solves two core problems for academic presentations:
-
-- **Left: How to create a presentation video from a paper?**  
-  *PaperTalker* — an agent that integrates **slides**, **subtitling**, **cursor grounding**, **speech synthesis**, and **talking-head video rendering**.
-
-- **Right: How to evaluate a presentation video?**  
-  *Paper2Video* — a benchmark with well-designed metrics to evaluate presentation quality.
-
-
----
-
-## 🚀 Try PaperTalker for your Paper!
-<p align="center">
-  <img src="assets/method.png" alt="Approach" width="100%">
-</p>
-
-### 1. Requirements
-Prepare the environment:
-```bash
-cd src
-conda create -n p2v python=3.10
-conda activate p2v
-pip install -r requirements.txt
-conda install -c conda-forge tectonic
-````
-Download the dependent code and follow the instructions in **[Hallo2](https://github.com/fudan-generative-vision/hallo2)** to download the model weight.
-```bash
-git clone https://github.com/fudan-generative-vision/hallo2.git
-```
-You need to **prepare the environment separately for talking-head generation** to potential avoide package conflicts, please refer to  <a href="git clone https://github.com/fudan-generative-vision/hallo2.git">Hallo2</a>. After installing, use `which python` to get the python environment path.
-```bash
-cd hallo2
-conda create -n hallo python=3.10
-conda activate hallo
-pip install -r requirements.txt
-```
-
-### 2. Configure LLMs
-Export your **API credentials**:
-```bash
-export GEMINI_API_KEY="your_gemini_key_here"
-export OPENAI_API_KEY="your_openai_key_here"
-```
-The best practice is to use **GPT4.1** or **Gemini2.5-Pro** for both LLM and VLMs. We also support locally deployed open-source model(e.g., Qwen), details please referring to <a href="https://github.com/Paper2Poster/Paper2Poster.git">Paper2Poster</a>.
-
-### 3. Inference
-The script `pipeline.py` provides an automated pipeline for generating academic presentation videos. It takes **LaTeX paper sources** together with **reference image/audio** as input, and goes through multiple sub-modules (Slides → Subtitles → Speech → Cursor → Talking Head) to produce a complete presentation video. ⚡ The minimum recommended GPU for running this pipeline is **NVIDIA A6000** with 48G.
-
-#### Example Usage
-
-Run the following command to launch a full generation:
-
-```bash
-python pipeline.py \
-    --model_name_t gpt-4.1 \
-    --model_name_v gpt-4.1 \
-    --model_name_talking hallo2 \
-    --result_dir /path/to/output \
-    --paper_latex_root /path/to/latex_proj \
-    --ref_img /path/to/ref_img.png \
-    --ref_audio /path/to/ref_audio.wav \
-    --talking_head_env /path/to/hallo2_env \
-    --gpu_list [0,1,2,3,4,5,6,7]
-```
-
-| Argument | Type | Default | Description |
-|----------|------|---------|-------------|
-| `--model_name_t` | `str` | `gpt-4.1` | LLM |
-| `--model_name_v` | `str` | `gpt-4.1` | VLM |
-| `--model_name_talking` | `str` | `hallo2` | Talking Head model. Currently only **hallo2** is supported |
-| `--result_dir` | `str` | `/path/to/output` | Output directory (slides, subtitles, videos, etc.) |
-| `--paper_latex_root` | `str` | `/path/to/latex_proj` | Root directory of the LaTeX paper project |
-| `--ref_img` | `str` | `/path/to/ref_img.png` | Reference image (must be **square** portrait) |
-| `--ref_audio` | `str` | `/path/to/ref_audio.wav` | Reference audio (recommended: ~10s) |
-| `--ref_text` | `str` | `None` | Optional reference text (for style guidance for subtitles) |
-| `--beamer_templete_prompt` | `str` | `None` | Optional reference text (for style guidance for slides) |
-| `--gpu_list` | `list[int]` | `""` | GPU list for parallel execution (used in **cursor generation** and **Talking Head rendering**) |
-| `--if_tree_search` | `bool` | `True` | Whether to enable tree search for slide layout refinement |
-| `--stage` | `str` | `"[0]"` | Pipeline stages to run (e.g., `[0]` full pipeline, `[1,2,3]` partial stages) |
-| `--talking_head_env` | `str` | `/path/to/hallo2_env` | python environment path for talking-head generation |
----
-
-## 📊 Evaluation: Paper2Video
-<p align="center">
-  <img src="assets/metrics.png" alt="Metrics" width="100%">
-</p>
-
-Unlike natural video generation, academic presentation videos serve a highly specialized role: they are not merely about visual fidelity but about **communicating scholarship**. This makes it difficult to directly apply conventional metrics from video synthesis(e.g., FVD, IS, or CLIP-based similarity). Instead, their value lies in how well they **disseminate research** and **amplify scholarly visibility**.From this perspective, we argue that a high-quality academic presentation video should be judged along two complementary dimensions:
-#### For the Audience
-- The video is expected to **faithfully convey the paper’s core ideas**.  
-- It should remain **accessible to diverse audiences**.  
-
-#### For the Author
-- The video should **foreground the authors’ intellectual contribution and identity**.  
-- It should **enhance the work’s visibility and impact**.  
-
-To capture these goals, we introduce evaluation metrics specifically designed for academic presentation videos: Meta Similarity, PresentArena, PresentQuiz, IP Memory.
-
-### Run Eval
-- Prepare the environment:
-```bash
-cd src/evaluation
-conda create -n p2v_e python=3.10
-conda activate p2v_e
-pip install -r requirements.txt
-```
-- For MetaSimilarity and PresentArena:
-```bash
-python MetaSim_audio.py --r /path/to/result_dir --g /path/to/gt_dir --s /path/to/save_dir
-python MetaSim_content.py --r /path/to/result_dir --g /path/to/gt_dir --s /path/to/save_dir
-```
-```bash
-python PresentArena.py --r /path/to/result_dir --g /path/to/gt_dir --s /path/to/save_dir
-```
-- For **PresentQuiz**, first generate questions from paper and eval using Gemini:
-```bash
-cd PresentQuiz
-python create_paper_questions.py ----paper_folder /path/to/data
-python PresentQuiz.py --r /path/to/result_dir --g /path/to/gt_dir --s /path/to/save_dir
-```
-
-- For **IP Memory**, first generate question pairs from generated videos and eval using Gemini:
-```bash
-cd IPMemory
-python construct.py
-python ip_qa.py
-```
-See the codes for more details!
-
-👉 Paper2Video Benchmark is available at:
-[HuggingFace](https://huggingface.co/datasets/ZaynZhu/Paper2Video)
-
----
-
-## 😼 Fun: Paper2Video for Paper2Video
-Check out **How Paper2Video for Paper2Video**:
-
-https://github.com/user-attachments/assets/ff58f4d8-8376-4e12-b967-711118adf3c4
-
-## 🙏 Acknowledgements
-
-* The souces of the presentation videos are SlideLive and YouTuBe.
-* We thank all the authors who spend a great effort to create presentation videos!
-* We thank [CAMEL](https://github.com/camel-ai/camel) for open-source well-organized multi-agent framework codebase.
-* We thank the authors of [Hallo2](https://github.com/fudan-generative-vision/hallo2.git) and [Paper2Poster](https://github.com/Paper2Poster/Paper2Poster.git) for their open-sourced codes.
-* We thank [Wei Jia](https://github.com/weeadd) for his effort in collecting the data and implementing the baselines. We also thank all the participants involved in the human studies.
-* We thank all the **Show Lab @ NUS** members for support!
-
-
-
----
-
-## 📌 Citation
-
-
-If you find our work useful, please cite:
-
-```bibtex
-@misc{paper2video,
-      title={Paper2Video: Automatic Video Generation from Scientific Papers}, 
-      author={Zeyu Zhu and Kevin Qinghong Lin and Mike Zheng Shou},
-      year={2025},
-      eprint={2510.05096},
-      archivePrefix={arXiv},
-      primaryClass={cs.CV},
-      url={https://arxiv.org/abs/2510.05096}, 
-}
-```
-[![Star History](https://api.star-history.com/svg?repos=showlab/Paper2Video&type=Date)](https://star-history.com/#showlab/Paper2Video&Date)
diff --git a/Paper2Video/__init__.py b/Paper2Video/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/Paper2Video/src/__init__.py b/Paper2Video/src/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/Paper2Video/src/evaluation/IPMemory/construct.py b/Paper2Video/src/evaluation/IPMemory/construct.py
deleted file mode 100644
index 7389f8ff8c768e331c2657cbfd1e9a1ad9d0244a..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/IPMemory/construct.py
+++ /dev/null
@@ -1,69 +0,0 @@
-"""
-    construct question about Academic IP
-    input query: 4 video clips from 4 different paper presentation + query (image/audio)
-    input question: 4 understanding qa from corresponding paper
-    output task: choose the right question to ask
-"""
-import os, re
-import json
-import random
-import itertools
-from os import path
-from typing import List
-from pathlib import Path
-from tqdm import tqdm
-
-def generate_combinations(total_num, comb_size):
-    return list(itertools.combinations(range(total_num), comb_size))
-
-def generate_ip_task(vaild_data_name, num_qa_pair):
-    combs = list(itertools.combinations(range(len(vaild_data_name)), 4))
-    combs = random.sample(combs, num_qa_pair)
-    
-    qa_list = []
-    for comb in combs:
-        ## questions
-        question_list = []
-        question_index = random.randint(1, 50)
-        for index in comb:
-            question_path = path.join(vaild_data_name[index][1], "4o-mini_qa.json")
-            with open(question_path, 'r') as f: question = json.load(f)["understanding"]["questions"]
-            question_list.append(question["Question {}".format(str(question_index))]["question"])
-        ## query        
-        query_list = []
-        for index in comb:
-            ref_img_path = path.join(vaild_data_name[index][1], "ref_img.png")
-            ref_audio_path = path.join(vaild_data_name[index][1], "ref_audio.wav")
-            query_list.append((ref_img_path, ref_audio_path))
-        ## qa
-        qa = {}
-        qa["videos"] = []
-        for idx in range(len(comb)):
-            qa["videos"].append(vaild_data_name[comb[idx]][0])
-            
-        qa["querys"] = query_list
-        qa["questions"] = question_list
-        qa_list.append(qa)
-    with open("ip_qa.json", 'w') as f: json.dump(qa_list, f, indent=4)
-    
-_num_at_start = re.compile(r'^\s*["\']?(\d+)')
-def sort_by_leading_number(paths: List[str]) -> List[str]:
-    def key(p: str):
-        name = Path(p).name  
-        m = _num_at_start.match(name)
-        return (int(m.group(1)) if m else float('inf'), name)
-    return sorted(paths, key=key)
-
-if __name__ == "__main__":
-    num_qa_pair = 10 # C (num_data) (4)
-    root_dir = "/path/to/result"
-    gt_dir = "/path/to/data"
-    
-    all_data_name = sort_by_leading_number(os.listdir(root_dir))
-    all_groundtruth = sort_by_leading_number(os.listdir(gt_dir))
-    vaild_data_name = []
-    for data_idx in range(len(all_data_name)):
-        if path.basename(root_dir) == "paper2video":
-            video_result_1 = path.join(root_dir, all_data_name[data_idx], "3_merage.mp4")
-            video_result_2 = path.join(root_dir.replace("paper2video", "presentagent"), all_data_name[data_idx], "result.mp4")
-    generate_ip_task(vaild_data_name, num_qa_pair)
diff --git a/Paper2Video/src/evaluation/IPMemory/ip_qa.py b/Paper2Video/src/evaluation/IPMemory/ip_qa.py
deleted file mode 100644
index 0c3d7a6c5f5b2f4cb4105739edbb3e2fe1a5346a..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/IPMemory/ip_qa.py
+++ /dev/null
@@ -1,142 +0,0 @@
-import os
-import re
-import json
-import time
-import random
-import argparse, pdb
-from os import path
-import google.generativeai as genai
-from moviepy.editor import VideoFileClip
-from camel.models import ModelFactory
-from camel.types import ModelType, ModelPlatformType
-from camel.configs import GeminiConfig
-from typing import List
-from pathlib import Path
-
-
-genai.configure(api_key="")  
-    
-_num_at_start = re.compile(r'^\s*["\']?(\d+)')
-def sort_by_leading_number(paths: List[str]) -> List[str]:
-    def key(p: str):
-        name = Path(p).name  
-        m = _num_at_start.match(name)
-        return (int(m.group(1)) if m else float('inf'), name)
-    return sorted(paths, key=key)
-dataset_path = "/path/to/data"
-dataset_list = sort_by_leading_number(os.listdir(dataset_path))
-
-
-def eval_ip(root_path, clip_duration, model_list, prompt_path, question_path, test_type='image'):
-    tmp_dir = "tmp"
-    os.makedirs(tmp_dir, exist_ok=True)
-    gemini_model = genai.GenerativeModel("models/gemini-2.5-pro-flash")
-    
-    with open(prompt_path, 'r') as f: prompt = f.readlines()
-    prompt = "/n".join(prompt)
-    with open(question_path, 'r') as f: questions = json.load(f)
-    
-    result_each_question = []
-    for question in questions:
-        video_ids = question["videos"]
-        querys = question["querys"]
-        qs = question["questions"]
-        
-        ## get video clips
-        video_clips_path = {}
-        for model in model_list: video_clips_path[model] = []
-        
-        start_p2v = None
-        for vid_id in video_ids:
-            tmp_dir_id = path.join(tmp_dir, str(vid_id))
-            os.makedirs(tmp_dir_id, exist_ok=True)
-            for model in model_list:
-                if model == 'p2v': video_path = path.join(root_path, "paper2video", str(vid_id), '3_merage.mp4')
-                elif model == 'p2v-o': video_path = path.join(root_path, "paper2video_wo_presenter", str(vid_id), 'result.mp4')
-                elif model == 'veo3': video_path = path.join(root_path, "veo3", str(vid_id)+".mp4")
-                elif model == 'wan2.2': video_path = path.join(root_path, "wan2.2", str(int(vid_id)-1), "result.mp4")
-                elif model == 'presentagent': video_path = path.join(root_path, "presentagent", str(vid_id), "result.mp4")
-                elif model == 'human-made': video_path = path.join(dataset_path, dataset_list[int(vid_id)-1], "gt_presentation_video.mp4")
-                
-                video = VideoFileClip(video_path)
-                start = random.uniform(0, video.duration-clip_duration-1)
-                end = start + clip_duration
-                if model == 'p2v' or model == "p2v-o":
-                    if start_p2v is None:
-                        start_p2v = random.uniform(0, video.duration-clip_duration-1)
-                        start = start_p2v
-                        end = start_p2v + clip_duration
-                    else:
-                        start = start_p2v
-                        end = start_p2v + clip_duration
-                else:
-                    start = random.uniform(0, video.duration-clip_duration-1)
-                    end = start + clip_duration
-                
-                clip_save_path = path.join(tmp_dir_id, model+".mp4")
-                subclip = video.subclip(start, end)
-                subclip.write_videofile(clip_save_path, codec="libx264", audio_codec="aac")
-                video_clips_path[model].append(clip_save_path)
-        ## test for each model, 4 qas
-        result_each_model = {}
-        for model in model_list:
-            video_input = video_clips_path[model]
-            videos = upload_videos(video_input)
-            result_each_model[model] = []
-            for idx, query in enumerate(querys):
-                if test_type == 'image': 
-                    query = query[0]
-                    query_state = genai.upload_file(path=query, mime_type="image/png")
-                elif test_type == 'aduio': 
-                    query = query[1]
-
-                answer = idx
-                ori_idxs = [0, 1, 2, 3]
-                shuffled_idx = ori_idxs.copy()
-                random.shuffle(shuffled_idx)
-                mapping = {orig: shuffled for orig, shuffled in zip(ori_idxs, shuffled_idx)}
-                new_answer = mapping[idx]
-                new_qs = [qs[mapping[idx]] for idx in ori_idxs]
-                
-                contents = [prompt, "Here are the quary", genai.get_file(query_state.name), "Here are the video clips"]
-                contents.extend(videos)
-                contents.extend(["Here are the questions"])
-                contents.extend(new_qs)
-                
-                response = gemini_model.generate_content(contents)
-                #pdb.set_trace()
-                match = re.search(r"My choice:\s*(\d+)", response.text)
-                if match: choice_num = int(match.group(1)) - 1
-                if choice_num == new_answer:
-                    result_each_model[model].append([query, new_qs, choice_num, new_answer, True])
-                else:
-                    result_each_model[model].append([query, new_qs, choice_num, new_answer, False])
-        result_each_question.append(result_each_model)
-        print(result_each_question)
-    with open("ip_qa_result.json", 'w') as f: json.dump(result_each_question, f, indent=4)
-                
-def upload_videos(video_list):
-    videos = video_list.copy()
-    for idx, value in enumerate(videos): 
-        videos[idx] = genai.upload_file(path=value, mime_type="video/mp4")
-    while True:
-        flag = True
-        for idx, value in enumerate(videos): 
-            file_state = genai.get_file(videos[idx].name)
-            if file_state.state.name != "ACTIVE": 
-                flag = False
-                time.sleep(5)
-                print(f"waiting 5 seconds...")
-                break
-        if flag: break
-    for idx, value in enumerate(videos): 
-        videos[idx] = genai.get_file(videos[idx].name)
-    return videos
-
-if __name__ == "__main__":
-    clip_duration = 4
-    prompt_path = "./prompt/ip_qa.txt"
-    model_list = ["p2v", "p2v-o", "veo3", "wan2.2", "presentagent", "human-made"]
-    root_path = "/path/to/result"
-    question_path = "ip_qa.json"
-    eval_ip(root_path, clip_duration, model_list, prompt_path, question_path)
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/MetaSim_audio.py b/Paper2Video/src/evaluation/MetaSim_audio.py
deleted file mode 100644
index 6ef3456099602b22d97e8276a59eba463533b6f4..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/MetaSim_audio.py
+++ /dev/null
@@ -1,102 +0,0 @@
-import os, re, json
-import random
-import argparse
-import moviepy.editor as mp
-from os import path
-from pathlib import Path
-from typing import List
-from pyannote.audio import Audio
-from pyannote.audio.pipelines.speaker_verification import PretrainedSpeakerEmbedding
-from scipy.spatial.distance import cosine
-
-
-def extract_random_audio_segment(video_path: str, output_wav_path: str, duration: float = 5.0):
-    print(video_path)
-    video = mp.VideoFileClip(video_path)
-    audio = video.audio
-
-    total_duration = audio.duration
-    if duration >= total_duration: start_time = 0
-    else: start_time = random.uniform(0, total_duration - duration)
-
-    audio_subclip = audio.subclip(start_time, start_time + duration)
-    audio_subclip.write_audiofile(output_wav_path, codec='pcm_s16le', fps=16000)
-
-def compute_speaker_similarity(audio_path_1: str, audio_path_2: str, device: str = "cuda") -> float:
-    embedding_model = PretrainedSpeakerEmbedding("speechbrain/spkrec-ecapa-voxceleb", device=device)
-    audio_loader = Audio(sample_rate=16000)
-
-    wav1, _ = audio_loader(audio_path_1)
-    wav2, _ = audio_loader(audio_path_2)
-    
-    wav1 = wav1[0:1].unsqueeze(0)
-    wav2 = wav2[0:1].unsqueeze(0)
-    
-    embedding1 = embedding_model(wav1)
-    embedding2 = embedding_model(wav2)
-    embedding1 = embedding1.reshape(embedding1.shape[1])
-    embedding2 = embedding2.reshape(embedding2.shape[1])
-
-    similarity = 1 - cosine(embedding1, embedding2)
-    return similarity
-
-
-def get_audio_sim_score(gen_video_path, gt_video_path):
-    extract_random_audio_segment(gen_video_path, gen_video_path.replace('.mp4', '.wav'), duration=5) 
-    extract_random_audio_segment(gt_video_path, gt_video_path.replace('.mp4', '.wav'), duration=5)
-    similarity = compute_speaker_similarity(gen_video_path.replace('.mp4', '.wav'), 
-                                            gt_video_path.replace('.mp4', '.wav'))
-    return similarity
-
-_num_at_start = re.compile(r'^\s*["\']?(\d+)')
-def sort_by_leading_number(paths: List[str]) -> List[str]:
-    def key(p: str):
-        name = Path(p).name  
-        m = _num_at_start.match(name)
-        return (int(m.group(1)) if m else float('inf'), name)
-    return sorted(paths, key=key)
-
-if __name__ == "__main__":
-    parser = argparse.ArgumentParser()
-    parser.add_argument("-r", "--result_dir", default="/path/to/result_dir")
-    parser.add_argument("-g", "--gt_dir", default="/path/to/gt_dir")
-    parser.add_argument("-s", "--save_dir", default="/path/to/save_dir")
-    args = parser.parse_args()
-    
-    ## load exist result if have
-    save_dir = args.save_dir
-    save_dir = path.join(save_dir, path.basename(args.result_dir))
-    save_path = path.join(save_dir, "audio_sim.json")
-    os.makedirs(save_dir, exist_ok=True)
-    if path.exists(save_path):
-        with open(save_path, 'r') as f: audio_similarity_list = json.load(f)
-    else: audio_similarity_list = []
-    
-    ## path
-    gt_dir, result_dir = args.gt_dir, args.result_dir
-    groundtruth_list = sort_by_leading_number([path.join(gt_dir, name) for name in os.listdir(gt_dir)])
-    result_list = sort_by_leading_number([path.join(result_dir, name) for name in os.listdir(result_dir)])
-    
-    for index in range(len(audio_similarity_list), 40):
-        if path.basename(args.result_dir) == "paper2video":
-            p2v_video_path = path.join(result_list[index], "3_merage.mp4")
-        elif path.basename(args.result_dir) == "veo3":
-            p2v_video_path = path.join(result_list[index])
-        else:
-            p2v_video_path = path.join(result_list[index], "result.mp4")
-        if path.exists(p2v_video_path) is False: continue
-        gt_video_path = path.join(groundtruth_list[index], "gt_presentation_video.mp4")
-        if path.exists(gt_video_path) is False: continue
-        print(p2v_video_path, gt_video_path)
-        similarity = get_audio_sim_score(p2v_video_path, gt_video_path)
-        audio_similarity_list.append({
-            "data_idx": index,
-            "score": similarity.item()
-        })
-    print(audio_similarity_list)
-    with open(save_path, 'w') as f: json.dump(audio_similarity_list, f, indent=4)
-
-    # import numpy as np
-    # avg = np.average(similarity_all)
-    # var = np.var(similarity_all)
-    # print(avg, var)
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/MetaSim_content.py b/Paper2Video/src/evaluation/MetaSim_content.py
deleted file mode 100644
index f4913e0919a3777a334a9b741fe277efd9048e3a..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/MetaSim_content.py
+++ /dev/null
@@ -1,144 +0,0 @@
-import os, re, pdb, json
-from PIL import Image
-import pytesseract
-
-import whisperx
-import argparse
-import torch
-import numpy as np
-from os import path
-from pathlib import Path
-from typing import List
-from camel.models import ModelFactory
-from camel.types import ModelType, ModelPlatformType
-from camel.configs import GeminiConfig
-
-
-os.environ["GEMINI_API_KEY"] = ""
-prompt_path = "./prompt/content_sim_score.txt"
-
-agent_config = {
-    "model_type": ModelType.GEMINI_2_5_FLASH,
-    "model_config": GeminiConfig().as_dict(),
-    "model_platform": ModelPlatformType.GEMINI,}
-actor_model = ModelFactory.create(
-    model_platform=agent_config['model_platform'],
-    model_type=agent_config['model_type'],
-    model_config_dict=agent_config['model_config'],)
-
-def extract_slide_texts(slide_dir):
-    slide_texts = []
-    for fname in sorted(os.listdir(slide_dir)):
-        if fname.lower().endswith(('.png', '.jpg', '.jpeg')):
-            path = os.path.join(slide_dir, fname)
-            text = pytesseract.image_to_string(Image.open(path))
-            slide_texts.append(text.strip())
-    return slide_texts
-
-def load_subtitles(sub_path):
-    with open(sub_path, "r") as f:
-        lines = f.readlines()
-    return [line.strip() for line in lines if line.strip()]
-
-def build_prompt(slides_1, subs_1, slides_2, subs_2):
-    prompt = (
-        "Human Presentation:\n"
-        "Slides:\n" + "\n".join(slides_1) + "\n"
-        "Subtitles:\n" + "\n".join(subs_1) + "\n\n"
-        "Generated Presentation:\n"
-        "Slides:\n" + "\n".join(slides_2) + "\n"
-        "Subtitles:\n" + "\n".join(subs_2) + "\n\n")
-    return prompt
-
-def run_similarity_eval(slide_dir_1, slide_dir_2, sub_path_1, sub_path_2):
-    slides_1 = extract_slide_texts(slide_dir_1)
-    slides_2 = extract_slide_texts(slide_dir_2)
-    subs_1 = load_subtitles(sub_path_1)
-    subs_2 = load_subtitles(sub_path_2)
-
-    with open(prompt_path, 'r') as f: prompt = f.readlines() 
-    prompt = "\n".join(prompt)
-    prompt_q = build_prompt(slides_1, subs_1, slides_2, subs_2)
-    prompt = prompt + '/n' + prompt_q
-    
-    output = actor_model.run([{"role": "user", "content": prompt}])
-    print("=== Similarity Evaluation ===\n")
-    print(output.choices[0].message.content)
-    return output.choices[0].message.content
-
-def extract_plain_subtitle_with_whisperx(video_path: str, output_path: str, model_name: str = "large-v3", language: str = "en"):
-    device = "cuda" if torch.cuda.is_available() else "cpu"
-    model = whisperx.load_model(model_name, device=device, language=language)
-
-    audio = whisperx.load_audio(video_path)
-    result = model.transcribe(audio, batch_size=16)
-
-    with open(output_path, "w") as f:
-        for seg in result["segments"]:
-            f.write(seg["text"].strip() + "\n")
-
-def extract_similarity_scores(text):
-    content_match = re.search(r"Content Similarity:\s*(\d+)/5", text)
-    if content_match:
-        content_score = int(content_match.group(1))
-        return content_score
-
-_num_at_start = re.compile(r'^\s*["\']?(\d+)')
-def sort_by_leading_number(paths: List[str]) -> List[str]:
-    def key(p: str):
-        name = Path(p).name  
-        m = _num_at_start.match(name)
-        return (int(m.group(1)) if m else float('inf'), name)
-    return sorted(paths, key=key)
-
-if __name__ == "__main__":
-    parser = argparse.ArgumentParser()
-    parser.add_argument("-r", "--result_dir", default="/path/to/result_dir")
-    parser.add_argument("-g", "--gt_dir", default="/path/to/gt_dir")
-    parser.add_argument("-s", "--save_dir", default="/path/to/save_dir")
-    args = parser.parse_args()
-    
-    ## load exist result if have
-    save_dir = args.save_dir
-    save_dir = path.join(save_dir, path.basename(args.result_dir))
-    save_path = path.join(save_dir, "content_sim.json")
-    os.makedirs(save_dir, exist_ok=True)
-    if path.exists(save_path):
-        with open(save_path, 'r') as f: content_sim_list = json.load(f)
-    else: content_sim_list = []
-    
-    ## path
-    gt_dir, result_dir = args.gt_dir, args.result_dir
-    groundtruth_list = sort_by_leading_number([path.join(gt_dir, name) for name in os.listdir(gt_dir)])
-    result_list = sort_by_leading_number([path.join(result_dir, name) for name in os.listdir(result_dir)])
-    
-    ## eval
-    for index in range(25, 100):
-        # video -> subtitle
-        if path.basename(args.result_dir) == "paper2video":
-            p2v_video_path = path.join(result_list[index], "3_merage.mp4")
-            if path.exists(p2v_video_path) is False: continue
-        else:
-            p2v_video_path = path.join(result_list[index], "result.mp4")
-        if path.exists(p2v_video_path) is False: continue
-        gt_video_path = path.join(groundtruth_list[index], "gt_presentation_video.mp4")
-        extract_plain_subtitle_with_whisperx(gt_video_path, gt_video_path.replace(".mp4", "_sub.txt"))
-        extract_plain_subtitle_with_whisperx(p2v_video_path, p2v_video_path.replace(".mp4", "_sub.txt"))
-        
-        # slide dir 
-        gt_slide_dir = path.join(groundtruth_list[index], "slide_imgs")
-        p2v_slide_dir = path.join(result_list[index], "slide_imgs")
-        
-        # eval
-        result = run_similarity_eval(
-            slide_dir_1=gt_slide_dir,
-            slide_dir_2=p2v_slide_dir,
-            sub_path_1=gt_video_path.replace(".mp4", "_sub.txt"),
-            sub_path_2=p2v_video_path.replace(".mp4", "_sub.txt"))
-        content_score = extract_similarity_scores(result)
-        content_sim_list.append({
-            "data_idx": index,
-            "score": content_score
-        })
-    
-        with open(save_path, 'w') as f: json.dump(content_sim_list, f)
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentArena.py b/Paper2Video/src/evaluation/PresentArena.py
deleted file mode 100644
index dfe79f7bbf87880ccae2b50525a8e9fa19b98255..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentArena.py
+++ /dev/null
@@ -1,106 +0,0 @@
-'''
-    Using VideoLLM (Gemini) as judger 
-'''
-import os, re, json
-import time
-import argparse
-import google.generativeai as genai
-from os import path
-from typing import List
-from pathlib import Path
-from tqdm import tqdm
-
-
-genai.configure(api_key="")  
-def eval_gemini(gt_vid_path, gen_vid_path):
-    model = genai.GenerativeModel("models/gemini-2.5-pro")
-    gt_vid = genai.upload_file(path=gt_vid_path, mime_type="video/mp4")
-    gen_vid = genai.upload_file(path=gen_vid_path, mime_type="video/mp4")
-    while True:
-        refreshed_1 = genai.get_file(gt_vid.name)
-        refreshed_2 = genai.get_file(gen_vid.name)
-        if refreshed_1.state.name == "ACTIVE" and refreshed_2.state.name == "ACTIVE": break
-        elif refreshed_1.state.name == "FAILED" or refreshed_2.state.name == "FAILED": 
-            #raise RuntimeError("❌ File processing failed.")
-            return None
-        else:
-            print(f"waiting 5 seconds...")
-            time.sleep(5)
-
-    prompt_path = "./prompt/which_is_better.txt"
-    with open(prompt_path, 'r') as f: prompt = f.readlines()
-    prompt = "/n".join(prompt)
-    print("Sending prompt to Gemini...")
-    response = model.generate_content([prompt, refreshed_1, refreshed_2])
-    print("\n===== Evaluation Result =====")
-    print(response.text)
-    print("=============================\n")
-
-    return response.text
-
-_num_at_start = re.compile(r'^\s*["\']?(\d+)')
-def sort_by_leading_number(paths: List[str]) -> List[str]:
-    def key(p: str):
-        name = Path(p).name  
-        m = _num_at_start.match(name)
-        return (int(m.group(1)) if m else float('inf'), name)
-    return sorted(paths, key=key)
-
-if __name__ == "__main__":
-    parser = argparse.ArgumentParser()
-    parser.add_argument("-r", "--result_dir", default="/path/to/result_dir")
-    parser.add_argument("-g", "--gt_dir", default="/path/to/gt_dir")
-    parser.add_argument("-s", "--save_dir", default="/path/to/save_dir")
-    args = parser.parse_args()
-    
-    ## load exist result if have
-    save_dir = args.save_dir
-    if path.basename(args.result_dir) == "paper2video":
-        save_dir = path.join(save_dir, path.basename(args.result_dir))
-    else: save_dir = path.join(save_dir, path.basename(args.result_dir))
-    
-    save_path = path.join(save_dir, "video_arena.json")
-    os.makedirs(save_dir, exist_ok=True)
-    if path.exists(save_path):
-        with open(save_path, 'r') as f: arena_score_list = json.load(f)
-    else: arena_score_list = []
-    
-    ## path
-    gt_dir, result_dir = args.gt_dir, args.result_dir
-    groundtruth_list = sort_by_leading_number([path.join(gt_dir, name) for name in os.listdir(gt_dir)])
-    result_list = sort_by_leading_number([path.join(result_dir, name) for name in os.listdir(result_dir)])
-    
-    ## Generated v.s GT (1)
-    for index in tqdm(len(result_list)):
-        if path.basename(args.result_dir) == "paper2video":
-            test_video_path = path.join(result_list[index], "3_merage.mp4")
-        elif path.basename(args.result_dir) == 'veo3':
-            test_video_path = result_list[index]
-        else:
-            test_video_path = path.join(result_list[index], "result.mp4")
-            
-        if path.exists(test_video_path) is False: continue
-        gt_video_path = path.join(groundtruth_list[index], "gt_presentation_video.mp4")
-        if path.exists(gt_video_path) is False: 
-            gt_video_path = path.join(groundtruth_list[index], "raw_video.mp4")
-            if path.exists(gt_video_path) is False: continue
-        result = eval_gemini(gt_video_path, test_video_path)
-        if result is None: continue
-        
-        pat = r"\[(?:A|B)\]"
-        m = re.findall(pat, result, flags=re.I)
-        score = 0
-        if m[0][1] == "B": score += 1
-        
-        result = eval_gemini(test_video_path, gt_video_path)
-        if result is None: continue
-        
-        pat = r"\[(?:A|B)\]"
-        m = re.findall(pat, result, flags=re.I)
-        if m[0][1] == "A": score += 1
-        
-        arena_score_list.append({
-            "data_idx": index,
-            "score": score/2
-        })
-        with open(save_path, 'w') as f: json.dump(arena_score_list, f, indent=4)
diff --git a/Paper2Video/src/evaluation/PresentQuiz/PresentQuiz.py b/Paper2Video/src/evaluation/PresentQuiz/PresentQuiz.py
deleted file mode 100644
index 4d3c0647843abe57897c4aee4a79ee5e650deee7..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/PresentQuiz.py
+++ /dev/null
@@ -1,264 +0,0 @@
-import random
-import string
-import yaml
-import PIL
-import tempfile
-import io
-import argparse
-from os import path
-from camel.models import ModelFactory
-from math import ceil
-from openai import OpenAI
-from camel.messages import BaseMessage
-from utils.src.model_utils import parse_pdf
-from urllib.parse import unquote
-from copy import deepcopy
-from transformers import AutoTokenizer, AutoModelForCausalLM
-from pytorch_fid.fid_score import compute_statistics_of_path
-import pytorch_fid.fid_score as fid
-from PIL import Image
-from httpx import Timeout
-from docling.document_converter import DocumentConverter, PdfFormatOption
-import re
-import shutil
-import pytesseract
-from utils.wei_utils import account_token
-from camel.types import ModelPlatformType, ModelType
-from marker.models import create_model_dict
-from camel.configs import ChatGPTConfig
-from camel.agents import ChatAgent
-from jinja2 import Environment, StrictUndefined
-from utils.src.utils import get_json_from_response
-from pathlib import Path
-from docling_core.types.doc import ImageRefMode, PictureItem, TableItem
-from collections import defaultdict
-from camel.configs import ChatGPTConfig, QwenConfig, VLLMConfig, OpenRouterConfig, GeminiConfig
-
-from docling.datamodel.base_models import InputFormat
-from docling.datamodel.pipeline_options import PdfPipelineOptions
-from docling.document_converter import DocumentConverter, PdfFormatOption
-
-import math
-import base64
-import requests
-from io import BytesIO
-from PIL import Image
-
-import torch
-import json
-import os
-import pickle as pkl
-import numpy as np
-from transformers import AltCLIPProcessor, AltCLIPModel
-from pathlib import Path
-from typing import List
-from moviepy.editor import VideoFileClip
-
-
-os.environ["GEMINI_API_KEY"] = ""
-
-def compute_accuracy(predicted, ground_truth, aspects):
-    """
-    Parameters
-    ----------
-    predicted : dict
-        {question: {'answer': <letter>, 'reference': ...}, ...}
-    ground_truth : dict
-        {question: '<letter>. full answer', ...}
-    aspects : dict
-        {question: '<aspect name>', ...}
-
-    Returns
-    -------
-    overall_accuracy : float
-    aspect_summary : dict
-        {
-          '<aspect name>': {
-              'total':    <int>,   # questions in this aspect
-              'correct':  <int>,   # correctly answered questions
-              'accuracy': <float>  # correct / total (0–1)
-          },
-          ...
-        }
-    """
-    correct_global = 0
-    total_global   = len(ground_truth)
-
-    total_by_aspect   = defaultdict(int)
-    correct_by_aspect = defaultdict(int)
-
-    for q, pred_info in predicted.items():
-        letter_pred = pred_info['answer']
-        aspect = aspects.get(q, 'Unknown')
-        total_by_aspect[aspect] += 1
-
-        if q in ground_truth:
-            letter_gt = ground_truth[q].split('.')[0].strip()
-
-            if len(letter_pred) > 0:
-                letter_pred = letter_pred[0].upper()
-            if letter_pred == letter_gt:
-                correct_global += 1
-                correct_by_aspect[aspect] += 1
-
-    overall_accuracy = correct_global / total_global if total_global else 0.0
-
-    # Build the per-aspect dictionary
-    aspect_summary = {}
-    for aspect, total in total_by_aspect.items():
-        correct = correct_by_aspect[aspect]
-        acc     = correct / total if total else 0.0
-        aspect_summary[aspect] = {
-            'total':   total,
-            'correct': correct,
-            'accuracy': acc
-        }
-
-    return overall_accuracy, aspect_summary
-
-def eval_qa_get_answer(video_input, questions, answers, aspects, agent_config, input_type='video'):
-    agent_name = f'answer_question_from_{input_type}'
-    with open(f"prompt/{agent_name}.yaml", "r") as f: config = yaml.safe_load(f)
-
-    actor_model = ModelFactory.create(
-            model_platform=agent_config['model_platform'],
-            model_type=agent_config['model_type'],
-            model_config_dict=agent_config['model_config'],)
-
-    actor_sys_msg = config['system_prompt']
-
-    actor_agent = ChatAgent(system_message=actor_sys_msg, model=actor_model, message_window_size=None,)
-    actor_agent.reset()
-    
-    jinja_env = Environment(undefined=StrictUndefined)
-    template = jinja_env.from_string(config["template"])
-    with open(video_input, "rb") as f: video_bytes = f.read()
-    if input_type == 'video':
-        prompt = template.render(**{'questions': questions,})
-         
-        clip = VideoFileClip(video_input)
-        duration = clip.duration  
-        msg = BaseMessage.make_user_message(
-            role_name="User",
-            content=prompt+"The video length is {}, you should NOT reference the timesteps if it exceeds video length".format(str(duration)),
-            video_bytes=video_bytes,
-            video_detail="low")
-        response = actor_agent.step(msg)
-        agent_answers = get_json_from_response(response.msgs[0].content)
-
-    input_token, output_token = account_token(response)
-    accuracy, aspect_accuracy = compute_accuracy(agent_answers, answers, aspects)
-    return accuracy, aspect_accuracy, agent_answers, input_token, output_token
-
-def run_qa_metric(question_path, video_path, result_path, test_model):
-    if test_model == "gemini":
-        agent_config = {
-                            "model_type": ModelType.GEMINI_2_5_FLASH,
-                            "model_config": GeminiConfig().as_dict(),
-                            "model_platform": ModelPlatformType.GEMINI,
-                        } 
-    overall_qa_result = {"qa_result": {}}
-
-    qa_dict = json.load(open(question_path, 'r'))
-    detail_qa, understanding_qa = qa_dict['detail'], qa_dict['understanding']
-    input_token_all, output_token_all =0, 0
-    detail_accuracy, detail_aspect_accuracy, detail_agent_answers, input_token, output_token = eval_qa_get_answer(
-            video_input=video_path,
-            questions=detail_qa['questions'],
-            answers=detail_qa['answers'],
-            aspects=detail_qa['aspects'],
-            agent_config=agent_config,
-            input_type='video')
-    input_token_all += input_token
-    output_token_all += output_token
-    understanding_accuracy, understanding_aspect_accuracy, understanding_agent_answers, input_token, output_token = eval_qa_get_answer(
-            video_input=video_path,
-            questions=understanding_qa['questions'],
-            answers=understanding_qa['answers'],
-            aspects=understanding_qa['aspects'],
-            agent_config=agent_config,
-            input_type='video')
-    input_token_all += input_token
-    output_token_all += output_token
-    overall_qa_result['qa_result'][test_model] = {
-            'detail_accuracy': detail_accuracy,
-            'detail_aspect_accuracy': detail_aspect_accuracy,
-            'detail_agent_answers': detail_agent_answers,
-            'understanding_accuracy': understanding_accuracy,
-            'understanding_aspect_accuracy': understanding_aspect_accuracy,
-            'understanding_agent_answers': understanding_agent_answers}
-    all_models_in_file = list(overall_qa_result['qa_result'].keys())
-    detail_accs = []
-    understanding_accs = []
-    for m in all_models_in_file:
-        detail_accs.append(overall_qa_result['qa_result'][m]['detail_accuracy'])
-        understanding_accs.append(overall_qa_result['qa_result'][m]['understanding_accuracy'])
-
-    avg_detail_accuracy = float(np.mean(detail_accs)) if detail_accs else 0.0
-    avg_understanding_accuracy = float(np.mean(understanding_accs)) if understanding_accs else 0.0
-
-    overall_qa_result['avg_detail_accuracy'] = avg_detail_accuracy
-    overall_qa_result['avg_understanding_accuracy'] = avg_understanding_accuracy
-
-    # Finally, overwrite the same JSON file with the updated results
-    with open(result_path, 'w') as f: json.dump(overall_qa_result, f, indent=4)
-    print(detail_accuracy, detail_aspect_accuracy, detail_agent_answers, input_token, output_token)
-
-_num_at_start = re.compile(r'^\s*["\']?(\d+)')
-def sort_by_leading_number(paths: List[str]) -> List[str]:
-    def key(p: str):
-        name = Path(p).name  
-        m = _num_at_start.match(name)
-        return (int(m.group(1)) if m else float('inf'), name)
-    return sorted(paths, key=key)
-  
-if __name__ == "__main__":
-    parser = argparse.ArgumentParser()
-    parser.add_argument("-r", "--result_dir", default="/path/to/result")
-    parser.add_argument("-g", "--data_dir", default="/path/to/data")
-    parser.add_argument("-s", "--save_dir", default="/path/to/data")
-    args = parser.parse_args()
-    ## mkdirs
-    save_dir = args.save_dir
-    if path.basename(args.result_dir) == "paper2video":
-        save_dir = path.join(save_dir, path.basename(args.result_dir))
-    else: save_dir = path.join(save_dir, path.basename(args.result_dir))
-    
-    save_path = path.join(save_dir, "qa_result")
-    os.makedirs(save_dir, exist_ok=True)
-    os.makedirs(save_path, exist_ok=True)
-    
-    ## run test
-    gt_dir, result_dir = args.data_dir, args.result_dir
-    groundtruth_list = sort_by_leading_number([path.join(gt_dir, name) for name in os.listdir(gt_dir)])
-    if path.basename(args.result_dir) == "human_made": result_list = [] # from dataset
-    else: result_list = sort_by_leading_number([path.join(result_dir, name) for name in os.listdir(result_dir)])
-    
-    start, end = 1, 100
-    for index in range(start, end):
-        qa_json_path = path.join(groundtruth_list[index], "4o-mini_qa.json")
-        
-        ## paper2video
-        if path.basename(args.result_dir) == 'paper2video':
-            if without_presenter_flag is False:
-                test_video_path = path.join(result_list[index], "3_merage.mp4")
-            else:
-                test_video_path = path.join(result_list[index], "1_merage.mp4")
-            if path.exists(test_video_path) is False: continue
-        ## human made as baseline
-        elif path.basename(args.result_dir) == 'human_made':
-            test_video_path = path.join(groundtruth_list[index], "gt_presentation_video.mp4")
-            if path.exists(test_video_path) is False:
-                test_video_path = path.join(groundtruth_list[index], "raw_video.mp4")
-        ## veo3
-        elif path.basename(args.result_dir) == 'veo3':
-            test_video_path = result_list[index]
-        elif path.basename(args.result_dir) == 'wan2.1':
-            test_video_path = path.join(result_list[index], "result.mp4")
-        ## presentagent
-        else:
-            test_video_path = path.join(result_list[index], "result.mp4")
-        if path.exists(test_video_path) is False: continue
-        result_save_path = path.join(save_path, "qa_result_{}.json".format(index)) 
-        print("start")
-        run_qa_metric(qa_json_path, test_video_path, result_save_path, 'gemini')
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/create_paper_questions.py b/Paper2Video/src/evaluation/PresentQuiz/create_paper_questions.py
deleted file mode 100644
index 487793657d599beea560ac4bb89cf1000c64ecee..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/create_paper_questions.py
+++ /dev/null
@@ -1,47 +0,0 @@
-from utils.poster_eval_utils import *
-import argparse
-import os
-import json
-
-
-os.environ["OPENAI_API_KEY"] = ""
-
-
-if __name__ == '__main__':
-    parser = argparse.ArgumentParser()
-    parser.add_argument('--paper_folder', type=str, default="path/to/data")
-    parser.add_argument('--model_name', type=str, default='4o')
-    args = parser.parse_args()
-
-    paper_text = get_poster_text(os.path.join(args.paper_folder, 'pdf', 'paper.pdf'))
-
-    if args.model_name == '4o':
-        model_type = ModelType.GPT_4O
-    elif args.model_name == 'o3':
-        model_type = ModelType.O3
-    elif args.model_name == 'gemini':
-        model_type = ModelType.GEMINI_2_5_PRO
-        
-    detail_qa = get_questions(paper_text, 'detail', model_type)
-    understanding_qa = get_questions(paper_text, 'understanding', model_type)
-
-    detail_q, detail_a, detail_aspects = get_answers_and_remove_answers(detail_qa)
-    understanding_q, understanding_a, understanding_aspects = get_answers_and_remove_answers(understanding_qa)
-
-    final_qa = {}
-    detail_qa = {
-        'questions': detail_q,
-        'answers': detail_a,
-        'aspects': detail_aspects,
-    }
-
-    understanding_qa = {
-        'questions': understanding_q,
-        'answers': understanding_a,
-        'aspects': understanding_aspects,
-    }
-    final_qa['detail'] = detail_qa
-    final_qa['understanding'] = understanding_qa
-
-    with open(os.path.join(args.paper_folder, f'{args.model_name}_qa.json'), 'w') as f:
-        json.dump(final_qa, f, indent=4)
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/__init__.py b/Paper2Video/src/evaluation/PresentQuiz/docling/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/__init__.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/abstract_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/abstract_backend.py
deleted file mode 100644
index 491330b36f71c364fe96695fcfaa3ab752bac1e2..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/abstract_backend.py
+++ /dev/null
@@ -1,63 +0,0 @@
-from abc import ABC, abstractmethod
-from io import BytesIO
-from pathlib import Path
-from typing import TYPE_CHECKING, Set, Union
-
-from docling_core.types.doc import DoclingDocument
-
-if TYPE_CHECKING:
-    from docling.datamodel.base_models import InputFormat
-    from docling.datamodel.document import InputDocument
-
-
-class AbstractDocumentBackend(ABC):
-    @abstractmethod
-    def __init__(self, in_doc: "InputDocument", path_or_stream: Union[BytesIO, Path]):
-        self.file = in_doc.file
-        self.path_or_stream = path_or_stream
-        self.document_hash = in_doc.document_hash
-        self.input_format = in_doc.format
-
-    @abstractmethod
-    def is_valid(self) -> bool:
-        pass
-
-    @classmethod
-    @abstractmethod
-    def supports_pagination(cls) -> bool:
-        pass
-
-    def unload(self):
-        if isinstance(self.path_or_stream, BytesIO):
-            self.path_or_stream.close()
-
-        self.path_or_stream = None
-
-    @classmethod
-    @abstractmethod
-    def supported_formats(cls) -> Set["InputFormat"]:
-        pass
-
-
-class PaginatedDocumentBackend(AbstractDocumentBackend):
-    """DeclarativeDocumentBackend.
-
-    A declarative document backend is a backend that can transform to DoclingDocument
-    straight without a recognition pipeline.
-    """
-
-    @abstractmethod
-    def page_count(self) -> int:
-        pass
-
-
-class DeclarativeDocumentBackend(AbstractDocumentBackend):
-    """DeclarativeDocumentBackend.
-
-    A declarative document backend is a backend that can transform to DoclingDocument
-    straight without a recognition pipeline.
-    """
-
-    @abstractmethod
-    def convert(self) -> DoclingDocument:
-        pass
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/asciidoc_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/asciidoc_backend.py
deleted file mode 100644
index 397bfc44b91666c24ee38b3191978698a923d0c3..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/asciidoc_backend.py
+++ /dev/null
@@ -1,430 +0,0 @@
-import logging
-import re
-from io import BytesIO
-from pathlib import Path
-from typing import Set, Union
-
-from docling_core.types.doc import (
-    DocItemLabel,
-    DoclingDocument,
-    DocumentOrigin,
-    GroupItem,
-    GroupLabel,
-    ImageRef,
-    Size,
-    TableCell,
-    TableData,
-)
-
-from docling.backend.abstract_backend import DeclarativeDocumentBackend
-from docling.datamodel.base_models import InputFormat
-from docling.datamodel.document import InputDocument
-
-_log = logging.getLogger(__name__)
-
-
-class AsciiDocBackend(DeclarativeDocumentBackend):
-    def __init__(self, in_doc: InputDocument, path_or_stream: Union[BytesIO, Path]):
-        super().__init__(in_doc, path_or_stream)
-
-        self.path_or_stream = path_or_stream
-
-        try:
-            if isinstance(self.path_or_stream, BytesIO):
-                text_stream = self.path_or_stream.getvalue().decode("utf-8")
-                self.lines = text_stream.split("\n")
-            if isinstance(self.path_or_stream, Path):
-                with open(self.path_or_stream, "r", encoding="utf-8") as f:
-                    self.lines = f.readlines()
-            self.valid = True
-
-        except Exception as e:
-            raise RuntimeError(
-                f"Could not initialize AsciiDoc backend for file with hash {self.document_hash}."
-            ) from e
-        return
-
-    def is_valid(self) -> bool:
-        return self.valid
-
-    @classmethod
-    def supports_pagination(cls) -> bool:
-        return False
-
-    def unload(self):
-        return
-
-    @classmethod
-    def supported_formats(cls) -> Set[InputFormat]:
-        return {InputFormat.ASCIIDOC}
-
-    def convert(self) -> DoclingDocument:
-        """
-        Parses the ASCII into a structured document model.
-        """
-
-        origin = DocumentOrigin(
-            filename=self.file.name or "file",
-            mimetype="text/asciidoc",
-            binary_hash=self.document_hash,
-        )
-
-        doc = DoclingDocument(name=self.file.stem or "file", origin=origin)
-
-        doc = self._parse(doc)
-
-        return doc
-
-    def _parse(self, doc: DoclingDocument):
-        """
-        Main function that orchestrates the parsing by yielding components:
-        title, section headers, text, lists, and tables.
-        """
-
-        content = ""
-
-        in_list = False
-        in_table = False
-
-        text_data: list[str] = []
-        table_data: list[str] = []
-        caption_data: list[str] = []
-
-        # parents: dict[int, Union[DocItem, GroupItem, None]] = {}
-        parents: dict[int, Union[GroupItem, None]] = {}
-        # indents: dict[int, Union[DocItem, GroupItem, None]] = {}
-        indents: dict[int, Union[GroupItem, None]] = {}
-
-        for i in range(0, 10):
-            parents[i] = None
-            indents[i] = None
-
-        for line in self.lines:
-            # line = line.strip()
-
-            # Title
-            if self._is_title(line):
-                item = self._parse_title(line)
-                level = item["level"]
-
-                parents[level] = doc.add_text(
-                    text=item["text"], label=DocItemLabel.TITLE
-                )
-
-            # Section headers
-            elif self._is_section_header(line):
-                item = self._parse_section_header(line)
-                level = item["level"]
-
-                parents[level] = doc.add_heading(
-                    text=item["text"], level=item["level"], parent=parents[level - 1]
-                )
-                for k, v in parents.items():
-                    if k > level:
-                        parents[k] = None
-
-            # Lists
-            elif self._is_list_item(line):
-
-                _log.debug(f"line: {line}")
-                item = self._parse_list_item(line)
-                _log.debug(f"parsed list-item: {item}")
-
-                level = self._get_current_level(parents)
-
-                if not in_list:
-                    in_list = True
-
-                    parents[level + 1] = doc.add_group(
-                        parent=parents[level], name="list", label=GroupLabel.LIST
-                    )
-                    indents[level + 1] = item["indent"]
-
-                elif in_list and item["indent"] > indents[level]:
-                    parents[level + 1] = doc.add_group(
-                        parent=parents[level], name="list", label=GroupLabel.LIST
-                    )
-                    indents[level + 1] = item["indent"]
-
-                elif in_list and item["indent"] < indents[level]:
-
-                    # print(item["indent"], " => ", indents[level])
-                    while item["indent"] < indents[level]:
-                        # print(item["indent"], " => ", indents[level])
-                        parents[level] = None
-                        indents[level] = None
-                        level -= 1
-
-                doc.add_list_item(
-                    item["text"], parent=self._get_current_parent(parents)
-                )
-
-            elif in_list and not self._is_list_item(line):
-                in_list = False
-
-                level = self._get_current_level(parents)
-                parents[level] = None
-
-            # Tables
-            elif line.strip() == "|===" and not in_table:  # start of table
-                in_table = True
-
-            elif self._is_table_line(line):  # within a table
-                in_table = True
-                table_data.append(self._parse_table_line(line))
-
-            elif in_table and (
-                (not self._is_table_line(line)) or line.strip() == "|==="
-            ):  # end of table
-
-                caption = None
-                if len(caption_data) > 0:
-                    caption = doc.add_text(
-                        text=" ".join(caption_data), label=DocItemLabel.CAPTION
-                    )
-
-                caption_data = []
-
-                data = self._populate_table_as_grid(table_data)
-                doc.add_table(
-                    data=data, parent=self._get_current_parent(parents), caption=caption
-                )
-
-                in_table = False
-                table_data = []
-
-            # Picture
-            elif self._is_picture(line):
-
-                caption = None
-                if len(caption_data) > 0:
-                    caption = doc.add_text(
-                        text=" ".join(caption_data), label=DocItemLabel.CAPTION
-                    )
-
-                caption_data = []
-
-                item = self._parse_picture(line)
-
-                size = None
-                if "width" in item and "height" in item:
-                    size = Size(width=int(item["width"]), height=int(item["height"]))
-
-                uri = None
-                if (
-                    "uri" in item
-                    and not item["uri"].startswith("http")
-                    and item["uri"].startswith("//")
-                ):
-                    uri = "file:" + item["uri"]
-                elif (
-                    "uri" in item
-                    and not item["uri"].startswith("http")
-                    and item["uri"].startswith("/")
-                ):
-                    uri = "file:/" + item["uri"]
-                elif "uri" in item and not item["uri"].startswith("http"):
-                    uri = "file://" + item["uri"]
-
-                image = ImageRef(mimetype="image/png", size=size, dpi=70, uri=uri)
-                doc.add_picture(image=image, caption=caption)
-
-            # Caption
-            elif self._is_caption(line) and len(caption_data) == 0:
-                item = self._parse_caption(line)
-                caption_data.append(item["text"])
-
-            elif (
-                len(line.strip()) > 0 and len(caption_data) > 0
-            ):  # allow multiline captions
-                item = self._parse_text(line)
-                caption_data.append(item["text"])
-
-            # Plain text
-            elif len(line.strip()) == 0 and len(text_data) > 0:
-                doc.add_text(
-                    text=" ".join(text_data),
-                    label=DocItemLabel.PARAGRAPH,
-                    parent=self._get_current_parent(parents),
-                )
-                text_data = []
-
-            elif len(line.strip()) > 0:  # allow multiline texts
-
-                item = self._parse_text(line)
-                text_data.append(item["text"])
-
-        if len(text_data) > 0:
-            doc.add_text(
-                text=" ".join(text_data),
-                label=DocItemLabel.PARAGRAPH,
-                parent=self._get_current_parent(parents),
-            )
-            text_data = []
-
-        if in_table and len(table_data) > 0:
-            data = self._populate_table_as_grid(table_data)
-            doc.add_table(data=data, parent=self._get_current_parent(parents))
-
-            in_table = False
-            table_data = []
-
-        return doc
-
-    def _get_current_level(self, parents):
-        for k, v in parents.items():
-            if v == None and k > 0:
-                return k - 1
-
-        return 0
-
-    def _get_current_parent(self, parents):
-        for k, v in parents.items():
-            if v == None and k > 0:
-                return parents[k - 1]
-
-        return None
-
-    #   =========   Title
-    def _is_title(self, line):
-        return re.match(r"^= ", line)
-
-    def _parse_title(self, line):
-        return {"type": "title", "text": line[2:].strip(), "level": 0}
-
-    #   =========   Section headers
-    def _is_section_header(self, line):
-        return re.match(r"^==+", line)
-
-    def _parse_section_header(self, line):
-        match = re.match(r"^(=+)\s+(.*)", line)
-
-        marker = match.group(1)  # The list marker (e.g., "*", "-", "1.")
-        text = match.group(2)  # The actual text of the list item
-
-        header_level = marker.count("=")  # number of '=' represents level
-        return {
-            "type": "header",
-            "level": header_level - 1,
-            "text": text.strip(),
-        }
-
-    #   =========   Lists
-    def _is_list_item(self, line):
-        return re.match(r"^(\s)*(\*|-|\d+\.|\w+\.) ", line)
-
-    def _parse_list_item(self, line):
-        """Extract the item marker (number or bullet symbol) and the text of the item."""
-
-        match = re.match(r"^(\s*)(\*|-|\d+\.)\s+(.*)", line)
-        if match:
-            indent = match.group(1)
-            marker = match.group(2)  # The list marker (e.g., "*", "-", "1.")
-            text = match.group(3)  # The actual text of the list item
-
-            if marker == "*" or marker == "-":
-                return {
-                    "type": "list_item",
-                    "marker": marker,
-                    "text": text.strip(),
-                    "numbered": False,
-                    "indent": 0 if indent == None else len(indent),
-                }
-            else:
-                return {
-                    "type": "list_item",
-                    "marker": marker,
-                    "text": text.strip(),
-                    "numbered": True,
-                    "indent": 0 if indent == None else len(indent),
-                }
-        else:
-            # Fallback if no match
-            return {
-                "type": "list_item",
-                "marker": "-",
-                "text": line,
-                "numbered": False,
-                "indent": 0,
-            }
-
-    #   =========   Tables
-    def _is_table_line(self, line):
-        return re.match(r"^\|.*\|", line)
-
-    def _parse_table_line(self, line):
-        # Split table cells and trim extra spaces
-        return [cell.strip() for cell in line.split("|") if cell.strip()]
-
-    def _populate_table_as_grid(self, table_data):
-
-        num_rows = len(table_data)
-
-        # Adjust the table data into a grid format
-        num_cols = max(len(row) for row in table_data)
-
-        data = TableData(num_rows=num_rows, num_cols=num_cols, table_cells=[])
-        for row_idx, row in enumerate(table_data):
-            # Pad rows with empty strings to match column count
-            # grid.append(row + [''] * (max_cols - len(row)))
-
-            for col_idx, text in enumerate(row):
-                row_span = 1
-                col_span = 1
-
-                cell = TableCell(
-                    text=text,
-                    row_span=row_span,
-                    col_span=col_span,
-                    start_row_offset_idx=row_idx,
-                    end_row_offset_idx=row_idx + row_span,
-                    start_col_offset_idx=col_idx,
-                    end_col_offset_idx=col_idx + col_span,
-                    col_header=False,
-                    row_header=False,
-                )
-                data.table_cells.append(cell)
-
-        return data
-
-    #   =========   Pictures
-    def _is_picture(self, line):
-        return re.match(r"^image::", line)
-
-    def _parse_picture(self, line):
-        """
-        Parse an image macro, extracting its path and attributes.
-        Syntax: image::path/to/image.png[Alt Text, width=200, height=150, align=center]
-        """
-        mtch = re.match(r"^image::(.+)\[(.*)\]$", line)
-        if mtch:
-            picture_path = mtch.group(1).strip()
-            attributes = mtch.group(2).split(",")
-            picture_info = {"type": "picture", "uri": picture_path}
-
-            # Extract optional attributes (alt text, width, height, alignment)
-            if attributes:
-                picture_info["alt"] = attributes[0].strip() if attributes[0] else ""
-                for attr in attributes[1:]:
-                    key, value = attr.split("=")
-                    picture_info[key.strip()] = value.strip()
-
-            return picture_info
-
-        return {"type": "picture", "uri": line}
-
-    #   =========   Captions
-    def _is_caption(self, line):
-        return re.match(r"^\.(.+)", line)
-
-    def _parse_caption(self, line):
-        mtch = re.match(r"^\.(.+)", line)
-        if mtch:
-            text = mtch.group(1)
-            return {"type": "caption", "text": text}
-
-        return {"type": "caption", "text": ""}
-
-    #   =========   Plain text
-    def _parse_text(self, line):
-        return {"type": "text", "text": line.strip()}
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/docling_parse_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/docling_parse_backend.py
deleted file mode 100644
index 6d22127bbfcb129791cf43fbfa749e0e437ff58a..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/docling_parse_backend.py
+++ /dev/null
@@ -1,227 +0,0 @@
-import logging
-import random
-from io import BytesIO
-from pathlib import Path
-from typing import Iterable, List, Optional, Union
-
-import pypdfium2 as pdfium
-from docling_core.types.doc import BoundingBox, CoordOrigin, Size
-from docling_parse.pdf_parsers import pdf_parser_v1
-from PIL import Image, ImageDraw
-from pypdfium2 import PdfPage
-
-from docling.backend.pdf_backend import PdfDocumentBackend, PdfPageBackend
-from docling.datamodel.base_models import Cell
-from docling.datamodel.document import InputDocument
-
-_log = logging.getLogger(__name__)
-
-
-class DoclingParsePageBackend(PdfPageBackend):
-    def __init__(
-        self, parser: pdf_parser_v1, document_hash: str, page_no: int, page_obj: PdfPage
-    ):
-        self._ppage = page_obj
-        parsed_page = parser.parse_pdf_from_key_on_page(document_hash, page_no)
-
-        self.valid = "pages" in parsed_page
-        if self.valid:
-            self._dpage = parsed_page["pages"][0]
-        else:
-            _log.info(
-                f"An error occurred when loading page {page_no} of document {document_hash}."
-            )
-
-    def is_valid(self) -> bool:
-        return self.valid
-
-    def get_text_in_rect(self, bbox: BoundingBox) -> str:
-        if not self.valid:
-            return ""
-        # Find intersecting cells on the page
-        text_piece = ""
-        page_size = self.get_size()
-        parser_width = self._dpage["width"]
-        parser_height = self._dpage["height"]
-
-        scale = (
-            1  # FIX - Replace with param in get_text_in_rect across backends (optional)
-        )
-
-        for i in range(len(self._dpage["cells"])):
-            rect = self._dpage["cells"][i]["box"]["device"]
-            x0, y0, x1, y1 = rect
-            cell_bbox = BoundingBox(
-                l=x0 * scale * page_size.width / parser_width,
-                b=y0 * scale * page_size.height / parser_height,
-                r=x1 * scale * page_size.width / parser_width,
-                t=y1 * scale * page_size.height / parser_height,
-                coord_origin=CoordOrigin.BOTTOMLEFT,
-            ).to_top_left_origin(page_height=page_size.height * scale)
-
-            overlap_frac = cell_bbox.intersection_area_with(bbox) / cell_bbox.area()
-
-            if overlap_frac > 0.5:
-                if len(text_piece) > 0:
-                    text_piece += " "
-                text_piece += self._dpage["cells"][i]["content"]["rnormalized"]
-
-        return text_piece
-
-    def get_text_cells(self) -> Iterable[Cell]:
-        cells: List[Cell] = []
-        cell_counter = 0
-
-        if not self.valid:
-            return cells
-
-        page_size = self.get_size()
-
-        parser_width = self._dpage["width"]
-        parser_height = self._dpage["height"]
-
-        for i in range(len(self._dpage["cells"])):
-            rect = self._dpage["cells"][i]["box"]["device"]
-            x0, y0, x1, y1 = rect
-
-            if x1 < x0:
-                x0, x1 = x1, x0
-            if y1 < y0:
-                y0, y1 = y1, y0
-
-            text_piece = self._dpage["cells"][i]["content"]["rnormalized"]
-            cells.append(
-                Cell(
-                    id=cell_counter,
-                    text=text_piece,
-                    bbox=BoundingBox(
-                        # l=x0, b=y0, r=x1, t=y1,
-                        l=x0 * page_size.width / parser_width,
-                        b=y0 * page_size.height / parser_height,
-                        r=x1 * page_size.width / parser_width,
-                        t=y1 * page_size.height / parser_height,
-                        coord_origin=CoordOrigin.BOTTOMLEFT,
-                    ).to_top_left_origin(page_size.height),
-                )
-            )
-            cell_counter += 1
-
-        def draw_clusters_and_cells():
-            image = (
-                self.get_page_image()
-            )  # make new image to avoid drawing on the saved ones
-            draw = ImageDraw.Draw(image)
-            for c in cells:
-                x0, y0, x1, y1 = c.bbox.as_tuple()
-                cell_color = (
-                    random.randint(30, 140),
-                    random.randint(30, 140),
-                    random.randint(30, 140),
-                )
-                draw.rectangle([(x0, y0), (x1, y1)], outline=cell_color)
-            image.show()
-
-        # before merge:
-        # draw_clusters_and_cells()
-
-        # cells = merge_horizontal_cells(cells)
-
-        # after merge:
-        # draw_clusters_and_cells()
-
-        return cells
-
-    def get_bitmap_rects(self, scale: float = 1) -> Iterable[BoundingBox]:
-        AREA_THRESHOLD = 0  # 32 * 32
-
-        for i in range(len(self._dpage["images"])):
-            bitmap = self._dpage["images"][i]
-            cropbox = BoundingBox.from_tuple(
-                bitmap["box"], origin=CoordOrigin.BOTTOMLEFT
-            ).to_top_left_origin(self.get_size().height)
-
-            if cropbox.area() > AREA_THRESHOLD:
-                cropbox = cropbox.scaled(scale=scale)
-
-                yield cropbox
-
-    def get_page_image(
-        self, scale: float = 1, cropbox: Optional[BoundingBox] = None
-    ) -> Image.Image:
-
-        page_size = self.get_size()
-
-        if not cropbox:
-            cropbox = BoundingBox(
-                l=0,
-                r=page_size.width,
-                t=0,
-                b=page_size.height,
-                coord_origin=CoordOrigin.TOPLEFT,
-            )
-            padbox = BoundingBox(
-                l=0, r=0, t=0, b=0, coord_origin=CoordOrigin.BOTTOMLEFT
-            )
-        else:
-            padbox = cropbox.to_bottom_left_origin(page_size.height).model_copy()
-            padbox.r = page_size.width - padbox.r
-            padbox.t = page_size.height - padbox.t
-
-        image = (
-            self._ppage.render(
-                scale=scale * 1.5,
-                rotation=0,  # no additional rotation
-                crop=padbox.as_tuple(),
-            )
-            .to_pil()
-            .resize(size=(round(cropbox.width * scale), round(cropbox.height * scale)))
-        )  # We resize the image from 1.5x the given scale to make it sharper.
-
-        return image
-
-    def get_size(self) -> Size:
-        return Size(width=self._ppage.get_width(), height=self._ppage.get_height())
-
-    def unload(self):
-        self._ppage = None
-        self._dpage = None
-
-
-class DoclingParseDocumentBackend(PdfDocumentBackend):
-    def __init__(self, in_doc: "InputDocument", path_or_stream: Union[BytesIO, Path]):
-        super().__init__(in_doc, path_or_stream)
-
-        self._pdoc = pdfium.PdfDocument(self.path_or_stream)
-        self.parser = pdf_parser_v1()
-
-        success = False
-        if isinstance(self.path_or_stream, BytesIO):
-            success = self.parser.load_document_from_bytesio(
-                self.document_hash, self.path_or_stream
-            )
-        elif isinstance(self.path_or_stream, Path):
-            success = self.parser.load_document(
-                self.document_hash, str(self.path_or_stream)
-            )
-
-        if not success:
-            raise RuntimeError(
-                f"docling-parse could not load document with hash {self.document_hash}."
-            )
-
-    def page_count(self) -> int:
-        return len(self._pdoc)  # To be replaced with docling-parse API
-
-    def load_page(self, page_no: int) -> DoclingParsePageBackend:
-        return DoclingParsePageBackend(
-            self.parser, self.document_hash, page_no, self._pdoc[page_no]
-        )
-
-    def is_valid(self) -> bool:
-        return self.page_count() > 0
-
-    def unload(self):
-        super().unload()
-        self.parser.unload_document(self.document_hash)
-        self._pdoc.close()
-        self._pdoc = None
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/docling_parse_v2_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/docling_parse_v2_backend.py
deleted file mode 100644
index 27a368f92e11a26041a701012b96f875544385f0..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/docling_parse_v2_backend.py
+++ /dev/null
@@ -1,250 +0,0 @@
-import logging
-import random
-from io import BytesIO
-from pathlib import Path
-from typing import TYPE_CHECKING, Iterable, List, Optional, Union
-
-import pypdfium2 as pdfium
-from docling_core.types.doc import BoundingBox, CoordOrigin
-from docling_parse.pdf_parsers import pdf_parser_v2
-from PIL import Image, ImageDraw
-from pypdfium2 import PdfPage
-
-from docling.backend.pdf_backend import PdfDocumentBackend, PdfPageBackend
-from docling.datamodel.base_models import Cell, Size
-
-if TYPE_CHECKING:
-    from docling.datamodel.document import InputDocument
-
-_log = logging.getLogger(__name__)
-
-
-class DoclingParseV2PageBackend(PdfPageBackend):
-    def __init__(
-        self, parser: pdf_parser_v2, document_hash: str, page_no: int, page_obj: PdfPage
-    ):
-        self._ppage = page_obj
-        parsed_page = parser.parse_pdf_from_key_on_page(document_hash, page_no)
-
-        self.valid = "pages" in parsed_page and len(parsed_page["pages"]) == 1
-        if self.valid:
-            self._dpage = parsed_page["pages"][0]
-        else:
-            _log.info(
-                f"An error occurred when loading page {page_no} of document {document_hash}."
-            )
-
-    def is_valid(self) -> bool:
-        return self.valid
-
-    def get_text_in_rect(self, bbox: BoundingBox) -> str:
-        if not self.valid:
-            return ""
-        # Find intersecting cells on the page
-        text_piece = ""
-        page_size = self.get_size()
-
-        parser_width = self._dpage["sanitized"]["dimension"]["width"]
-        parser_height = self._dpage["sanitized"]["dimension"]["height"]
-
-        scale = (
-            1  # FIX - Replace with param in get_text_in_rect across backends (optional)
-        )
-
-        cells_data = self._dpage["sanitized"]["cells"]["data"]
-        cells_header = self._dpage["sanitized"]["cells"]["header"]
-
-        for i, cell_data in enumerate(cells_data):
-            x0 = cell_data[cells_header.index("x0")]
-            y0 = cell_data[cells_header.index("y0")]
-            x1 = cell_data[cells_header.index("x1")]
-            y1 = cell_data[cells_header.index("y1")]
-
-            cell_bbox = BoundingBox(
-                l=x0 * scale * page_size.width / parser_width,
-                b=y0 * scale * page_size.height / parser_height,
-                r=x1 * scale * page_size.width / parser_width,
-                t=y1 * scale * page_size.height / parser_height,
-                coord_origin=CoordOrigin.BOTTOMLEFT,
-            ).to_top_left_origin(page_height=page_size.height * scale)
-
-            overlap_frac = cell_bbox.intersection_area_with(bbox) / cell_bbox.area()
-
-            if overlap_frac > 0.5:
-                if len(text_piece) > 0:
-                    text_piece += " "
-                text_piece += cell_data[cells_header.index("text")]
-
-        return text_piece
-
-    def get_text_cells(self) -> Iterable[Cell]:
-        cells: List[Cell] = []
-        cell_counter = 0
-
-        if not self.valid:
-            return cells
-
-        page_size = self.get_size()
-
-        parser_width = self._dpage["sanitized"]["dimension"]["width"]
-        parser_height = self._dpage["sanitized"]["dimension"]["height"]
-
-        cells_data = self._dpage["sanitized"]["cells"]["data"]
-        cells_header = self._dpage["sanitized"]["cells"]["header"]
-
-        for i, cell_data in enumerate(cells_data):
-            x0 = cell_data[cells_header.index("x0")]
-            y0 = cell_data[cells_header.index("y0")]
-            x1 = cell_data[cells_header.index("x1")]
-            y1 = cell_data[cells_header.index("y1")]
-
-            if x1 < x0:
-                x0, x1 = x1, x0
-            if y1 < y0:
-                y0, y1 = y1, y0
-
-            text_piece = cell_data[cells_header.index("text")]
-            cells.append(
-                Cell(
-                    id=cell_counter,
-                    text=text_piece,
-                    bbox=BoundingBox(
-                        # l=x0, b=y0, r=x1, t=y1,
-                        l=x0 * page_size.width / parser_width,
-                        b=y0 * page_size.height / parser_height,
-                        r=x1 * page_size.width / parser_width,
-                        t=y1 * page_size.height / parser_height,
-                        coord_origin=CoordOrigin.BOTTOMLEFT,
-                    ).to_top_left_origin(page_size.height),
-                )
-            )
-            cell_counter += 1
-
-        def draw_clusters_and_cells():
-            image = (
-                self.get_page_image()
-            )  # make new image to avoid drawing on the saved ones
-            draw = ImageDraw.Draw(image)
-            for c in cells:
-                x0, y0, x1, y1 = c.bbox.as_tuple()
-                cell_color = (
-                    random.randint(30, 140),
-                    random.randint(30, 140),
-                    random.randint(30, 140),
-                )
-                draw.rectangle([(x0, y0), (x1, y1)], outline=cell_color)
-            image.show()
-
-        # draw_clusters_and_cells()
-
-        return cells
-
-    def get_bitmap_rects(self, scale: float = 1) -> Iterable[BoundingBox]:
-        AREA_THRESHOLD = 0  # 32 * 32
-
-        images = self._dpage["sanitized"]["images"]["data"]
-        images_header = self._dpage["sanitized"]["images"]["header"]
-
-        for row in images:
-            x0 = row[images_header.index("x0")]
-            y0 = row[images_header.index("y0")]
-            x1 = row[images_header.index("x1")]
-            y1 = row[images_header.index("y1")]
-
-            cropbox = BoundingBox.from_tuple(
-                (x0, y0, x1, y1), origin=CoordOrigin.BOTTOMLEFT
-            ).to_top_left_origin(self.get_size().height)
-
-            if cropbox.area() > AREA_THRESHOLD:
-                cropbox = cropbox.scaled(scale=scale)
-
-                yield cropbox
-
-    def get_page_image(
-        self, scale: float = 1, cropbox: Optional[BoundingBox] = None
-    ) -> Image.Image:
-
-        page_size = self.get_size()
-
-        if not cropbox:
-            cropbox = BoundingBox(
-                l=0,
-                r=page_size.width,
-                t=0,
-                b=page_size.height,
-                coord_origin=CoordOrigin.TOPLEFT,
-            )
-            padbox = BoundingBox(
-                l=0, r=0, t=0, b=0, coord_origin=CoordOrigin.BOTTOMLEFT
-            )
-        else:
-            padbox = cropbox.to_bottom_left_origin(page_size.height).model_copy()
-            padbox.r = page_size.width - padbox.r
-            padbox.t = page_size.height - padbox.t
-
-        image = (
-            self._ppage.render(
-                scale=scale * 1.5,
-                rotation=0,  # no additional rotation
-                crop=padbox.as_tuple(),
-            )
-            .to_pil()
-            .resize(size=(round(cropbox.width * scale), round(cropbox.height * scale)))
-        )  # We resize the image from 1.5x the given scale to make it sharper.
-
-        return image
-
-    def get_size(self) -> Size:
-        return Size(width=self._ppage.get_width(), height=self._ppage.get_height())
-
-    def unload(self):
-        self._ppage = None
-        self._dpage = None
-
-
-class DoclingParseV2DocumentBackend(PdfDocumentBackend):
-    def __init__(self, in_doc: "InputDocument", path_or_stream: Union[BytesIO, Path]):
-        super().__init__(in_doc, path_or_stream)
-
-        self._pdoc = pdfium.PdfDocument(self.path_or_stream)
-        self.parser = pdf_parser_v2("fatal")
-
-        success = False
-        if isinstance(self.path_or_stream, BytesIO):
-            success = self.parser.load_document_from_bytesio(
-                self.document_hash, self.path_or_stream
-            )
-        elif isinstance(self.path_or_stream, Path):
-            success = self.parser.load_document(
-                self.document_hash, str(self.path_or_stream)
-            )
-
-        if not success:
-            raise RuntimeError(
-                f"docling-parse v2 could not load document {self.document_hash}."
-            )
-
-    def page_count(self) -> int:
-        # return len(self._pdoc)  # To be replaced with docling-parse API
-
-        len_1 = len(self._pdoc)
-        len_2 = self.parser.number_of_pages(self.document_hash)
-
-        if len_1 != len_2:
-            _log.error(f"Inconsistent number of pages: {len_1}!={len_2}")
-
-        return len_2
-
-    def load_page(self, page_no: int) -> DoclingParseV2PageBackend:
-        return DoclingParseV2PageBackend(
-            self.parser, self.document_hash, page_no, self._pdoc[page_no]
-        )
-
-    def is_valid(self) -> bool:
-        return self.page_count() > 0
-
-    def unload(self):
-        super().unload()
-        self.parser.unload_document(self.document_hash)
-        self._pdoc.close()
-        self._pdoc = None
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/html_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/html_backend.py
deleted file mode 100644
index 286dfbfaedbfee4c058a70c86a2f1520712f7b69..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/html_backend.py
+++ /dev/null
@@ -1,442 +0,0 @@
-import logging
-from io import BytesIO
-from pathlib import Path
-from typing import Optional, Set, Union
-
-from bs4 import BeautifulSoup, Tag
-from docling_core.types.doc import (
-    DocItemLabel,
-    DoclingDocument,
-    DocumentOrigin,
-    GroupLabel,
-    TableCell,
-    TableData,
-)
-
-from docling.backend.abstract_backend import DeclarativeDocumentBackend
-from docling.datamodel.base_models import InputFormat
-from docling.datamodel.document import InputDocument
-
-_log = logging.getLogger(__name__)
-
-
-class HTMLDocumentBackend(DeclarativeDocumentBackend):
-    def __init__(self, in_doc: "InputDocument", path_or_stream: Union[BytesIO, Path]):
-        super().__init__(in_doc, path_or_stream)
-        _log.debug("About to init HTML backend...")
-        self.soup: Optional[Tag] = None
-        # HTML file:
-        self.path_or_stream = path_or_stream
-        # Initialise the parents for the hierarchy
-        self.max_levels = 10
-        self.level = 0
-        self.parents = {}  # type: ignore
-        for i in range(0, self.max_levels):
-            self.parents[i] = None
-        self.labels = {}  # type: ignore
-
-        try:
-            if isinstance(self.path_or_stream, BytesIO):
-                text_stream = self.path_or_stream.getvalue()
-                self.soup = BeautifulSoup(text_stream, "html.parser")
-            if isinstance(self.path_or_stream, Path):
-                with open(self.path_or_stream, "rb") as f:
-                    html_content = f.read()
-                    self.soup = BeautifulSoup(html_content, "html.parser")
-        except Exception as e:
-            raise RuntimeError(
-                f"Could not initialize HTML backend for file with hash {self.document_hash}."
-            ) from e
-
-    def is_valid(self) -> bool:
-        return self.soup is not None
-
-    @classmethod
-    def supports_pagination(cls) -> bool:
-        return False
-
-    def unload(self):
-        if isinstance(self.path_or_stream, BytesIO):
-            self.path_or_stream.close()
-
-        self.path_or_stream = None
-
-    @classmethod
-    def supported_formats(cls) -> Set[InputFormat]:
-        return {InputFormat.HTML}
-
-    def convert(self) -> DoclingDocument:
-        # access self.path_or_stream to load stuff
-        origin = DocumentOrigin(
-            filename=self.file.name or "file",
-            mimetype="text/html",
-            binary_hash=self.document_hash,
-        )
-
-        doc = DoclingDocument(name=self.file.stem or "file", origin=origin)
-        _log.debug("Trying to convert HTML...")
-
-        if self.is_valid():
-            assert self.soup is not None
-            content = self.soup.body or self.soup
-            # Replace <br> tags with newline characters
-            for br in content.find_all("br"):
-                br.replace_with("\n")
-            doc = self.walk(content, doc)
-        else:
-            raise RuntimeError(
-                f"Cannot convert doc with {self.document_hash} because the backend failed to init."
-            )
-        return doc
-
-    def walk(self, element: Tag, doc: DoclingDocument):
-        try:
-            # Iterate over elements in the body of the document
-            for idx, element in enumerate(element.children):
-                try:
-                    self.analyse_element(element, idx, doc)
-                except Exception as exc_child:
-
-                    _log.error(" -> error treating child: ", exc_child)
-                    _log.error(" => element: ", element, "\n")
-                    raise exc_child
-
-        except Exception as exc:
-            pass
-
-        return doc
-
-    def analyse_element(self, element: Tag, idx: int, doc: DoclingDocument):
-        """
-        if element.name!=None:
-            _log.debug("\t"*self.level, idx, "\t", f"{element.name} ({self.level})")
-        """
-
-        if element.name in self.labels:
-            self.labels[element.name] += 1
-        else:
-            self.labels[element.name] = 1
-
-        if element.name in ["h1", "h2", "h3", "h4", "h5", "h6"]:
-            self.handle_header(element, idx, doc)
-        elif element.name in ["p"]:
-            self.handle_paragraph(element, idx, doc)
-        elif element.name in ["pre"]:
-            self.handle_code(element, idx, doc)
-        elif element.name in ["ul", "ol"]:
-            self.handle_list(element, idx, doc)
-        elif element.name in ["li"]:
-            self.handle_listitem(element, idx, doc)
-        elif element.name == "table":
-            self.handle_table(element, idx, doc)
-        elif element.name == "figure":
-            self.handle_figure(element, idx, doc)
-        elif element.name == "img":
-            self.handle_image(element, idx, doc)
-        else:
-            self.walk(element, doc)
-
-    def get_direct_text(self, item: Tag):
-        """Get the direct text of the <li> element (ignoring nested lists)."""
-        text = item.find(string=True, recursive=False)
-        if isinstance(text, str):
-            return text.strip()
-
-        return ""
-
-    # Function to recursively extract text from all child nodes
-    def extract_text_recursively(self, item: Tag):
-        result = []
-
-        if isinstance(item, str):
-            return [item]
-
-        if item.name not in ["ul", "ol"]:
-            try:
-                # Iterate over the children (and their text and tails)
-                for child in item:
-                    try:
-                        # Recursively get the child's text content
-                        result.extend(self.extract_text_recursively(child))
-                    except:
-                        pass
-            except:
-                _log.warn("item has no children")
-                pass
-
-        return "".join(result) + " "
-
-    def handle_header(self, element: Tag, idx: int, doc: DoclingDocument):
-        """Handles header tags (h1, h2, etc.)."""
-        hlevel = int(element.name.replace("h", ""))
-        slevel = hlevel - 1
-
-        label = DocItemLabel.SECTION_HEADER
-        text = element.text.strip()
-
-        if hlevel == 1:
-            for key, val in self.parents.items():
-                self.parents[key] = None
-
-            self.level = 1
-            self.parents[self.level] = doc.add_text(
-                parent=self.parents[0], label=DocItemLabel.TITLE, text=text
-            )
-        else:
-            if hlevel > self.level:
-
-                # add invisible group
-                for i in range(self.level + 1, hlevel):
-                    self.parents[i] = doc.add_group(
-                        name=f"header-{i}",
-                        label=GroupLabel.SECTION,
-                        parent=self.parents[i - 1],
-                    )
-                self.level = hlevel
-
-            elif hlevel < self.level:
-
-                # remove the tail
-                for key, val in self.parents.items():
-                    if key > hlevel:
-                        self.parents[key] = None
-                self.level = hlevel
-
-            self.parents[hlevel] = doc.add_heading(
-                parent=self.parents[hlevel - 1],
-                text=text,
-                level=hlevel,
-            )
-
-    def handle_code(self, element: Tag, idx: int, doc: DoclingDocument):
-        """Handles monospace code snippets (pre)."""
-        if element.text is None:
-            return
-        text = element.text.strip()
-        label = DocItemLabel.CODE
-        if len(text) == 0:
-            return
-        doc.add_code(parent=self.parents[self.level], text=text)
-
-    def handle_paragraph(self, element: Tag, idx: int, doc: DoclingDocument):
-        """Handles paragraph tags (p)."""
-        if element.text is None:
-            return
-        text = element.text.strip()
-        label = DocItemLabel.PARAGRAPH
-        if len(text) == 0:
-            return
-        doc.add_text(parent=self.parents[self.level], label=label, text=text)
-
-    def handle_list(self, element: Tag, idx: int, doc: DoclingDocument):
-        """Handles list tags (ul, ol) and their list items."""
-
-        if element.name == "ul":
-            # create a list group
-            self.parents[self.level + 1] = doc.add_group(
-                parent=self.parents[self.level], name="list", label=GroupLabel.LIST
-            )
-        elif element.name == "ol":
-            # create a list group
-            self.parents[self.level + 1] = doc.add_group(
-                parent=self.parents[self.level],
-                name="ordered list",
-                label=GroupLabel.ORDERED_LIST,
-            )
-        self.level += 1
-
-        self.walk(element, doc)
-
-        self.parents[self.level + 1] = None
-        self.level -= 1
-
-    def handle_listitem(self, element: Tag, idx: int, doc: DoclingDocument):
-        """Handles listitem tags (li)."""
-        nested_lists = element.find(["ul", "ol"])
-
-        parent_list_label = self.parents[self.level].label
-        index_in_list = len(self.parents[self.level].children) + 1
-
-        if nested_lists:
-            name = element.name
-            # Text in list item can be hidden within hierarchy, hence
-            # we need to extract it recursively
-            text = self.extract_text_recursively(element)
-            # Flatten text, remove break lines:
-            text = text.replace("\n", "").replace("\r", "")
-            text = " ".join(text.split()).strip()
-
-            marker = ""
-            enumerated = False
-            if parent_list_label == GroupLabel.ORDERED_LIST:
-                marker = str(index_in_list)
-                enumerated = True
-
-            if len(text) > 0:
-                # create a list-item
-                self.parents[self.level + 1] = doc.add_list_item(
-                    text=text,
-                    enumerated=enumerated,
-                    marker=marker,
-                    parent=self.parents[self.level],
-                )
-                self.level += 1
-
-            self.walk(element, doc)
-
-            self.parents[self.level + 1] = None
-            self.level -= 1
-
-        elif isinstance(element.text, str):
-            text = element.text.strip()
-
-            marker = ""
-            enumerated = False
-            if parent_list_label == GroupLabel.ORDERED_LIST:
-                marker = f"{str(index_in_list)}."
-                enumerated = True
-            doc.add_list_item(
-                text=text,
-                enumerated=enumerated,
-                marker=marker,
-                parent=self.parents[self.level],
-            )
-        else:
-            _log.warn("list-item has no text: ", element)
-
-    def handle_table(self, element: Tag, idx: int, doc: DoclingDocument):
-        """Handles table tags."""
-
-        nested_tables = element.find("table")
-        if nested_tables is not None:
-            _log.warn("detected nested tables: skipping for now")
-            return
-
-        # Count the number of rows (number of <tr> elements)
-        num_rows = len(element.find_all("tr"))
-
-        # Find the number of columns (taking into account colspan)
-        num_cols = 0
-        for row in element.find_all("tr"):
-            col_count = 0
-            for cell in row.find_all(["td", "th"]):
-                colspan = int(cell.get("colspan", 1))
-                col_count += colspan
-            num_cols = max(num_cols, col_count)
-
-        grid = [[None for _ in range(num_cols)] for _ in range(num_rows)]
-
-        data = TableData(num_rows=num_rows, num_cols=num_cols, table_cells=[])
-
-        # Iterate over the rows in the table
-        for row_idx, row in enumerate(element.find_all("tr")):
-
-            # For each row, find all the column cells (both <td> and <th>)
-            cells = row.find_all(["td", "th"])
-
-            # Check if each cell in the row is a header -> means it is a column header
-            col_header = True
-            for j, html_cell in enumerate(cells):
-                if html_cell.name == "td":
-                    col_header = False
-
-            col_idx = 0
-            # Extract and print the text content of each cell
-            for _, html_cell in enumerate(cells):
-
-                text = html_cell.text
-                try:
-                    text = self.extract_table_cell_text(html_cell)
-                except Exception as exc:
-                    _log.warn("exception: ", exc)
-                    exit(-1)
-
-                # label = html_cell.name
-
-                col_span = int(html_cell.get("colspan", 1))
-                row_span = int(html_cell.get("rowspan", 1))
-
-                while grid[row_idx][col_idx] is not None:
-                    col_idx += 1
-                for r in range(row_span):
-                    for c in range(col_span):
-                        grid[row_idx + r][col_idx + c] = text
-
-                cell = TableCell(
-                    text=text,
-                    row_span=row_span,
-                    col_span=col_span,
-                    start_row_offset_idx=row_idx,
-                    end_row_offset_idx=row_idx + row_span,
-                    start_col_offset_idx=col_idx,
-                    end_col_offset_idx=col_idx + col_span,
-                    col_header=col_header,
-                    row_header=((not col_header) and html_cell.name == "th"),
-                )
-                data.table_cells.append(cell)
-
-        doc.add_table(data=data, parent=self.parents[self.level])
-
-    def get_list_text(self, list_element: Tag, level=0):
-        """Recursively extract text from <ul> or <ol> with proper indentation."""
-        result = []
-        bullet_char = "*"  # Default bullet character for unordered lists
-
-        if list_element.name == "ol":  # For ordered lists, use numbers
-            for i, li in enumerate(list_element.find_all("li", recursive=False), 1):
-                # Add numbering for ordered lists
-                result.append(f"{'    ' * level}{i}. {li.get_text(strip=True)}")
-                # Handle nested lists
-                nested_list = li.find(["ul", "ol"])
-                if nested_list:
-                    result.extend(self.get_list_text(nested_list, level + 1))
-        elif list_element.name == "ul":  # For unordered lists, use bullet points
-            for li in list_element.find_all("li", recursive=False):
-                # Add bullet points for unordered lists
-                result.append(
-                    f"{'    ' * level}{bullet_char} {li.get_text(strip=True)}"
-                )
-                # Handle nested lists
-                nested_list = li.find(["ul", "ol"])
-                if nested_list:
-                    result.extend(self.get_list_text(nested_list, level + 1))
-
-        return result
-
-    def extract_table_cell_text(self, cell: Tag):
-        """Extract text from a table cell, including lists with indents."""
-        contains_lists = cell.find(["ul", "ol"])
-        if contains_lists is None:
-            return cell.text
-        else:
-            _log.debug(
-                "should extract the content correctly for table-cells with lists ..."
-            )
-            return cell.text
-
-    def handle_figure(self, element: Tag, idx: int, doc: DoclingDocument):
-        """Handles image tags (img)."""
-
-        # Extract the image URI from the <img> tag
-        # image_uri = root.xpath('//figure//img/@src')[0]
-
-        contains_captions = element.find(["figcaption"])
-        if contains_captions is None:
-            doc.add_picture(parent=self.parents[self.level], caption=None)
-
-        else:
-            texts = []
-            for item in contains_captions:
-                texts.append(item.text)
-
-            fig_caption = doc.add_text(
-                label=DocItemLabel.CAPTION, text=("".join(texts)).strip()
-            )
-            doc.add_picture(
-                parent=self.parents[self.level],
-                caption=fig_caption,
-            )
-
-    def handle_image(self, element: Tag, idx, doc: DoclingDocument):
-        """Handles image tags (img)."""
-        doc.add_picture(parent=self.parents[self.level], caption=None)
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/json/__init__.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/json/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/json/docling_json_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/json/docling_json_backend.py
deleted file mode 100644
index 73ac69720b61cf1eb3bc0566e676603cf0ede53c..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/json/docling_json_backend.py
+++ /dev/null
@@ -1,58 +0,0 @@
-from io import BytesIO
-from pathlib import Path
-from typing import Union
-
-from docling_core.types.doc import DoclingDocument
-from typing_extensions import override
-
-from docling.backend.abstract_backend import DeclarativeDocumentBackend
-from docling.datamodel.base_models import InputFormat
-from docling.datamodel.document import InputDocument
-
-
-class DoclingJSONBackend(DeclarativeDocumentBackend):
-    @override
-    def __init__(
-        self, in_doc: InputDocument, path_or_stream: Union[BytesIO, Path]
-    ) -> None:
-        super().__init__(in_doc, path_or_stream)
-
-        # given we need to store any actual conversion exception for raising it from
-        # convert(), this captures the successful result or the actual error in a
-        # mutually exclusive way:
-        self._doc_or_err = self._get_doc_or_err()
-
-    @override
-    def is_valid(self) -> bool:
-        return isinstance(self._doc_or_err, DoclingDocument)
-
-    @classmethod
-    @override
-    def supports_pagination(cls) -> bool:
-        return False
-
-    @classmethod
-    @override
-    def supported_formats(cls) -> set[InputFormat]:
-        return {InputFormat.JSON_DOCLING}
-
-    def _get_doc_or_err(self) -> Union[DoclingDocument, Exception]:
-        try:
-            json_data: Union[str, bytes]
-            if isinstance(self.path_or_stream, Path):
-                with open(self.path_or_stream, encoding="utf-8") as f:
-                    json_data = f.read()
-            elif isinstance(self.path_or_stream, BytesIO):
-                json_data = self.path_or_stream.getvalue()
-            else:
-                raise RuntimeError(f"Unexpected: {type(self.path_or_stream)=}")
-            return DoclingDocument.model_validate_json(json_data=json_data)
-        except Exception as e:
-            return e
-
-    @override
-    def convert(self) -> DoclingDocument:
-        if isinstance(self._doc_or_err, DoclingDocument):
-            return self._doc_or_err
-        else:
-            raise self._doc_or_err
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/md_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/md_backend.py
deleted file mode 100644
index 19a21c19d7fbbafaeea9ca95a89e13fec8387b1d..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/md_backend.py
+++ /dev/null
@@ -1,428 +0,0 @@
-import logging
-import re
-import warnings
-from io import BytesIO
-from pathlib import Path
-from typing import List, Optional, Set, Union
-
-import marko
-import marko.element
-import marko.ext
-import marko.ext.gfm
-import marko.inline
-from docling_core.types.doc import (
-    DocItem,
-    DocItemLabel,
-    DoclingDocument,
-    DocumentOrigin,
-    GroupLabel,
-    NodeItem,
-    TableCell,
-    TableData,
-    TextItem,
-)
-from marko import Markdown
-
-from docling.backend.abstract_backend import DeclarativeDocumentBackend
-from docling.backend.html_backend import HTMLDocumentBackend
-from docling.datamodel.base_models import InputFormat
-from docling.datamodel.document import InputDocument
-
-_log = logging.getLogger(__name__)
-
-_MARKER_BODY = "DOCLING_DOC_MD_HTML_EXPORT"
-_START_MARKER = f"#_#_{_MARKER_BODY}_START_#_#"
-_STOP_MARKER = f"#_#_{_MARKER_BODY}_STOP_#_#"
-
-
-class MarkdownDocumentBackend(DeclarativeDocumentBackend):
-    def _shorten_underscore_sequences(self, markdown_text: str, max_length: int = 10):
-        # This regex will match any sequence of underscores
-        pattern = r"_+"
-
-        def replace_match(match):
-            underscore_sequence = match.group(
-                0
-            )  # Get the full match (sequence of underscores)
-
-            # Shorten the sequence if it exceeds max_length
-            if len(underscore_sequence) > max_length:
-                return "_" * max_length
-            else:
-                return underscore_sequence  # Leave it unchanged if it is shorter or equal to max_length
-
-        # Use re.sub to replace long underscore sequences
-        shortened_text = re.sub(pattern, replace_match, markdown_text)
-
-        if len(shortened_text) != len(markdown_text):
-            warnings.warn("Detected potentially incorrect Markdown, correcting...")
-
-        return shortened_text
-
-    def __init__(self, in_doc: "InputDocument", path_or_stream: Union[BytesIO, Path]):
-        super().__init__(in_doc, path_or_stream)
-
-        _log.debug("MD INIT!!!")
-
-        # Markdown file:
-        self.path_or_stream = path_or_stream
-        self.valid = True
-        self.markdown = ""  # To store original Markdown string
-
-        self.in_table = False
-        self.md_table_buffer: list[str] = []
-        self.inline_texts: list[str] = []
-        self._html_blocks: int = 0
-
-        try:
-            if isinstance(self.path_or_stream, BytesIO):
-                text_stream = self.path_or_stream.getvalue().decode("utf-8")
-                # remove invalid sequences
-                # very long sequences of underscores will lead to unnecessary long processing times.
-                # In any proper Markdown files, underscores have to be escaped,
-                # otherwise they represent emphasis (bold or italic)
-                self.markdown = self._shorten_underscore_sequences(text_stream)
-            if isinstance(self.path_or_stream, Path):
-                with open(self.path_or_stream, "r", encoding="utf-8") as f:
-                    md_content = f.read()
-                    # remove invalid sequences
-                    # very long sequences of underscores will lead to unnecessary long processing times.
-                    # In any proper Markdown files, underscores have to be escaped,
-                    # otherwise they represent emphasis (bold or italic)
-                    self.markdown = self._shorten_underscore_sequences(md_content)
-            self.valid = True
-
-            _log.debug(self.markdown)
-        except Exception as e:
-            raise RuntimeError(
-                f"Could not initialize MD backend for file with hash {self.document_hash}."
-            ) from e
-        return
-
-    def _close_table(self, doc: DoclingDocument):
-        if self.in_table:
-            _log.debug("=== TABLE START ===")
-            for md_table_row in self.md_table_buffer:
-                _log.debug(md_table_row)
-            _log.debug("=== TABLE END ===")
-            tcells: List[TableCell] = []
-            result_table = []
-            for n, md_table_row in enumerate(self.md_table_buffer):
-                data = []
-                if n == 0:
-                    header = [t.strip() for t in md_table_row.split("|")[1:-1]]
-                    for value in header:
-                        data.append(value)
-                    result_table.append(data)
-                if n > 1:
-                    values = [t.strip() for t in md_table_row.split("|")[1:-1]]
-                    for value in values:
-                        data.append(value)
-                    result_table.append(data)
-
-            for trow_ind, trow in enumerate(result_table):
-                for tcol_ind, cellval in enumerate(trow):
-                    row_span = (
-                        1  # currently supporting just simple tables (without spans)
-                    )
-                    col_span = (
-                        1  # currently supporting just simple tables (without spans)
-                    )
-                    icell = TableCell(
-                        text=cellval.strip(),
-                        row_span=row_span,
-                        col_span=col_span,
-                        start_row_offset_idx=trow_ind,
-                        end_row_offset_idx=trow_ind + row_span,
-                        start_col_offset_idx=tcol_ind,
-                        end_col_offset_idx=tcol_ind + col_span,
-                        col_header=False,
-                        row_header=False,
-                    )
-                    tcells.append(icell)
-
-            num_rows = len(result_table)
-            num_cols = len(result_table[0])
-            self.in_table = False
-            self.md_table_buffer = []  # clean table markdown buffer
-            # Initialize Docling TableData
-            table_data = TableData(
-                num_rows=num_rows, num_cols=num_cols, table_cells=tcells
-            )
-            # Populate
-            for tcell in tcells:
-                table_data.table_cells.append(tcell)
-            if len(tcells) > 0:
-                doc.add_table(data=table_data)
-        return
-
-    def _process_inline_text(
-        self, parent_item: Optional[NodeItem], doc: DoclingDocument
-    ):
-        txt = " ".join(self.inline_texts)
-        if len(txt) > 0:
-            doc.add_text(
-                label=DocItemLabel.PARAGRAPH,
-                parent=parent_item,
-                text=txt,
-            )
-        self.inline_texts = []
-
-    def _iterate_elements(
-        self,
-        element: marko.element.Element,
-        depth: int,
-        doc: DoclingDocument,
-        visited: Set[marko.element.Element],
-        parent_item: Optional[NodeItem] = None,
-    ):
-
-        if element in visited:
-            return
-
-        # Iterates over all elements in the AST
-        # Check for different element types and process relevant details
-        if isinstance(element, marko.block.Heading) and len(element.children) > 0:
-            self._close_table(doc)
-            self._process_inline_text(parent_item, doc)
-            _log.debug(
-                f" - Heading level {element.level}, content: {element.children[0].children}"  # type: ignore
-            )
-            if element.level == 1:
-                doc_label = DocItemLabel.TITLE
-            else:
-                doc_label = DocItemLabel.SECTION_HEADER
-
-            # Header could have arbitrary inclusion of bold, italic or emphasis,
-            # hence we need to traverse the tree to get full text of a header
-            strings: List[str] = []
-
-            # Define a recursive function to traverse the tree
-            def traverse(node: marko.block.BlockElement):
-                # Check if the node has a "children" attribute
-                if hasattr(node, "children"):
-                    # If "children" is a list, continue traversal
-                    if isinstance(node.children, list):
-                        for child in node.children:
-                            traverse(child)
-                    # If "children" is text, add it to header text
-                    elif isinstance(node.children, str):
-                        strings.append(node.children)
-
-            traverse(element)
-            snippet_text = "".join(strings)
-            if len(snippet_text) > 0:
-                parent_item = doc.add_text(
-                    label=doc_label, parent=parent_item, text=snippet_text
-                )
-
-        elif isinstance(element, marko.block.List):
-            has_non_empty_list_items = False
-            for child in element.children:
-                if isinstance(child, marko.block.ListItem) and len(child.children) > 0:
-                    has_non_empty_list_items = True
-                    break
-
-            self._close_table(doc)
-            self._process_inline_text(parent_item, doc)
-            _log.debug(f" - List {'ordered' if element.ordered else 'unordered'}")
-            if has_non_empty_list_items:
-                label = GroupLabel.ORDERED_LIST if element.ordered else GroupLabel.LIST
-                parent_item = doc.add_group(
-                    label=label, name=f"list", parent=parent_item
-                )
-
-        elif isinstance(element, marko.block.ListItem) and len(element.children) > 0:
-            self._close_table(doc)
-            self._process_inline_text(parent_item, doc)
-            _log.debug(" - List item")
-
-            first_child = element.children[0]
-            snippet_text = str(first_child.children[0].children)  # type: ignore
-            is_numbered = False
-            if (
-                parent_item is not None
-                and isinstance(parent_item, DocItem)
-                and parent_item.label == GroupLabel.ORDERED_LIST
-            ):
-                is_numbered = True
-            doc.add_list_item(
-                enumerated=is_numbered, parent=parent_item, text=snippet_text
-            )
-            visited.add(first_child)
-
-        elif isinstance(element, marko.inline.Image):
-            self._close_table(doc)
-            self._process_inline_text(parent_item, doc)
-            _log.debug(f" - Image with alt: {element.title}, url: {element.dest}")
-
-            fig_caption: Optional[TextItem] = None
-            if element.title is not None and element.title != "":
-                fig_caption = doc.add_text(
-                    label=DocItemLabel.CAPTION, text=element.title
-                )
-
-            doc.add_picture(parent=parent_item, caption=fig_caption)
-
-        elif isinstance(element, marko.block.Paragraph) and len(element.children) > 0:
-            self._process_inline_text(parent_item, doc)
-
-        elif isinstance(element, marko.inline.RawText):
-            _log.debug(f" - Paragraph (raw text): {element.children}")
-            snippet_text = element.children.strip()
-            # Detect start of the table:
-            if "|" in snippet_text:
-                # most likely part of the markdown table
-                self.in_table = True
-                if len(self.md_table_buffer) > 0:
-                    self.md_table_buffer[len(self.md_table_buffer) - 1] += snippet_text
-                else:
-                    self.md_table_buffer.append(snippet_text)
-            else:
-                self._close_table(doc)
-                # most likely just inline text
-                self.inline_texts.append(str(element.children))
-
-        elif isinstance(element, marko.inline.CodeSpan):
-            self._close_table(doc)
-            self._process_inline_text(parent_item, doc)
-            _log.debug(f" - Code Span: {element.children}")
-            snippet_text = str(element.children).strip()
-            doc.add_code(parent=parent_item, text=snippet_text)
-
-        elif (
-            isinstance(element, (marko.block.CodeBlock, marko.block.FencedCode))
-            and len(element.children) > 0
-            and isinstance((first_child := element.children[0]), marko.inline.RawText)
-            and len(snippet_text := (first_child.children.strip())) > 0
-        ):
-            self._close_table(doc)
-            self._process_inline_text(parent_item, doc)
-            _log.debug(f" - Code Block: {element.children}")
-            doc.add_code(parent=parent_item, text=snippet_text)
-
-        elif isinstance(element, marko.inline.LineBreak):
-            if self.in_table:
-                _log.debug("Line break in a table")
-                self.md_table_buffer.append("")
-
-        elif isinstance(element, marko.block.HTMLBlock):
-            self._html_blocks += 1
-            self._process_inline_text(parent_item, doc)
-            self._close_table(doc)
-            _log.debug("HTML Block: {}".format(element))
-            if (
-                len(element.body) > 0
-            ):  # If Marko doesn't return any content for HTML block, skip it
-                html_block = element.body.strip()
-
-                # wrap in markers to enable post-processing in convert()
-                text_to_add = f"{_START_MARKER}{html_block}{_STOP_MARKER}"
-                doc.add_code(parent=parent_item, text=text_to_add)
-        else:
-            if not isinstance(element, str):
-                self._close_table(doc)
-                _log.debug("Some other element: {}".format(element))
-
-        processed_block_types = (
-            marko.block.Heading,
-            marko.block.CodeBlock,
-            marko.block.FencedCode,
-            marko.inline.RawText,
-        )
-
-        # Iterate through the element's children (if any)
-        if hasattr(element, "children") and not isinstance(
-            element, processed_block_types
-        ):
-            for child in element.children:
-                self._iterate_elements(
-                    element=child,
-                    depth=depth + 1,
-                    doc=doc,
-                    visited=visited,
-                    parent_item=parent_item,
-                )
-
-    def is_valid(self) -> bool:
-        return self.valid
-
-    def unload(self):
-        if isinstance(self.path_or_stream, BytesIO):
-            self.path_or_stream.close()
-        self.path_or_stream = None
-
-    @classmethod
-    def supports_pagination(cls) -> bool:
-        return False
-
-    @classmethod
-    def supported_formats(cls) -> Set[InputFormat]:
-        return {InputFormat.MD}
-
-    def convert(self) -> DoclingDocument:
-        _log.debug("converting Markdown...")
-
-        origin = DocumentOrigin(
-            filename=self.file.name or "file",
-            mimetype="text/markdown",
-            binary_hash=self.document_hash,
-        )
-
-        doc = DoclingDocument(name=self.file.stem or "file", origin=origin)
-
-        if self.is_valid():
-            # Parse the markdown into an abstract syntax tree (AST)
-            marko_parser = Markdown()
-            parsed_ast = marko_parser.parse(self.markdown)
-            # Start iterating from the root of the AST
-            self._iterate_elements(
-                element=parsed_ast,
-                depth=0,
-                doc=doc,
-                parent_item=None,
-                visited=set(),
-            )
-            self._process_inline_text(None, doc)  # handle last hanging inline text
-            self._close_table(doc=doc)  # handle any last hanging table
-
-            # if HTML blocks were detected, export to HTML and delegate to HTML backend
-            if self._html_blocks > 0:
-
-                # export to HTML
-                html_backend_cls = HTMLDocumentBackend
-                html_str = doc.export_to_html()
-
-                def _restore_original_html(txt, regex):
-                    _txt, count = re.subn(regex, "", txt)
-                    if count != self._html_blocks:
-                        raise RuntimeError(
-                            "An internal error has occurred during Markdown conversion."
-                        )
-                    return _txt
-
-                # restore original HTML by removing previouly added markers
-                for regex in [
-                    rf"<pre>\s*<code>\s*{_START_MARKER}",
-                    rf"{_STOP_MARKER}\s*</code>\s*</pre>",
-                ]:
-                    html_str = _restore_original_html(txt=html_str, regex=regex)
-                self._html_blocks = 0
-
-                # delegate to HTML backend
-                stream = BytesIO(bytes(html_str, encoding="utf-8"))
-                in_doc = InputDocument(
-                    path_or_stream=stream,
-                    format=InputFormat.HTML,
-                    backend=html_backend_cls,
-                    filename=self.file.name,
-                )
-                html_backend_obj = html_backend_cls(
-                    in_doc=in_doc, path_or_stream=stream
-                )
-                doc = html_backend_obj.convert()
-        else:
-            raise RuntimeError(
-                f"Cannot convert md with {self.document_hash} because the backend failed to init."
-            )
-        return doc
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/msexcel_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/msexcel_backend.py
deleted file mode 100644
index 19c25341375a6525598b2077b5e933a301c8b571..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/msexcel_backend.py
+++ /dev/null
@@ -1,386 +0,0 @@
-import logging
-from io import BytesIO
-from pathlib import Path
-from typing import Dict, Set, Tuple, Union
-
-from docling_core.types.doc import (
-    DoclingDocument,
-    DocumentOrigin,
-    GroupLabel,
-    ImageRef,
-    TableCell,
-    TableData,
-)
-
-# from lxml import etree
-from openpyxl import Workbook, load_workbook
-from openpyxl.cell.cell import Cell
-from openpyxl.drawing.image import Image
-from openpyxl.worksheet.worksheet import Worksheet
-
-from docling.backend.abstract_backend import DeclarativeDocumentBackend
-from docling.datamodel.base_models import InputFormat
-from docling.datamodel.document import InputDocument
-
-_log = logging.getLogger(__name__)
-
-from typing import Any, List
-
-from PIL import Image as PILImage
-from pydantic import BaseModel
-
-
-class ExcelCell(BaseModel):
-    row: int
-    col: int
-    text: str
-    row_span: int
-    col_span: int
-
-
-class ExcelTable(BaseModel):
-    num_rows: int
-    num_cols: int
-    data: List[ExcelCell]
-
-
-class MsExcelDocumentBackend(DeclarativeDocumentBackend):
-    def __init__(self, in_doc: "InputDocument", path_or_stream: Union[BytesIO, Path]):
-        super().__init__(in_doc, path_or_stream)
-
-        # Initialise the parents for the hierarchy
-        self.max_levels = 10
-
-        self.parents: Dict[int, Any] = {}
-        for i in range(-1, self.max_levels):
-            self.parents[i] = None
-
-        self.workbook = None
-        try:
-            if isinstance(self.path_or_stream, BytesIO):
-                self.workbook = load_workbook(filename=self.path_or_stream)
-
-            elif isinstance(self.path_or_stream, Path):
-                self.workbook = load_workbook(filename=str(self.path_or_stream))
-
-            self.valid = True
-        except Exception as e:
-            self.valid = False
-
-            raise RuntimeError(
-                f"MsPowerpointDocumentBackend could not load document with hash {self.document_hash}"
-            ) from e
-
-    def is_valid(self) -> bool:
-        _log.info(f"valid: {self.valid}")
-        return self.valid
-
-    @classmethod
-    def supports_pagination(cls) -> bool:
-        return True
-
-    def unload(self):
-        if isinstance(self.path_or_stream, BytesIO):
-            self.path_or_stream.close()
-
-        self.path_or_stream = None
-
-    @classmethod
-    def supported_formats(cls) -> Set[InputFormat]:
-        return {InputFormat.XLSX}
-
-    def convert(self) -> DoclingDocument:
-        # Parses the XLSX into a structured document model.
-
-        origin = DocumentOrigin(
-            filename=self.file.name or "file.xlsx",
-            mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet",
-            binary_hash=self.document_hash,
-        )
-
-        doc = DoclingDocument(name=self.file.stem or "file.xlsx", origin=origin)
-
-        if self.is_valid():
-            doc = self._convert_workbook(doc)
-        else:
-            raise RuntimeError(
-                f"Cannot convert doc with {self.document_hash} because the backend failed to init."
-            )
-
-        return doc
-
-    def _convert_workbook(self, doc: DoclingDocument) -> DoclingDocument:
-
-        if self.workbook is not None:
-
-            # Iterate over all sheets
-            for sheet_name in self.workbook.sheetnames:
-                _log.info(f"Processing sheet: {sheet_name}")
-
-                # Access the sheet by name
-                sheet = self.workbook[sheet_name]
-
-                self.parents[0] = doc.add_group(
-                    parent=None,
-                    label=GroupLabel.SECTION,
-                    name=f"sheet: {sheet_name}",
-                )
-
-                doc = self._convert_sheet(doc, sheet)
-        else:
-            _log.error("Workbook is not initialized.")
-
-        return doc
-
-    def _convert_sheet(self, doc: DoclingDocument, sheet: Worksheet):
-
-        doc = self._find_tables_in_sheet(doc, sheet)
-
-        doc = self._find_images_in_sheet(doc, sheet)
-
-        return doc
-
-    def _find_tables_in_sheet(self, doc: DoclingDocument, sheet: Worksheet):
-
-        tables = self._find_data_tables(sheet)
-
-        for excel_table in tables:
-            num_rows = excel_table.num_rows
-            num_cols = excel_table.num_cols
-
-            table_data = TableData(
-                num_rows=num_rows,
-                num_cols=num_cols,
-                table_cells=[],
-            )
-
-            for excel_cell in excel_table.data:
-
-                cell = TableCell(
-                    text=excel_cell.text,
-                    row_span=excel_cell.row_span,
-                    col_span=excel_cell.col_span,
-                    start_row_offset_idx=excel_cell.row,
-                    end_row_offset_idx=excel_cell.row + excel_cell.row_span,
-                    start_col_offset_idx=excel_cell.col,
-                    end_col_offset_idx=excel_cell.col + excel_cell.col_span,
-                    col_header=False,
-                    row_header=False,
-                )
-                table_data.table_cells.append(cell)
-
-            doc.add_table(data=table_data, parent=self.parents[0])
-
-        return doc
-
-    def _find_data_tables(self, sheet: Worksheet):
-        """
-        Find all compact rectangular data tables in a sheet.
-        """
-        # _log.info("find_data_tables")
-
-        tables = []  # List to store found tables
-        visited: set[Tuple[int, int]] = set()  # Track already visited cells
-
-        # Iterate over all cells in the sheet
-        for ri, row in enumerate(sheet.iter_rows(values_only=False)):
-            for rj, cell in enumerate(row):
-
-                # Skip empty or already visited cells
-                if cell.value is None or (ri, rj) in visited:
-                    continue
-
-                # If the cell starts a new table, find its bounds
-                table_bounds, visited_cells = self._find_table_bounds(
-                    sheet, ri, rj, visited
-                )
-
-                visited.update(visited_cells)  # Mark these cells as visited
-                tables.append(table_bounds)
-
-        return tables
-
-    def _find_table_bounds(
-        self,
-        sheet: Worksheet,
-        start_row: int,
-        start_col: int,
-        visited: set[Tuple[int, int]],
-    ):
-        """
-        Determine the bounds of a compact rectangular table.
-        Returns:
-        - A dictionary with the bounds and data.
-        - A set of visited cell coordinates.
-        """
-        _log.info("find_table_bounds")
-
-        max_row = self._find_table_bottom(sheet, start_row, start_col)
-        max_col = self._find_table_right(sheet, start_row, start_col)
-
-        # Collect the data within the bounds
-        data = []
-        visited_cells = set()
-        for ri in range(start_row, max_row + 1):
-            for rj in range(start_col, max_col + 1):
-
-                cell = sheet.cell(row=ri + 1, column=rj + 1)  # 1-based indexing
-
-                # Check if the cell belongs to a merged range
-                row_span = 1
-                col_span = 1
-
-                # _log.info(sheet.merged_cells.ranges)
-                for merged_range in sheet.merged_cells.ranges:
-
-                    if (
-                        merged_range.min_row <= ri + 1
-                        and ri + 1 <= merged_range.max_row
-                        and merged_range.min_col <= rj + 1
-                        and rj + 1 <= merged_range.max_col
-                    ):
-
-                        row_span = merged_range.max_row - merged_range.min_row + 1
-                        col_span = merged_range.max_col - merged_range.min_col + 1
-                        break
-
-                if (ri, rj) not in visited_cells:
-                    data.append(
-                        ExcelCell(
-                            row=ri - start_row,
-                            col=rj - start_col,
-                            text=str(cell.value),
-                            row_span=row_span,
-                            col_span=col_span,
-                        )
-                    )
-                    # _log.info(f"cell: {ri}, {rj} -> {ri - start_row}, {rj - start_col}, {row_span}, {col_span}: {str(cell.value)}")
-
-                    # Mark all cells in the span as visited
-                    for span_row in range(ri, ri + row_span):
-                        for span_col in range(rj, rj + col_span):
-                            visited_cells.add((span_row, span_col))
-
-        return (
-            ExcelTable(
-                num_rows=max_row + 1 - start_row,
-                num_cols=max_col + 1 - start_col,
-                data=data,
-            ),
-            visited_cells,
-        )
-
-    def _find_table_bottom(self, sheet: Worksheet, start_row: int, start_col: int):
-        """Function to find the bottom boundary of the table"""
-
-        max_row = start_row
-
-        while max_row < sheet.max_row - 1:
-            # Get the cell value or check if it is part of a merged cell
-            cell = sheet.cell(row=max_row + 2, column=start_col + 1)
-
-            # Check if the cell is part of a merged range
-            merged_range = next(
-                (mr for mr in sheet.merged_cells.ranges if cell.coordinate in mr),
-                None,
-            )
-
-            if cell.value is None and not merged_range:
-                break  # Stop if the cell is empty and not merged
-
-            # Expand max_row to include the merged range if applicable
-            if merged_range:
-                max_row = max(max_row, merged_range.max_row - 1)
-            else:
-                max_row += 1
-
-        return max_row
-
-    def _find_table_right(self, sheet: Worksheet, start_row: int, start_col: int):
-        """Function to find the right boundary of the table"""
-
-        max_col = start_col
-
-        while max_col < sheet.max_column - 1:
-            # Get the cell value or check if it is part of a merged cell
-            cell = sheet.cell(row=start_row + 1, column=max_col + 2)
-
-            # Check if the cell is part of a merged range
-            merged_range = next(
-                (mr for mr in sheet.merged_cells.ranges if cell.coordinate in mr),
-                None,
-            )
-
-            if cell.value is None and not merged_range:
-                break  # Stop if the cell is empty and not merged
-
-            # Expand max_col to include the merged range if applicable
-            if merged_range:
-                max_col = max(max_col, merged_range.max_col - 1)
-            else:
-                max_col += 1
-
-        return max_col
-
-    def _find_images_in_sheet(
-        self, doc: DoclingDocument, sheet: Worksheet
-    ) -> DoclingDocument:
-
-        # Iterate over byte images in the sheet
-        for idx, image in enumerate(sheet._images):  # type: ignore
-
-            try:
-                pil_image = PILImage.open(image.ref)
-
-                doc.add_picture(
-                    parent=self.parents[0],
-                    image=ImageRef.from_pil(image=pil_image, dpi=72),
-                    caption=None,
-                )
-            except:
-                _log.error("could not extract the image from excel sheets")
-
-        """
-        for idx, chart in enumerate(sheet._charts):  # type: ignore
-            try:
-                chart_path = f"chart_{idx + 1}.png"
-                _log.info(
-                    f"Chart found, but dynamic rendering is required for: {chart_path}"
-                )
-
-                _log.info(f"Chart {idx + 1}:")
-                
-                # Chart type
-                # _log.info(f"Type: {type(chart).__name__}")
-                print(f"Type: {type(chart).__name__}")
-
-                # Extract series data
-                for series_idx, series in enumerate(chart.series):
-                    #_log.info(f"Series {series_idx + 1}:")
-                    print(f"Series {series_idx + 1} type: {type(series).__name__}")
-                    #print(f"x-values: {series.xVal}")
-                    #print(f"y-values: {series.yVal}")
-
-                    print(f"xval type: {type(series.xVal).__name__}")
-                    
-                    xvals = []
-                    for _ in series.xVal.numLit.pt:
-                        print(f"xval type: {type(_).__name__}")
-                        if hasattr(_, 'v'):
-                            xvals.append(_.v)
-
-                    print(f"x-values: {xvals}")
-                            
-                    yvals = []
-                    for _ in series.yVal:
-                        if hasattr(_, 'v'):
-                            yvals.append(_.v)
-                            
-                    print(f"y-values: {yvals}")                    
-                    
-            except Exception as exc:
-                print(exc)
-                continue
-        """
-
-        return doc
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/mspowerpoint_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/mspowerpoint_backend.py
deleted file mode 100644
index 8b86008bdbd1c72cf1392af091a9ae5c174a2de5..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/mspowerpoint_backend.py
+++ /dev/null
@@ -1,424 +0,0 @@
-import logging
-from io import BytesIO
-from pathlib import Path
-from typing import Set, Union
-
-from docling_core.types.doc import (
-    BoundingBox,
-    CoordOrigin,
-    DocItemLabel,
-    DoclingDocument,
-    DocumentOrigin,
-    GroupLabel,
-    ImageRef,
-    ProvenanceItem,
-    Size,
-    TableCell,
-    TableData,
-)
-from PIL import Image, UnidentifiedImageError
-from pptx import Presentation
-from pptx.enum.shapes import MSO_SHAPE_TYPE, PP_PLACEHOLDER
-
-from docling.backend.abstract_backend import (
-    DeclarativeDocumentBackend,
-    PaginatedDocumentBackend,
-)
-from docling.datamodel.base_models import InputFormat
-from docling.datamodel.document import InputDocument
-
-_log = logging.getLogger(__name__)
-
-
-class MsPowerpointDocumentBackend(DeclarativeDocumentBackend, PaginatedDocumentBackend):
-    def __init__(self, in_doc: "InputDocument", path_or_stream: Union[BytesIO, Path]):
-        super().__init__(in_doc, path_or_stream)
-        self.namespaces = {
-            "a": "http://schemas.openxmlformats.org/drawingml/2006/main",
-            "c": "http://schemas.openxmlformats.org/drawingml/2006/chart",
-            "p": "http://schemas.openxmlformats.org/presentationml/2006/main",
-        }
-        # Powerpoint file:
-        self.path_or_stream = path_or_stream
-
-        self.pptx_obj = None
-        self.valid = False
-        try:
-            if isinstance(self.path_or_stream, BytesIO):
-                self.pptx_obj = Presentation(self.path_or_stream)
-            elif isinstance(self.path_or_stream, Path):
-                self.pptx_obj = Presentation(str(self.path_or_stream))
-
-            self.valid = True
-        except Exception as e:
-            raise RuntimeError(
-                f"MsPowerpointDocumentBackend could not load document with hash {self.document_hash}"
-            ) from e
-
-        return
-
-    def page_count(self) -> int:
-        if self.is_valid():
-            assert self.pptx_obj is not None
-            return len(self.pptx_obj.slides)
-        else:
-            return 0
-
-    def is_valid(self) -> bool:
-        return self.valid
-
-    @classmethod
-    def supports_pagination(cls) -> bool:
-        return True  # True? if so, how to handle pages...
-
-    def unload(self):
-        if isinstance(self.path_or_stream, BytesIO):
-            self.path_or_stream.close()
-
-        self.path_or_stream = None
-
-    @classmethod
-    def supported_formats(cls) -> Set[InputFormat]:
-        return {InputFormat.PPTX}
-
-    def convert(self) -> DoclingDocument:
-        # Parses the PPTX into a structured document model.
-        # origin = DocumentOrigin(filename=self.path_or_stream.name, mimetype=next(iter(FormatToMimeType.get(InputFormat.PPTX))), binary_hash=self.document_hash)
-
-        origin = DocumentOrigin(
-            filename=self.file.name or "file",
-            mimetype="application/vnd.ms-powerpoint",
-            binary_hash=self.document_hash,
-        )
-
-        doc = DoclingDocument(
-            name=self.file.stem or "file", origin=origin
-        )  # must add origin information
-        doc = self.walk_linear(self.pptx_obj, doc)
-
-        return doc
-
-    def generate_prov(
-        self, shape, slide_ind, text="", slide_size=Size(width=1, height=1)
-    ):
-        if shape.left:
-            left = shape.left
-            top = shape.top
-            width = shape.width
-            height = shape.height
-        else:
-            left = 0
-            top = 0
-            width = slide_size.width
-            height = slide_size.height
-        shape_bbox = [left, top, left + width, top + height]
-        shape_bbox = BoundingBox.from_tuple(shape_bbox, origin=CoordOrigin.BOTTOMLEFT)
-        prov = ProvenanceItem(
-            page_no=slide_ind + 1, charspan=[0, len(text)], bbox=shape_bbox
-        )
-
-        return prov
-
-    def handle_text_elements(self, shape, parent_slide, slide_ind, doc, slide_size):
-        is_a_list = False
-        is_list_group_created = False
-        enum_list_item_value = 0
-        new_list = None
-        bullet_type = "None"
-        list_text = ""
-        list_label = GroupLabel.LIST
-        doc_label = DocItemLabel.LIST_ITEM
-        prov = self.generate_prov(shape, slide_ind, shape.text.strip(), slide_size)
-
-        # Identify if shape contains lists
-        for paragraph in shape.text_frame.paragraphs:
-            # Check if paragraph is a bullet point using the `element` XML
-            p = paragraph._element
-            if (
-                p.find(".//a:buChar", namespaces={"a": self.namespaces["a"]})
-                is not None
-            ):
-                bullet_type = "Bullet"
-                is_a_list = True
-            elif (
-                p.find(".//a:buAutoNum", namespaces={"a": self.namespaces["a"]})
-                is not None
-            ):
-                bullet_type = "Numbered"
-                is_a_list = True
-            else:
-                is_a_list = False
-
-            if paragraph.level > 0:
-                # Most likely a sub-list
-                is_a_list = True
-
-            if is_a_list:
-                # Determine if this is an unordered list or an ordered list.
-                # Set GroupLabel.ORDERED_LIST when it fits.
-                if bullet_type == "Numbered":
-                    list_label = GroupLabel.ORDERED_LIST
-
-            if is_a_list:
-                _log.debug("LIST DETECTED!")
-            else:
-                _log.debug("No List")
-
-        # If there is a list inside of the shape, create a new docling list to assign list items to
-        # if is_a_list:
-        #     new_list = doc.add_group(
-        #         label=list_label, name=f"list", parent=parent_slide
-        #     )
-
-        # Iterate through paragraphs to build up text
-        for paragraph in shape.text_frame.paragraphs:
-            # p_text = paragraph.text.strip()
-            p = paragraph._element
-            enum_list_item_value += 1
-            inline_paragraph_text = ""
-            inline_list_item_text = ""
-
-            for e in p.iterfind(".//a:r", namespaces={"a": self.namespaces["a"]}):
-                if len(e.text.strip()) > 0:
-                    e_is_a_list_item = False
-                    is_numbered = False
-                    if (
-                        p.find(".//a:buChar", namespaces={"a": self.namespaces["a"]})
-                        is not None
-                    ):
-                        bullet_type = "Bullet"
-                        e_is_a_list_item = True
-                    elif (
-                        p.find(".//a:buAutoNum", namespaces={"a": self.namespaces["a"]})
-                        is not None
-                    ):
-                        bullet_type = "Numbered"
-                        is_numbered = True
-                        e_is_a_list_item = True
-                    else:
-                        e_is_a_list_item = False
-
-                    if e_is_a_list_item:
-                        if len(inline_paragraph_text) > 0:
-                            # output accumulated inline text:
-                            doc.add_text(
-                                label=doc_label,
-                                parent=parent_slide,
-                                text=inline_paragraph_text,
-                                prov=prov,
-                            )
-                        # Set marker and enumerated arguments if this is an enumeration element.
-                        inline_list_item_text += e.text
-                        # print(e.text)
-                    else:
-                        # Assign proper label to the text, depending if it's a Title or Section Header
-                        # For other types of text, assign - PARAGRAPH
-                        doc_label = DocItemLabel.PARAGRAPH
-                        if shape.is_placeholder:
-                            placeholder_type = shape.placeholder_format.type
-                            if placeholder_type in [
-                                PP_PLACEHOLDER.CENTER_TITLE,
-                                PP_PLACEHOLDER.TITLE,
-                            ]:
-                                # It's a title
-                                doc_label = DocItemLabel.TITLE
-                            elif placeholder_type == PP_PLACEHOLDER.SUBTITLE:
-                                DocItemLabel.SECTION_HEADER
-                        enum_list_item_value = 0
-                        inline_paragraph_text += e.text
-
-            if len(inline_paragraph_text) > 0:
-                # output accumulated inline text:
-                doc.add_text(
-                    label=doc_label,
-                    parent=parent_slide,
-                    text=inline_paragraph_text,
-                    prov=prov,
-                )
-
-            if len(inline_list_item_text) > 0:
-                enum_marker = ""
-                if is_numbered:
-                    enum_marker = str(enum_list_item_value) + "."
-                if not is_list_group_created:
-                    new_list = doc.add_group(
-                        label=list_label, name=f"list", parent=parent_slide
-                    )
-                    is_list_group_created = True
-                doc.add_list_item(
-                    marker=enum_marker,
-                    enumerated=is_numbered,
-                    parent=new_list,
-                    text=inline_list_item_text,
-                    prov=prov,
-                )
-        return
-
-    def handle_title(self, shape, parent_slide, slide_ind, doc):
-        placeholder_type = shape.placeholder_format.type
-        txt = shape.text.strip()
-        prov = self.generate_prov(shape, slide_ind, txt)
-
-        if len(txt.strip()) > 0:
-            # title = slide.shapes.title.text if slide.shapes.title else "No title"
-            if placeholder_type in [PP_PLACEHOLDER.CENTER_TITLE, PP_PLACEHOLDER.TITLE]:
-                _log.info(f"Title found: {shape.text}")
-                doc.add_text(
-                    label=DocItemLabel.TITLE, parent=parent_slide, text=txt, prov=prov
-                )
-            elif placeholder_type == PP_PLACEHOLDER.SUBTITLE:
-                _log.info(f"Subtitle found: {shape.text}")
-                # Using DocItemLabel.FOOTNOTE, while SUBTITLE label is not avail.
-                doc.add_text(
-                    label=DocItemLabel.SECTION_HEADER,
-                    parent=parent_slide,
-                    text=txt,
-                    prov=prov,
-                )
-        return
-
-    def handle_pictures(self, shape, parent_slide, slide_ind, doc, slide_size):
-        # Open it with PIL
-        try:
-            # Get the image bytes
-            image = shape.image
-            image_bytes = image.blob
-            im_dpi, _ = image.dpi
-            pil_image = Image.open(BytesIO(image_bytes))
-
-            # shape has picture
-            prov = self.generate_prov(shape, slide_ind, "", slide_size)
-            doc.add_picture(
-                parent=parent_slide,
-                image=ImageRef.from_pil(image=pil_image, dpi=im_dpi),
-                caption=None,
-                prov=prov,
-            )
-        except (UnidentifiedImageError, OSError) as e:
-            _log.warning(f"Warning: image cannot be loaded by Pillow: {e}")
-        return
-
-    def handle_tables(self, shape, parent_slide, slide_ind, doc, slide_size):
-        # Handling tables, images, charts
-        if shape.has_table:
-            table = shape.table
-            table_xml = shape._element
-
-            prov = self.generate_prov(shape, slide_ind, "", slide_size)
-
-            num_cols = 0
-            num_rows = len(table.rows)
-            tcells = []
-            # Access the XML element for the shape that contains the table
-            table_xml = shape._element
-
-            for row_idx, row in enumerate(table.rows):
-                if len(row.cells) > num_cols:
-                    num_cols = len(row.cells)
-                for col_idx, cell in enumerate(row.cells):
-                    # Access the XML of the cell (this is the 'tc' element in table XML)
-                    cell_xml = table_xml.xpath(
-                        f".//a:tbl/a:tr[{row_idx + 1}]/a:tc[{col_idx + 1}]"
-                    )
-
-                    if not cell_xml:
-                        continue  # If no cell XML is found, skip
-
-                    cell_xml = cell_xml[0]  # Get the first matching XML node
-                    row_span = cell_xml.get("rowSpan")  # Vertical span
-                    col_span = cell_xml.get("gridSpan")  # Horizontal span
-
-                    if row_span is None:
-                        row_span = 1
-                    else:
-                        row_span = int(row_span)
-
-                    if col_span is None:
-                        col_span = 1
-                    else:
-                        col_span = int(col_span)
-
-                    icell = TableCell(
-                        text=cell.text.strip(),
-                        row_span=row_span,
-                        col_span=col_span,
-                        start_row_offset_idx=row_idx,
-                        end_row_offset_idx=row_idx + row_span,
-                        start_col_offset_idx=col_idx,
-                        end_col_offset_idx=col_idx + col_span,
-                        col_header=False,
-                        row_header=False,
-                    )
-                    if len(cell.text.strip()) > 0:
-                        tcells.append(icell)
-            # Initialize Docling TableData
-            data = TableData(num_rows=num_rows, num_cols=num_cols, table_cells=[])
-            # Populate
-            for tcell in tcells:
-                data.table_cells.append(tcell)
-            if len(tcells) > 0:
-                # If table is not fully empty...
-                # Create Docling table
-                doc.add_table(parent=parent_slide, data=data, prov=prov)
-        return
-
-    def walk_linear(self, pptx_obj, doc) -> DoclingDocument:
-        # Units of size in PPTX by default are EMU units (English Metric Units)
-        slide_width = pptx_obj.slide_width
-        slide_height = pptx_obj.slide_height
-
-        text_content = []  # type: ignore
-
-        max_levels = 10
-        parents = {}  # type: ignore
-        for i in range(0, max_levels):
-            parents[i] = None
-
-        # Loop through each slide
-        for slide_num, slide in enumerate(pptx_obj.slides):
-            slide_ind = pptx_obj.slides.index(slide)
-            parent_slide = doc.add_group(
-                name=f"slide-{slide_ind}", label=GroupLabel.CHAPTER, parent=parents[0]
-            )
-
-            slide_size = Size(width=slide_width, height=slide_height)
-            parent_page = doc.add_page(page_no=slide_ind + 1, size=slide_size)
-
-            def handle_shapes(shape, parent_slide, slide_ind, doc, slide_size):
-                handle_groups(shape, parent_slide, slide_ind, doc, slide_size)
-                if shape.has_table:
-                    # Handle Tables
-                    self.handle_tables(shape, parent_slide, slide_ind, doc, slide_size)
-                if shape.shape_type == MSO_SHAPE_TYPE.PICTURE:
-                    # Handle Pictures
-                    self.handle_pictures(
-                        shape, parent_slide, slide_ind, doc, slide_size
-                    )
-                # If shape doesn't have any text, move on to the next shape
-                if not hasattr(shape, "text"):
-                    return
-                if shape.text is None:
-                    return
-                if len(shape.text.strip()) == 0:
-                    return
-                if not shape.has_text_frame:
-                    _log.warning("Warning: shape has text but not text_frame")
-                    return
-                # Handle other text elements, including lists (bullet lists, numbered lists)
-                self.handle_text_elements(
-                    shape, parent_slide, slide_ind, doc, slide_size
-                )
-                return
-
-            def handle_groups(shape, parent_slide, slide_ind, doc, slide_size):
-                if shape.shape_type == MSO_SHAPE_TYPE.GROUP:
-                    for groupedshape in shape.shapes:
-                        handle_shapes(
-                            groupedshape, parent_slide, slide_ind, doc, slide_size
-                        )
-
-            # Loop through each shape in the slide
-            for shape in slide.shapes:
-                handle_shapes(shape, parent_slide, slide_ind, doc, slide_size)
-
-        return doc
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/msword_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/msword_backend.py
deleted file mode 100644
index 1a504bcb7dac70117871b379df74e1ff1dc56779..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/msword_backend.py
+++ /dev/null
@@ -1,582 +0,0 @@
-import logging
-import re
-from io import BytesIO
-from pathlib import Path
-from typing import Any, Optional, Union
-
-from docling_core.types.doc import (
-    DocItemLabel,
-    DoclingDocument,
-    DocumentOrigin,
-    GroupLabel,
-    ImageRef,
-    NodeItem,
-    TableCell,
-    TableData,
-)
-from docx import Document
-from docx.document import Document as DocxDocument
-from docx.oxml.table import CT_Tc
-from docx.oxml.xmlchemy import BaseOxmlElement
-from docx.table import Table, _Cell
-from docx.text.paragraph import Paragraph
-from lxml import etree
-from lxml.etree import XPath
-from PIL import Image, UnidentifiedImageError
-from typing_extensions import override
-
-from docling.backend.abstract_backend import DeclarativeDocumentBackend
-from docling.datamodel.base_models import InputFormat
-from docling.datamodel.document import InputDocument
-
-_log = logging.getLogger(__name__)
-
-
-class MsWordDocumentBackend(DeclarativeDocumentBackend):
-    @override
-    def __init__(
-        self, in_doc: "InputDocument", path_or_stream: Union[BytesIO, Path]
-    ) -> None:
-        super().__init__(in_doc, path_or_stream)
-        self.XML_KEY = (
-            "{http://schemas.openxmlformats.org/wordprocessingml/2006/main}val"
-        )
-        self.xml_namespaces = {
-            "w": "http://schemas.microsoft.com/office/word/2003/wordml"
-        }
-        # self.initialise(path_or_stream)
-        # Word file:
-        self.path_or_stream: Union[BytesIO, Path] = path_or_stream
-        self.valid: bool = False
-        # Initialise the parents for the hierarchy
-        self.max_levels: int = 10
-        self.level_at_new_list: Optional[int] = None
-        self.parents: dict[int, Optional[NodeItem]] = {}
-        for i in range(-1, self.max_levels):
-            self.parents[i] = None
-
-        self.level = 0
-        self.listIter = 0
-
-        self.history: dict[str, Any] = {
-            "names": [None],
-            "levels": [None],
-            "numids": [None],
-            "indents": [None],
-        }
-
-        self.docx_obj = None
-        try:
-            if isinstance(self.path_or_stream, BytesIO):
-                self.docx_obj = Document(self.path_or_stream)
-            elif isinstance(self.path_or_stream, Path):
-                self.docx_obj = Document(str(self.path_or_stream))
-
-            self.valid = True
-        except Exception as e:
-            raise RuntimeError(
-                f"MsPowerpointDocumentBackend could not load document with hash {self.document_hash}"
-            ) from e
-
-    @override
-    def is_valid(self) -> bool:
-        return self.valid
-
-    @classmethod
-    @override
-    def supports_pagination(cls) -> bool:
-        return False
-
-    @override
-    def unload(self):
-        if isinstance(self.path_or_stream, BytesIO):
-            self.path_or_stream.close()
-
-        self.path_or_stream = None
-
-    @classmethod
-    @override
-    def supported_formats(cls) -> set[InputFormat]:
-        return {InputFormat.DOCX}
-
-    @override
-    def convert(self) -> DoclingDocument:
-        """Parses the DOCX into a structured document model.
-
-        Returns:
-            The parsed document.
-        """
-
-        origin = DocumentOrigin(
-            filename=self.file.name or "file",
-            mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document",
-            binary_hash=self.document_hash,
-        )
-
-        doc = DoclingDocument(name=self.file.stem or "file", origin=origin)
-        if self.is_valid():
-            assert self.docx_obj is not None
-            doc = self.walk_linear(self.docx_obj.element.body, self.docx_obj, doc)
-            return doc
-        else:
-            raise RuntimeError(
-                f"Cannot convert doc with {self.document_hash} because the backend failed to init."
-            )
-
-    def update_history(
-        self,
-        name: str,
-        level: Optional[int],
-        numid: Optional[int],
-        ilevel: Optional[int],
-    ):
-        self.history["names"].append(name)
-        self.history["levels"].append(level)
-
-        self.history["numids"].append(numid)
-        self.history["indents"].append(ilevel)
-
-    def prev_name(self) -> Optional[str]:
-        return self.history["names"][-1]
-
-    def prev_level(self) -> Optional[int]:
-        return self.history["levels"][-1]
-
-    def prev_numid(self) -> Optional[int]:
-        return self.history["numids"][-1]
-
-    def prev_indent(self) -> Optional[int]:
-        return self.history["indents"][-1]
-
-    def get_level(self) -> int:
-        """Return the first None index."""
-        for k, v in self.parents.items():
-            if k >= 0 and v == None:
-                return k
-        return 0
-
-    def walk_linear(
-        self,
-        body: BaseOxmlElement,
-        docx_obj: DocxDocument,
-        doc: DoclingDocument,
-    ) -> DoclingDocument:
-        for element in body:
-            tag_name = etree.QName(element).localname
-            # Check for Inline Images (blip elements)
-            namespaces = {
-                "a": "http://schemas.openxmlformats.org/drawingml/2006/main",
-                "r": "http://schemas.openxmlformats.org/officeDocument/2006/relationships",
-                "w": "http://schemas.openxmlformats.org/wordprocessingml/2006/main",
-            }
-            xpath_expr = XPath(".//a:blip", namespaces=namespaces)
-            drawing_blip = xpath_expr(element)
-
-            # Check for Tables
-            if element.tag.endswith("tbl"):
-                try:
-                    self.handle_tables(element, docx_obj, doc)
-                except Exception:
-                    _log.debug("could not parse a table, broken docx table")
-
-            elif drawing_blip:
-                self.handle_pictures(docx_obj, drawing_blip, doc)
-            # Check for the sdt containers, like table of contents
-            elif tag_name in ["sdt"]:
-                sdt_content = element.find(".//w:sdtContent", namespaces=namespaces)
-                if sdt_content is not None:
-                    # Iterate paragraphs, runs, or text inside <w:sdtContent>.
-                    paragraphs = sdt_content.findall(".//w:p", namespaces=namespaces)
-                    for p in paragraphs:
-                        self.handle_text_elements(p, docx_obj, doc)
-            # Check for Text
-            elif tag_name in ["p"]:
-                # "tcPr", "sectPr"
-                self.handle_text_elements(element, docx_obj, doc)
-            else:
-                _log.debug(f"Ignoring element in DOCX with tag: {tag_name}")
-        return doc
-
-    def str_to_int(self, s: Optional[str], default: Optional[int] = 0) -> Optional[int]:
-        if s is None:
-            return None
-        try:
-            return int(s)
-        except ValueError:
-            return default
-
-    def split_text_and_number(self, input_string: str) -> list[str]:
-        match = re.match(r"(\D+)(\d+)$|^(\d+)(\D+)", input_string)
-        if match:
-            parts = list(filter(None, match.groups()))
-            return parts
-        else:
-            return [input_string]
-
-    def get_numId_and_ilvl(
-        self, paragraph: Paragraph
-    ) -> tuple[Optional[int], Optional[int]]:
-        # Access the XML element of the paragraph
-        numPr = paragraph._element.find(
-            ".//w:numPr", namespaces=paragraph._element.nsmap
-        )
-
-        if numPr is not None:
-            # Get the numId element and extract the value
-            numId_elem = numPr.find("w:numId", namespaces=paragraph._element.nsmap)
-            ilvl_elem = numPr.find("w:ilvl", namespaces=paragraph._element.nsmap)
-            numId = numId_elem.get(self.XML_KEY) if numId_elem is not None else None
-            ilvl = ilvl_elem.get(self.XML_KEY) if ilvl_elem is not None else None
-
-            return self.str_to_int(numId, None), self.str_to_int(ilvl, None)
-
-        return None, None  # If the paragraph is not part of a list
-
-    def get_label_and_level(self, paragraph: Paragraph) -> tuple[str, Optional[int]]:
-        if paragraph.style is None:
-            return "Normal", None
-        label = paragraph.style.style_id
-        if label is None:
-            return "Normal", None
-        if ":" in label:
-            parts = label.split(":")
-
-            if len(parts) == 2:
-                return parts[0], self.str_to_int(parts[1], None)
-
-        parts = self.split_text_and_number(label)
-
-        if "Heading" in label and len(parts) == 2:
-            parts.sort()
-            label_str: str = ""
-            label_level: Optional[int] = 0
-            if parts[0] == "Heading":
-                label_str = parts[0]
-                label_level = self.str_to_int(parts[1], None)
-            if parts[1] == "Heading":
-                label_str = parts[1]
-                label_level = self.str_to_int(parts[0], None)
-            return label_str, label_level
-        else:
-            return label, None
-
-    def handle_text_elements(
-        self,
-        element: BaseOxmlElement,
-        docx_obj: DocxDocument,
-        doc: DoclingDocument,
-    ) -> None:
-        paragraph = Paragraph(element, docx_obj)
-
-        if paragraph.text is None:
-            return
-        text = paragraph.text.strip()
-
-        # Common styles for bullet and numbered lists.
-        # "List Bullet", "List Number", "List Paragraph"
-        # Identify wether list is a numbered list or not
-        # is_numbered = "List Bullet" not in paragraph.style.name
-        is_numbered = False
-        p_style_id, p_level = self.get_label_and_level(paragraph)
-        numid, ilevel = self.get_numId_and_ilvl(paragraph)
-
-        if numid == 0:
-            numid = None
-
-        # Handle lists
-        if (
-            numid is not None
-            and ilevel is not None
-            and p_style_id not in ["Title", "Heading"]
-        ):
-            self.add_listitem(
-                doc,
-                numid,
-                ilevel,
-                text,
-                is_numbered,
-            )
-            self.update_history(p_style_id, p_level, numid, ilevel)
-            return
-        elif (
-            numid is None
-            and self.prev_numid() is not None
-            and p_style_id not in ["Title", "Heading"]
-        ):  # Close list
-            if self.level_at_new_list:
-                for key in range(len(self.parents)):
-                    if key >= self.level_at_new_list:
-                        self.parents[key] = None
-                self.level = self.level_at_new_list - 1
-                self.level_at_new_list = None
-            else:
-                for key in range(len(self.parents)):
-                    self.parents[key] = None
-                self.level = 0
-
-        if p_style_id in ["Title"]:
-            for key in range(len(self.parents)):
-                self.parents[key] = None
-            self.parents[0] = doc.add_text(
-                parent=None, label=DocItemLabel.TITLE, text=text
-            )
-        elif "Heading" in p_style_id:
-            self.add_header(doc, p_level, text)
-
-        elif p_style_id in [
-            "Paragraph",
-            "Normal",
-            "Subtitle",
-            "Author",
-            "DefaultText",
-            "ListParagraph",
-            "ListBullet",
-            "Quote",
-        ]:
-            level = self.get_level()
-            doc.add_text(
-                label=DocItemLabel.PARAGRAPH, parent=self.parents[level - 1], text=text
-            )
-
-        else:
-            # Text style names can, and will have, not only default values but user values too
-            # hence we treat all other labels as pure text
-            level = self.get_level()
-            doc.add_text(
-                label=DocItemLabel.PARAGRAPH, parent=self.parents[level - 1], text=text
-            )
-
-        self.update_history(p_style_id, p_level, numid, ilevel)
-        return
-
-    def add_header(
-        self, doc: DoclingDocument, curr_level: Optional[int], text: str
-    ) -> None:
-        level = self.get_level()
-        if isinstance(curr_level, int):
-            if curr_level > level:
-                # add invisible group
-                for i in range(level, curr_level):
-                    self.parents[i] = doc.add_group(
-                        parent=self.parents[i - 1],
-                        label=GroupLabel.SECTION,
-                        name=f"header-{i}",
-                    )
-            elif curr_level < level:
-                # remove the tail
-                for key in range(len(self.parents)):
-                    if key >= curr_level:
-                        self.parents[key] = None
-
-            self.parents[curr_level] = doc.add_heading(
-                parent=self.parents[curr_level - 1],
-                text=text,
-                level=curr_level,
-            )
-        else:
-            self.parents[self.level] = doc.add_heading(
-                parent=self.parents[self.level - 1],
-                text=text,
-                level=1,
-            )
-        return
-
-    def add_listitem(
-        self,
-        doc: DoclingDocument,
-        numid: int,
-        ilevel: int,
-        text: str,
-        is_numbered: bool = False,
-    ) -> None:
-        enum_marker = ""
-
-        level = self.get_level()
-        prev_indent = self.prev_indent()
-        if self.prev_numid() is None:  # Open new list
-            self.level_at_new_list = level
-
-            self.parents[level] = doc.add_group(
-                label=GroupLabel.LIST, name="list", parent=self.parents[level - 1]
-            )
-
-            # Set marker and enumerated arguments if this is an enumeration element.
-            self.listIter += 1
-            if is_numbered:
-                enum_marker = str(self.listIter) + "."
-                is_numbered = True
-            doc.add_list_item(
-                marker=enum_marker,
-                enumerated=is_numbered,
-                parent=self.parents[level],
-                text=text,
-            )
-
-        elif (
-            self.prev_numid() == numid
-            and self.level_at_new_list is not None
-            and prev_indent is not None
-            and prev_indent < ilevel
-        ):  # Open indented list
-            for i in range(
-                self.level_at_new_list + prev_indent + 1,
-                self.level_at_new_list + ilevel + 1,
-            ):
-                # Determine if this is an unordered list or an ordered list.
-                # Set GroupLabel.ORDERED_LIST when it fits.
-                self.listIter = 0
-                if is_numbered:
-                    self.parents[i] = doc.add_group(
-                        label=GroupLabel.ORDERED_LIST,
-                        name="list",
-                        parent=self.parents[i - 1],
-                    )
-                else:
-                    self.parents[i] = doc.add_group(
-                        label=GroupLabel.LIST, name="list", parent=self.parents[i - 1]
-                    )
-
-            # TODO: Set marker and enumerated arguments if this is an enumeration element.
-            self.listIter += 1
-            if is_numbered:
-                enum_marker = str(self.listIter) + "."
-                is_numbered = True
-            doc.add_list_item(
-                marker=enum_marker,
-                enumerated=is_numbered,
-                parent=self.parents[self.level_at_new_list + ilevel],
-                text=text,
-            )
-
-        elif (
-            self.prev_numid() == numid
-            and self.level_at_new_list is not None
-            and prev_indent is not None
-            and ilevel < prev_indent
-        ):  # Close list
-            for k, v in self.parents.items():
-                if k > self.level_at_new_list + ilevel:
-                    self.parents[k] = None
-
-            # TODO: Set marker and enumerated arguments if this is an enumeration element.
-            self.listIter += 1
-            if is_numbered:
-                enum_marker = str(self.listIter) + "."
-                is_numbered = True
-            doc.add_list_item(
-                marker=enum_marker,
-                enumerated=is_numbered,
-                parent=self.parents[self.level_at_new_list + ilevel],
-                text=text,
-            )
-            self.listIter = 0
-
-        elif self.prev_numid() == numid or prev_indent == ilevel:
-            # TODO: Set marker and enumerated arguments if this is an enumeration element.
-            self.listIter += 1
-            if is_numbered:
-                enum_marker = str(self.listIter) + "."
-                is_numbered = True
-            doc.add_list_item(
-                marker=enum_marker,
-                enumerated=is_numbered,
-                parent=self.parents[level - 1],
-                text=text,
-            )
-        return
-
-    def handle_tables(
-        self,
-        element: BaseOxmlElement,
-        docx_obj: DocxDocument,
-        doc: DoclingDocument,
-    ) -> None:
-        table: Table = Table(element, docx_obj)
-        num_rows = len(table.rows)
-        num_cols = len(table.columns)
-        _log.debug(f"Table grid with {num_rows} rows and {num_cols} columns")
-
-        if num_rows == 1 and num_cols == 1:
-            cell_element = table.rows[0].cells[0]
-            # In case we have a table of only 1 cell, we consider it furniture
-            # And proceed processing the content of the cell as though it's in the document body
-            self.walk_linear(cell_element._element, docx_obj, doc)
-            return
-
-        data = TableData(num_rows=num_rows, num_cols=num_cols)
-        cell_set: set[CT_Tc] = set()
-        for row_idx, row in enumerate(table.rows):
-            _log.debug(f"Row index {row_idx} with {len(row.cells)} populated cells")
-            col_idx = 0
-            while col_idx < num_cols:
-                cell: _Cell = row.cells[col_idx]
-                _log.debug(
-                    f" col {col_idx} grid_span {cell.grid_span} grid_cols_before {row.grid_cols_before}"
-                )
-                if cell is None or cell._tc in cell_set:
-                    _log.debug(f"  skipped since repeated content")
-                    col_idx += cell.grid_span
-                    continue
-                else:
-                    cell_set.add(cell._tc)
-
-                spanned_idx = row_idx
-                spanned_tc: Optional[CT_Tc] = cell._tc
-                while spanned_tc == cell._tc:
-                    spanned_idx += 1
-                    spanned_tc = (
-                        table.rows[spanned_idx].cells[col_idx]._tc
-                        if spanned_idx < num_rows
-                        else None
-                    )
-                _log.debug(f"  spanned before row {spanned_idx}")
-
-                table_cell = TableCell(
-                    text=cell.text,
-                    row_span=spanned_idx - row_idx,
-                    col_span=cell.grid_span,
-                    start_row_offset_idx=row.grid_cols_before + row_idx,
-                    end_row_offset_idx=row.grid_cols_before + spanned_idx,
-                    start_col_offset_idx=col_idx,
-                    end_col_offset_idx=col_idx + cell.grid_span,
-                    col_header=False,
-                    row_header=False,
-                )
-                data.table_cells.append(table_cell)
-                col_idx += cell.grid_span
-
-        level = self.get_level()
-        doc.add_table(data=data, parent=self.parents[level - 1])
-        return
-
-    def handle_pictures(
-        self, docx_obj: DocxDocument, drawing_blip: Any, doc: DoclingDocument
-    ) -> None:
-        def get_docx_image(drawing_blip):
-            rId = drawing_blip[0].get(
-                "{http://schemas.openxmlformats.org/officeDocument/2006/relationships}embed"
-            )
-            if rId in docx_obj.part.rels:
-                # Access the image part using the relationship ID
-                image_part = docx_obj.part.rels[rId].target_part
-                image_data = image_part.blob  # Get the binary image data
-            return image_data
-
-        level = self.get_level()
-        # Open the BytesIO object with PIL to create an Image
-        try:
-            image_data = get_docx_image(drawing_blip)
-            image_bytes = BytesIO(image_data)
-            pil_image = Image.open(image_bytes)
-            doc.add_picture(
-                parent=self.parents[level - 1],
-                image=ImageRef.from_pil(image=pil_image, dpi=72),
-                caption=None,
-            )
-        except (UnidentifiedImageError, OSError) as e:
-            _log.warning("Warning: image cannot be loaded by Pillow")
-            doc.add_picture(
-                parent=self.parents[level - 1],
-                caption=None,
-            )
-        return
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/pdf_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/pdf_backend.py
deleted file mode 100644
index 35c83b8c549a7be6da564b3b36262cd0c852b6d3..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/pdf_backend.py
+++ /dev/null
@@ -1,76 +0,0 @@
-from abc import ABC, abstractmethod
-from io import BytesIO
-from pathlib import Path
-from typing import Iterable, Optional, Set, Union
-
-from docling_core.types.doc import BoundingBox, Size
-from PIL import Image
-
-from docling.backend.abstract_backend import PaginatedDocumentBackend
-from docling.datamodel.base_models import Cell, InputFormat
-from docling.datamodel.document import InputDocument
-
-
-class PdfPageBackend(ABC):
-    @abstractmethod
-    def get_text_in_rect(self, bbox: BoundingBox) -> str:
-        pass
-
-    @abstractmethod
-    def get_text_cells(self) -> Iterable[Cell]:
-        pass
-
-    @abstractmethod
-    def get_bitmap_rects(self, float: int = 1) -> Iterable[BoundingBox]:
-        pass
-
-    @abstractmethod
-    def get_page_image(
-        self, scale: float = 1, cropbox: Optional[BoundingBox] = None
-    ) -> Image.Image:
-        pass
-
-    @abstractmethod
-    def get_size(self) -> Size:
-        pass
-
-    @abstractmethod
-    def is_valid(self) -> bool:
-        pass
-
-    @abstractmethod
-    def unload(self):
-        pass
-
-
-class PdfDocumentBackend(PaginatedDocumentBackend):
-    def __init__(self, in_doc: InputDocument, path_or_stream: Union[BytesIO, Path]):
-        super().__init__(in_doc, path_or_stream)
-
-        if self.input_format is not InputFormat.PDF:
-            if self.input_format is InputFormat.IMAGE:
-                buf = BytesIO()
-                img = Image.open(self.path_or_stream)
-                img.save(buf, "PDF")
-                buf.seek(0)
-                self.path_or_stream = buf
-            else:
-                raise RuntimeError(
-                    f"Incompatible file format {self.input_format} was passed to a PdfDocumentBackend."
-                )
-
-    @abstractmethod
-    def load_page(self, page_no: int) -> PdfPageBackend:
-        pass
-
-    @abstractmethod
-    def page_count(self) -> int:
-        pass
-
-    @classmethod
-    def supported_formats(cls) -> Set[InputFormat]:
-        return {InputFormat.PDF}
-
-    @classmethod
-    def supports_pagination(cls) -> bool:
-        return True
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/pypdfium2_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/pypdfium2_backend.py
deleted file mode 100644
index 5b627da70aca63f1e53280f569decd6cda1186c5..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/pypdfium2_backend.py
+++ /dev/null
@@ -1,260 +0,0 @@
-import logging
-import random
-from io import BytesIO
-from pathlib import Path
-from typing import TYPE_CHECKING, Iterable, List, Optional, Union
-
-import pypdfium2 as pdfium
-import pypdfium2.raw as pdfium_c
-from docling_core.types.doc import BoundingBox, CoordOrigin, Size
-from PIL import Image, ImageDraw
-from pypdfium2 import PdfTextPage
-from pypdfium2._helpers.misc import PdfiumError
-
-from docling.backend.pdf_backend import PdfDocumentBackend, PdfPageBackend
-from docling.datamodel.base_models import Cell
-
-if TYPE_CHECKING:
-    from docling.datamodel.document import InputDocument
-
-_log = logging.getLogger(__name__)
-
-
-class PyPdfiumPageBackend(PdfPageBackend):
-    def __init__(
-        self, pdfium_doc: pdfium.PdfDocument, document_hash: str, page_no: int
-    ):
-        self.valid = True  # No better way to tell from pypdfium.
-        try:
-            self._ppage: pdfium.PdfPage = pdfium_doc[page_no]
-        except PdfiumError as e:
-            _log.info(
-                f"An exception occurred when loading page {page_no} of document {document_hash}.",
-                exc_info=True,
-            )
-            self.valid = False
-        self.text_page: Optional[PdfTextPage] = None
-
-    def is_valid(self) -> bool:
-        return self.valid
-
-    def get_bitmap_rects(self, scale: float = 1) -> Iterable[BoundingBox]:
-        AREA_THRESHOLD = 0  # 32 * 32
-        for obj in self._ppage.get_objects(filter=[pdfium_c.FPDF_PAGEOBJ_IMAGE]):
-            pos = obj.get_pos()
-            cropbox = BoundingBox.from_tuple(
-                pos, origin=CoordOrigin.BOTTOMLEFT
-            ).to_top_left_origin(page_height=self.get_size().height)
-
-            if cropbox.area() > AREA_THRESHOLD:
-                cropbox = cropbox.scaled(scale=scale)
-
-                yield cropbox
-
-    def get_text_in_rect(self, bbox: BoundingBox) -> str:
-        if not self.text_page:
-            self.text_page = self._ppage.get_textpage()
-
-        if bbox.coord_origin != CoordOrigin.BOTTOMLEFT:
-            bbox = bbox.to_bottom_left_origin(self.get_size().height)
-
-        text_piece = self.text_page.get_text_bounded(*bbox.as_tuple())
-
-        return text_piece
-
-    def get_text_cells(self) -> Iterable[Cell]:
-        if not self.text_page:
-            self.text_page = self._ppage.get_textpage()
-
-        cells = []
-        cell_counter = 0
-
-        page_size = self.get_size()
-
-        for i in range(self.text_page.count_rects()):
-            rect = self.text_page.get_rect(i)
-            text_piece = self.text_page.get_text_bounded(*rect)
-            x0, y0, x1, y1 = rect
-            cells.append(
-                Cell(
-                    id=cell_counter,
-                    text=text_piece,
-                    bbox=BoundingBox(
-                        l=x0, b=y0, r=x1, t=y1, coord_origin=CoordOrigin.BOTTOMLEFT
-                    ).to_top_left_origin(page_size.height),
-                )
-            )
-            cell_counter += 1
-
-        # PyPdfium2 produces very fragmented cells, with sub-word level boundaries, in many PDFs.
-        # The cell merging code below is to clean this up.
-        def merge_horizontal_cells(
-            cells: List[Cell],
-            horizontal_threshold_factor: float = 1.0,
-            vertical_threshold_factor: float = 0.5,
-        ) -> List[Cell]:
-            if not cells:
-                return []
-
-            def group_rows(cells: List[Cell]) -> List[List[Cell]]:
-                rows = []
-                current_row = [cells[0]]
-                row_top = cells[0].bbox.t
-                row_bottom = cells[0].bbox.b
-                row_height = cells[0].bbox.height
-
-                for cell in cells[1:]:
-                    vertical_threshold = row_height * vertical_threshold_factor
-                    if (
-                        abs(cell.bbox.t - row_top) <= vertical_threshold
-                        and abs(cell.bbox.b - row_bottom) <= vertical_threshold
-                    ):
-                        current_row.append(cell)
-                        row_top = min(row_top, cell.bbox.t)
-                        row_bottom = max(row_bottom, cell.bbox.b)
-                        row_height = row_bottom - row_top
-                    else:
-                        rows.append(current_row)
-                        current_row = [cell]
-                        row_top = cell.bbox.t
-                        row_bottom = cell.bbox.b
-                        row_height = cell.bbox.height
-
-                if current_row:
-                    rows.append(current_row)
-
-                return rows
-
-            def merge_row(row: List[Cell]) -> List[Cell]:
-                merged = []
-                current_group = [row[0]]
-
-                for cell in row[1:]:
-                    prev_cell = current_group[-1]
-                    avg_height = (prev_cell.bbox.height + cell.bbox.height) / 2
-                    if (
-                        cell.bbox.l - prev_cell.bbox.r
-                        <= avg_height * horizontal_threshold_factor
-                    ):
-                        current_group.append(cell)
-                    else:
-                        merged.append(merge_group(current_group))
-                        current_group = [cell]
-
-                if current_group:
-                    merged.append(merge_group(current_group))
-
-                return merged
-
-            def merge_group(group: List[Cell]) -> Cell:
-                if len(group) == 1:
-                    return group[0]
-
-                merged_text = "".join(cell.text for cell in group)
-                merged_bbox = BoundingBox(
-                    l=min(cell.bbox.l for cell in group),
-                    t=min(cell.bbox.t for cell in group),
-                    r=max(cell.bbox.r for cell in group),
-                    b=max(cell.bbox.b for cell in group),
-                )
-                return Cell(id=group[0].id, text=merged_text, bbox=merged_bbox)
-
-            rows = group_rows(cells)
-            merged_cells = [cell for row in rows for cell in merge_row(row)]
-
-            for i, cell in enumerate(merged_cells, 1):
-                cell.id = i
-
-            return merged_cells
-
-        def draw_clusters_and_cells():
-            image = (
-                self.get_page_image()
-            )  # make new image to avoid drawing on the saved ones
-            draw = ImageDraw.Draw(image)
-            for c in cells:
-                x0, y0, x1, y1 = c.bbox.as_tuple()
-                cell_color = (
-                    random.randint(30, 140),
-                    random.randint(30, 140),
-                    random.randint(30, 140),
-                )
-                draw.rectangle([(x0, y0), (x1, y1)], outline=cell_color)
-            image.show()
-
-        # before merge:
-        # draw_clusters_and_cells()
-
-        cells = merge_horizontal_cells(cells)
-
-        # after merge:
-        # draw_clusters_and_cells()
-
-        return cells
-
-    def get_page_image(
-        self, scale: float = 1, cropbox: Optional[BoundingBox] = None
-    ) -> Image.Image:
-
-        page_size = self.get_size()
-
-        if not cropbox:
-            cropbox = BoundingBox(
-                l=0,
-                r=page_size.width,
-                t=0,
-                b=page_size.height,
-                coord_origin=CoordOrigin.TOPLEFT,
-            )
-            padbox = BoundingBox(
-                l=0, r=0, t=0, b=0, coord_origin=CoordOrigin.BOTTOMLEFT
-            )
-        else:
-            padbox = cropbox.to_bottom_left_origin(page_size.height).model_copy()
-            padbox.r = page_size.width - padbox.r
-            padbox.t = page_size.height - padbox.t
-
-        image = (
-            self._ppage.render(
-                scale=scale * 1.5,
-                rotation=0,  # no additional rotation
-                crop=padbox.as_tuple(),
-            )
-            .to_pil()
-            .resize(size=(round(cropbox.width * scale), round(cropbox.height * scale)))
-        )  # We resize the image from 1.5x the given scale to make it sharper.
-
-        return image
-
-    def get_size(self) -> Size:
-        return Size(width=self._ppage.get_width(), height=self._ppage.get_height())
-
-    def unload(self):
-        self._ppage = None
-        self.text_page = None
-
-
-class PyPdfiumDocumentBackend(PdfDocumentBackend):
-    def __init__(self, in_doc: "InputDocument", path_or_stream: Union[BytesIO, Path]):
-        super().__init__(in_doc, path_or_stream)
-
-        try:
-            self._pdoc = pdfium.PdfDocument(self.path_or_stream)
-        except PdfiumError as e:
-            raise RuntimeError(
-                f"pypdfium could not load document with hash {self.document_hash}"
-            ) from e
-
-    def page_count(self) -> int:
-        return len(self._pdoc)
-
-    def load_page(self, page_no: int) -> PyPdfiumPageBackend:
-        return PyPdfiumPageBackend(self._pdoc, self.document_hash, page_no)
-
-    def is_valid(self) -> bool:
-        return self.page_count() > 0
-
-    def unload(self):
-        super().unload()
-        self._pdoc.close()
-        self._pdoc = None
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/xml/__init__.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/xml/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/xml/pubmed_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/xml/pubmed_backend.py
deleted file mode 100644
index acbcd4e1f8705297317e2fdc105f4fb5b9c0859b..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/xml/pubmed_backend.py
+++ /dev/null
@@ -1,592 +0,0 @@
-import logging
-from io import BytesIO
-from pathlib import Path
-from typing import Any, Set, Union
-
-import lxml
-from bs4 import BeautifulSoup
-from docling_core.types.doc import (
-    DocItemLabel,
-    DoclingDocument,
-    DocumentOrigin,
-    GroupLabel,
-    TableCell,
-    TableData,
-)
-from lxml import etree
-from typing_extensions import TypedDict, override
-
-from docling.backend.abstract_backend import DeclarativeDocumentBackend
-from docling.datamodel.base_models import InputFormat
-from docling.datamodel.document import InputDocument
-
-_log = logging.getLogger(__name__)
-
-
-class Paragraph(TypedDict):
-    text: str
-    headers: list[str]
-
-
-class Author(TypedDict):
-    name: str
-    affiliation_names: list[str]
-
-
-class Table(TypedDict):
-    label: str
-    caption: str
-    content: str
-
-
-class FigureCaption(TypedDict):
-    label: str
-    caption: str
-
-
-class Reference(TypedDict):
-    author_names: str
-    title: str
-    journal: str
-    year: str
-
-
-class XMLComponents(TypedDict):
-    title: str
-    authors: list[Author]
-    abstract: str
-    paragraphs: list[Paragraph]
-    tables: list[Table]
-    figure_captions: list[FigureCaption]
-    references: list[Reference]
-
-
-class PubMedDocumentBackend(DeclarativeDocumentBackend):
-    """
-    The code from this document backend has been developed by modifying parts of the PubMed Parser library (version 0.5.0, released on 12.08.2024):
-    Achakulvisut et al., (2020).
-    Pubmed Parser: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset XML Dataset.
-    Journal of Open Source Software, 5(46), 1979,
-    https://doi.org/10.21105/joss.01979
-    """
-
-    @override
-    def __init__(self, in_doc: "InputDocument", path_or_stream: Union[BytesIO, Path]):
-        super().__init__(in_doc, path_or_stream)
-        self.path_or_stream = path_or_stream
-
-        # Initialize parents for the document hierarchy
-        self.parents: dict = {}
-
-        self.valid = False
-        try:
-            if isinstance(self.path_or_stream, BytesIO):
-                self.path_or_stream.seek(0)
-            self.tree: lxml.etree._ElementTree = etree.parse(self.path_or_stream)
-            if "/NLM//DTD JATS" in self.tree.docinfo.public_id:
-                self.valid = True
-        except Exception as exc:
-            raise RuntimeError(
-                f"Could not initialize PubMed backend for file with hash {self.document_hash}."
-            ) from exc
-
-    @override
-    def is_valid(self) -> bool:
-        return self.valid
-
-    @classmethod
-    @override
-    def supports_pagination(cls) -> bool:
-        return False
-
-    @override
-    def unload(self):
-        if isinstance(self.path_or_stream, BytesIO):
-            self.path_or_stream.close()
-        self.path_or_stream = None
-
-    @classmethod
-    @override
-    def supported_formats(cls) -> Set[InputFormat]:
-        return {InputFormat.XML_PUBMED}
-
-    @override
-    def convert(self) -> DoclingDocument:
-        # Create empty document
-        origin = DocumentOrigin(
-            filename=self.file.name or "file",
-            mimetype="application/xml",
-            binary_hash=self.document_hash,
-        )
-        doc = DoclingDocument(name=self.file.stem or "file", origin=origin)
-
-        _log.debug("Trying to convert PubMed XML document...")
-
-        # Get parsed XML components
-        xml_components: XMLComponents = self._parse()
-
-        # Add XML components to the document
-        doc = self._populate_document(doc, xml_components)
-        return doc
-
-    def _parse_title(self) -> str:
-        title: str = " ".join(
-            [
-                t.replace("\n", "")
-                for t in self.tree.xpath(".//title-group/article-title")[0].itertext()
-            ]
-        )
-        return title
-
-    def _parse_authors(self) -> list[Author]:
-        # Get mapping between affiliation ids and names
-        affiliation_names = []
-        for affiliation_node in self.tree.xpath(".//aff[@id]"):
-            affiliation_names.append(
-                ": ".join([t for t in affiliation_node.itertext() if t != "\n"])
-            )
-        affiliation_ids_names = {
-            id: name
-            for id, name in zip(self.tree.xpath(".//aff[@id]/@id"), affiliation_names)
-        }
-
-        # Get author names and affiliation names
-        authors: list[Author] = []
-        for author_node in self.tree.xpath(
-            './/contrib-group/contrib[@contrib-type="author"]'
-        ):
-            author: Author = {
-                "name": "",
-                "affiliation_names": [],
-            }
-
-            # Affiliation names
-            affiliation_ids = [
-                a.attrib["rid"] for a in author_node.xpath('xref[@ref-type="aff"]')
-            ]
-            for id in affiliation_ids:
-                if id in affiliation_ids_names:
-                    author["affiliation_names"].append(affiliation_ids_names[id])
-
-            # Name
-            author["name"] = (
-                author_node.xpath("name/surname")[0].text
-                + " "
-                + author_node.xpath("name/given-names")[0].text
-            )
-
-            authors.append(author)
-        return authors
-
-    def _parse_abstract(self) -> str:
-        texts = []
-        for abstract_node in self.tree.xpath(".//abstract"):
-            for text in abstract_node.itertext():
-                texts.append(text.replace("\n", ""))
-        abstract: str = "".join(texts)
-        return abstract
-
-    def _parse_main_text(self) -> list[Paragraph]:
-        paragraphs: list[Paragraph] = []
-        for paragraph_node in self.tree.xpath("//body//p"):
-            # Skip captions
-            if "/caption" in paragraph_node.getroottree().getpath(paragraph_node):
-                continue
-
-            paragraph: Paragraph = {"text": "", "headers": []}
-
-            # Text
-            paragraph["text"] = "".join(
-                [t.replace("\n", "") for t in paragraph_node.itertext()]
-            )
-
-            # Header
-            path = "../title"
-            while len(paragraph_node.xpath(path)) > 0:
-                paragraph["headers"].append(
-                    "".join(
-                        [
-                            t.replace("\n", "")
-                            for t in paragraph_node.xpath(path)[0].itertext()
-                        ]
-                    )
-                )
-                path = "../" + path
-
-            paragraphs.append(paragraph)
-
-        return paragraphs
-
-    def _parse_tables(self) -> list[Table]:
-        tables: list[Table] = []
-        for table_node in self.tree.xpath(".//body//table-wrap"):
-            table: Table = {"label": "", "caption": "", "content": ""}
-
-            # Content
-            if len(table_node.xpath("table")) > 0:
-                table_content_node = table_node.xpath("table")[0]
-            elif len(table_node.xpath("alternatives/table")) > 0:
-                table_content_node = table_node.xpath("alternatives/table")[0]
-            else:
-                table_content_node = None
-            if table_content_node != None:
-                table["content"] = etree.tostring(table_content_node).decode("utf-8")
-
-            # Caption
-            if len(table_node.xpath("caption/p")) > 0:
-                caption_node = table_node.xpath("caption/p")[0]
-            elif len(table_node.xpath("caption/title")) > 0:
-                caption_node = table_node.xpath("caption/title")[0]
-            else:
-                caption_node = None
-            if caption_node != None:
-                table["caption"] = "".join(
-                    [t.replace("\n", "") for t in caption_node.itertext()]
-                )
-
-            # Label
-            if len(table_node.xpath("label")) > 0:
-                table["label"] = table_node.xpath("label")[0].text
-
-            tables.append(table)
-        return tables
-
-    def _parse_figure_captions(self) -> list[FigureCaption]:
-        figure_captions: list[FigureCaption] = []
-
-        if not (self.tree.xpath(".//fig")):
-            return figure_captions
-
-        for figure_node in self.tree.xpath(".//fig"):
-            figure_caption: FigureCaption = {
-                "caption": "",
-                "label": "",
-            }
-
-            # Label
-            if figure_node.xpath("label"):
-                figure_caption["label"] = "".join(
-                    [
-                        t.replace("\n", "")
-                        for t in figure_node.xpath("label")[0].itertext()
-                    ]
-                )
-
-            # Caption
-            if figure_node.xpath("caption"):
-                caption = ""
-                for caption_node in figure_node.xpath("caption")[0].getchildren():
-                    caption += (
-                        "".join([t.replace("\n", "") for t in caption_node.itertext()])
-                        + "\n"
-                    )
-                figure_caption["caption"] = caption
-
-            figure_captions.append(figure_caption)
-
-        return figure_captions
-
-    def _parse_references(self) -> list[Reference]:
-        references: list[Reference] = []
-        for reference_node_abs in self.tree.xpath(".//ref-list/ref"):
-            reference: Reference = {
-                "author_names": "",
-                "title": "",
-                "journal": "",
-                "year": "",
-            }
-            reference_node: Any = None
-            for tag in ["mixed-citation", "element-citation", "citation"]:
-                if len(reference_node_abs.xpath(tag)) > 0:
-                    reference_node = reference_node_abs.xpath(tag)[0]
-                    break
-
-            if reference_node is None:
-                continue
-
-            if all(
-                not (ref_type in ["citation-type", "publication-type"])
-                for ref_type in reference_node.attrib.keys()
-            ):
-                continue
-
-            # Author names
-            names = []
-            if len(reference_node.xpath("name")) > 0:
-                for name_node in reference_node.xpath("name"):
-                    name_str = " ".join(
-                        [t.text for t in name_node.getchildren() if (t.text != None)]
-                    )
-                    names.append(name_str)
-            elif len(reference_node.xpath("person-group")) > 0:
-                for name_node in reference_node.xpath("person-group")[0]:
-                    name_str = (
-                        name_node.xpath("given-names")[0].text
-                        + " "
-                        + name_node.xpath("surname")[0].text
-                    )
-                    names.append(name_str)
-            reference["author_names"] = "; ".join(names)
-
-            # Title
-            if len(reference_node.xpath("article-title")) > 0:
-                reference["title"] = " ".join(
-                    [
-                        t.replace("\n", " ")
-                        for t in reference_node.xpath("article-title")[0].itertext()
-                    ]
-                )
-
-            # Journal
-            if len(reference_node.xpath("source")) > 0:
-                reference["journal"] = reference_node.xpath("source")[0].text
-
-            # Year
-            if len(reference_node.xpath("year")) > 0:
-                reference["year"] = reference_node.xpath("year")[0].text
-
-            if (
-                not (reference_node.xpath("article-title"))
-                and not (reference_node.xpath("journal"))
-                and not (reference_node.xpath("year"))
-            ):
-                reference["title"] = reference_node.text
-
-            references.append(reference)
-        return references
-
-    def _parse(self) -> XMLComponents:
-        """Parsing PubMed document."""
-        xml_components: XMLComponents = {
-            "title": self._parse_title(),
-            "authors": self._parse_authors(),
-            "abstract": self._parse_abstract(),
-            "paragraphs": self._parse_main_text(),
-            "tables": self._parse_tables(),
-            "figure_captions": self._parse_figure_captions(),
-            "references": self._parse_references(),
-        }
-        return xml_components
-
-    def _populate_document(
-        self, doc: DoclingDocument, xml_components: XMLComponents
-    ) -> DoclingDocument:
-        self._add_title(doc, xml_components)
-        self._add_authors(doc, xml_components)
-        self._add_abstract(doc, xml_components)
-        self._add_main_text(doc, xml_components)
-
-        if xml_components["tables"]:
-            self._add_tables(doc, xml_components)
-
-        if xml_components["figure_captions"]:
-            self._add_figure_captions(doc, xml_components)
-
-        self._add_references(doc, xml_components)
-        return doc
-
-    def _add_figure_captions(
-        self, doc: DoclingDocument, xml_components: XMLComponents
-    ) -> None:
-        self.parents["Figures"] = doc.add_heading(
-            parent=self.parents["Title"], text="Figures"
-        )
-        for figure_caption_xml_component in xml_components["figure_captions"]:
-            figure_caption_text = (
-                figure_caption_xml_component["label"]
-                + ": "
-                + figure_caption_xml_component["caption"].strip()
-            )
-            fig_caption = doc.add_text(
-                label=DocItemLabel.CAPTION, text=figure_caption_text
-            )
-            doc.add_picture(
-                parent=self.parents["Figures"],
-                caption=fig_caption,
-            )
-        return
-
-    def _add_title(self, doc: DoclingDocument, xml_components: XMLComponents) -> None:
-        self.parents["Title"] = doc.add_text(
-            parent=None,
-            text=xml_components["title"],
-            label=DocItemLabel.TITLE,
-        )
-        return
-
-    def _add_authors(self, doc: DoclingDocument, xml_components: XMLComponents) -> None:
-        authors_affiliations: list = []
-        for author in xml_components["authors"]:
-            authors_affiliations.append(author["name"])
-            authors_affiliations.append(", ".join(author["affiliation_names"]))
-        authors_affiliations_str = "; ".join(authors_affiliations)
-
-        doc.add_text(
-            parent=self.parents["Title"],
-            text=authors_affiliations_str,
-            label=DocItemLabel.PARAGRAPH,
-        )
-        return
-
-    def _add_abstract(
-        self, doc: DoclingDocument, xml_components: XMLComponents
-    ) -> None:
-        abstract_text: str = xml_components["abstract"]
-        self.parents["Abstract"] = doc.add_heading(
-            parent=self.parents["Title"], text="Abstract"
-        )
-        doc.add_text(
-            parent=self.parents["Abstract"],
-            text=abstract_text,
-            label=DocItemLabel.TEXT,
-        )
-        return
-
-    def _add_main_text(
-        self, doc: DoclingDocument, xml_components: XMLComponents
-    ) -> None:
-        added_headers: list = []
-        for paragraph in xml_components["paragraphs"]:
-            if not (paragraph["headers"]):
-                continue
-
-            # Header
-            for i, header in enumerate(reversed(paragraph["headers"])):
-                if header in added_headers:
-                    continue
-                added_headers.append(header)
-
-                if ((i - 1) >= 0) and list(reversed(paragraph["headers"]))[
-                    i - 1
-                ] in self.parents:
-                    parent = self.parents[list(reversed(paragraph["headers"]))[i - 1]]
-                else:
-                    parent = self.parents["Title"]
-
-                self.parents[header] = doc.add_heading(parent=parent, text=header)
-
-            # Paragraph text
-            if paragraph["headers"][0] in self.parents:
-                parent = self.parents[paragraph["headers"][0]]
-            else:
-                parent = self.parents["Title"]
-
-            doc.add_text(parent=parent, label=DocItemLabel.TEXT, text=paragraph["text"])
-        return
-
-    def _add_references(
-        self, doc: DoclingDocument, xml_components: XMLComponents
-    ) -> None:
-        self.parents["References"] = doc.add_heading(
-            parent=self.parents["Title"], text="References"
-        )
-        current_list = doc.add_group(
-            parent=self.parents["References"], label=GroupLabel.LIST, name="list"
-        )
-        for reference in xml_components["references"]:
-            reference_text: str = ""
-            if reference["author_names"]:
-                reference_text += reference["author_names"] + ". "
-
-            if reference["title"]:
-                reference_text += reference["title"]
-                if reference["title"][-1] != ".":
-                    reference_text += "."
-                reference_text += " "
-
-            if reference["journal"]:
-                reference_text += reference["journal"]
-
-            if reference["year"]:
-                reference_text += " (" + reference["year"] + ")"
-
-            if not (reference_text):
-                _log.debug(f"Skipping reference for: {str(self.file)}")
-                continue
-
-            doc.add_list_item(
-                text=reference_text, enumerated=False, parent=current_list
-            )
-        return
-
-    def _add_tables(self, doc: DoclingDocument, xml_components: XMLComponents) -> None:
-        self.parents["Tables"] = doc.add_heading(
-            parent=self.parents["Title"], text="Tables"
-        )
-        for table_xml_component in xml_components["tables"]:
-            try:
-                self._add_table(doc, table_xml_component)
-            except Exception as e:
-                _log.debug(f"Skipping unsupported table for: {str(self.file)}")
-                pass
-        return
-
-    def _add_table(self, doc: DoclingDocument, table_xml_component: Table) -> None:
-        soup = BeautifulSoup(table_xml_component["content"], "html.parser")
-        table_tag = soup.find("table")
-
-        nested_tables = table_tag.find("table")
-        if nested_tables:
-            _log.debug(f"Skipping nested table for: {str(self.file)}")
-            return
-
-        # Count the number of rows (number of <tr> elements)
-        num_rows = len(table_tag.find_all("tr"))
-
-        # Find the number of columns (taking into account colspan)
-        num_cols = 0
-        for row in table_tag.find_all("tr"):
-            col_count = 0
-            for cell in row.find_all(["td", "th"]):
-                colspan = int(cell.get("colspan", 1))
-                col_count += colspan
-            num_cols = max(num_cols, col_count)
-
-        grid = [[None for _ in range(num_cols)] for _ in range(num_rows)]
-
-        data = TableData(num_rows=num_rows, num_cols=num_cols, table_cells=[])
-
-        # Iterate over the rows in the table
-        for row_idx, row in enumerate(table_tag.find_all("tr")):
-            # For each row, find all the column cells (both <td> and <th>)
-            cells = row.find_all(["td", "th"])
-
-            # Check if each cell in the row is a header -> means it is a column header
-            col_header = True
-            for j, html_cell in enumerate(cells):
-                if html_cell.name == "td":
-                    col_header = False
-
-            # Extract and print the text content of each cell
-            col_idx = 0
-            for _, html_cell in enumerate(cells):
-                text = html_cell.text
-
-                col_span = int(html_cell.get("colspan", 1))
-                row_span = int(html_cell.get("rowspan", 1))
-
-                while grid[row_idx][col_idx] != None:
-                    col_idx += 1
-                for r in range(row_span):
-                    for c in range(col_span):
-                        grid[row_idx + r][col_idx + c] = text
-
-                cell = TableCell(
-                    text=text,
-                    row_span=row_span,
-                    col_span=col_span,
-                    start_row_offset_idx=row_idx,
-                    end_row_offset_idx=row_idx + row_span,
-                    start_col_offset_idx=col_idx,
-                    end_col_offset_idx=col_idx + col_span,
-                    col_header=col_header,
-                    row_header=((not col_header) and html_cell.name == "th"),
-                )
-                data.table_cells.append(cell)
-
-        table_caption = doc.add_text(
-            label=DocItemLabel.CAPTION,
-            text=table_xml_component["label"] + ": " + table_xml_component["caption"],
-        )
-        doc.add_table(data=data, parent=self.parents["Tables"], caption=table_caption)
-        return
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/xml/uspto_backend.py b/Paper2Video/src/evaluation/PresentQuiz/docling/backend/xml/uspto_backend.py
deleted file mode 100644
index 21001ab7497fdbee3e89e597c494474c2933295b..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/backend/xml/uspto_backend.py
+++ /dev/null
@@ -1,1888 +0,0 @@
-"""Backend to parse patents from the United States Patent Office (USPTO).
-
-The parsers included in this module can handle patent grants pubished since 1976 and
-patent applications since 2001.
-The original files can be found in https://bulkdata.uspto.gov.
-"""
-
-import html
-import logging
-import re
-import xml.sax
-import xml.sax.xmlreader
-from abc import ABC, abstractmethod
-from enum import Enum, unique
-from io import BytesIO
-from pathlib import Path
-from typing import Any, Final, Optional, Union
-
-from bs4 import BeautifulSoup, Tag
-from docling_core.types.doc import (
-    DocItem,
-    DocItemLabel,
-    DoclingDocument,
-    DocumentOrigin,
-    TableCell,
-    TableData,
-    TextItem,
-)
-from docling_core.types.doc.document import LevelNumber
-from pydantic import NonNegativeInt
-from typing_extensions import Self, TypedDict, override
-
-from docling.backend.abstract_backend import DeclarativeDocumentBackend
-from docling.datamodel.base_models import InputFormat
-from docling.datamodel.document import InputDocument
-
-_log = logging.getLogger(__name__)
-
-XML_DECLARATION: Final = '<?xml version="1.0" encoding="UTF-8"?>'
-
-
-@unique
-class PatentHeading(Enum):
-    """Text of docling headings for tagged sections in USPTO patent documents."""
-
-    ABSTRACT = "ABSTRACT", 2
-    CLAIMS = "CLAIMS", 2
-
-    @override
-    def __new__(cls, value: str, _) -> Self:
-        obj = object.__new__(cls)
-        obj._value_ = value
-        return obj
-
-    @override
-    def __init__(self, _, level: LevelNumber) -> None:
-        self.level: LevelNumber = level
-
-
-class PatentUsptoDocumentBackend(DeclarativeDocumentBackend):
-    @override
-    def __init__(
-        self, in_doc: InputDocument, path_or_stream: Union[BytesIO, Path]
-    ) -> None:
-        super().__init__(in_doc, path_or_stream)
-
-        self.patent_content: str = ""
-        self.parser: Optional[PatentUspto] = None
-
-        try:
-            if isinstance(self.path_or_stream, BytesIO):
-                while line := self.path_or_stream.readline().decode("utf-8"):
-                    if line.startswith("<!DOCTYPE") or line == "PATN\n":
-                        self._set_parser(line)
-                    self.patent_content += line
-            elif isinstance(self.path_or_stream, Path):
-                with open(self.path_or_stream, encoding="utf-8") as file_obj:
-                    while line := file_obj.readline():
-                        if line.startswith("<!DOCTYPE") or line == "PATN\n":
-                            self._set_parser(line)
-                        self.patent_content += line
-        except Exception as exc:
-            raise RuntimeError(
-                f"Could not initialize USPTO backend for file with hash {self.document_hash}."
-            ) from exc
-
-    def _set_parser(self, doctype: str) -> None:
-        doctype_line = doctype.lower()
-        if doctype == "PATN\n":
-            self.parser = PatentUsptoGrantAps()
-        elif "us-patent-application-v4" in doctype_line:
-            self.parser = PatentUsptoIce()
-        elif "us-patent-grant-v4" in doctype_line:
-            self.parser = PatentUsptoIce()
-        elif "us-grant-025" in doctype_line:
-            self.parser = PatentUsptoGrantV2()
-        elif all(
-            item in doctype_line
-            for item in ("patent-application-publication", "pap-v1")
-        ):
-            self.parser = PatentUsptoAppV1()
-        else:
-            self.parser = None
-
-    @override
-    def is_valid(self) -> bool:
-        return bool(self.patent_content) and bool(self.parser)
-
-    @classmethod
-    @override
-    def supports_pagination(cls) -> bool:
-        return False
-
-    @override
-    def unload(self) -> None:
-        return
-
-    @classmethod
-    @override
-    def supported_formats(cls) -> set[InputFormat]:
-        return {InputFormat.XML_USPTO}
-
-    @override
-    def convert(self) -> DoclingDocument:
-
-        if self.parser is not None:
-            doc = self.parser.parse(self.patent_content)
-            if doc is None:
-                raise RuntimeError(
-                    f"Failed to convert doc (hash={self.document_hash}, "
-                    f"name={self.file.name})."
-                )
-            doc.name = self.file.name or "file"
-            mime_type = (
-                "text/plain"
-                if isinstance(self.parser, PatentUsptoGrantAps)
-                else "application/xml"
-            )
-            doc.origin = DocumentOrigin(
-                mimetype=mime_type,
-                binary_hash=self.document_hash,
-                filename=self.file.name or "file",
-            )
-
-            return doc
-        else:
-            raise RuntimeError(
-                f"Cannot convert doc (hash={self.document_hash}, "
-                f"name={self.file.name}) because the backend failed to init."
-            )
-
-
-class PatentUspto(ABC):
-    """Parser of patent documents from the US Patent Office."""
-
-    @abstractmethod
-    def parse(self, patent_content: str) -> Optional[DoclingDocument]:
-        """Parse a USPTO patent.
-
-        Parameters:
-            patent_content: The content of a single patent in a USPTO file.
-
-        Returns:
-            The patent parsed as a docling document.
-        """
-        pass
-
-
-class PatentUsptoIce(PatentUspto):
-    """Parser of patent documents from the US Patent Office (ICE).
-
-    The compatible formats are:
-    - Patent Grant Full Text Data/XML Version 4.x ICE (from January 2005)
-    - Patent Application Full Text Data/XML Version 4.x ICE (from January 2005)
-    """
-
-    def __init__(self) -> None:
-        """Build an instance of PatentUsptoIce class."""
-        self.handler = PatentUsptoIce.PatentHandler()
-        self.pattern = re.compile(r"^(<table .*?</table>)", re.MULTILINE | re.DOTALL)
-
-    def parse(self, patent_content: str) -> Optional[DoclingDocument]:
-        try:
-            xml.sax.parseString(patent_content, self.handler)
-        except xml.sax._exceptions.SAXParseException as exc_sax:
-            _log.error(f"Error in parsing USPTO document: {exc_sax}")
-
-            return None
-
-        doc = self.handler.doc
-        if doc:
-            raw_tables = re.findall(self.pattern, patent_content)
-            parsed_tables: list[TableData] = []
-            _log.debug(f"Found {len(raw_tables)} tables to be parsed with XmlTable.")
-            for table in raw_tables:
-                table_parser = XmlTable(XML_DECLARATION + "\n" + table)
-                try:
-                    table_data = table_parser.parse()
-                    if table_data:
-                        parsed_tables.append(table_data)
-                except Exception as exc_table:
-                    _log.error(f"Error in parsing USPTO tables: {exc_table}")
-            if len(parsed_tables) != len(doc.tables):
-                _log.error(
-                    f"Number of referenced ({len(doc.tables)}) and parsed "
-                    f"({len(parsed_tables)}) tables differ."
-                )
-            else:
-                for idx, item in enumerate(parsed_tables):
-                    doc.tables[idx].data = item
-
-        return doc
-
-    class PatentHandler(xml.sax.handler.ContentHandler):
-        """SAX ContentHandler for patent documents."""
-
-        APP_DOC_ELEMENT: Final = "us-patent-application"
-        GRANT_DOC_ELEMENT: Final = "us-patent-grant"
-
-        @unique
-        class Element(Enum):
-            """Represents an element of interest in the patent application document."""
-
-            ABSTRACT = "abstract", True
-            TITLE = "invention-title", True
-            CLAIMS = "claims", False
-            CLAIM = "claim", False
-            CLAIM_TEXT = "claim-text", True
-            PARAGRAPH = "p", True
-            HEADING = "heading", True
-            DESCRIPTION = "description", False
-            TABLE = "table", False  # to track its position, without text
-            DRAWINGS = "description-of-drawings", True
-            STYLE_SUPERSCRIPT = "sup", True
-            STYLE_SUBSCRIPT = "sub", True
-            MATHS = "maths", False  # to avoid keeping formulas
-
-            @override
-            def __new__(cls, value: str, _) -> Self:
-                obj = object.__new__(cls)
-                obj._value_ = value
-                return obj
-
-            @override
-            def __init__(self, _, is_text: bool) -> None:
-                self.is_text: bool = is_text
-
-        @override
-        def __init__(self) -> None:
-            """Build an instance of the patent handler."""
-            # Current patent being parsed
-            self.doc: Optional[DoclingDocument] = None
-            # Keep track of docling hierarchy level
-            self.level: LevelNumber = 1
-            # Keep track of docling parents by level
-            self.parents: dict[LevelNumber, Optional[DocItem]] = {1: None}
-            # Content to retain for the current patent
-            self.property: list[str]
-            self.claim: str
-            self.claims: list[str]
-            self.abstract: str
-            self.text: str
-            self._clean_data()
-            # To handle mathematical styling
-            self.style_html = HtmlEntity()
-
-        @override
-        def startElement(self, tag, attributes):  # noqa: N802
-            """Signal the start of an element.
-
-            Args:
-                tag: The element tag.
-                attributes: The element attributes.
-            """
-            if tag in (
-                self.APP_DOC_ELEMENT,
-                self.GRANT_DOC_ELEMENT,
-            ):
-                self.doc = DoclingDocument(name="file")
-                self.text = ""
-            self._start_registered_elements(tag, attributes)
-
-        @override
-        def skippedEntity(self, name):  # noqa: N802
-            """Receive notification of a skipped entity.
-
-            HTML entities will be skipped by the parser. This method will unescape them
-            and add them to the text.
-
-            Args:
-                name: Entity name.
-            """
-            if self.property:
-                elm_val = self.property[-1]
-                element = self.Element(elm_val)
-                if element.is_text:
-                    escaped = self.style_html.get_greek_from_iso8879(f"&{name};")
-                    unescaped = html.unescape(escaped)
-                    if unescaped == escaped:
-                        _log.debug(f"Unrecognized HTML entity: {name}")
-                        return
-
-                    if element in (
-                        self.Element.STYLE_SUPERSCRIPT,
-                        self.Element.STYLE_SUBSCRIPT,
-                    ):
-                        # superscripts and subscripts need to be under text elements
-                        if len(self.property) < 2:
-                            return
-                        parent_val = self.property[-2]
-                        parent = self.Element(parent_val)
-                        if parent.is_text:
-                            self.text += self._apply_style(unescaped, elm_val)
-                    else:
-                        self.text += unescaped
-
-        @override
-        def endElement(self, tag):  # noqa: N802
-            """Signal the end of an element.
-
-            Args:
-                tag: The element tag.
-            """
-            if tag in (
-                self.APP_DOC_ELEMENT,
-                self.GRANT_DOC_ELEMENT,
-            ):
-                self._clean_data()
-            self._end_registered_element(tag)
-
-        @override
-        def characters(self, content):
-            """Receive notification of character data.
-
-            Args:
-                content: Data reported by the handler.
-            """
-            if self.property:
-                elm_val = self.property[-1]
-                element = self.Element(elm_val)
-                if element.is_text:
-                    if element in (
-                        self.Element.STYLE_SUPERSCRIPT,
-                        self.Element.STYLE_SUBSCRIPT,
-                    ):
-                        # superscripts and subscripts need to be under text elements
-                        if len(self.property) < 2:
-                            return
-                        parent_val = self.property[-2]
-                        parent = self.Element(parent_val)
-                        if parent.is_text:
-                            self.text += self._apply_style(content, elm_val)
-                    else:
-                        self.text += content
-
-        def _start_registered_elements(
-            self, tag: str, attributes: xml.sax.xmlreader.AttributesImpl
-        ) -> None:
-            if tag in [member.value for member in self.Element]:
-                # special case for claims: claim lines may start before the
-                # previous one is closed
-                if (
-                    tag == self.Element.CLAIM_TEXT.value
-                    and self.property
-                    and self.property[-1] == tag
-                    and self.text.strip()
-                ):
-                    self.claim += " " + self.text.strip()
-                    self.text = ""
-                elif tag == self.Element.HEADING.value:
-                    level_attr: str = attributes.get("level", "")
-                    new_level: int = int(level_attr) if level_attr.isnumeric() else 1
-                    max_level = min(self.parents.keys())
-                    # increase heading level with 1 for title, if any
-                    self.level = (
-                        new_level + 1 if (new_level + 1) in self.parents else max_level
-                    )
-                self.property.append(tag)
-
-        def _end_registered_element(self, tag: str) -> None:
-            if tag in [item.value for item in self.Element] and self.property:
-                current_tag = self.property.pop()
-                self._add_property(current_tag, self.text.strip())
-
-        def _add_property(self, name: str, text: str) -> None:
-            if not name or not self.doc:
-                return
-
-            if name == self.Element.TITLE.value:
-                if text:
-                    self.parents[self.level + 1] = self.doc.add_title(
-                        parent=self.parents[self.level],
-                        text=text,
-                    )
-                    self.level += 1
-                self.text = ""
-
-            elif name == self.Element.ABSTRACT.value:
-                if self.abstract:
-                    heading_text = PatentHeading.ABSTRACT.value
-                    heading_level = (
-                        PatentHeading.ABSTRACT.level
-                        if PatentHeading.ABSTRACT.level in self.parents
-                        else 1
-                    )
-                    abstract_item = self.doc.add_heading(
-                        heading_text,
-                        level=heading_level,
-                        parent=self.parents[heading_level],
-                    )
-                    self.doc.add_text(
-                        label=DocItemLabel.PARAGRAPH,
-                        text=self.abstract,
-                        parent=abstract_item,
-                    )
-
-            elif name == self.Element.CLAIM_TEXT.value:
-                text = re.sub("\\s+", " ", text).strip()
-                if text:
-                    self.claim += " " + text
-                self.text = ""
-
-            elif name == self.Element.CLAIM.value and self.claim:
-                self.claims.append(self.claim.strip())
-                self.claim = ""
-
-            elif name == self.Element.CLAIMS.value and self.claims:
-                heading_text = PatentHeading.CLAIMS.value
-                heading_level = (
-                    PatentHeading.CLAIMS.level
-                    if PatentHeading.CLAIMS.level in self.parents
-                    else 1
-                )
-                claims_item = self.doc.add_heading(
-                    heading_text,
-                    level=heading_level,
-                    parent=self.parents[heading_level],
-                )
-                for text in self.claims:
-                    self.doc.add_text(
-                        label=DocItemLabel.PARAGRAPH, text=text, parent=claims_item
-                    )
-
-            elif name == self.Element.PARAGRAPH.value and text:
-                # remmove blank spaces added in paragraphs
-                text = re.sub("\\s+", " ", text)
-                if self.Element.ABSTRACT.value in self.property:
-                    self.abstract = (
-                        (self.abstract + " " + text) if self.abstract else text
-                    )
-                else:
-                    self.doc.add_text(
-                        label=DocItemLabel.PARAGRAPH,
-                        text=text,
-                        parent=self.parents[self.level],
-                    )
-                self.text = ""
-
-            elif name == self.Element.HEADING.value and text:
-                self.parents[self.level + 1] = self.doc.add_heading(
-                    text=text,
-                    level=self.level,
-                    parent=self.parents[self.level],
-                )
-                self.level += 1
-                self.text = ""
-
-            elif name == self.Element.TABLE.value:
-                # set an empty table as placeholder
-                empty_table = TableData(num_rows=0, num_cols=0, table_cells=[])
-                self.doc.add_table(
-                    data=empty_table,
-                    parent=self.parents[self.level],
-                )
-
-        def _apply_style(self, text: str, style_tag: str) -> str:
-            """Apply an HTML style to text.
-
-            Args:
-                text: A string containing plain text.
-                style_tag: An HTML tag name for styling text. If the tag name is not
-                  recognized as one of the supported styles, the method will return
-                  the original `text`.
-
-            Returns:
-                A string after applying the style.
-            """
-            formatted = text
-
-            if style_tag == self.Element.STYLE_SUPERSCRIPT.value:
-                formatted = html.unescape(self.style_html.get_superscript(text))
-            elif style_tag == self.Element.STYLE_SUBSCRIPT.value:
-                formatted = html.unescape(self.style_html.get_subscript(text))
-
-            return formatted
-
-        def _clean_data(self) -> None:
-            """Reset the variables from stream data."""
-            self.property = []
-            self.claim = ""
-            self.claims = []
-            self.abstract = ""
-
-
-class PatentUsptoGrantV2(PatentUspto):
-    """Parser of patent documents from the US Patent Office (grants v2.5).
-
-    The compatible format is:
-    - Patent Grant Full Text Data/XML Version 2.5 (from January 2002 till December 2004)
-    """
-
-    @override
-    def __init__(self) -> None:
-        """Build an instance of PatentUsptoGrantV2 class."""
-        self.handler = PatentUsptoGrantV2.PatentHandler()
-        self.pattern = re.compile(r"^(<table .*?</table>)", re.MULTILINE | re.DOTALL)
-
-    @override
-    def parse(self, patent_content: str) -> Optional[DoclingDocument]:
-        try:
-            xml.sax.parseString(patent_content, self.handler)
-        except xml.sax._exceptions.SAXParseException as exc_sax:
-            _log.error(f"Error in parsing USPTO document: {exc_sax}")
-
-            return None
-
-        doc = self.handler.doc
-        if doc:
-            raw_tables = re.findall(self.pattern, patent_content)
-            parsed_tables: list[TableData] = []
-            _log.debug(f"Found {len(raw_tables)} tables to be parsed with XmlTable.")
-            for table in raw_tables:
-                table_parser = XmlTable(XML_DECLARATION + "\n" + table)
-                try:
-                    table_data = table_parser.parse()
-                    if table_data:
-                        parsed_tables.append(table_data)
-                except Exception as exc_table:
-                    _log.error(f"Error in parsing USPTO tables: {exc_table}")
-            if len(parsed_tables) != len(doc.tables):
-                _log.error(
-                    f"Number of referenced ({len(doc.tables)}) and parsed "
-                    f"({len(parsed_tables)}) tables differ."
-                )
-            else:
-                for idx, item in enumerate(parsed_tables):
-                    doc.tables[idx].data = item
-
-        return doc
-
-    class PatentHandler(xml.sax.handler.ContentHandler):
-        """SAX ContentHandler for patent documents."""
-
-        GRANT_DOC_ELEMENT: Final = "PATDOC"
-        CLAIM_STATEMENT: Final = "What is claimed is:"
-
-        @unique
-        class Element(Enum):
-            """Represents an element of interest in the patent application document."""
-
-            PDAT = "PDAT", True  # any type of data
-            ABSTRACT = ("SDOAB", False)
-            SDOCL = ("SDOCL", False)
-            TITLE = ("B540", False)
-            CLAIMS = ("CL", False)
-            CLAIM = ("CLM", False)
-            PARAGRAPH = ("PARA", True)
-            HEADING = ("H", True)
-            DRAWINGS = ("DRWDESC", False)
-            STYLE_SUPERSCRIPT = ("SP", False)
-            STYLE_SUBSCRIPT = ("SB", False)
-            STYLE_ITALIC = ("ITALIC", False)
-            CWU = ("CWU", False)  # avoid tables, chemicals, formulas
-            TABLE = ("table", False)  # to keep track of table positions
-
-            @override
-            def __new__(cls, value: str, _) -> Self:
-                obj = object.__new__(cls)
-                obj._value_ = value
-                return obj
-
-            @override
-            def __init__(self, _, is_text: bool) -> None:
-                self.is_text: bool = is_text
-
-        @override
-        def __init__(self) -> None:
-            """Build an instance of the patent handler."""
-            # Current patent being parsed
-            self.doc: Optional[DoclingDocument] = None
-            # Keep track of docling hierarchy level
-            self.level: LevelNumber = 1
-            # Keep track of docling parents by level
-            self.parents: dict[LevelNumber, Optional[DocItem]] = {1: None}
-            # Content to retain for the current patent
-            self.property: list[str]
-            self.claim: str
-            self.claims: list[str]
-            self.paragraph: str
-            self.abstract: str
-            self._clean_data()
-            # To handle mathematical styling
-            self.style_html = HtmlEntity()
-
-        @override
-        def startElement(self, tag, attributes):  # noqa: N802
-            """Signal the start of an element.
-
-            Args:
-                tag: The element tag.
-                attributes: The element attributes.
-            """
-            if tag == self.GRANT_DOC_ELEMENT:
-                self.doc = DoclingDocument(name="file")
-                self.text = ""
-            self._start_registered_elements(tag, attributes)
-
-        @override
-        def skippedEntity(self, name):  # noqa: N802
-            """Receive notification of a skipped entity.
-
-            HTML entities will be skipped by the parser. This method will unescape them
-            and add them to the text.
-
-            Args:
-                name: Entity name.
-            """
-            if self.property:
-                elm_val = self.property[-1]
-                element = self.Element(elm_val)
-                if element.is_text:
-                    escaped = self.style_html.get_greek_from_iso8879(f"&{name};")
-                    unescaped = html.unescape(escaped)
-                    if unescaped == escaped:
-                        logging.debug("Unrecognized HTML entity: " + name)
-                        return
-
-                    if element in (
-                        self.Element.STYLE_SUPERSCRIPT,
-                        self.Element.STYLE_SUBSCRIPT,
-                    ):
-                        # superscripts and subscripts need to be under text elements
-                        if len(self.property) < 2:
-                            return
-                        parent_val = self.property[-2]
-                        parent = self.Element(parent_val)
-                        if parent.is_text:
-                            self.text += self._apply_style(unescaped, elm_val)
-                    else:
-                        self.text += unescaped
-
-        @override
-        def endElement(self, tag):  # noqa: N802
-            """Signal the end of an element.
-
-            Args:
-                tag: The element tag.
-            """
-            if tag == self.GRANT_DOC_ELEMENT:
-                self._clean_data()
-            self._end_registered_element(tag)
-
-        @override
-        def characters(self, content):
-            """Receive notification of character data.
-
-            Args:
-                content: Data reported by the handler.
-            """
-            if self.property:
-                elm_val = self.property[-1]
-                element = self.Element(elm_val)
-                if element.is_text:
-                    if element in (
-                        self.Element.STYLE_SUPERSCRIPT,
-                        self.Element.STYLE_SUBSCRIPT,
-                    ):
-                        # superscripts and subscripts need to be under text elements
-                        if len(self.property) < 2:
-                            return
-                        parent_val = self.property[-2]
-                        parent = self.Element(parent_val)
-                        if parent.is_text:
-                            self.text += self._apply_style(content, elm_val)
-                    else:
-                        self.text += content
-
-        def _start_registered_elements(
-            self, tag: str, attributes: xml.sax.xmlreader.AttributesImpl
-        ) -> None:
-            if tag in [member.value for member in self.Element]:
-                if (
-                    tag == self.Element.HEADING.value
-                    and not self.Element.SDOCL.value in self.property
-                ):
-                    level_attr: str = attributes.get("LVL", "")
-                    new_level: int = int(level_attr) if level_attr.isnumeric() else 1
-                    max_level = min(self.parents.keys())
-                    # increase heading level with 1 for title, if any
-                    self.level = (
-                        new_level + 1 if (new_level + 1) in self.parents else max_level
-                    )
-                self.property.append(tag)
-
-        def _end_registered_element(self, tag: str) -> None:
-            if tag in [elm.value for elm in self.Element] and self.property:
-                current_tag = self.property.pop()
-                self._add_property(current_tag, self.text)
-
-        def _add_property(self, name: str, text: str) -> None:
-            if not name or not self.doc:
-                return
-            if name == self.Element.PDAT.value and text:
-                if not self.property:
-                    self.text = ""
-                    return
-
-                wrapper = self.property[-1]
-                text = self._apply_style(text, wrapper)
-
-                if self.Element.TITLE.value in self.property and text.strip():
-                    title = text.strip()
-                    self.parents[self.level + 1] = self.doc.add_title(
-                        parent=self.parents[self.level],
-                        text=title,
-                    )
-                    self.level += 1
-
-                elif self.Element.ABSTRACT.value in self.property:
-                    self.abstract += text
-
-                elif self.Element.CLAIM.value in self.property:
-                    self.claim += text
-
-                # Paragraph text not in claims or abstract
-                elif (
-                    self.Element.PARAGRAPH.value in self.property
-                    and self.Element.CLAIM.value not in self.property
-                    and self.Element.ABSTRACT.value not in self.property
-                ):
-                    self.paragraph += text
-
-                # headers except claims statement
-                elif (
-                    self.Element.HEADING.value in self.property
-                    and not self.Element.SDOCL.value in self.property
-                    and text.strip()
-                ):
-                    self.parents[self.level + 1] = self.doc.add_heading(
-                        text=text.strip(),
-                        level=self.level,
-                        parent=self.parents[self.level],
-                    )
-                    self.level += 1
-
-                self.text = ""
-
-            elif name == self.Element.CLAIM.value and self.claim.strip():
-                self.claims.append(self.claim.strip())
-                self.claim = ""
-
-            elif name == self.Element.CLAIMS.value and self.claims:
-                heading_text = PatentHeading.CLAIMS.value
-                heading_level = (
-                    PatentHeading.CLAIMS.level
-                    if PatentHeading.CLAIMS.level in self.parents
-                    else 1
-                )
-                claims_item = self.doc.add_heading(
-                    heading_text,
-                    level=heading_level,
-                    parent=self.parents[heading_level],
-                )
-                for text in self.claims:
-                    self.doc.add_text(
-                        label=DocItemLabel.PARAGRAPH, text=text, parent=claims_item
-                    )
-
-            elif name == self.Element.ABSTRACT.value and self.abstract.strip():
-                abstract = self.abstract.strip()
-                heading_text = PatentHeading.ABSTRACT.value
-                heading_level = (
-                    PatentHeading.ABSTRACT.level
-                    if PatentHeading.ABSTRACT.level in self.parents
-                    else 1
-                )
-                abstract_item = self.doc.add_heading(
-                    heading_text,
-                    level=heading_level,
-                    parent=self.parents[heading_level],
-                )
-                self.doc.add_text(
-                    label=DocItemLabel.PARAGRAPH, text=abstract, parent=abstract_item
-                )
-
-            elif name == self.Element.PARAGRAPH.value:
-                paragraph = self.paragraph.strip()
-                if paragraph and self.Element.CLAIM.value not in self.property:
-                    self.doc.add_text(
-                        label=DocItemLabel.PARAGRAPH,
-                        text=paragraph,
-                        parent=self.parents[self.level],
-                    )
-                elif self.Element.CLAIM.value in self.property:
-                    # we may need a space after a paragraph in claim text
-                    self.claim += " "
-                self.paragraph = ""
-
-            elif name == self.Element.TABLE.value:
-                # set an empty table as placeholder
-                empty_table = TableData(num_rows=0, num_cols=0, table_cells=[])
-                self.doc.add_table(
-                    data=empty_table,
-                    parent=self.parents[self.level],
-                )
-
-        def _apply_style(self, text: str, style_tag: str) -> str:
-            """Apply an HTML style to text.
-
-            Args:
-                text: A string containing plain text.
-                style_tag: An HTML tag name for styling text. If the tag name is not
-                  recognized as one of the supported styles, the method will return
-                  the original `text`.
-
-            Returns:
-                A string after applying the style.
-            """
-            formatted = text
-
-            if style_tag == self.Element.STYLE_SUPERSCRIPT.value:
-                formatted = html.unescape(self.style_html.get_superscript(text))
-            elif style_tag == self.Element.STYLE_SUBSCRIPT.value:
-                formatted = html.unescape(self.style_html.get_subscript(text))
-            elif style_tag == self.Element.STYLE_ITALIC.value:
-                formatted = html.unescape(self.style_html.get_math_italic(text))
-
-            return formatted
-
-        def _clean_data(self) -> None:
-            """Reset the variables from stream data."""
-            self.text = ""
-            self.property = []
-            self.claim = ""
-            self.claims = []
-            self.paragraph = ""
-            self.abstract = ""
-
-
-class PatentUsptoGrantAps(PatentUspto):
-    """Parser of patents documents from the US Patent Office (grants APS).
-
-    The compatible format is:
-    - Patent Grant Full Text Data/APS (from January 1976 till December 2001)
-    """
-
-    @unique
-    class Section(Enum):
-        """Represent a section in a patent APS document."""
-
-        ABSTRACT = "ABST"
-        SUMMARY = "BSUM"
-        DETAILS = "DETD"
-        CLAIMS = "CLMS"
-        DRAWINGS = "DRWD"
-
-    @unique
-    class Field(Enum):
-        """Represent a field in a patent APS document."""
-
-        DOC_NUMBER = "WKU"
-        TITLE = "TTL"
-        PARAGRAPH = "PAR"
-        PARAGRAPH_1 = "PA1"
-        PARAGRAPH_2 = "PA2"
-        PARAGRAPH_3 = "PA3"
-        TEXT = "PAL"
-        CAPTION = "PAC"
-        NUMBER = "NUM"
-        NAME = "NAM"
-        IPC = "ICL"
-        ISSUED = "ISD"
-        FILED = "APD"
-        PATENT_NUMBER = "PNO"
-        APPLICATION_NUMBER = "APN"
-        APPLICATION_TYPE = "APT"
-        COUNTRY = "CNT"
-
-    @override
-    def __init__(self) -> None:
-        """Build an instance of PatentUsptoGrantAps class."""
-        self.doc: Optional[DoclingDocument] = None
-        # Keep track of docling hierarchy level
-        self.level: LevelNumber = 1
-        # Keep track of docling parents by level
-        self.parents: dict[LevelNumber, Optional[DocItem]] = {1: None}
-
-    def get_last_text_item(self) -> Optional[TextItem]:
-        """Get the last text item at the current document level.
-
-        Returns:
-            The text item or None, if the current level parent has no children."""
-        if self.doc:
-            parent = self.parents[self.level]
-            children = parent.children if parent is not None else []
-        else:
-            return None
-        text_list: list[TextItem] = [
-            item
-            for item in self.doc.texts
-            if isinstance(item, TextItem) and item.get_ref() in children
-        ]
-
-        if text_list:
-            return text_list[-1]
-        else:
-            return None
-
-    def store_section(self, section: str) -> None:
-        """Store the section heading in the docling document.
-
-        Only the predefined sections from PatentHeading will be handled.
-        The other sections are created by the Field.CAPTION field.
-
-        Args:
-            section: A patent section name."""
-        heading: PatentHeading
-        if self.doc is None:
-            return
-        elif section == self.Section.ABSTRACT.value:
-            heading = PatentHeading.ABSTRACT
-        elif section == self.Section.CLAIMS.value:
-            heading = PatentHeading.CLAIMS
-        else:
-            return None
-
-        self.level = heading.level if heading.level in self.parents else 1
-        self.parents[self.level + 1] = self.doc.add_heading(
-            heading.value,
-            level=self.level,
-            parent=self.parents[self.level],
-        )
-        self.level += 1
-
-    def store_content(self, section: str, field: str, value: str) -> None:
-        """Store the key value within a document section in the docling document.
-
-        Args:
-            section: A patent section name.
-            field: A field name.
-            value: A field value name.
-        """
-        if (
-            not self.doc
-            or not field
-            or field not in [item.value for item in PatentUsptoGrantAps.Field]
-        ):
-            return
-
-        if field == self.Field.TITLE.value:
-            self.parents[self.level + 1] = self.doc.add_title(
-                parent=self.parents[self.level], text=value
-            )
-            self.level += 1
-
-        elif field == self.Field.TEXT.value and section == self.Section.ABSTRACT.value:
-            abst_item = self.get_last_text_item()
-            if abst_item:
-                abst_item.text += " " + value
-            else:
-                self.doc.add_text(
-                    label=DocItemLabel.PARAGRAPH,
-                    text=value,
-                    parent=self.parents[self.level],
-                )
-
-        elif field == self.Field.NUMBER.value and section == self.Section.CLAIMS.value:
-            self.doc.add_text(
-                label=DocItemLabel.PARAGRAPH,
-                text="",
-                parent=self.parents[self.level],
-            )
-
-        elif (
-            field
-            in (
-                self.Field.PARAGRAPH.value,
-                self.Field.PARAGRAPH_1.value,
-                self.Field.PARAGRAPH_2.value,
-                self.Field.PARAGRAPH_3.value,
-            )
-            and section == self.Section.CLAIMS.value
-        ):
-            last_claim = self.get_last_text_item()
-            if last_claim is None:
-                last_claim = self.doc.add_text(
-                    label=DocItemLabel.PARAGRAPH,
-                    text="",
-                    parent=self.parents[self.level],
-                )
-
-            last_claim.text += f" {value}" if last_claim.text else value
-
-        elif field == self.Field.CAPTION.value and section in (
-            self.Section.SUMMARY.value,
-            self.Section.DETAILS.value,
-            self.Section.DRAWINGS.value,
-        ):
-            # captions are siblings of abstract since no level info is provided
-            head_item = PatentHeading.ABSTRACT
-            self.level = head_item.level if head_item.level in self.parents else 1
-            self.parents[self.level + 1] = self.doc.add_heading(
-                value,
-                level=self.level,
-                parent=self.parents[self.level],
-            )
-            self.level += 1
-
-        elif field in (
-            self.Field.PARAGRAPH.value,
-            self.Field.PARAGRAPH_1.value,
-            self.Field.PARAGRAPH_2.value,
-            self.Field.PARAGRAPH_3.value,
-        ) and section in (
-            self.Section.SUMMARY.value,
-            self.Section.DETAILS.value,
-            self.Section.DRAWINGS.value,
-        ):
-            self.doc.add_text(
-                label=DocItemLabel.PARAGRAPH,
-                text=value,
-                parent=self.parents[self.level],
-            )
-
-    def parse(self, patent_content: str) -> Optional[DoclingDocument]:
-        self.doc = self.doc = DoclingDocument(name="file")
-        section: str = ""
-        key: str = ""
-        value: str = ""
-        line_num = 0
-        for line in patent_content.splitlines():
-            cols = re.split("\\s{2,}", line, maxsplit=1)
-            if key and value and (len(cols) == 1 or (len(cols) == 2 and cols[0])):
-                self.store_content(section, key, value)
-                key = ""
-                value = ""
-            if len(cols) == 1:  # section title
-                section = cols[0]
-                self.store_section(section)
-                _log.debug(f"Parsing section {section}")
-            elif len(cols) == 2:  # key value
-                if cols[0]:  # key present
-                    key = cols[0]
-                    value = cols[1]
-                elif not re.match(r"^##STR\d+##$", cols[1]):  # line continues
-                    value += " " + cols[1]
-            line_num += 1
-        if key and value:
-            self.store_content(section, key, value)
-
-        # TODO: parse tables
-        return self.doc
-
-
-class PatentUsptoAppV1(PatentUspto):
-    """Parser of patent documents from the US Patent Office (applications v1.x)
-
-    The compatible format is:
-    - Patent Application Full Text Data/XML Version 1.x (from March 2001 till December
-      2004)
-    """
-
-    @override
-    def __init__(self) -> None:
-        """Build an instance of PatentUsptoAppV1 class."""
-        self.handler = PatentUsptoAppV1.PatentHandler()
-        self.pattern = re.compile(r"^(<table .*?</table>)", re.MULTILINE | re.DOTALL)
-
-    @override
-    def parse(self, patent_content: str) -> Optional[DoclingDocument]:
-        try:
-            xml.sax.parseString(patent_content, self.handler)
-        except xml.sax._exceptions.SAXParseException as exc_sax:
-            _log.error(f"Error in parsing USPTO document: {exc_sax}")
-
-            return None
-
-        doc = self.handler.doc
-        if doc:
-            raw_tables = re.findall(self.pattern, patent_content)
-            parsed_tables: list[TableData] = []
-            _log.debug(f"Found {len(raw_tables)} tables to be parsed with XmlTable.")
-            for table in raw_tables:
-                table_parser = XmlTable(XML_DECLARATION + "\n" + table)
-                try:
-                    table_data = table_parser.parse()
-                    if table_data:
-                        parsed_tables.append(table_data)
-                except Exception as exc_table:
-                    _log.error(f"Error in parsing USPTO tables: {exc_table}")
-            if len(parsed_tables) != len(doc.tables):
-                _log.error(
-                    f"Number of referenced ({len(doc.tables)}) and parsed "
-                    f"({len(parsed_tables)}) tables differ."
-                )
-            else:
-                for idx, item in enumerate(parsed_tables):
-                    doc.tables[idx].data = item
-
-        return doc
-
-    class PatentHandler(xml.sax.handler.ContentHandler):
-        """SAX ContentHandler for patent documents."""
-
-        APP_DOC_ELEMENT: Final = "patent-application-publication"
-
-        @unique
-        class Element(Enum):
-            """Represents an element of interest in the patent application document."""
-
-            DRAWINGS = "brief-description-of-drawings", False
-            ABSTRACT = "subdoc-abstract", False
-            TITLE = "title-of-invention", True
-            CLAIMS = "subdoc-claims", False
-            CLAIM = "claim", False
-            CLAIM_TEXT = "claim-text", True
-            NUMBER = ("number", False)
-            PARAGRAPH = "paragraph", True
-            HEADING = "heading", True
-            STYLE_SUPERSCRIPT = "superscript", True
-            STYLE_SUBSCRIPT = "subscript", True
-            # do not store text of a table, since it can be within paragraph
-            TABLE = "table", False
-            # do not store text of a formula, since it can be within paragraph
-            MATH = "math-cwu", False
-
-            @override
-            def __new__(cls, value: str, _) -> Self:
-                obj = object.__new__(cls)
-                obj._value_ = value
-                return obj
-
-            @override
-            def __init__(self, _, is_text: bool) -> None:
-                self.is_text: bool = is_text
-
-        @override
-        def __init__(self) -> None:
-            """Build an instance of the patent handler."""
-            # Current patent being parsed
-            self.doc: Optional[DoclingDocument] = None
-            # Keep track of docling hierarchy level
-            self.level: LevelNumber = 1
-            # Keep track of docling parents by level
-            self.parents: dict[LevelNumber, Optional[DocItem]] = {1: None}
-            # Content to retain for the current patent
-            self.property: list[str]
-            self.claim: str
-            self.claims: list[str]
-            self.abstract: str
-            self.text: str
-            self._clean_data()
-            # To handle mathematical styling
-            self.style_html = HtmlEntity()
-
-        @override
-        def startElement(self, tag, attributes):  # noqa: N802
-            """Signal the start of an element.
-
-            Args:
-                tag: The element tag.
-                attributes: The element attributes.
-            """
-            if tag == self.APP_DOC_ELEMENT:
-                self.doc = DoclingDocument(name="file")
-                self.text = ""
-            self._start_registered_elements(tag, attributes)
-
-        @override
-        def skippedEntity(self, name):  # noqa: N802
-            """Receive notification of a skipped entity.
-
-            HTML entities will be skipped by the parser. This method will unescape them
-            and add them to the text.
-
-            Args:
-                name: Entity name.
-            """
-            if self.property:
-                elm_val = self.property[-1]
-                element = self.Element(elm_val)
-                if element.is_text:
-                    escaped = self.style_html.get_greek_from_iso8879(f"&{name};")
-                    unescaped = html.unescape(escaped)
-                    if unescaped == escaped:
-                        logging.debug("Unrecognized HTML entity: " + name)
-                        return
-
-                    if element in (
-                        self.Element.STYLE_SUPERSCRIPT,
-                        self.Element.STYLE_SUBSCRIPT,
-                    ):
-                        # superscripts and subscripts need to be under text elements
-                        if len(self.property) < 2:
-                            return
-                        parent_val = self.property[-2]
-                        parent = self.Element(parent_val)
-                        if parent.is_text:
-                            self.text += self._apply_style(unescaped, elm_val)
-                    else:
-                        self.text += unescaped
-
-        @override
-        def endElement(self, tag):  # noqa: N802
-            """Signal the end of an element.
-
-            Args:
-                tag: The element tag.
-            """
-            if tag == self.APP_DOC_ELEMENT:
-                self._clean_data()
-            self._end_registered_element(tag)
-
-        @override
-        def characters(self, content):
-            """Receive notification of character data.
-
-            Args:
-                content: Data reported by the handler.
-            """
-            if self.property:
-                elm_val = self.property[-1]
-                element = self.Element(elm_val)
-                if element.is_text:
-                    if element in (
-                        self.Element.STYLE_SUPERSCRIPT,
-                        self.Element.STYLE_SUBSCRIPT,
-                    ):
-                        # superscripts and subscripts need to be under text elements
-                        if len(self.property) < 2:
-                            return
-                        parent_val = self.property[-2]
-                        parent = self.Element(parent_val)
-                        if parent.is_text:
-                            self.text += self._apply_style(content, elm_val)
-                    else:
-                        self.text += content
-
-        def _start_registered_elements(
-            self, tag: str, attributes: xml.sax.xmlreader.AttributesImpl
-        ) -> None:
-            if tag in [member.value for member in self.Element]:
-                # special case for claims: claim lines may start before the
-                # previous one is closed
-                if (
-                    tag == self.Element.CLAIM_TEXT.value
-                    and self.property
-                    and self.property[-1] == tag
-                    and self.text.strip()
-                ):
-                    self.claim += " " + self.text.strip("\n")
-                    self.text = ""
-                elif tag == self.Element.HEADING.value:
-                    level_attr: str = attributes.get("lvl", "")
-                    new_level: int = int(level_attr) if level_attr.isnumeric() else 1
-                    max_level = min(self.parents.keys())
-                    # increase heading level with 1 for title, if any
-                    self.level = (
-                        new_level + 1 if (new_level + 1) in self.parents else max_level
-                    )
-                self.property.append(tag)
-
-        def _end_registered_element(self, tag: str) -> None:
-            if tag in [elm.value for elm in self.Element] and self.property:
-                current_tag = self.property.pop()
-                self._add_property(current_tag, self.text)
-
-        def _add_property(self, name: str, text: str) -> None:
-            if not name or not self.doc:
-                return
-
-            if name == self.Element.TITLE.value:
-                title = text.strip()
-                if title:
-                    self.parents[self.level + 1] = self.doc.add_text(
-                        parent=self.parents[self.level],
-                        label=DocItemLabel.TITLE,
-                        text=title,
-                    )
-                    self.level += 1
-                self.text = ""
-            elif name == self.Element.ABSTRACT.value:
-                abstract = self.abstract.strip()
-                if abstract:
-                    heading_text = PatentHeading.ABSTRACT.value
-                    heading_level = (
-                        PatentHeading.ABSTRACT.level
-                        if PatentHeading.ABSTRACT.level in self.parents
-                        else 1
-                    )
-                    abstract_item = self.doc.add_heading(
-                        heading_text,
-                        level=heading_level,
-                        parent=self.parents[heading_level],
-                    )
-                    self.doc.add_text(
-                        label=DocItemLabel.PARAGRAPH,
-                        text=self.abstract,
-                        parent=abstract_item,
-                    )
-                    self.abstract = ""
-                self.text = ""
-            elif name == self.Element.CLAIM_TEXT.value:
-                if text:
-                    self.claim += self.text.strip("\n")
-                self.text = ""
-
-            elif name == self.Element.CLAIM.value:
-                claim = self.claim.strip()
-                if claim:
-                    self.claims.append(claim)
-                self.claim = ""
-
-            elif name == self.Element.CLAIMS.value and self.claims:
-                heading_text = PatentHeading.CLAIMS.value
-                heading_level = (
-                    PatentHeading.CLAIMS.level
-                    if PatentHeading.CLAIMS.level in self.parents
-                    else 1
-                )
-                claims_item = self.doc.add_heading(
-                    heading_text,
-                    level=heading_level,
-                    parent=self.parents[heading_level],
-                )
-                for text in self.claims:
-                    self.doc.add_text(
-                        label=DocItemLabel.PARAGRAPH, text=text, parent=claims_item
-                    )
-
-            elif name in (
-                self.Element.PARAGRAPH.value,
-                self.Element.HEADING.value,
-            ):
-                if text and self.Element.ABSTRACT.value in self.property:
-                    self.abstract = (self.abstract + text) if self.abstract else text
-                elif text.strip():
-                    text = re.sub("\\s+", " ", text).strip()
-                    if name == self.Element.HEADING.value:
-                        self.parents[self.level + 1] = self.doc.add_heading(
-                            text=text,
-                            level=self.level,
-                            parent=self.parents[self.level],
-                        )
-                        self.level += 1
-                    else:
-                        self.doc.add_text(
-                            label=DocItemLabel.PARAGRAPH,
-                            text=text,
-                            parent=self.parents[self.level],
-                        )
-                self.text = ""
-
-            elif name == self.Element.TABLE.value:
-                # set an empty table as placeholder
-                empty_table = TableData(num_rows=0, num_cols=0, table_cells=[])
-                self.doc.add_table(
-                    data=empty_table,
-                    parent=self.parents[self.level],
-                )
-
-        def _apply_style(self, text: str, style_tag: str) -> str:
-            """Apply an HTML style to text.
-
-            Args:
-                text: A string containing plain text.
-                style_tag: An HTML tag name for styling text. If the tag name is not
-                  recognized as one of the supported styles, the method will return
-                  the original `text`.
-
-            Returns:
-                A string after applying the style.
-            """
-            formatted = html.unescape(text)
-
-            if style_tag == self.Element.STYLE_SUPERSCRIPT.value:
-                formatted = html.unescape(self.style_html.get_superscript(formatted))
-            elif style_tag == self.Element.STYLE_SUBSCRIPT.value:
-                formatted = html.unescape(self.style_html.get_subscript(formatted))
-
-            return formatted
-
-        def _clean_data(self):
-            """Reset the variables from stream data."""
-            self.property = []
-            self.abstract = ""
-            self.claim = ""
-            self.claims = []
-            self.text = ""
-
-
-class XmlTable:
-    """Provide a table parser for xml tables in USPTO patent documents.
-
-    The OASIS Open XML Exchange Table Model can be downloaded from:
-    http://oasis-open.org/specs/soextblx.dtd
-    """
-
-    class MinColInfoType(TypedDict):
-        offset: list[int]
-        colwidth: list[int]
-
-    class ColInfoType(MinColInfoType):
-        cell_range: list[int]
-        cell_offst: list[int]
-
-    def __init__(self, input: str) -> None:
-        """Initialize the table parser with the xml content.
-
-        Args:
-            input: The xml content.
-        """
-        self.max_nbr_messages = 2
-        self.nbr_messages = 0
-        self.empty_text = ""
-        self._soup = BeautifulSoup(input, features="xml")
-
-    def _create_tg_range(self, tgs: list[dict[str, Any]]) -> dict[int, ColInfoType]:
-        """Create a unified range along the table groups.
-
-        Args:
-            tgs: Table group column specifications.
-
-        Returns:
-            Unified group column specifications.
-        """
-        colinfo: dict[int, XmlTable.ColInfoType] = {}
-
-        if len(tgs) == 0:
-            return colinfo
-
-        for itg, tg in enumerate(tgs):
-            colinfo[itg] = {
-                "offset": [],
-                "colwidth": [],
-                "cell_range": [],
-                "cell_offst": [0],
-            }
-            offst = 0
-            for info in tg["colinfo"]:
-                cw = info["colwidth"]
-                cw = re.sub("pt", "", cw, flags=re.I)
-                cw = re.sub("mm", "", cw, flags=re.I)
-                try:
-                    cw = int(cw)
-                except BaseException:
-                    cw = float(cw)
-                colinfo[itg]["colwidth"].append(cw)
-                colinfo[itg]["offset"].append(offst)
-                offst += cw
-            colinfo[itg]["offset"].append(offst)
-
-        min_colinfo: XmlTable.MinColInfoType = {"offset": [], "colwidth": []}
-
-        min_colinfo["offset"] = colinfo[0]["offset"]
-        offset_w0 = []
-        for itg, col in colinfo.items():
-            # keep track of col with 0 width
-            for ic, cw in enumerate(col["colwidth"]):
-                if cw == 0:
-                    offset_w0.append(col["offset"][ic])
-
-            min_colinfo["offset"] = sorted(
-                list(set(col["offset"] + min_colinfo["offset"]))
-            )
-
-        # add back the 0 width cols to offset list
-        offset_w0 = list(set(offset_w0))
-        min_colinfo["offset"] = sorted(min_colinfo["offset"] + offset_w0)
-
-        for i in range(len(min_colinfo["offset"]) - 1):
-            min_colinfo["colwidth"].append(
-                min_colinfo["offset"][i + 1] - min_colinfo["offset"][i]
-            )
-
-        for itg, col in colinfo.items():
-            i = 1
-            range_ = 1
-            for min_i in range(1, len(min_colinfo["offset"])):
-                min_offst = min_colinfo["offset"][min_i]
-                offst = col["offset"][i]
-                if min_offst == offst:
-                    if (
-                        len(col["offset"]) == i + 1
-                        and len(min_colinfo["offset"]) > min_i + 1
-                    ):
-                        range_ += 1
-                    else:
-                        col["cell_range"].append(range_)
-                        col["cell_offst"].append(col["cell_offst"][-1] + range_)
-                        range_ = 1
-                        i += 1
-                elif min_offst < offst:
-                    range_ += 1
-                else:
-                    _log.debug("A USPTO XML table has wrong offsets.")
-                    return {}
-
-        return colinfo
-
-    def _get_max_ncols(self, tgs_info: dict[int, ColInfoType]) -> NonNegativeInt:
-        """Get the maximum number of columns across table groups.
-
-        Args:
-            tgs_info: Unified group column specifications.
-
-        Return:
-            The maximum number of columns.
-        """
-        ncols_max = 0
-        for rowinfo in tgs_info.values():
-            ncols_max = max(ncols_max, len(rowinfo["colwidth"]))
-
-        return ncols_max
-
-    def _parse_table(self, table: Tag) -> TableData:
-        """Parse the content of a table tag.
-
-        Args:
-            The table element.
-
-        Returns:
-            A docling table object.
-        """
-        tgs_align = []
-        tg_secs = table.find_all("tgroup")
-        if tg_secs:
-            for tg_sec in tg_secs:
-                ncols = tg_sec.get("cols", None)
-                if ncols:
-                    ncols = int(ncols)
-                tg_align = {"ncols": ncols, "colinfo": []}
-                cs_secs = tg_sec.find_all("colspec")
-                if cs_secs:
-                    for cs_sec in cs_secs:
-                        colname = cs_sec.get("colname", None)
-                        colwidth = cs_sec.get("colwidth", None)
-                        tg_align["colinfo"].append(
-                            {"colname": colname, "colwidth": colwidth}
-                        )
-
-                tgs_align.append(tg_align)
-
-        # create unified range along the table groups
-        tgs_range = self._create_tg_range(tgs_align)
-
-        # if the structure is broken, return an empty table
-        if not tgs_range:
-            dl_table = TableData(num_rows=0, num_cols=0, table_cells=[])
-            return dl_table
-
-        ncols_max = self._get_max_ncols(tgs_range)
-
-        # extract table data
-        table_data: list[TableCell] = []
-        i_row_global = 0
-        is_row_empty: bool = True
-        tg_secs = table.find_all("tgroup")
-        if tg_secs:
-            for itg, tg_sec in enumerate(tg_secs):
-                tg_range = tgs_range[itg]
-                row_secs = tg_sec.find_all(["row", "tr"])
-
-                if row_secs:
-                    for row_sec in row_secs:
-                        entry_secs = row_sec.find_all(["entry", "td"])
-                        is_header: bool = row_sec.parent.name in ["thead"]
-
-                        ncols = 0
-                        local_row: list[TableCell] = []
-                        is_row_empty = True
-                        if entry_secs:
-                            wrong_nbr_cols = False
-                            for ientry, entry_sec in enumerate(entry_secs):
-                                text = entry_sec.get_text().strip()
-
-                                # start-end
-                                namest = entry_sec.attrs.get("namest", None)
-                                nameend = entry_sec.attrs.get("nameend", None)
-                                if isinstance(namest, str) and namest.isnumeric():
-                                    namest = int(namest)
-                                else:
-                                    namest = ientry + 1
-                                if isinstance(nameend, str) and nameend.isnumeric():
-                                    nameend = int(nameend)
-                                    shift = 0
-                                else:
-                                    nameend = ientry + 2
-                                    shift = 1
-
-                                if nameend > len(tg_range["cell_offst"]):
-                                    wrong_nbr_cols = True
-                                    self.nbr_messages += 1
-                                    if self.nbr_messages <= self.max_nbr_messages:
-                                        _log.debug(
-                                            "USPTO table has # entries != # columns"
-                                        )
-                                    break
-
-                                range_ = [
-                                    tg_range["cell_offst"][namest - 1],
-                                    tg_range["cell_offst"][nameend - 1] - shift,
-                                ]
-
-                                # add row and replicate cell if needed
-                                cell_text = text if text else self.empty_text
-                                if cell_text != self.empty_text:
-                                    is_row_empty = False
-                                for irep in range(range_[0], range_[1] + 1):
-                                    ncols += 1
-                                    local_row.append(
-                                        TableCell(
-                                            column_header=is_header,
-                                            text=cell_text,
-                                            start_row_offset_idx=i_row_global,
-                                            end_row_offset_idx=i_row_global + 1,
-                                            row_span=1,
-                                            start_col_offset_idx=range_[0],
-                                            end_col_offset_idx=range_[1] + 1,
-                                            col_span=range_[1] - range_[0] + 1,
-                                        )
-                                    )
-
-                            if wrong_nbr_cols:
-                                # keep empty text, not to introduce noise
-                                local_row = []
-                                ncols = 0
-
-                            # add empty cell up to ncols_max
-                            for irep in range(ncols, ncols_max):
-                                local_row.append(
-                                    TableCell(
-                                        column_header=is_header,
-                                        text=self.empty_text,
-                                        start_row_offset_idx=i_row_global,
-                                        end_row_offset_idx=i_row_global + 1,
-                                        row_span=1,
-                                        start_col_offset_idx=irep,
-                                        end_col_offset_idx=irep + 1,
-                                        col_span=1,
-                                    )
-                                )
-                        # do not add empty rows
-                        if not is_row_empty:
-                            table_data.extend(local_row)
-                            i_row_global += 1
-
-        dl_table = TableData(
-            num_rows=i_row_global, num_cols=ncols_max, table_cells=table_data
-        )
-
-        return dl_table
-
-    def parse(self) -> Optional[TableData]:
-        """Parse the first table from an xml content.
-
-        Returns:
-            A docling table data.
-        """
-        section = self._soup.find("table")
-        if section is not None:
-            table = self._parse_table(section)
-            if table.num_rows == 0 or table.num_cols == 0:
-                _log.warning("The parsed USPTO table is empty")
-            return table
-        else:
-            return None
-
-
-class HtmlEntity:
-    """Provide utility functions to get the HTML entities of styled characters.
-
-    This class has been developped from:
-    https://unicode-table.com/en/html-entities/
-    https://www.w3.org/TR/WD-math-970515/table03.html
-    """
-
-    def __init__(self):
-        """Initialize this class by loading the HTML entity dictionaries."""
-        self.superscript = str.maketrans(
-            {
-                "1": "&sup1;",
-                "2": "&sup2;",
-                "3": "&sup3;",
-                "4": "&#8308;",
-                "5": "&#8309;",
-                "6": "&#8310;",
-                "7": "&#8311;",
-                "8": "&#8312;",
-                "9": "&#8313;",
-                "0": "&#8304;",
-                "+": "&#8314;",
-                "-": "&#8315;",
-                "−": "&#8315;",
-                "=": "&#8316;",
-                "(": "&#8317;",
-                ")": "&#8318;",
-                "a": "&#170;",
-                "o": "&#186;",
-                "i": "&#8305;",
-                "n": "&#8319;",
-            }
-        )
-        self.subscript = str.maketrans(
-            {
-                "1": "&#8321;",
-                "2": "&#8322;",
-                "3": "&#8323;",
-                "4": "&#8324;",
-                "5": "&#8325;",
-                "6": "&#8326;",
-                "7": "&#8327;",
-                "8": "&#8328;",
-                "9": "&#8329;",
-                "0": "&#8320;",
-                "+": "&#8330;",
-                "-": "&#8331;",
-                "−": "&#8331;",
-                "=": "&#8332;",
-                "(": "&#8333;",
-                ")": "&#8334;",
-                "a": "&#8336;",
-                "e": "&#8337;",
-                "o": "&#8338;",
-                "x": "&#8339;",
-            }
-        )
-        self.mathematical_italic = str.maketrans(
-            {
-                "A": "&#119860;",
-                "B": "&#119861;",
-                "C": "&#119862;",
-                "D": "&#119863;",
-                "E": "&#119864;",
-                "F": "&#119865;",
-                "G": "&#119866;",
-                "H": "&#119867;",
-                "I": "&#119868;",
-                "J": "&#119869;",
-                "K": "&#119870;",
-                "L": "&#119871;",
-                "M": "&#119872;",
-                "N": "&#119873;",
-                "O": "&#119874;",
-                "P": "&#119875;",
-                "Q": "&#119876;",
-                "R": "&#119877;",
-                "S": "&#119878;",
-                "T": "&#119879;",
-                "U": "&#119880;",
-                "V": "&#119881;",
-                "W": "&#119882;",
-                "Y": "&#119884;",
-                "Z": "&#119885;",
-                "a": "&#119886;",
-                "b": "&#119887;",
-                "c": "&#119888;",
-                "d": "&#119889;",
-                "e": "&#119890;",
-                "f": "&#119891;",
-                "g": "&#119892;",
-                "h": "&#119893;",
-                "i": "&#119894;",
-                "j": "&#119895;",
-                "k": "&#119896;",
-                "l": "&#119897;",
-                "m": "&#119898;",
-                "n": "&#119899;",
-                "o": "&#119900;",
-                "p": "&#119901;",
-                "q": "&#119902;",
-                "r": "&#119903;",
-                "s": "&#119904;",
-                "t": "&#119905;",
-                "u": "&#119906;",
-                "v": "&#119907;",
-                "w": "&#119908;",
-                "x": "&#119909;",
-                "y": "&#119910;",
-                "z": "&#119911;",
-            }
-        )
-
-        self.lookup_iso8879 = {
-            "&Agr;": "&Alpha;",
-            "&Bgr;": "&Beta;",
-            "&Ggr;": "&Gamma;",
-            "&Dgr;": "&Delta;",
-            "&Egr;": "&Epsilon;",
-            "&Zgr;": "&Zeta;",
-            "&EEgr;": "&Eta;",
-            "&THgr;": "&Theta;",
-            "&Igr;": "&Iota;",
-            "&Kgr;": "&Kappa;",
-            "&Lgr;": "&Lambda;",
-            "&Mgr;": "&Mu;",
-            "&Ngr;": "&Nu;",
-            "&Xgr;": "&Xi;",
-            "&Ogr;": "&Omicron;",
-            "&Pgr;": "&Pi;",
-            "&Rgr;": "&Rho;",
-            "&Sgr;": "&Sigma;",
-            "&Tgr;": "&Tau;",
-            "&Ugr;": "&Upsilon;",
-            "&PHgr;": "&Phi;",
-            "&KHgr;": "&Chi;",
-            "&PSgr;": "&Psi;",
-            "&OHgr;": "&Omega;",
-            "&agr;": "&alpha;",
-            "&bgr;": "&beta;",
-            "&ggr;": "&gamma;",
-            "&dgr;": "&delta;",
-            "&egr;": "&epsilon;",
-            "&zgr;": "&zeta;",
-            "&eegr;": "&eta;",
-            "&thgr;": "&theta;",
-            "&igr;": "&iota;",
-            "&kgr;": "&kappa;",
-            "&lgr;": "&lambda;",
-            "&mgr;": "&mu;",
-            "&ngr;": "&nu;",
-            "&xgr;": "&xi;",
-            "&ogr;": "&omicron;",
-            "&pgr;": "&pi;",
-            "&rgr;": "&rho;",
-            "&sgr;": "&sigmaf;",
-            "&tgr;": "&tau;",
-            "&ugr;": "&upsilon;",
-            "&phgr;": "&phi;",
-            "&khgr;": "&chi;",
-            "&psgr;": "&psi;",
-            "&ohgr;": "&omega;",
-        }
-
-    def get_superscript(self, text: str) -> str:
-        """Get a text in superscript as HTML entities.
-
-        Args:
-            text: The text to transform.
-
-        Returns:
-            The text in superscript as HTML entities.
-        """
-        return text.translate(self.superscript)
-
-    def get_subscript(self, text: str) -> str:
-        """Get a text in subscript as HTML entities.
-
-        Args:
-            The text to transform.
-
-        Returns:
-            The text in subscript as HTML entities.
-        """
-        return text.translate(self.subscript)
-
-    def get_math_italic(self, text: str) -> str:
-        """Get a text in italic as HTML entities.
-
-        Args:
-            The text to transform.
-
-        Returns:
-            The text in italics as HTML entities.
-        """
-        return text.translate(self.mathematical_italic)
-
-    def get_greek_from_iso8879(self, text: str) -> str:
-        """Get an HTML entity of a greek letter in ISO 8879.
-
-        Args:
-            The text to transform, as an ISO 8879 entitiy.
-
-        Returns:
-            The HTML entity representing a greek letter. If the input text is not
-              supported, the original text is returned.
-        """
-        return self.lookup_iso8879.get(text, text)
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/chunking/__init__.py b/Paper2Video/src/evaluation/PresentQuiz/docling/chunking/__init__.py
deleted file mode 100644
index e72deb971264cef854d1f6900656c163bdaa083d..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/chunking/__init__.py
+++ /dev/null
@@ -1,12 +0,0 @@
-#
-# Copyright IBM Corp. 2024 - 2024
-# SPDX-License-Identifier: MIT
-#
-
-from docling_core.transforms.chunker.base import BaseChunk, BaseChunker, BaseMeta
-from docling_core.transforms.chunker.hierarchical_chunker import (
-    DocChunk,
-    DocMeta,
-    HierarchicalChunker,
-)
-from docling_core.transforms.chunker.hybrid_chunker import HybridChunker
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/cli/__init__.py b/Paper2Video/src/evaluation/PresentQuiz/docling/cli/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/cli/main.py b/Paper2Video/src/evaluation/PresentQuiz/docling/cli/main.py
deleted file mode 100644
index e2bc0dd67a35c028fe7b37f66f51c03d9bebfe17..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/cli/main.py
+++ /dev/null
@@ -1,456 +0,0 @@
-import importlib
-import logging
-import platform
-import re
-import sys
-import tempfile
-import time
-import warnings
-from pathlib import Path
-from typing import Annotated, Dict, Iterable, List, Optional, Type
-
-import typer
-from docling_core.types.doc import ImageRefMode
-from docling_core.utils.file import resolve_source_to_path
-from pydantic import TypeAdapter
-
-from docling.backend.docling_parse_backend import DoclingParseDocumentBackend
-from docling.backend.docling_parse_v2_backend import DoclingParseV2DocumentBackend
-from docling.backend.pdf_backend import PdfDocumentBackend
-from docling.backend.pypdfium2_backend import PyPdfiumDocumentBackend
-from docling.datamodel.base_models import (
-    ConversionStatus,
-    FormatToExtensions,
-    InputFormat,
-    OutputFormat,
-)
-from docling.datamodel.document import ConversionResult
-from docling.datamodel.pipeline_options import (
-    AcceleratorDevice,
-    AcceleratorOptions,
-    EasyOcrOptions,
-    OcrEngine,
-    OcrMacOptions,
-    OcrOptions,
-    PdfBackend,
-    PdfPipelineOptions,
-    RapidOcrOptions,
-    TableFormerMode,
-    TesseractCliOcrOptions,
-    TesseractOcrOptions,
-)
-from docling.datamodel.settings import settings
-from docling.document_converter import DocumentConverter, FormatOption, PdfFormatOption
-
-warnings.filterwarnings(action="ignore", category=UserWarning, module="pydantic|torch")
-warnings.filterwarnings(action="ignore", category=FutureWarning, module="easyocr")
-
-_log = logging.getLogger(__name__)
-from rich.console import Console
-
-err_console = Console(stderr=True)
-
-
-app = typer.Typer(
-    name="Docling",
-    no_args_is_help=True,
-    add_completion=False,
-    pretty_exceptions_enable=False,
-)
-
-
-def version_callback(value: bool):
-    if value:
-        docling_version = importlib.metadata.version("docling")
-        docling_core_version = importlib.metadata.version("docling-core")
-        docling_ibm_models_version = importlib.metadata.version("docling-ibm-models")
-        docling_parse_version = importlib.metadata.version("docling-parse")
-        platform_str = platform.platform()
-        py_impl_version = sys.implementation.cache_tag
-        py_lang_version = platform.python_version()
-        print(f"Docling version: {docling_version}")
-        print(f"Docling Core version: {docling_core_version}")
-        print(f"Docling IBM Models version: {docling_ibm_models_version}")
-        print(f"Docling Parse version: {docling_parse_version}")
-        print(f"Python: {py_impl_version} ({py_lang_version})")
-        print(f"Platform: {platform_str}")
-        raise typer.Exit()
-
-
-def export_documents(
-    conv_results: Iterable[ConversionResult],
-    output_dir: Path,
-    export_json: bool,
-    export_html: bool,
-    export_md: bool,
-    export_txt: bool,
-    export_doctags: bool,
-    image_export_mode: ImageRefMode,
-):
-
-    success_count = 0
-    failure_count = 0
-
-    for conv_res in conv_results:
-        if conv_res.status == ConversionStatus.SUCCESS:
-            success_count += 1
-            doc_filename = conv_res.input.file.stem
-
-            # Export JSON format:
-            if export_json:
-                fname = output_dir / f"{doc_filename}.json"
-                _log.info(f"writing JSON output to {fname}")
-                conv_res.document.save_as_json(
-                    filename=fname, image_mode=image_export_mode
-                )
-
-            # Export HTML format:
-            if export_html:
-                fname = output_dir / f"{doc_filename}.html"
-                _log.info(f"writing HTML output to {fname}")
-                conv_res.document.save_as_html(
-                    filename=fname, image_mode=image_export_mode
-                )
-
-            # Export Text format:
-            if export_txt:
-                fname = output_dir / f"{doc_filename}.txt"
-                _log.info(f"writing TXT output to {fname}")
-                conv_res.document.save_as_markdown(
-                    filename=fname,
-                    strict_text=True,
-                    image_mode=ImageRefMode.PLACEHOLDER,
-                )
-
-            # Export Markdown format:
-            if export_md:
-                fname = output_dir / f"{doc_filename}.md"
-                _log.info(f"writing Markdown output to {fname}")
-                conv_res.document.save_as_markdown(
-                    filename=fname, image_mode=image_export_mode
-                )
-
-            # Export Document Tags format:
-            if export_doctags:
-                fname = output_dir / f"{doc_filename}.doctags"
-                _log.info(f"writing Doc Tags output to {fname}")
-                conv_res.document.save_as_document_tokens(filename=fname)
-
-        else:
-            _log.warning(f"Document {conv_res.input.file} failed to convert.")
-            failure_count += 1
-
-    _log.info(
-        f"Processed {success_count + failure_count} docs, of which {failure_count} failed"
-    )
-
-
-def _split_list(raw: Optional[str]) -> Optional[List[str]]:
-    if raw is None:
-        return None
-    return re.split(r"[;,]", raw)
-
-
-@app.command(no_args_is_help=True)
-def convert(
-    input_sources: Annotated[
-        List[str],
-        typer.Argument(
-            ...,
-            metavar="source",
-            help="PDF files to convert. Can be local file / directory paths or URL.",
-        ),
-    ],
-    from_formats: List[InputFormat] = typer.Option(
-        None,
-        "--from",
-        help="Specify input formats to convert from. Defaults to all formats.",
-    ),
-    to_formats: List[OutputFormat] = typer.Option(
-        None, "--to", help="Specify output formats. Defaults to Markdown."
-    ),
-    headers: str = typer.Option(
-        None,
-        "--headers",
-        help="Specify http request headers used when fetching url input sources in the form of a JSON string",
-    ),
-    image_export_mode: Annotated[
-        ImageRefMode,
-        typer.Option(
-            ...,
-            help="Image export mode for the document (only in case of JSON, Markdown or HTML). With `placeholder`, only the position of the image is marked in the output. In `embedded` mode, the image is embedded as base64 encoded string. In `referenced` mode, the image is exported in PNG format and referenced from the main exported document.",
-        ),
-    ] = ImageRefMode.EMBEDDED,
-    ocr: Annotated[
-        bool,
-        typer.Option(
-            ..., help="If enabled, the bitmap content will be processed using OCR."
-        ),
-    ] = True,
-    force_ocr: Annotated[
-        bool,
-        typer.Option(
-            ...,
-            help="Replace any existing text with OCR generated text over the full content.",
-        ),
-    ] = False,
-    ocr_engine: Annotated[
-        OcrEngine, typer.Option(..., help="The OCR engine to use.")
-    ] = OcrEngine.EASYOCR,
-    ocr_lang: Annotated[
-        Optional[str],
-        typer.Option(
-            ...,
-            help="Provide a comma-separated list of languages used by the OCR engine. Note that each OCR engine has different values for the language names.",
-        ),
-    ] = None,
-    pdf_backend: Annotated[
-        PdfBackend, typer.Option(..., help="The PDF backend to use.")
-    ] = PdfBackend.DLPARSE_V2,
-    table_mode: Annotated[
-        TableFormerMode,
-        typer.Option(..., help="The mode to use in the table structure model."),
-    ] = TableFormerMode.FAST,
-    enrich_code: Annotated[
-        bool,
-        typer.Option(..., help="Enable the code enrichment model in the pipeline."),
-    ] = False,
-    enrich_formula: Annotated[
-        bool,
-        typer.Option(..., help="Enable the formula enrichment model in the pipeline."),
-    ] = False,
-    enrich_picture_classes: Annotated[
-        bool,
-        typer.Option(
-            ...,
-            help="Enable the picture classification enrichment model in the pipeline.",
-        ),
-    ] = False,
-    enrich_picture_description: Annotated[
-        bool,
-        typer.Option(..., help="Enable the picture description model in the pipeline."),
-    ] = False,
-    artifacts_path: Annotated[
-        Optional[Path],
-        typer.Option(..., help="If provided, the location of the model artifacts."),
-    ] = None,
-    abort_on_error: Annotated[
-        bool,
-        typer.Option(
-            ...,
-            "--abort-on-error/--no-abort-on-error",
-            help="If enabled, the bitmap content will be processed using OCR.",
-        ),
-    ] = False,
-    output: Annotated[
-        Path, typer.Option(..., help="Output directory where results are saved.")
-    ] = Path("."),
-    verbose: Annotated[
-        int,
-        typer.Option(
-            "--verbose",
-            "-v",
-            count=True,
-            help="Set the verbosity level. -v for info logging, -vv for debug logging.",
-        ),
-    ] = 0,
-    debug_visualize_cells: Annotated[
-        bool,
-        typer.Option(..., help="Enable debug output which visualizes the PDF cells"),
-    ] = False,
-    debug_visualize_ocr: Annotated[
-        bool,
-        typer.Option(..., help="Enable debug output which visualizes the OCR cells"),
-    ] = False,
-    debug_visualize_layout: Annotated[
-        bool,
-        typer.Option(
-            ..., help="Enable debug output which visualizes the layour clusters"
-        ),
-    ] = False,
-    debug_visualize_tables: Annotated[
-        bool,
-        typer.Option(..., help="Enable debug output which visualizes the table cells"),
-    ] = False,
-    version: Annotated[
-        Optional[bool],
-        typer.Option(
-            "--version",
-            callback=version_callback,
-            is_eager=True,
-            help="Show version information.",
-        ),
-    ] = None,
-    document_timeout: Annotated[
-        Optional[float],
-        typer.Option(
-            ...,
-            help="The timeout for processing each document, in seconds.",
-        ),
-    ] = None,
-    num_threads: Annotated[int, typer.Option(..., help="Number of threads")] = 4,
-    device: Annotated[
-        AcceleratorDevice, typer.Option(..., help="Accelerator device")
-    ] = AcceleratorDevice.AUTO,
-):
-    if verbose == 0:
-        logging.basicConfig(level=logging.WARNING)
-    elif verbose == 1:
-        logging.basicConfig(level=logging.INFO)
-    elif verbose == 2:
-        logging.basicConfig(level=logging.DEBUG)
-
-    settings.debug.visualize_cells = debug_visualize_cells
-    settings.debug.visualize_layout = debug_visualize_layout
-    settings.debug.visualize_tables = debug_visualize_tables
-    settings.debug.visualize_ocr = debug_visualize_ocr
-
-    if from_formats is None:
-        from_formats = [e for e in InputFormat]
-
-    parsed_headers: Optional[Dict[str, str]] = None
-    if headers is not None:
-        headers_t = TypeAdapter(Dict[str, str])
-        parsed_headers = headers_t.validate_json(headers)
-
-    with tempfile.TemporaryDirectory() as tempdir:
-        input_doc_paths: List[Path] = []
-        for src in input_sources:
-            try:
-                # check if we can fetch some remote url
-                source = resolve_source_to_path(
-                    source=src, headers=parsed_headers, workdir=Path(tempdir)
-                )
-                input_doc_paths.append(source)
-            except FileNotFoundError:
-                err_console.print(
-                    f"[red]Error: The input file {src} does not exist.[/red]"
-                )
-                raise typer.Abort()
-            except IsADirectoryError:
-                # if the input matches to a file or a folder
-                try:
-                    local_path = TypeAdapter(Path).validate_python(src)
-                    if local_path.exists() and local_path.is_dir():
-                        for fmt in from_formats:
-                            for ext in FormatToExtensions[fmt]:
-                                input_doc_paths.extend(
-                                    list(local_path.glob(f"**/*.{ext}"))
-                                )
-                                input_doc_paths.extend(
-                                    list(local_path.glob(f"**/*.{ext.upper()}"))
-                                )
-                    elif local_path.exists():
-                        input_doc_paths.append(local_path)
-                    else:
-                        err_console.print(
-                            f"[red]Error: The input file {src} does not exist.[/red]"
-                        )
-                        raise typer.Abort()
-                except Exception as err:
-                    err_console.print(f"[red]Error: Cannot read the input {src}.[/red]")
-                    _log.info(err)  # will print more details if verbose is activated
-                    raise typer.Abort()
-
-        if to_formats is None:
-            to_formats = [OutputFormat.MARKDOWN]
-
-        export_json = OutputFormat.JSON in to_formats
-        export_html = OutputFormat.HTML in to_formats
-        export_md = OutputFormat.MARKDOWN in to_formats
-        export_txt = OutputFormat.TEXT in to_formats
-        export_doctags = OutputFormat.DOCTAGS in to_formats
-
-        if ocr_engine == OcrEngine.EASYOCR:
-            ocr_options: OcrOptions = EasyOcrOptions(force_full_page_ocr=force_ocr)
-        elif ocr_engine == OcrEngine.TESSERACT_CLI:
-            ocr_options = TesseractCliOcrOptions(force_full_page_ocr=force_ocr)
-        elif ocr_engine == OcrEngine.TESSERACT:
-            ocr_options = TesseractOcrOptions(force_full_page_ocr=force_ocr)
-        elif ocr_engine == OcrEngine.OCRMAC:
-            ocr_options = OcrMacOptions(force_full_page_ocr=force_ocr)
-        elif ocr_engine == OcrEngine.RAPIDOCR:
-            ocr_options = RapidOcrOptions(force_full_page_ocr=force_ocr)
-        else:
-            raise RuntimeError(f"Unexpected OCR engine type {ocr_engine}")
-
-        ocr_lang_list = _split_list(ocr_lang)
-        if ocr_lang_list is not None:
-            ocr_options.lang = ocr_lang_list
-
-        accelerator_options = AcceleratorOptions(num_threads=num_threads, device=device)
-        pipeline_options = PdfPipelineOptions(
-            accelerator_options=accelerator_options,
-            do_ocr=ocr,
-            ocr_options=ocr_options,
-            do_table_structure=True,
-            do_code_enrichment=enrich_code,
-            do_formula_enrichment=enrich_formula,
-            do_picture_description=enrich_picture_description,
-            do_picture_classification=enrich_picture_classes,
-            document_timeout=document_timeout,
-        )
-        pipeline_options.table_structure_options.do_cell_matching = (
-            True  # do_cell_matching
-        )
-        pipeline_options.table_structure_options.mode = table_mode
-
-        if image_export_mode != ImageRefMode.PLACEHOLDER:
-            pipeline_options.generate_page_images = True
-            pipeline_options.generate_picture_images = (
-                True  # FIXME: to be deprecated in verson 3
-            )
-            pipeline_options.images_scale = 2
-
-        if artifacts_path is not None:
-            pipeline_options.artifacts_path = artifacts_path
-
-        if pdf_backend == PdfBackend.DLPARSE_V1:
-            backend: Type[PdfDocumentBackend] = DoclingParseDocumentBackend
-        elif pdf_backend == PdfBackend.DLPARSE_V2:
-            backend = DoclingParseV2DocumentBackend
-        elif pdf_backend == PdfBackend.PYPDFIUM2:
-            backend = PyPdfiumDocumentBackend
-        else:
-            raise RuntimeError(f"Unexpected PDF backend type {pdf_backend}")
-
-        pdf_format_option = PdfFormatOption(
-            pipeline_options=pipeline_options,
-            backend=backend,  # pdf_backend
-        )
-        format_options: Dict[InputFormat, FormatOption] = {
-            InputFormat.PDF: pdf_format_option,
-            InputFormat.IMAGE: pdf_format_option,
-        }
-        doc_converter = DocumentConverter(
-            allowed_formats=from_formats,
-            format_options=format_options,
-        )
-
-        start_time = time.time()
-
-        conv_results = doc_converter.convert_all(
-            input_doc_paths, headers=parsed_headers, raises_on_error=abort_on_error
-        )
-
-        output.mkdir(parents=True, exist_ok=True)
-        export_documents(
-            conv_results,
-            output_dir=output,
-            export_json=export_json,
-            export_html=export_html,
-            export_md=export_md,
-            export_txt=export_txt,
-            export_doctags=export_doctags,
-            image_export_mode=image_export_mode,
-        )
-
-        end_time = time.time() - start_time
-
-    _log.info(f"All documents were converted in {end_time:.2f} seconds.")
-
-
-click_app = typer.main.get_command(app)
-
-if __name__ == "__main__":
-    app()
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/cli/models.py b/Paper2Video/src/evaluation/PresentQuiz/docling/cli/models.py
deleted file mode 100644
index 3b62ad6b6761e603101920ad5867d5143b40f1e4..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/cli/models.py
+++ /dev/null
@@ -1,107 +0,0 @@
-import logging
-import warnings
-from enum import Enum
-from pathlib import Path
-from typing import Annotated, Optional
-
-import typer
-from rich.console import Console
-from rich.logging import RichHandler
-
-from docling.datamodel.settings import settings
-from docling.utils.model_downloader import download_models
-
-warnings.filterwarnings(action="ignore", category=UserWarning, module="pydantic|torch")
-warnings.filterwarnings(action="ignore", category=FutureWarning, module="easyocr")
-
-console = Console()
-err_console = Console(stderr=True)
-
-
-app = typer.Typer(
-    name="Docling models helper",
-    no_args_is_help=True,
-    add_completion=False,
-    pretty_exceptions_enable=False,
-)
-
-
-class _AvailableModels(str, Enum):
-    LAYOUT = "layout"
-    TABLEFORMER = "tableformer"
-    CODE_FORMULA = "code_formula"
-    PICTURE_CLASSIFIER = "picture_classifier"
-    SMOLVLM = "smolvlm"
-    EASYOCR = "easyocr"
-
-
-@app.command("download")
-def download(
-    output_dir: Annotated[
-        Path,
-        typer.Option(
-            ...,
-            "-o",
-            "--output-dir",
-            help="The directory where all the models are downloaded.",
-        ),
-    ] = (settings.cache_dir / "models"),
-    force: Annotated[
-        bool, typer.Option(..., help="If true, the download will be forced")
-    ] = False,
-    models: Annotated[
-        Optional[list[_AvailableModels]],
-        typer.Argument(
-            help=f"Models to download (default behavior: all will be downloaded)",
-        ),
-    ] = None,
-    quiet: Annotated[
-        bool,
-        typer.Option(
-            ...,
-            "-q",
-            "--quiet",
-            help="No extra output is generated, the CLI prints only the directory with the cached models.",
-        ),
-    ] = False,
-):
-    if not quiet:
-        FORMAT = "%(message)s"
-        logging.basicConfig(
-            level=logging.INFO,
-            format="[blue]%(message)s[/blue]",
-            datefmt="[%X]",
-            handlers=[RichHandler(show_level=False, show_time=False, markup=True)],
-        )
-    to_download = models or [m for m in _AvailableModels]
-    output_dir = download_models(
-        output_dir=output_dir,
-        force=force,
-        progress=(not quiet),
-        with_layout=_AvailableModels.LAYOUT in to_download,
-        with_tableformer=_AvailableModels.TABLEFORMER in to_download,
-        with_code_formula=_AvailableModels.CODE_FORMULA in to_download,
-        with_picture_classifier=_AvailableModels.PICTURE_CLASSIFIER in to_download,
-        with_smolvlm=_AvailableModels.SMOLVLM in to_download,
-        with_easyocr=_AvailableModels.EASYOCR in to_download,
-    )
-
-    if quiet:
-        typer.echo(output_dir)
-    else:
-        typer.secho(f"\nModels downloaded into: {output_dir}.", fg="green")
-
-        console.print(
-            "\n",
-            "Docling can now be configured for running offline using the local artifacts.\n\n",
-            "Using the CLI:",
-            f"`docling --artifacts-path={output_dir} FILE`",
-            "\n",
-            "Using Python: see the documentation at <https://ds4sd.github.io/docling/usage>.",
-        )
-
-
-click_app = typer.main.get_command(app)
-
-if __name__ == "__main__":
-    app()
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/cli/tools.py b/Paper2Video/src/evaluation/PresentQuiz/docling/cli/tools.py
deleted file mode 100644
index 8711013c93044466477419885550a460f13d1444..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/cli/tools.py
+++ /dev/null
@@ -1,17 +0,0 @@
-import typer
-
-from docling.cli.models import app as models_app
-
-app = typer.Typer(
-    name="Docling helpers",
-    no_args_is_help=True,
-    add_completion=False,
-    pretty_exceptions_enable=False,
-)
-
-app.add_typer(models_app, name="models")
-
-click_app = typer.main.get_command(app)
-
-if __name__ == "__main__":
-    app()
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/__init__.py b/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/base_models.py b/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/base_models.py
deleted file mode 100644
index d1e7ce3aedc28bb6da94831282ac1c76fa7b7a27..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/base_models.py
+++ /dev/null
@@ -1,258 +0,0 @@
-from enum import Enum
-from typing import TYPE_CHECKING, Dict, List, Optional, Union
-
-from docling_core.types.doc import (
-    BoundingBox,
-    DocItemLabel,
-    NodeItem,
-    PictureDataType,
-    Size,
-    TableCell,
-)
-from docling_core.types.io import (  # DO ΝΟΤ REMOVE; explicitly exposed from this location
-    DocumentStream,
-)
-from PIL.Image import Image
-from pydantic import BaseModel, ConfigDict
-
-if TYPE_CHECKING:
-    from docling.backend.pdf_backend import PdfPageBackend
-
-
-class ConversionStatus(str, Enum):
-    PENDING = "pending"
-    STARTED = "started"
-    FAILURE = "failure"
-    SUCCESS = "success"
-    PARTIAL_SUCCESS = "partial_success"
-    SKIPPED = "skipped"
-
-
-class InputFormat(str, Enum):
-    """A document format supported by document backend parsers."""
-
-    DOCX = "docx"
-    PPTX = "pptx"
-    HTML = "html"
-    XML_PUBMED = "xml_pubmed"
-    IMAGE = "image"
-    PDF = "pdf"
-    ASCIIDOC = "asciidoc"
-    MD = "md"
-    XLSX = "xlsx"
-    XML_USPTO = "xml_uspto"
-    JSON_DOCLING = "json_docling"
-
-
-class OutputFormat(str, Enum):
-    MARKDOWN = "md"
-    JSON = "json"
-    HTML = "html"
-    TEXT = "text"
-    DOCTAGS = "doctags"
-
-
-FormatToExtensions: Dict[InputFormat, List[str]] = {
-    InputFormat.DOCX: ["docx", "dotx", "docm", "dotm"],
-    InputFormat.PPTX: ["pptx", "potx", "ppsx", "pptm", "potm", "ppsm"],
-    InputFormat.PDF: ["pdf"],
-    InputFormat.MD: ["md"],
-    InputFormat.HTML: ["html", "htm", "xhtml"],
-    InputFormat.XML_PUBMED: ["xml", "nxml"],
-    InputFormat.IMAGE: ["jpg", "jpeg", "png", "tif", "tiff", "bmp"],
-    InputFormat.ASCIIDOC: ["adoc", "asciidoc", "asc"],
-    InputFormat.XLSX: ["xlsx"],
-    InputFormat.XML_USPTO: ["xml", "txt"],
-    InputFormat.JSON_DOCLING: ["json"],
-}
-
-FormatToMimeType: Dict[InputFormat, List[str]] = {
-    InputFormat.DOCX: [
-        "application/vnd.openxmlformats-officedocument.wordprocessingml.document",
-        "application/vnd.openxmlformats-officedocument.wordprocessingml.template",
-    ],
-    InputFormat.PPTX: [
-        "application/vnd.openxmlformats-officedocument.presentationml.template",
-        "application/vnd.openxmlformats-officedocument.presentationml.slideshow",
-        "application/vnd.openxmlformats-officedocument.presentationml.presentation",
-    ],
-    InputFormat.HTML: ["text/html", "application/xhtml+xml"],
-    InputFormat.XML_PUBMED: ["application/xml"],
-    InputFormat.IMAGE: [
-        "image/png",
-        "image/jpeg",
-        "image/tiff",
-        "image/gif",
-        "image/bmp",
-    ],
-    InputFormat.PDF: ["application/pdf"],
-    InputFormat.ASCIIDOC: ["text/asciidoc"],
-    InputFormat.MD: ["text/markdown", "text/x-markdown"],
-    InputFormat.XLSX: [
-        "application/vnd.openxmlformats-officedocument.spreadsheetml.sheet"
-    ],
-    InputFormat.XML_USPTO: ["application/xml", "text/plain"],
-    InputFormat.JSON_DOCLING: ["application/json"],
-}
-
-MimeTypeToFormat: dict[str, list[InputFormat]] = {
-    mime: [fmt for fmt in FormatToMimeType if mime in FormatToMimeType[fmt]]
-    for value in FormatToMimeType.values()
-    for mime in value
-}
-
-
-class DocInputType(str, Enum):
-    PATH = "path"
-    STREAM = "stream"
-
-
-class DoclingComponentType(str, Enum):
-    DOCUMENT_BACKEND = "document_backend"
-    MODEL = "model"
-    DOC_ASSEMBLER = "doc_assembler"
-    USER_INPUT = "user_input"
-
-
-class ErrorItem(BaseModel):
-    component_type: DoclingComponentType
-    module_name: str
-    error_message: str
-
-
-class Cell(BaseModel):
-    id: int
-    text: str
-    bbox: BoundingBox
-
-
-class OcrCell(Cell):
-    confidence: float
-
-
-class Cluster(BaseModel):
-    id: int
-    label: DocItemLabel
-    bbox: BoundingBox
-    confidence: float = 1.0
-    cells: List[Cell] = []
-    children: List["Cluster"] = []  # Add child cluster support
-
-
-class BasePageElement(BaseModel):
-    label: DocItemLabel
-    id: int
-    page_no: int
-    cluster: Cluster
-    text: Optional[str] = None
-
-
-class LayoutPrediction(BaseModel):
-    clusters: List[Cluster] = []
-
-
-class ContainerElement(
-    BasePageElement
-):  # Used for Form and Key-Value-Regions, only for typing.
-    pass
-
-
-class Table(BasePageElement):
-    otsl_seq: List[str]
-    num_rows: int = 0
-    num_cols: int = 0
-    table_cells: List[TableCell]
-
-
-class TableStructurePrediction(BaseModel):
-    table_map: Dict[int, Table] = {}
-
-
-class TextElement(BasePageElement):
-    text: str
-
-
-class FigureElement(BasePageElement):
-    annotations: List[PictureDataType] = []
-    provenance: Optional[str] = None
-    predicted_class: Optional[str] = None
-    confidence: Optional[float] = None
-
-
-class FigureClassificationPrediction(BaseModel):
-    figure_count: int = 0
-    figure_map: Dict[int, FigureElement] = {}
-
-
-class EquationPrediction(BaseModel):
-    equation_count: int = 0
-    equation_map: Dict[int, TextElement] = {}
-
-
-class PagePredictions(BaseModel):
-    layout: Optional[LayoutPrediction] = None
-    tablestructure: Optional[TableStructurePrediction] = None
-    figures_classification: Optional[FigureClassificationPrediction] = None
-    equations_prediction: Optional[EquationPrediction] = None
-
-
-PageElement = Union[TextElement, Table, FigureElement, ContainerElement]
-
-
-class AssembledUnit(BaseModel):
-    elements: List[PageElement] = []
-    body: List[PageElement] = []
-    headers: List[PageElement] = []
-
-
-class ItemAndImageEnrichmentElement(BaseModel):
-    model_config = ConfigDict(arbitrary_types_allowed=True)
-
-    item: NodeItem
-    image: Image
-
-
-class Page(BaseModel):
-    model_config = ConfigDict(arbitrary_types_allowed=True)
-
-    page_no: int
-    # page_hash: Optional[str] = None
-    size: Optional[Size] = None
-    cells: List[Cell] = []
-    predictions: PagePredictions = PagePredictions()
-    assembled: Optional[AssembledUnit] = None
-
-    _backend: Optional["PdfPageBackend"] = (
-        None  # Internal PDF backend. By default it is cleared during assembling.
-    )
-    _default_image_scale: float = 1.0  # Default image scale for external usage.
-    _image_cache: Dict[float, Image] = (
-        {}
-    )  # Cache of images in different scales. By default it is cleared during assembling.
-
-    def get_image(
-        self, scale: float = 1.0, cropbox: Optional[BoundingBox] = None
-    ) -> Optional[Image]:
-        if self._backend is None:
-            return self._image_cache.get(scale, None)
-
-        if not scale in self._image_cache:
-            if cropbox is None:
-                self._image_cache[scale] = self._backend.get_page_image(scale=scale)
-            else:
-                return self._backend.get_page_image(scale=scale, cropbox=cropbox)
-
-        if cropbox is None:
-            return self._image_cache[scale]
-        else:
-            page_im = self._image_cache[scale]
-            assert self.size is not None
-            return page_im.crop(
-                cropbox.to_top_left_origin(page_height=self.size.height)
-                .scaled(scale=scale)
-                .as_tuple()
-            )
-
-    @property
-    def image(self) -> Optional[Image]:
-        return self.get_image(scale=self._default_image_scale)
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/document.py b/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/document.py
deleted file mode 100644
index d887fed942930f0344ec7beb63572d5383129161..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/document.py
+++ /dev/null
@@ -1,394 +0,0 @@
-import logging
-import re
-from enum import Enum
-from io import BytesIO
-from pathlib import Path, PurePath
-from typing import (
-    TYPE_CHECKING,
-    Dict,
-    Iterable,
-    List,
-    Literal,
-    Optional,
-    Set,
-    Type,
-    Union,
-)
-
-import filetype
-from docling_core.types.doc import (
-    DocItem,
-    DocItemLabel,
-    DoclingDocument,
-    PictureItem,
-    SectionHeaderItem,
-    TableItem,
-    TextItem,
-)
-from docling_core.types.doc.document import ListItem
-from docling_core.types.legacy_doc.base import (
-    BaseText,
-    Figure,
-    GlmTableCell,
-    PageDimensions,
-    PageReference,
-    Prov,
-    Ref,
-)
-from docling_core.types.legacy_doc.base import Table as DsSchemaTable
-from docling_core.types.legacy_doc.base import TableCell
-from docling_core.types.legacy_doc.document import (
-    CCSDocumentDescription as DsDocumentDescription,
-)
-from docling_core.types.legacy_doc.document import CCSFileInfoObject as DsFileInfoObject
-from docling_core.types.legacy_doc.document import ExportedCCSDocument as DsDocument
-from docling_core.utils.file import resolve_source_to_stream
-from docling_core.utils.legacy import docling_document_to_legacy
-from pydantic import BaseModel
-from typing_extensions import deprecated
-
-from docling.backend.abstract_backend import (
-    AbstractDocumentBackend,
-    PaginatedDocumentBackend,
-)
-from docling.datamodel.base_models import (
-    AssembledUnit,
-    ConversionStatus,
-    DocumentStream,
-    ErrorItem,
-    FormatToExtensions,
-    FormatToMimeType,
-    InputFormat,
-    MimeTypeToFormat,
-    Page,
-)
-from docling.datamodel.settings import DocumentLimits
-from docling.utils.profiling import ProfilingItem
-from docling.utils.utils import create_file_hash, create_hash
-
-if TYPE_CHECKING:
-    from docling.document_converter import FormatOption
-
-_log = logging.getLogger(__name__)
-
-layout_label_to_ds_type = {
-    DocItemLabel.TITLE: "title",
-    DocItemLabel.DOCUMENT_INDEX: "table",
-    DocItemLabel.SECTION_HEADER: "subtitle-level-1",
-    DocItemLabel.CHECKBOX_SELECTED: "checkbox-selected",
-    DocItemLabel.CHECKBOX_UNSELECTED: "checkbox-unselected",
-    DocItemLabel.CAPTION: "caption",
-    DocItemLabel.PAGE_HEADER: "page-header",
-    DocItemLabel.PAGE_FOOTER: "page-footer",
-    DocItemLabel.FOOTNOTE: "footnote",
-    DocItemLabel.TABLE: "table",
-    DocItemLabel.FORMULA: "equation",
-    DocItemLabel.LIST_ITEM: "paragraph",
-    DocItemLabel.CODE: "paragraph",
-    DocItemLabel.PICTURE: "figure",
-    DocItemLabel.TEXT: "paragraph",
-    DocItemLabel.PARAGRAPH: "paragraph",
-    DocItemLabel.FORM: DocItemLabel.FORM.value,
-    DocItemLabel.KEY_VALUE_REGION: DocItemLabel.KEY_VALUE_REGION.value,
-}
-
-_EMPTY_DOCLING_DOC = DoclingDocument(name="dummy")
-
-
-class InputDocument(BaseModel):
-    file: PurePath
-    document_hash: str  # = None
-    valid: bool = True
-    limits: DocumentLimits = DocumentLimits()
-    format: InputFormat  # = None
-
-    filesize: Optional[int] = None
-    page_count: int = 0
-
-    _backend: AbstractDocumentBackend  # Internal PDF backend used
-
-    def __init__(
-        self,
-        path_or_stream: Union[BytesIO, Path],
-        format: InputFormat,
-        backend: Type[AbstractDocumentBackend],
-        filename: Optional[str] = None,
-        limits: Optional[DocumentLimits] = None,
-    ):
-        super().__init__(
-            file="", document_hash="", format=InputFormat.PDF
-        )  # initialize with dummy values
-
-        self.limits = limits or DocumentLimits()
-        self.format = format
-
-        try:
-            if isinstance(path_or_stream, Path):
-                self.file = path_or_stream
-                self.filesize = path_or_stream.stat().st_size
-                if self.filesize > self.limits.max_file_size:
-                    self.valid = False
-                else:
-                    self.document_hash = create_file_hash(path_or_stream)
-                    self._init_doc(backend, path_or_stream)
-
-            elif isinstance(path_or_stream, BytesIO):
-                assert (
-                    filename is not None
-                ), "Can't construct InputDocument from stream without providing filename arg."
-                self.file = PurePath(filename)
-                self.filesize = path_or_stream.getbuffer().nbytes
-
-                if self.filesize > self.limits.max_file_size:
-                    self.valid = False
-                else:
-                    self.document_hash = create_file_hash(path_or_stream)
-                    self._init_doc(backend, path_or_stream)
-            else:
-                raise RuntimeError(
-                    f"Unexpected type path_or_stream: {type(path_or_stream)}"
-                )
-
-            # For paginated backends, check if the maximum page count is exceeded.
-            if self.valid and self._backend.is_valid():
-                if self._backend.supports_pagination() and isinstance(
-                    self._backend, PaginatedDocumentBackend
-                ):
-                    self.page_count = self._backend.page_count()
-                    if not self.page_count <= self.limits.max_num_pages:
-                        self.valid = False
-                    elif self.page_count < self.limits.page_range[0]:
-                        self.valid = False
-
-        except (FileNotFoundError, OSError) as e:
-            self.valid = False
-            _log.exception(
-                f"File {self.file.name} not found or cannot be opened.", exc_info=e
-            )
-            # raise
-        except RuntimeError as e:
-            self.valid = False
-            _log.exception(
-                f"An unexpected error occurred while opening the document {self.file.name}",
-                exc_info=e,
-            )
-            # raise
-
-    def _init_doc(
-        self,
-        backend: Type[AbstractDocumentBackend],
-        path_or_stream: Union[BytesIO, Path],
-    ) -> None:
-        self._backend = backend(self, path_or_stream=path_or_stream)
-        if not self._backend.is_valid():
-            self.valid = False
-
-
-class DocumentFormat(str, Enum):
-    V2 = "v2"
-    V1 = "v1"
-
-
-class ConversionResult(BaseModel):
-    input: InputDocument
-
-    status: ConversionStatus = ConversionStatus.PENDING  # failure, success
-    errors: List[ErrorItem] = []  # structure to keep errors
-
-    pages: List[Page] = []
-    assembled: AssembledUnit = AssembledUnit()
-    timings: Dict[str, ProfilingItem] = {}
-
-    document: DoclingDocument = _EMPTY_DOCLING_DOC
-
-    @property
-    @deprecated("Use document instead.")
-    def legacy_document(self):
-        return docling_document_to_legacy(self.document)
-
-
-class _DummyBackend(AbstractDocumentBackend):
-    def __init__(self, *args, **kwargs):
-        super().__init__(*args, **kwargs)
-
-    def is_valid(self) -> bool:
-        return False
-
-    @classmethod
-    def supported_formats(cls) -> Set[InputFormat]:
-        return set()
-
-    @classmethod
-    def supports_pagination(cls) -> bool:
-        return False
-
-    def unload(self):
-        return super().unload()
-
-
-class _DocumentConversionInput(BaseModel):
-
-    path_or_stream_iterator: Iterable[Union[Path, str, DocumentStream]]
-    headers: Optional[Dict[str, str]] = None
-    limits: Optional[DocumentLimits] = DocumentLimits()
-
-    def docs(
-        self, format_options: Dict[InputFormat, "FormatOption"]
-    ) -> Iterable[InputDocument]:
-        for item in self.path_or_stream_iterator:
-            obj = (
-                resolve_source_to_stream(item, self.headers)
-                if isinstance(item, str)
-                else item
-            )
-            format = self._guess_format(obj)
-            backend: Type[AbstractDocumentBackend]
-            if format not in format_options.keys():
-                _log.error(
-                    f"Input document {obj.name} does not match any allowed format."
-                )
-                backend = _DummyBackend
-            else:
-                backend = format_options[format].backend
-
-            if isinstance(obj, Path):
-                yield InputDocument(
-                    path_or_stream=obj,
-                    format=format,  # type: ignore[arg-type]
-                    filename=obj.name,
-                    limits=self.limits,
-                    backend=backend,
-                )
-            elif isinstance(obj, DocumentStream):
-                yield InputDocument(
-                    path_or_stream=obj.stream,
-                    format=format,  # type: ignore[arg-type]
-                    filename=obj.name,
-                    limits=self.limits,
-                    backend=backend,
-                )
-            else:
-                raise RuntimeError(f"Unexpected obj type in iterator: {type(obj)}")
-
-    def _guess_format(self, obj: Union[Path, DocumentStream]) -> Optional[InputFormat]:
-        content = b""  # empty binary blob
-        formats: list[InputFormat] = []
-
-        if isinstance(obj, Path):
-            mime = filetype.guess_mime(str(obj))
-            if mime is None:
-                ext = obj.suffix[1:]
-                mime = _DocumentConversionInput._mime_from_extension(ext)
-            if mime is None:  # must guess from
-                with obj.open("rb") as f:
-                    content = f.read(1024)  # Read first 1KB
-
-        elif isinstance(obj, DocumentStream):
-            content = obj.stream.read(8192)
-            obj.stream.seek(0)
-            mime = filetype.guess_mime(content)
-            if mime is None:
-                ext = (
-                    obj.name.rsplit(".", 1)[-1]
-                    if ("." in obj.name and not obj.name.startswith("."))
-                    else ""
-                )
-                mime = _DocumentConversionInput._mime_from_extension(ext)
-
-        mime = mime or _DocumentConversionInput._detect_html_xhtml(content)
-        mime = mime or "text/plain"
-        formats = MimeTypeToFormat.get(mime, [])
-        if formats:
-            if len(formats) == 1 and mime not in ("text/plain"):
-                return formats[0]
-            else:  # ambiguity in formats
-                return _DocumentConversionInput._guess_from_content(
-                    content, mime, formats
-                )
-        else:
-            return None
-
-    @staticmethod
-    def _guess_from_content(
-        content: bytes, mime: str, formats: list[InputFormat]
-    ) -> Optional[InputFormat]:
-        """Guess the input format of a document by checking part of its content."""
-        input_format: Optional[InputFormat] = None
-        content_str = content.decode("utf-8")
-
-        if mime == "application/xml":
-            match_doctype = re.search(r"<!DOCTYPE [^>]+>", content_str)
-            if match_doctype:
-                xml_doctype = match_doctype.group()
-                if InputFormat.XML_USPTO in formats and any(
-                    item in xml_doctype
-                    for item in (
-                        "us-patent-application-v4",
-                        "us-patent-grant-v4",
-                        "us-grant-025",
-                        "patent-application-publication",
-                    )
-                ):
-                    input_format = InputFormat.XML_USPTO
-
-                if (
-                    InputFormat.XML_PUBMED in formats
-                    and "/NLM//DTD JATS" in xml_doctype
-                ):
-                    input_format = InputFormat.XML_PUBMED
-
-        elif mime == "text/plain":
-            if InputFormat.XML_USPTO in formats and content_str.startswith("PATN\r\n"):
-                input_format = InputFormat.XML_USPTO
-
-        return input_format
-
-    @staticmethod
-    def _mime_from_extension(ext):
-        mime = None
-        if ext in FormatToExtensions[InputFormat.ASCIIDOC]:
-            mime = FormatToMimeType[InputFormat.ASCIIDOC][0]
-        elif ext in FormatToExtensions[InputFormat.HTML]:
-            mime = FormatToMimeType[InputFormat.HTML][0]
-        elif ext in FormatToExtensions[InputFormat.MD]:
-            mime = FormatToMimeType[InputFormat.MD][0]
-        elif ext in FormatToExtensions[InputFormat.JSON_DOCLING]:
-            mime = FormatToMimeType[InputFormat.JSON_DOCLING][0]
-        elif ext in FormatToExtensions[InputFormat.PDF]:
-            mime = FormatToMimeType[InputFormat.PDF][0]
-        return mime
-
-    @staticmethod
-    def _detect_html_xhtml(
-        content: bytes,
-    ) -> Optional[Literal["application/xhtml+xml", "application/xml", "text/html"]]:
-        """Guess the mime type of an XHTML, HTML, or XML file from its content.
-
-        Args:
-            content: A short piece of a document from its beginning.
-
-        Returns:
-            The mime type of an XHTML, HTML, or XML file, or None if the content does
-              not match any of these formats.
-        """
-        content_str = content.decode("ascii", errors="ignore").lower()
-        # Remove XML comments
-        content_str = re.sub(r"<!--(.*?)-->", "", content_str, flags=re.DOTALL)
-        content_str = content_str.lstrip()
-
-        if re.match(r"<\?xml", content_str):
-            if "xhtml" in content_str[:1000]:
-                return "application/xhtml+xml"
-            else:
-                return "application/xml"
-
-        if re.match(r"<!doctype\s+html|<html|<head|<body", content_str):
-            return "text/html"
-
-        p = re.compile(
-            r"<!doctype\s+(?P<root>[a-zA-Z_:][a-zA-Z0-9_:.-]*)\s+.*>\s*<(?P=root)\b"
-        )
-        if p.search(content_str):
-            return "application/xml"
-
-        return None
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/pipeline_options.py b/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/pipeline_options.py
deleted file mode 100644
index 3b6401b649679d4235e2b77abffdc40ff615de55..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/pipeline_options.py
+++ /dev/null
@@ -1,296 +0,0 @@
-import logging
-import os
-from enum import Enum
-from pathlib import Path
-from typing import Annotated, Any, Dict, List, Literal, Optional, Union
-
-from pydantic import AnyUrl, BaseModel, ConfigDict, Field, model_validator
-from pydantic_settings import BaseSettings, SettingsConfigDict
-
-_log = logging.getLogger(__name__)
-
-
-class AcceleratorDevice(str, Enum):
-    """Devices to run model inference"""
-
-    AUTO = "auto"
-    CPU = "cpu"
-    CUDA = "cuda"
-    MPS = "mps"
-
-
-class AcceleratorOptions(BaseSettings):
-    model_config = SettingsConfigDict(
-        env_prefix="DOCLING_", env_nested_delimiter="_", populate_by_name=True
-    )
-
-    num_threads: int = 4
-    device: AcceleratorDevice = AcceleratorDevice.AUTO
-
-    @model_validator(mode="before")
-    @classmethod
-    def check_alternative_envvars(cls, data: Any) -> Any:
-        r"""
-        Set num_threads from the "alternative" envvar OMP_NUM_THREADS.
-        The alternative envvar is used only if it is valid and the regular envvar is not set.
-
-        Notice: The standard pydantic settings mechanism with parameter "aliases" does not provide
-        the same functionality. In case the alias envvar is set and the user tries to override the
-        parameter in settings initialization, Pydantic treats the parameter provided in __init__()
-        as an extra input instead of simply overwriting the evvar value for that parameter.
-        """
-        if isinstance(data, dict):
-            input_num_threads = data.get("num_threads")
-
-            # Check if to set the num_threads from the alternative envvar
-            if input_num_threads is None:
-                docling_num_threads = os.getenv("DOCLING_NUM_THREADS")
-                omp_num_threads = os.getenv("OMP_NUM_THREADS")
-                if docling_num_threads is None and omp_num_threads is not None:
-                    try:
-                        data["num_threads"] = int(omp_num_threads)
-                    except ValueError:
-                        _log.error(
-                            "Ignoring misformatted envvar OMP_NUM_THREADS '%s'",
-                            omp_num_threads,
-                        )
-        return data
-
-
-class TableFormerMode(str, Enum):
-    """Modes for the TableFormer model."""
-
-    FAST = "fast"
-    ACCURATE = "accurate"
-
-
-class TableStructureOptions(BaseModel):
-    """Options for the table structure."""
-
-    do_cell_matching: bool = (
-        True
-        # True:  Matches predictions back to PDF cells. Can break table output if PDF cells
-        #        are merged across table columns.
-        # False: Let table structure model define the text cells, ignore PDF cells.
-    )
-    mode: TableFormerMode = TableFormerMode.FAST
-
-
-class OcrOptions(BaseModel):
-    """OCR options."""
-
-    kind: str
-    lang: List[str]
-    force_full_page_ocr: bool = False  # If enabled a full page OCR is always applied
-    bitmap_area_threshold: float = (
-        0.05  # percentage of the area for a bitmap to processed with OCR
-    )
-
-
-class RapidOcrOptions(OcrOptions):
-    """Options for the RapidOCR engine."""
-
-    kind: Literal["rapidocr"] = "rapidocr"
-
-    # English and chinese are the most commly used models and have been tested with RapidOCR.
-    lang: List[str] = [
-        "english",
-        "chinese",
-    ]  # However, language as a parameter is not supported by rapidocr yet and hence changing this options doesn't affect anything.
-    # For more details on supported languages by RapidOCR visit https://rapidai.github.io/RapidOCRDocs/blog/2022/09/28/%E6%94%AF%E6%8C%81%E8%AF%86%E5%88%AB%E8%AF%AD%E8%A8%80/
-
-    # For more details on the following options visit https://rapidai.github.io/RapidOCRDocs/install_usage/api/RapidOCR/
-    text_score: float = 0.5  # same default as rapidocr
-
-    use_det: Optional[bool] = None  # same default as rapidocr
-    use_cls: Optional[bool] = None  # same default as rapidocr
-    use_rec: Optional[bool] = None  # same default as rapidocr
-
-    # class Device(Enum):
-    #     CPU = "CPU"
-    #     CUDA = "CUDA"
-    #     DIRECTML = "DIRECTML"
-    #     AUTO = "AUTO"
-
-    # device: Device = Device.AUTO  # Default value is AUTO
-
-    print_verbose: bool = False  # same default as rapidocr
-
-    det_model_path: Optional[str] = None  # same default as rapidocr
-    cls_model_path: Optional[str] = None  # same default as rapidocr
-    rec_model_path: Optional[str] = None  # same default as rapidocr
-    rec_keys_path: Optional[str] = None  # same default as rapidocr
-
-    model_config = ConfigDict(
-        extra="forbid",
-    )
-
-
-class EasyOcrOptions(OcrOptions):
-    """Options for the EasyOCR engine."""
-
-    kind: Literal["easyocr"] = "easyocr"
-    lang: List[str] = ["fr", "de", "es", "en"]
-
-    use_gpu: Optional[bool] = None
-
-    confidence_threshold: float = 0.5
-
-    model_storage_directory: Optional[str] = None
-    recog_network: Optional[str] = "standard"
-    download_enabled: bool = True
-
-    model_config = ConfigDict(
-        extra="forbid",
-        protected_namespaces=(),
-    )
-
-
-class TesseractCliOcrOptions(OcrOptions):
-    """Options for the TesseractCli engine."""
-
-    kind: Literal["tesseract"] = "tesseract"
-    lang: List[str] = ["fra", "deu", "spa", "eng"]
-    tesseract_cmd: str = "tesseract"
-    path: Optional[str] = None
-
-    model_config = ConfigDict(
-        extra="forbid",
-    )
-
-
-class TesseractOcrOptions(OcrOptions):
-    """Options for the Tesseract engine."""
-
-    kind: Literal["tesserocr"] = "tesserocr"
-    lang: List[str] = ["fra", "deu", "spa", "eng"]
-    path: Optional[str] = None
-
-    model_config = ConfigDict(
-        extra="forbid",
-    )
-
-
-class OcrMacOptions(OcrOptions):
-    """Options for the Mac OCR engine."""
-
-    kind: Literal["ocrmac"] = "ocrmac"
-    lang: List[str] = ["fr-FR", "de-DE", "es-ES", "en-US"]
-    recognition: str = "accurate"
-    framework: str = "vision"
-
-    model_config = ConfigDict(
-        extra="forbid",
-    )
-
-
-class PictureDescriptionBaseOptions(BaseModel):
-    kind: str
-    batch_size: int = 8
-    scale: float = 2
-
-    bitmap_area_threshold: float = (
-        0.2  # percentage of the area for a bitmap to processed with the models
-    )
-
-
-class PictureDescriptionApiOptions(PictureDescriptionBaseOptions):
-    kind: Literal["api"] = "api"
-
-    url: AnyUrl = AnyUrl("http://localhost:8000/v1/chat/completions")
-    headers: Dict[str, str] = {}
-    params: Dict[str, Any] = {}
-    timeout: float = 20
-
-    prompt: str = "Describe this image in a few sentences."
-    provenance: str = ""
-
-
-class PictureDescriptionVlmOptions(PictureDescriptionBaseOptions):
-    kind: Literal["vlm"] = "vlm"
-
-    repo_id: str
-    prompt: str = "Describe this image in a few sentences."
-    # Config from here https://huggingface.co/docs/transformers/en/main_classes/text_generation#transformers.GenerationConfig
-    generation_config: Dict[str, Any] = dict(max_new_tokens=200, do_sample=False)
-
-    @property
-    def repo_cache_folder(self) -> str:
-        return self.repo_id.replace("/", "--")
-
-
-smolvlm_picture_description = PictureDescriptionVlmOptions(
-    repo_id="HuggingFaceTB/SmolVLM-256M-Instruct"
-)
-# phi_picture_description = PictureDescriptionVlmOptions(repo_id="microsoft/Phi-3-vision-128k-instruct")
-granite_picture_description = PictureDescriptionVlmOptions(
-    repo_id="ibm-granite/granite-vision-3.1-2b-preview",
-    prompt="What is shown in this image?",
-)
-
-
-# Define an enum for the backend options
-class PdfBackend(str, Enum):
-    """Enum of valid PDF backends."""
-
-    PYPDFIUM2 = "pypdfium2"
-    DLPARSE_V1 = "dlparse_v1"
-    DLPARSE_V2 = "dlparse_v2"
-
-
-# Define an enum for the ocr engines
-class OcrEngine(str, Enum):
-    """Enum of valid OCR engines."""
-
-    EASYOCR = "easyocr"
-    TESSERACT_CLI = "tesseract_cli"
-    TESSERACT = "tesseract"
-    OCRMAC = "ocrmac"
-    RAPIDOCR = "rapidocr"
-
-
-class PipelineOptions(BaseModel):
-    """Base pipeline options."""
-
-    create_legacy_output: bool = (
-        True  # This default will be set to False on a future version of docling
-    )
-    document_timeout: Optional[float] = None
-    accelerator_options: AcceleratorOptions = AcceleratorOptions()
-
-
-class PdfPipelineOptions(PipelineOptions):
-    """Options for the PDF pipeline."""
-
-    artifacts_path: Optional[Union[Path, str]] = None
-    do_table_structure: bool = True  # True: perform table structure extraction
-    do_ocr: bool = True  # True: perform OCR, replace programmatic PDF text
-    do_code_enrichment: bool = False  # True: perform code OCR
-    do_formula_enrichment: bool = False  # True: perform formula OCR, return Latex code
-    do_picture_classification: bool = False  # True: classify pictures in documents
-    do_picture_description: bool = False  # True: run describe pictures in documents
-
-    table_structure_options: TableStructureOptions = TableStructureOptions()
-    ocr_options: Union[
-        EasyOcrOptions,
-        TesseractCliOcrOptions,
-        TesseractOcrOptions,
-        OcrMacOptions,
-        RapidOcrOptions,
-    ] = Field(EasyOcrOptions(), discriminator="kind")
-    picture_description_options: Annotated[
-        Union[PictureDescriptionApiOptions, PictureDescriptionVlmOptions],
-        Field(discriminator="kind"),
-    ] = smolvlm_picture_description
-
-    images_scale: float = 1.0
-    generate_page_images: bool = False
-    generate_picture_images: bool = False
-    generate_table_images: bool = Field(
-        default=False,
-        deprecated=(
-            "Field `generate_table_images` is deprecated. "
-            "To obtain table images, set `PdfPipelineOptions.generate_page_images = True` "
-            "before conversion and then use the `TableItem.get_image` function."
-        ),
-    )
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/settings.py b/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/settings.py
deleted file mode 100644
index 439ffe744b903ff27f576e4b9bbdb0da58a440e4..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/datamodel/settings.py
+++ /dev/null
@@ -1,67 +0,0 @@
-import sys
-from pathlib import Path
-from typing import Annotated, Tuple
-
-from pydantic import BaseModel, PlainValidator
-from pydantic_settings import BaseSettings, SettingsConfigDict
-
-
-def _validate_page_range(v: Tuple[int, int]) -> Tuple[int, int]:
-    if v[0] < 1 or v[1] < v[0]:
-        raise ValueError(
-            "Invalid page range: start must be ≥ 1 and end must be ≥ start."
-        )
-    return v
-
-
-PageRange = Annotated[Tuple[int, int], PlainValidator(_validate_page_range)]
-
-DEFAULT_PAGE_RANGE: PageRange = (1, sys.maxsize)
-
-
-class DocumentLimits(BaseModel):
-    max_num_pages: int = sys.maxsize
-    max_file_size: int = sys.maxsize
-    page_range: PageRange = DEFAULT_PAGE_RANGE
-
-
-class BatchConcurrencySettings(BaseModel):
-    doc_batch_size: int = 2
-    doc_batch_concurrency: int = 2
-    page_batch_size: int = 4
-    page_batch_concurrency: int = 2
-    elements_batch_size: int = 16
-
-    # doc_batch_size: int = 1
-    # doc_batch_concurrency: int = 1
-    # page_batch_size: int = 1
-    # page_batch_concurrency: int = 1
-
-    # model_concurrency: int = 2
-
-    # To force models into single core: export OMP_NUM_THREADS=1
-
-
-class DebugSettings(BaseModel):
-    visualize_cells: bool = False
-    visualize_ocr: bool = False
-    visualize_layout: bool = False
-    visualize_raw_layout: bool = False
-    visualize_tables: bool = False
-
-    profile_pipeline_timings: bool = False
-
-    # Path used to output debug information.
-    debug_output_path: str = str(Path.cwd() / "debug")
-
-
-class AppSettings(BaseSettings):
-    model_config = SettingsConfigDict(env_prefix="DOCLING_", env_nested_delimiter="_")
-
-    perf: BatchConcurrencySettings
-    debug: DebugSettings
-
-    cache_dir: Path = Path.home() / ".cache" / "docling"
-
-
-settings = AppSettings(perf=BatchConcurrencySettings(), debug=DebugSettings())
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/document_converter.py b/Paper2Video/src/evaluation/PresentQuiz/docling/document_converter.py
deleted file mode 100644
index d885dd20dee2ce3efae4566b06356abcd6827ad6..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/document_converter.py
+++ /dev/null
@@ -1,348 +0,0 @@
-import logging
-import math
-import sys
-import time
-from functools import partial
-from pathlib import Path
-from typing import Dict, Iterable, Iterator, List, Optional, Tuple, Type, Union
-
-from pydantic import BaseModel, ConfigDict, model_validator, validate_call
-
-from docling.backend.abstract_backend import AbstractDocumentBackend
-from docling.backend.asciidoc_backend import AsciiDocBackend
-from docling.backend.docling_parse_v2_backend import DoclingParseV2DocumentBackend
-from docling.backend.html_backend import HTMLDocumentBackend
-from docling.backend.json.docling_json_backend import DoclingJSONBackend
-from docling.backend.md_backend import MarkdownDocumentBackend
-from docling.backend.msexcel_backend import MsExcelDocumentBackend
-from docling.backend.mspowerpoint_backend import MsPowerpointDocumentBackend
-from docling.backend.msword_backend import MsWordDocumentBackend
-from docling.backend.xml.pubmed_backend import PubMedDocumentBackend
-from docling.backend.xml.uspto_backend import PatentUsptoDocumentBackend
-from docling.datamodel.base_models import (
-    ConversionStatus,
-    DoclingComponentType,
-    DocumentStream,
-    ErrorItem,
-    InputFormat,
-)
-from docling.datamodel.document import (
-    ConversionResult,
-    InputDocument,
-    _DocumentConversionInput,
-)
-from docling.datamodel.pipeline_options import PipelineOptions
-from docling.datamodel.settings import (
-    DEFAULT_PAGE_RANGE,
-    DocumentLimits,
-    PageRange,
-    settings,
-)
-from docling.exceptions import ConversionError
-from docling.pipeline.base_pipeline import BasePipeline
-from docling.pipeline.simple_pipeline import SimplePipeline
-from docling.pipeline.standard_pdf_pipeline import StandardPdfPipeline
-from docling.utils.utils import chunkify
-
-_log = logging.getLogger(__name__)
-
-
-class FormatOption(BaseModel):
-    pipeline_cls: Type[BasePipeline]
-    pipeline_options: Optional[PipelineOptions] = None
-    backend: Type[AbstractDocumentBackend]
-
-    model_config = ConfigDict(arbitrary_types_allowed=True)
-
-    @model_validator(mode="after")
-    def set_optional_field_default(self) -> "FormatOption":
-        if self.pipeline_options is None:
-            self.pipeline_options = self.pipeline_cls.get_default_options()
-        return self
-
-
-class ExcelFormatOption(FormatOption):
-    pipeline_cls: Type = SimplePipeline
-    backend: Type[AbstractDocumentBackend] = MsExcelDocumentBackend
-
-
-class WordFormatOption(FormatOption):
-    pipeline_cls: Type = SimplePipeline
-    backend: Type[AbstractDocumentBackend] = MsWordDocumentBackend
-
-
-class PowerpointFormatOption(FormatOption):
-    pipeline_cls: Type = SimplePipeline
-    backend: Type[AbstractDocumentBackend] = MsPowerpointDocumentBackend
-
-
-class MarkdownFormatOption(FormatOption):
-    pipeline_cls: Type = SimplePipeline
-    backend: Type[AbstractDocumentBackend] = MarkdownDocumentBackend
-
-
-class AsciiDocFormatOption(FormatOption):
-    pipeline_cls: Type = SimplePipeline
-    backend: Type[AbstractDocumentBackend] = AsciiDocBackend
-
-
-class HTMLFormatOption(FormatOption):
-    pipeline_cls: Type = SimplePipeline
-    backend: Type[AbstractDocumentBackend] = HTMLDocumentBackend
-
-
-class PatentUsptoFormatOption(FormatOption):
-    pipeline_cls: Type = SimplePipeline
-    backend: Type[PatentUsptoDocumentBackend] = PatentUsptoDocumentBackend
-
-
-class XMLPubMedFormatOption(FormatOption):
-    pipeline_cls: Type = SimplePipeline
-    backend: Type[AbstractDocumentBackend] = PubMedDocumentBackend
-
-
-class ImageFormatOption(FormatOption):
-    pipeline_cls: Type = StandardPdfPipeline
-    backend: Type[AbstractDocumentBackend] = DoclingParseV2DocumentBackend
-
-
-class PdfFormatOption(FormatOption):
-    pipeline_cls: Type = StandardPdfPipeline
-    backend: Type[AbstractDocumentBackend] = DoclingParseV2DocumentBackend
-
-
-def _get_default_option(format: InputFormat) -> FormatOption:
-    format_to_default_options = {
-        InputFormat.XLSX: FormatOption(
-            pipeline_cls=SimplePipeline, backend=MsExcelDocumentBackend
-        ),
-        InputFormat.DOCX: FormatOption(
-            pipeline_cls=SimplePipeline, backend=MsWordDocumentBackend
-        ),
-        InputFormat.PPTX: FormatOption(
-            pipeline_cls=SimplePipeline, backend=MsPowerpointDocumentBackend
-        ),
-        InputFormat.MD: FormatOption(
-            pipeline_cls=SimplePipeline, backend=MarkdownDocumentBackend
-        ),
-        InputFormat.ASCIIDOC: FormatOption(
-            pipeline_cls=SimplePipeline, backend=AsciiDocBackend
-        ),
-        InputFormat.HTML: FormatOption(
-            pipeline_cls=SimplePipeline, backend=HTMLDocumentBackend
-        ),
-        InputFormat.XML_USPTO: FormatOption(
-            pipeline_cls=SimplePipeline, backend=PatentUsptoDocumentBackend
-        ),
-        InputFormat.XML_PUBMED: FormatOption(
-            pipeline_cls=SimplePipeline, backend=PubMedDocumentBackend
-        ),
-        InputFormat.IMAGE: FormatOption(
-            pipeline_cls=StandardPdfPipeline, backend=DoclingParseV2DocumentBackend
-        ),
-        InputFormat.PDF: FormatOption(
-            pipeline_cls=StandardPdfPipeline, backend=DoclingParseV2DocumentBackend
-        ),
-        InputFormat.JSON_DOCLING: FormatOption(
-            pipeline_cls=SimplePipeline, backend=DoclingJSONBackend
-        ),
-    }
-    if (options := format_to_default_options.get(format)) is not None:
-        return options
-    else:
-        raise RuntimeError(f"No default options configured for {format}")
-
-
-class DocumentConverter:
-    _default_download_filename = "file"
-
-    def __init__(
-        self,
-        allowed_formats: Optional[List[InputFormat]] = None,
-        format_options: Optional[Dict[InputFormat, FormatOption]] = None,
-    ):
-        self.allowed_formats = (
-            allowed_formats if allowed_formats is not None else [e for e in InputFormat]
-        )
-        self.format_to_options = {
-            format: (
-                _get_default_option(format=format)
-                if (custom_option := (format_options or {}).get(format)) is None
-                else custom_option
-            )
-            for format in self.allowed_formats
-        }
-        self.initialized_pipelines: Dict[Type[BasePipeline], BasePipeline] = {}
-
-    def initialize_pipeline(self, format: InputFormat):
-        """Initialize the conversion pipeline for the selected format."""
-        pipeline = self._get_pipeline(doc_format=format)
-        if pipeline is None:
-            raise ConversionError(
-                f"No pipeline could be initialized for format {format}"
-            )
-
-    @validate_call(config=ConfigDict(strict=True))
-    def convert(
-        self,
-        source: Union[Path, str, DocumentStream],  # TODO review naming
-        headers: Optional[Dict[str, str]] = None,
-        raises_on_error: bool = True,
-        max_num_pages: int = sys.maxsize,
-        max_file_size: int = sys.maxsize,
-        page_range: PageRange = DEFAULT_PAGE_RANGE,
-    ) -> ConversionResult:
-        all_res = self.convert_all(
-            source=[source],
-            raises_on_error=raises_on_error,
-            max_num_pages=max_num_pages,
-            max_file_size=max_file_size,
-            headers=headers,
-            page_range=page_range,
-        )
-        return next(all_res)
-
-    @validate_call(config=ConfigDict(strict=True))
-    def convert_all(
-        self,
-        source: Iterable[Union[Path, str, DocumentStream]],  # TODO review naming
-        headers: Optional[Dict[str, str]] = None,
-        raises_on_error: bool = True,  # True: raises on first conversion error; False: does not raise on conv error
-        max_num_pages: int = sys.maxsize,
-        max_file_size: int = sys.maxsize,
-        page_range: PageRange = DEFAULT_PAGE_RANGE,
-    ) -> Iterator[ConversionResult]:
-        limits = DocumentLimits(
-            max_num_pages=max_num_pages,
-            max_file_size=max_file_size,
-            page_range=page_range,
-        )
-        conv_input = _DocumentConversionInput(
-            path_or_stream_iterator=source, limits=limits, headers=headers
-        )
-        conv_res_iter = self._convert(conv_input, raises_on_error=raises_on_error)
-
-        had_result = False
-        for conv_res in conv_res_iter:
-            had_result = True
-            if raises_on_error and conv_res.status not in {
-                ConversionStatus.SUCCESS,
-                ConversionStatus.PARTIAL_SUCCESS,
-            }:
-                raise ConversionError(
-                    f"Conversion failed for: {conv_res.input.file} with status: {conv_res.status}"
-                )
-            else:
-                yield conv_res
-
-        if not had_result and raises_on_error:
-            raise ConversionError(
-                f"Conversion failed because the provided file has no recognizable format or it wasn't in the list of allowed formats."
-            )
-
-    def _convert(
-        self, conv_input: _DocumentConversionInput, raises_on_error: bool
-    ) -> Iterator[ConversionResult]:
-        start_time = time.monotonic()
-
-        for input_batch in chunkify(
-            conv_input.docs(self.format_to_options),
-            settings.perf.doc_batch_size,  # pass format_options
-        ):
-            _log.info(f"Going to convert document batch...")
-
-            # parallel processing only within input_batch
-            # with ThreadPoolExecutor(
-            #    max_workers=settings.perf.doc_batch_concurrency
-            # ) as pool:
-            #   yield from pool.map(self.process_document, input_batch)
-            # Note: PDF backends are not thread-safe, thread pool usage was disabled.
-
-            for item in map(
-                partial(self._process_document, raises_on_error=raises_on_error),
-                input_batch,
-            ):
-                elapsed = time.monotonic() - start_time
-                start_time = time.monotonic()
-                _log.info(
-                    f"Finished converting document {item.input.file.name} in {elapsed:.2f} sec."
-                )
-                yield item
-
-    def _get_pipeline(self, doc_format: InputFormat) -> Optional[BasePipeline]:
-        fopt = self.format_to_options.get(doc_format)
-
-        if fopt is None:
-            return None
-        else:
-            pipeline_class = fopt.pipeline_cls
-            pipeline_options = fopt.pipeline_options
-
-        if pipeline_options is None:
-            return None
-        # TODO this will ignore if different options have been defined for the same pipeline class.
-        if (
-            pipeline_class not in self.initialized_pipelines
-            or self.initialized_pipelines[pipeline_class].pipeline_options
-            != pipeline_options
-        ):
-            self.initialized_pipelines[pipeline_class] = pipeline_class(
-                pipeline_options=pipeline_options
-            )
-        return self.initialized_pipelines[pipeline_class]
-
-    def _process_document(
-        self, in_doc: InputDocument, raises_on_error: bool
-    ) -> ConversionResult:
-
-        valid = (
-            self.allowed_formats is not None and in_doc.format in self.allowed_formats
-        )
-        if valid:
-            conv_res = self._execute_pipeline(in_doc, raises_on_error=raises_on_error)
-        else:
-            error_message = f"File format not allowed: {in_doc.file}"
-            if raises_on_error:
-                raise ConversionError(error_message)
-            else:
-                error_item = ErrorItem(
-                    component_type=DoclingComponentType.USER_INPUT,
-                    module_name="",
-                    error_message=error_message,
-                )
-                conv_res = ConversionResult(
-                    input=in_doc, status=ConversionStatus.SKIPPED, errors=[error_item]
-                )
-
-        return conv_res
-
-    def _execute_pipeline(
-        self, in_doc: InputDocument, raises_on_error: bool
-    ) -> ConversionResult:
-        if in_doc.valid:
-            pipeline = self._get_pipeline(in_doc.format)
-            if pipeline is not None:
-                conv_res = pipeline.execute(in_doc, raises_on_error=raises_on_error)
-            else:
-                if raises_on_error:
-                    raise ConversionError(
-                        f"No pipeline could be initialized for {in_doc.file}."
-                    )
-                else:
-                    conv_res = ConversionResult(
-                        input=in_doc,
-                        status=ConversionStatus.FAILURE,
-                    )
-        else:
-            if raises_on_error:
-                raise ConversionError(f"Input document {in_doc.file} is not valid.")
-
-            else:
-                # invalid doc or not of desired format
-                conv_res = ConversionResult(
-                    input=in_doc,
-                    status=ConversionStatus.FAILURE,
-                )
-                # TODO add error log why it failed.
-
-        return conv_res
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/exceptions.py b/Paper2Video/src/evaluation/PresentQuiz/docling/exceptions.py
deleted file mode 100644
index 13145b9c0a2d0a66c2380fe4606279a256caaa58..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/exceptions.py
+++ /dev/null
@@ -1,6 +0,0 @@
-class BaseError(RuntimeError):
-    pass
-
-
-class ConversionError(BaseError):
-    pass
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/__init__.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/base_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/base_model.py
deleted file mode 100644
index 9cdc0ecbdb40651f1b2c351882a0f92cc99becc0..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/base_model.py
+++ /dev/null
@@ -1,87 +0,0 @@
-from abc import ABC, abstractmethod
-from typing import Any, Generic, Iterable, Optional
-
-from docling_core.types.doc import BoundingBox, DocItem, DoclingDocument, NodeItem
-from typing_extensions import TypeVar
-
-from docling.datamodel.base_models import ItemAndImageEnrichmentElement, Page
-from docling.datamodel.document import ConversionResult
-from docling.datamodel.settings import settings
-
-
-class BasePageModel(ABC):
-    @abstractmethod
-    def __call__(
-        self, conv_res: ConversionResult, page_batch: Iterable[Page]
-    ) -> Iterable[Page]:
-        pass
-
-
-EnrichElementT = TypeVar("EnrichElementT", default=NodeItem)
-
-
-class GenericEnrichmentModel(ABC, Generic[EnrichElementT]):
-
-    elements_batch_size: int = settings.perf.elements_batch_size
-
-    @abstractmethod
-    def is_processable(self, doc: DoclingDocument, element: NodeItem) -> bool:
-        pass
-
-    @abstractmethod
-    def prepare_element(
-        self, conv_res: ConversionResult, element: NodeItem
-    ) -> Optional[EnrichElementT]:
-        pass
-
-    @abstractmethod
-    def __call__(
-        self, doc: DoclingDocument, element_batch: Iterable[EnrichElementT]
-    ) -> Iterable[NodeItem]:
-        pass
-
-
-class BaseEnrichmentModel(GenericEnrichmentModel[NodeItem]):
-
-    def prepare_element(
-        self, conv_res: ConversionResult, element: NodeItem
-    ) -> Optional[NodeItem]:
-        if self.is_processable(doc=conv_res.document, element=element):
-            return element
-        return None
-
-
-class BaseItemAndImageEnrichmentModel(
-    GenericEnrichmentModel[ItemAndImageEnrichmentElement]
-):
-
-    images_scale: float
-    expansion_factor: float = 0.0
-
-    def prepare_element(
-        self, conv_res: ConversionResult, element: NodeItem
-    ) -> Optional[ItemAndImageEnrichmentElement]:
-        if not self.is_processable(doc=conv_res.document, element=element):
-            return None
-
-        assert isinstance(element, DocItem)
-        element_prov = element.prov[0]
-
-        bbox = element_prov.bbox
-        width = bbox.r - bbox.l
-        height = bbox.t - bbox.b
-
-        # TODO: move to a utility in the BoundingBox class
-        expanded_bbox = BoundingBox(
-            l=bbox.l - width * self.expansion_factor,
-            t=bbox.t + height * self.expansion_factor,
-            r=bbox.r + width * self.expansion_factor,
-            b=bbox.b - height * self.expansion_factor,
-            coord_origin=bbox.coord_origin,
-        )
-
-        page_ix = element_prov.page_no - 1
-        cropped_image = conv_res.pages[page_ix].get_image(
-            scale=self.images_scale, cropbox=expanded_bbox
-        )
-        return ItemAndImageEnrichmentElement(item=element, image=cropped_image)
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/base_ocr_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/base_ocr_model.py
deleted file mode 100644
index 9afb7ddebe9572ddfc6687d5a4cd0bd0324a8066..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/base_ocr_model.py
+++ /dev/null
@@ -1,189 +0,0 @@
-import copy
-import logging
-from abc import abstractmethod
-from pathlib import Path
-from typing import Iterable, List
-
-import numpy as np
-from docling_core.types.doc import BoundingBox, CoordOrigin
-from PIL import Image, ImageDraw
-from rtree import index
-from scipy.ndimage import binary_dilation, find_objects, label
-
-from docling.datamodel.base_models import Cell, OcrCell, Page
-from docling.datamodel.document import ConversionResult
-from docling.datamodel.pipeline_options import OcrOptions
-from docling.datamodel.settings import settings
-from docling.models.base_model import BasePageModel
-
-_log = logging.getLogger(__name__)
-
-
-class BaseOcrModel(BasePageModel):
-    def __init__(self, enabled: bool, options: OcrOptions):
-        self.enabled = enabled
-        self.options = options
-
-    # Computes the optimum amount and coordinates of rectangles to OCR on a given page
-    def get_ocr_rects(self, page: Page) -> List[BoundingBox]:
-        BITMAP_COVERAGE_TRESHOLD = 0.75
-        assert page.size is not None
-
-        def find_ocr_rects(size, bitmap_rects):
-            image = Image.new(
-                "1", (round(size.width), round(size.height))
-            )  # '1' mode is binary
-
-            # Draw all bitmap rects into a binary image
-            draw = ImageDraw.Draw(image)
-            for rect in bitmap_rects:
-                x0, y0, x1, y1 = rect.as_tuple()
-                x0, y0, x1, y1 = round(x0), round(y0), round(x1), round(y1)
-                draw.rectangle([(x0, y0), (x1, y1)], fill=1)
-
-            np_image = np.array(image)
-
-            # Dilate the image by 10 pixels to merge nearby bitmap rectangles
-            structure = np.ones(
-                (20, 20)
-            )  # Create a 20x20 structure element (10 pixels in all directions)
-            np_image = binary_dilation(np_image > 0, structure=structure)
-
-            # Find the connected components
-            labeled_image, num_features = label(
-                np_image > 0
-            )  # Label black (0 value) regions
-
-            # Find enclosing bounding boxes for each connected component.
-            slices = find_objects(labeled_image)
-            bounding_boxes = [
-                BoundingBox(
-                    l=slc[1].start,
-                    t=slc[0].start,
-                    r=slc[1].stop - 1,
-                    b=slc[0].stop - 1,
-                    coord_origin=CoordOrigin.TOPLEFT,
-                )
-                for slc in slices
-            ]
-
-            # Compute area fraction on page covered by bitmaps
-            area_frac = np.sum(np_image > 0) / (size.width * size.height)
-
-            return (area_frac, bounding_boxes)  # fraction covered  # boxes
-
-        if page._backend is not None:
-            bitmap_rects = page._backend.get_bitmap_rects()
-        else:
-            bitmap_rects = []
-        coverage, ocr_rects = find_ocr_rects(page.size, bitmap_rects)
-
-        # return full-page rectangle if page is dominantly covered with bitmaps
-        if self.options.force_full_page_ocr or coverage > max(
-            BITMAP_COVERAGE_TRESHOLD, self.options.bitmap_area_threshold
-        ):
-            return [
-                BoundingBox(
-                    l=0,
-                    t=0,
-                    r=page.size.width,
-                    b=page.size.height,
-                    coord_origin=CoordOrigin.TOPLEFT,
-                )
-            ]
-        # return individual rectangles if the bitmap coverage is above the threshold
-        elif coverage > self.options.bitmap_area_threshold:
-            return ocr_rects
-        else:  # overall coverage of bitmaps is too low, drop all bitmap rectangles.
-            return []
-
-    # Filters OCR cells by dropping any OCR cell that intersects with an existing programmatic cell.
-    def _filter_ocr_cells(self, ocr_cells, programmatic_cells):
-        # Create R-tree index for programmatic cells
-        p = index.Property()
-        p.dimension = 2
-        idx = index.Index(properties=p)
-        for i, cell in enumerate(programmatic_cells):
-            idx.insert(i, cell.bbox.as_tuple())
-
-        def is_overlapping_with_existing_cells(ocr_cell):
-            # Query the R-tree to get overlapping rectangles
-            possible_matches_index = list(idx.intersection(ocr_cell.bbox.as_tuple()))
-
-            return (
-                len(possible_matches_index) > 0
-            )  # this is a weak criterion but it works.
-
-        filtered_ocr_cells = [
-            rect for rect in ocr_cells if not is_overlapping_with_existing_cells(rect)
-        ]
-        return filtered_ocr_cells
-
-    def post_process_cells(self, ocr_cells, programmatic_cells):
-        r"""
-        Post-process the ocr and programmatic cells and return the final list of of cells
-        """
-        if self.options.force_full_page_ocr:
-            # If a full page OCR is forced, use only the OCR cells
-            cells = [
-                Cell(id=c_ocr.id, text=c_ocr.text, bbox=c_ocr.bbox)
-                for c_ocr in ocr_cells
-            ]
-            return cells
-
-        ## Remove OCR cells which overlap with programmatic cells.
-        filtered_ocr_cells = self._filter_ocr_cells(ocr_cells, programmatic_cells)
-        programmatic_cells.extend(filtered_ocr_cells)
-        return programmatic_cells
-
-    def draw_ocr_rects_and_cells(self, conv_res, page, ocr_rects, show: bool = False):
-        image = copy.deepcopy(page.image)
-        scale_x = image.width / page.size.width
-        scale_y = image.height / page.size.height
-
-        draw = ImageDraw.Draw(image, "RGBA")
-
-        # Draw OCR rectangles as yellow filled rect
-        for rect in ocr_rects:
-            x0, y0, x1, y1 = rect.as_tuple()
-            y0 *= scale_x
-            y1 *= scale_y
-            x0 *= scale_x
-            x1 *= scale_x
-
-            shade_color = (255, 255, 0, 40)  # transparent yellow
-            draw.rectangle([(x0, y0), (x1, y1)], fill=shade_color, outline=None)
-
-        # Draw OCR and programmatic cells
-        for tc in page.cells:
-            x0, y0, x1, y1 = tc.bbox.as_tuple()
-            y0 *= scale_x
-            y1 *= scale_y
-            x0 *= scale_x
-            x1 *= scale_x
-
-            if y1 <= y0:
-                y1, y0 = y0, y1
-
-            color = "gray"
-            if isinstance(tc, OcrCell):
-                color = "magenta"
-            draw.rectangle([(x0, y0), (x1, y1)], outline=color)
-
-        if show:
-            image.show()
-        else:
-            out_path: Path = (
-                Path(settings.debug.debug_output_path)
-                / f"debug_{conv_res.input.file.stem}"
-            )
-            out_path.mkdir(parents=True, exist_ok=True)
-
-            out_file = out_path / f"ocr_page_{page.page_no:05}.png"
-            image.save(str(out_file), format="png")
-
-    @abstractmethod
-    def __call__(
-        self, conv_res: ConversionResult, page_batch: Iterable[Page]
-    ) -> Iterable[Page]:
-        pass
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/code_formula_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/code_formula_model.py
deleted file mode 100644
index 1a0f0bf010eb0533a8b218aea3ebeeaf17bb7301..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/code_formula_model.py
+++ /dev/null
@@ -1,251 +0,0 @@
-import re
-from pathlib import Path
-from typing import Iterable, List, Literal, Optional, Tuple, Union
-
-import numpy as np
-from docling_core.types.doc import (
-    CodeItem,
-    DocItemLabel,
-    DoclingDocument,
-    NodeItem,
-    TextItem,
-)
-from docling_core.types.doc.labels import CodeLanguageLabel
-from PIL import Image
-from pydantic import BaseModel
-
-from docling.datamodel.base_models import ItemAndImageEnrichmentElement
-from docling.datamodel.pipeline_options import AcceleratorOptions
-from docling.models.base_model import BaseItemAndImageEnrichmentModel
-from docling.utils.accelerator_utils import decide_device
-
-
-class CodeFormulaModelOptions(BaseModel):
-    """
-    Configuration options for the CodeFormulaModel.
-
-    Attributes
-    ----------
-    kind : str
-        Type of the model. Fixed value "code_formula".
-    do_code_enrichment : bool
-        True if code enrichment is enabled, False otherwise.
-    do_formula_enrichment : bool
-        True if formula enrichment is enabled, False otherwise.
-    """
-
-    kind: Literal["code_formula"] = "code_formula"
-    do_code_enrichment: bool = True
-    do_formula_enrichment: bool = True
-
-
-class CodeFormulaModel(BaseItemAndImageEnrichmentModel):
-    """
-    Model for processing and enriching documents with code and formula predictions.
-
-    Attributes
-    ----------
-    enabled : bool
-        True if the model is enabled, False otherwise.
-    options : CodeFormulaModelOptions
-        Configuration options for the CodeFormulaModel.
-    code_formula_model : CodeFormulaPredictor
-        The predictor model for code and formula processing.
-
-    Methods
-    -------
-    __init__(self, enabled, artifacts_path, accelerator_options, code_formula_options)
-        Initializes the CodeFormulaModel with the given configuration options.
-    is_processable(self, doc, element)
-        Determines if a given element in a document can be processed by the model.
-    __call__(self, doc, element_batch)
-        Processes the given batch of elements and enriches them with predictions.
-    """
-
-    _model_repo_folder = "ds4sd--CodeFormula"
-    elements_batch_size = 5
-    images_scale = 1.66  # = 120 dpi, aligned with training data resolution
-    expansion_factor = 0.03
-
-    def __init__(
-        self,
-        enabled: bool,
-        artifacts_path: Optional[Path],
-        options: CodeFormulaModelOptions,
-        accelerator_options: AcceleratorOptions,
-    ):
-        """
-        Initializes the CodeFormulaModel with the given configuration.
-
-        Parameters
-        ----------
-        enabled : bool
-            True if the model is enabled, False otherwise.
-        artifacts_path : Path
-            Path to the directory containing the model artifacts.
-        options : CodeFormulaModelOptions
-            Configuration options for the model.
-        accelerator_options : AcceleratorOptions
-            Options specifying the device and number of threads for acceleration.
-        """
-        self.enabled = enabled
-        self.options = options
-
-        if self.enabled:
-            device = decide_device(accelerator_options.device)
-
-            from docling_ibm_models.code_formula_model.code_formula_predictor import (
-                CodeFormulaPredictor,
-            )
-
-            if artifacts_path is None:
-                artifacts_path = self.download_models()
-            else:
-                artifacts_path = artifacts_path / self._model_repo_folder
-
-            self.code_formula_model = CodeFormulaPredictor(
-                artifacts_path=str(artifacts_path),
-                device=device,
-                num_threads=accelerator_options.num_threads,
-            )
-
-    @staticmethod
-    def download_models(
-        local_dir: Optional[Path] = None,
-        force: bool = False,
-        progress: bool = False,
-    ) -> Path:
-        from huggingface_hub import snapshot_download
-        from huggingface_hub.utils import disable_progress_bars
-
-        if not progress:
-            disable_progress_bars()
-        download_path = snapshot_download(
-            repo_id="ds4sd/CodeFormula",
-            force_download=force,
-            local_dir=local_dir,
-            revision="v1.0.1",
-        )
-
-        return Path(download_path)
-
-    def is_processable(self, doc: DoclingDocument, element: NodeItem) -> bool:
-        """
-        Determines if a given element in a document can be processed by the model.
-
-        Parameters
-        ----------
-        doc : DoclingDocument
-            The document being processed.
-        element : NodeItem
-            The element within the document to check.
-
-        Returns
-        -------
-        bool
-            True if the element can be processed, False otherwise.
-        """
-        return self.enabled and (
-            (isinstance(element, CodeItem) and self.options.do_code_enrichment)
-            or (
-                isinstance(element, TextItem)
-                and element.label == DocItemLabel.FORMULA
-                and self.options.do_formula_enrichment
-            )
-        )
-
-    def _extract_code_language(self, input_string: str) -> Tuple[str, Optional[str]]:
-        """Extracts a programming language from the beginning of a string.
-
-        This function checks if the input string starts with a pattern of the form
-        ``<_some_language_>``. If it does, it extracts the language string and returns
-        a tuple of (remainder, language). Otherwise, it returns the original string
-        and `None`.
-
-        Args:
-            input_string (str): The input string, which may start with ``<_language_>``.
-
-        Returns:
-            Tuple[str, Optional[str]]:
-                A tuple where:
-                - The first element is either:
-                    - The remainder of the string (everything after ``<_language_>``),
-                    if a match is found; or
-                    - The original string, if no match is found.
-                - The second element is the extracted language if a match is found;
-                otherwise, `None`.
-        """
-        pattern = r"^<_([^>]+)_>\s*(.*)"
-        match = re.match(pattern, input_string, flags=re.DOTALL)
-        if match:
-            language = str(match.group(1))  # the captured programming language
-            remainder = str(match.group(2))  # everything after the <_language_>
-            return remainder, language
-        else:
-            return input_string, None
-
-    def _get_code_language_enum(self, value: Optional[str]) -> CodeLanguageLabel:
-        """
-        Converts a string to a corresponding `CodeLanguageLabel` enum member.
-
-        If the provided string does not match any value in `CodeLanguageLabel`,
-        it defaults to `CodeLanguageLabel.UNKNOWN`.
-
-        Args:
-            value (Optional[str]): The string representation of the code language or None.
-
-        Returns:
-            CodeLanguageLabel: The corresponding enum member if the value is valid,
-            otherwise `CodeLanguageLabel.UNKNOWN`.
-        """
-        if not isinstance(value, str):
-            return CodeLanguageLabel.UNKNOWN
-
-        try:
-            return CodeLanguageLabel(value)
-        except ValueError:
-            return CodeLanguageLabel.UNKNOWN
-
-    def __call__(
-        self,
-        doc: DoclingDocument,
-        element_batch: Iterable[ItemAndImageEnrichmentElement],
-    ) -> Iterable[NodeItem]:
-        """
-        Processes the given batch of elements and enriches them with predictions.
-
-        Parameters
-        ----------
-        doc : DoclingDocument
-            The document being processed.
-        element_batch : Iterable[ItemAndImageEnrichmentElement]
-            A batch of elements to be processed.
-
-        Returns
-        -------
-        Iterable[Any]
-            An iterable of enriched elements.
-        """
-        if not self.enabled:
-            for element in element_batch:
-                yield element.item
-            return
-
-        labels: List[str] = []
-        images: List[Union[Image.Image, np.ndarray]] = []
-        elements: List[TextItem] = []
-        for el in element_batch:
-            assert isinstance(el.item, TextItem)
-            elements.append(el.item)
-            labels.append(el.item.label)
-            images.append(el.image)
-
-        outputs = self.code_formula_model.predict(images, labels)
-
-        for item, output in zip(elements, outputs):
-            if isinstance(item, CodeItem):
-                output, code_language = self._extract_code_language(output)
-                item.code_language = self._get_code_language_enum(code_language)
-            item.text = output
-
-            yield item
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/document_picture_classifier.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/document_picture_classifier.py
deleted file mode 100644
index 6e71246b019809a9d4a60f57c3ed0669c62cc178..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/document_picture_classifier.py
+++ /dev/null
@@ -1,190 +0,0 @@
-from pathlib import Path
-from typing import Iterable, List, Literal, Optional, Tuple, Union
-
-import numpy as np
-from docling_core.types.doc import (
-    DoclingDocument,
-    NodeItem,
-    PictureClassificationClass,
-    PictureClassificationData,
-    PictureItem,
-)
-from PIL import Image
-from pydantic import BaseModel
-
-from docling.datamodel.pipeline_options import AcceleratorOptions
-from docling.models.base_model import BaseEnrichmentModel
-from docling.utils.accelerator_utils import decide_device
-
-
-class DocumentPictureClassifierOptions(BaseModel):
-    """
-    Options for configuring the DocumentPictureClassifier.
-
-    Attributes
-    ----------
-    kind : Literal["document_picture_classifier"]
-        Identifier for the type of classifier.
-    """
-
-    kind: Literal["document_picture_classifier"] = "document_picture_classifier"
-
-
-class DocumentPictureClassifier(BaseEnrichmentModel):
-    """
-    A model for classifying pictures in documents.
-
-    This class enriches document pictures with predicted classifications
-    based on a predefined set of classes.
-
-    Attributes
-    ----------
-    enabled : bool
-        Whether the classifier is enabled for use.
-    options : DocumentPictureClassifierOptions
-        Configuration options for the classifier.
-    document_picture_classifier : DocumentPictureClassifierPredictor
-        The underlying prediction model, loaded if the classifier is enabled.
-
-    Methods
-    -------
-    __init__(enabled, artifacts_path, options, accelerator_options)
-        Initializes the classifier with specified configurations.
-    is_processable(doc, element)
-        Checks if the given element can be processed by the classifier.
-    __call__(doc, element_batch)
-        Processes a batch of elements and adds classification annotations.
-    """
-
-    _model_repo_folder = "ds4sd--DocumentFigureClassifier"
-    images_scale = 2
-
-    def __init__(
-        self,
-        enabled: bool,
-        artifacts_path: Optional[Path],
-        options: DocumentPictureClassifierOptions,
-        accelerator_options: AcceleratorOptions,
-    ):
-        """
-        Initializes the DocumentPictureClassifier.
-
-        Parameters
-        ----------
-        enabled : bool
-            Indicates whether the classifier is enabled.
-        artifacts_path : Optional[Union[Path, str]],
-            Path to the directory containing model artifacts.
-        options : DocumentPictureClassifierOptions
-            Configuration options for the classifier.
-        accelerator_options : AcceleratorOptions
-            Options for configuring the device and parallelism.
-        """
-        self.enabled = enabled
-        self.options = options
-
-        if self.enabled:
-            device = decide_device(accelerator_options.device)
-            from docling_ibm_models.document_figure_classifier_model.document_figure_classifier_predictor import (
-                DocumentFigureClassifierPredictor,
-            )
-
-            if artifacts_path is None:
-                artifacts_path = self.download_models()
-            else:
-                artifacts_path = artifacts_path / self._model_repo_folder
-
-            self.document_picture_classifier = DocumentFigureClassifierPredictor(
-                artifacts_path=str(artifacts_path),
-                device=device,
-                num_threads=accelerator_options.num_threads,
-            )
-
-    @staticmethod
-    def download_models(
-        local_dir: Optional[Path] = None, force: bool = False, progress: bool = False
-    ) -> Path:
-        from huggingface_hub import snapshot_download
-        from huggingface_hub.utils import disable_progress_bars
-
-        if not progress:
-            disable_progress_bars()
-        download_path = snapshot_download(
-            repo_id="ds4sd/DocumentFigureClassifier",
-            force_download=force,
-            local_dir=local_dir,
-            revision="v1.0.0",
-        )
-
-        return Path(download_path)
-
-    def is_processable(self, doc: DoclingDocument, element: NodeItem) -> bool:
-        """
-        Determines if the given element can be processed by the classifier.
-
-        Parameters
-        ----------
-        doc : DoclingDocument
-            The document containing the element.
-        element : NodeItem
-            The element to be checked.
-
-        Returns
-        -------
-        bool
-            True if the element is a PictureItem and processing is enabled; False otherwise.
-        """
-        return self.enabled and isinstance(element, PictureItem)
-
-    def __call__(
-        self,
-        doc: DoclingDocument,
-        element_batch: Iterable[NodeItem],
-    ) -> Iterable[NodeItem]:
-        """
-        Processes a batch of elements and enriches them with classification predictions.
-
-        Parameters
-        ----------
-        doc : DoclingDocument
-            The document containing the elements to be processed.
-        element_batch : Iterable[NodeItem]
-            A batch of pictures to classify.
-
-        Returns
-        -------
-        Iterable[NodeItem]
-            An iterable of NodeItem objects after processing. The field
-            'data.classification' is added containing the classification for each picture.
-        """
-        if not self.enabled:
-            for element in element_batch:
-                yield element
-            return
-
-        images: List[Union[Image.Image, np.ndarray]] = []
-        elements: List[PictureItem] = []
-        for el in element_batch:
-            assert isinstance(el, PictureItem)
-            elements.append(el)
-            img = el.get_image(doc)
-            assert img is not None
-            images.append(img)
-
-        outputs = self.document_picture_classifier.predict(images)
-
-        for element, output in zip(elements, outputs):
-            element.annotations.append(
-                PictureClassificationData(
-                    provenance="DocumentPictureClassifier",
-                    predicted_classes=[
-                        PictureClassificationClass(
-                            class_name=pred[0],
-                            confidence=pred[1],
-                        )
-                        for pred in output
-                    ],
-                )
-            )
-
-            yield element
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/ds_glm_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/ds_glm_model.py
deleted file mode 100644
index 5d4c6eee73a0308351288dedc9ff8d86f221b395..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/ds_glm_model.py
+++ /dev/null
@@ -1,386 +0,0 @@
-import copy
-import random
-from pathlib import Path
-from typing import List, Union
-
-from deepsearch_glm.andromeda_nlp import nlp_model
-from docling_core.types.doc import (
-    BoundingBox,
-    CoordOrigin,
-    DocItemLabel,
-    DoclingDocument,
-)
-from docling_core.types.legacy_doc.base import BoundingBox as DsBoundingBox
-from docling_core.types.legacy_doc.base import (
-    Figure,
-    PageDimensions,
-    PageReference,
-    Prov,
-    Ref,
-)
-from docling_core.types.legacy_doc.base import Table as DsSchemaTable
-from docling_core.types.legacy_doc.base import TableCell
-from docling_core.types.legacy_doc.document import BaseText
-from docling_core.types.legacy_doc.document import (
-    CCSDocumentDescription as DsDocumentDescription,
-)
-from docling_core.types.legacy_doc.document import CCSFileInfoObject as DsFileInfoObject
-from docling_core.types.legacy_doc.document import ExportedCCSDocument as DsDocument
-from PIL import ImageDraw
-from pydantic import BaseModel, ConfigDict, TypeAdapter
-
-from docling.datamodel.base_models import (
-    Cluster,
-    ContainerElement,
-    FigureElement,
-    Table,
-    TextElement,
-)
-from docling.datamodel.document import ConversionResult, layout_label_to_ds_type
-from docling.datamodel.settings import settings
-from docling.utils.glm_utils import to_docling_document
-from docling.utils.profiling import ProfilingScope, TimeRecorder
-from docling.utils.utils import create_hash
-
-
-class GlmOptions(BaseModel):
-    model_config = ConfigDict(protected_namespaces=())
-
-    model_names: str = ""  # e.g. "language;term;reference"
-
-
-class GlmModel:
-    def __init__(self, options: GlmOptions):
-        self.options = options
-
-        self.model = nlp_model(loglevel="error", text_ordering=True)
-
-    def _to_legacy_document(self, conv_res) -> DsDocument:
-        title = ""
-        desc: DsDocumentDescription = DsDocumentDescription(logs=[])
-
-        page_hashes = [
-            PageReference(
-                hash=create_hash(conv_res.input.document_hash + ":" + str(p.page_no)),
-                page=p.page_no + 1,
-                model="default",
-            )
-            for p in conv_res.pages
-        ]
-
-        file_info = DsFileInfoObject(
-            filename=conv_res.input.file.name,
-            document_hash=conv_res.input.document_hash,
-            num_pages=conv_res.input.page_count,
-            page_hashes=page_hashes,
-        )
-
-        main_text: List[Union[Ref, BaseText]] = []
-        page_headers: List[Union[Ref, BaseText]] = []
-        page_footers: List[Union[Ref, BaseText]] = []
-
-        tables: List[DsSchemaTable] = []
-        figures: List[Figure] = []
-
-        page_no_to_page = {p.page_no: p for p in conv_res.pages}
-
-        for element in conv_res.assembled.body:
-            # Convert bboxes to lower-left origin.
-            target_bbox = DsBoundingBox(
-                element.cluster.bbox.to_bottom_left_origin(
-                    page_no_to_page[element.page_no].size.height
-                ).as_tuple()
-            )
-
-            if isinstance(element, TextElement):
-                main_text.append(
-                    BaseText(
-                        text=element.text,
-                        obj_type=layout_label_to_ds_type.get(element.label),
-                        name=element.label,
-                        prov=[
-                            Prov(
-                                bbox=target_bbox,
-                                page=element.page_no + 1,
-                                span=[0, len(element.text)],
-                            )
-                        ],
-                    )
-                )
-            elif isinstance(element, Table):
-                index = len(tables)
-                ref_str = f"#/tables/{index}"
-                main_text.append(
-                    Ref(
-                        name=element.label,
-                        obj_type=layout_label_to_ds_type.get(element.label),
-                        ref=ref_str,
-                    ),
-                )
-
-                # Initialise empty table data grid (only empty cells)
-                table_data = [
-                    [
-                        TableCell(
-                            text="",
-                            # bbox=[0,0,0,0],
-                            spans=[[i, j]],
-                            obj_type="body",
-                        )
-                        for j in range(element.num_cols)
-                    ]
-                    for i in range(element.num_rows)
-                ]
-
-                # Overwrite cells in table data for which there is actual cell content.
-                for cell in element.table_cells:
-                    for i in range(
-                        min(cell.start_row_offset_idx, element.num_rows),
-                        min(cell.end_row_offset_idx, element.num_rows),
-                    ):
-                        for j in range(
-                            min(cell.start_col_offset_idx, element.num_cols),
-                            min(cell.end_col_offset_idx, element.num_cols),
-                        ):
-                            celltype = "body"
-                            if cell.column_header:
-                                celltype = "col_header"
-                            elif cell.row_header:
-                                celltype = "row_header"
-                            elif cell.row_section:
-                                celltype = "row_section"
-
-                            def make_spans(cell):
-                                for rspan in range(
-                                    min(cell.start_row_offset_idx, element.num_rows),
-                                    min(cell.end_row_offset_idx, element.num_rows),
-                                ):
-                                    for cspan in range(
-                                        min(
-                                            cell.start_col_offset_idx, element.num_cols
-                                        ),
-                                        min(cell.end_col_offset_idx, element.num_cols),
-                                    ):
-                                        yield [rspan, cspan]
-
-                            spans = list(make_spans(cell))
-                            if cell.bbox is not None:
-                                bbox = cell.bbox.to_bottom_left_origin(
-                                    page_no_to_page[element.page_no].size.height
-                                ).as_tuple()
-                            else:
-                                bbox = None
-
-                            table_data[i][j] = TableCell(
-                                text=cell.text,
-                                bbox=bbox,
-                                # col=j,
-                                # row=i,
-                                spans=spans,
-                                obj_type=celltype,
-                                # col_span=[cell.start_col_offset_idx, cell.end_col_offset_idx],
-                                # row_span=[cell.start_row_offset_idx, cell.end_row_offset_idx]
-                            )
-
-                tables.append(
-                    DsSchemaTable(
-                        num_cols=element.num_cols,
-                        num_rows=element.num_rows,
-                        obj_type=layout_label_to_ds_type.get(element.label),
-                        data=table_data,
-                        prov=[
-                            Prov(
-                                bbox=target_bbox,
-                                page=element.page_no + 1,
-                                span=[0, 0],
-                            )
-                        ],
-                    )
-                )
-
-            elif isinstance(element, FigureElement):
-                index = len(figures)
-                ref_str = f"#/figures/{index}"
-                main_text.append(
-                    Ref(
-                        name=element.label,
-                        obj_type=layout_label_to_ds_type.get(element.label),
-                        ref=ref_str,
-                    ),
-                )
-                figures.append(
-                    Figure(
-                        prov=[
-                            Prov(
-                                bbox=target_bbox,
-                                page=element.page_no + 1,
-                                span=[0, 0],
-                            )
-                        ],
-                        obj_type=layout_label_to_ds_type.get(element.label),
-                        payload={
-                            "children": TypeAdapter(List[Cluster]).dump_python(
-                                element.cluster.children
-                            )
-                        },  # hack to channel child clusters through GLM
-                    )
-                )
-            elif isinstance(element, ContainerElement):
-                main_text.append(
-                    BaseText(
-                        text="",
-                        payload={
-                            "children": TypeAdapter(List[Cluster]).dump_python(
-                                element.cluster.children
-                            )
-                        },  # hack to channel child clusters through GLM
-                        obj_type=layout_label_to_ds_type.get(element.label),
-                        name=element.label,
-                        prov=[
-                            Prov(
-                                bbox=target_bbox,
-                                page=element.page_no + 1,
-                                span=[0, 0],
-                            )
-                        ],
-                    )
-                )
-
-        # We can throw in headers and footers at the end of the legacy doc
-        # since the reading-order will re-sort it later.
-        for element in conv_res.assembled.headers:
-            # Convert bboxes to lower-left origin.
-            target_bbox = DsBoundingBox(
-                element.cluster.bbox.to_bottom_left_origin(
-                    page_no_to_page[element.page_no].size.height
-                ).as_tuple()
-            )
-
-            if isinstance(element, TextElement):
-
-                tel = BaseText(
-                    text=element.text,
-                    obj_type=layout_label_to_ds_type.get(element.label),
-                    name=element.label,
-                    prov=[
-                        Prov(
-                            bbox=target_bbox,
-                            page=element.page_no + 1,
-                            span=[0, len(element.text)],
-                        )
-                    ],
-                )
-                if element.label == DocItemLabel.PAGE_HEADER:
-                    index = len(page_headers)
-                    ref_str = f"#/page-headers/{index}"
-                    main_text.append(
-                        Ref(
-                            name=element.label,
-                            obj_type=layout_label_to_ds_type.get(element.label),
-                            ref=ref_str,
-                        ),
-                    )
-                    page_headers.append(tel)
-                elif element.label == DocItemLabel.PAGE_FOOTER:
-                    index = len(page_footers)
-                    ref_str = f"#/page-footers/{index}"
-                    main_text.append(
-                        Ref(
-                            name=element.label,
-                            obj_type=layout_label_to_ds_type.get(element.label),
-                            ref=ref_str,
-                        ),
-                    )
-                    page_footers.append(tel)
-
-        page_dimensions = [
-            PageDimensions(page=p.page_no + 1, height=p.size.height, width=p.size.width)
-            for p in conv_res.pages
-            if p.size is not None
-        ]
-
-        ds_doc: DsDocument = DsDocument(
-            name=title,
-            description=desc,
-            file_info=file_info,
-            main_text=main_text,
-            tables=tables,
-            figures=figures,
-            page_dimensions=page_dimensions,
-            page_headers=page_headers,
-            page_footers=page_footers,
-        )
-
-        return ds_doc
-
-    def __call__(self, conv_res: ConversionResult) -> DoclingDocument:
-        with TimeRecorder(conv_res, "glm", scope=ProfilingScope.DOCUMENT):
-            ds_doc = self._to_legacy_document(conv_res)
-            ds_doc_dict = ds_doc.model_dump(by_alias=True, exclude_none=True)
-
-            glm_doc = self.model.apply_on_doc(ds_doc_dict)
-
-            docling_doc: DoclingDocument = to_docling_document(glm_doc)  # Experimental
-            1 == 1
-
-        # DEBUG code:
-        def draw_clusters_and_cells(ds_document, page_no, show: bool = False):
-            clusters_to_draw = []
-            image = copy.deepcopy(conv_res.pages[page_no].image)
-            for ix, elem in enumerate(ds_document.main_text):
-                if isinstance(elem, BaseText):
-                    prov = elem.prov[0]  # type: ignore
-                elif isinstance(elem, Ref):
-                    _, arr, index = elem.ref.split("/")
-                    index = int(index)  # type: ignore
-                    if arr == "tables":
-                        prov = ds_document.tables[index].prov[0]
-                    elif arr == "figures":
-                        prov = ds_document.pictures[index].prov[0]
-                    else:
-                        prov = None
-
-                if prov and prov.page == page_no:
-                    clusters_to_draw.append(
-                        Cluster(
-                            id=ix,
-                            label=elem.name,
-                            bbox=BoundingBox.from_tuple(
-                                coord=prov.bbox,  # type: ignore
-                                origin=CoordOrigin.BOTTOMLEFT,
-                            ).to_top_left_origin(conv_res.pages[page_no].size.height),
-                        )
-                    )
-
-            draw = ImageDraw.Draw(image)
-            for c in clusters_to_draw:
-                x0, y0, x1, y1 = c.bbox.as_tuple()
-                draw.rectangle([(x0, y0), (x1, y1)], outline="red")
-                draw.text((x0 + 2, y0 + 2), f"{c.id}:{c.label}", fill=(255, 0, 0, 255))
-
-                cell_color = (
-                    random.randint(30, 140),
-                    random.randint(30, 140),
-                    random.randint(30, 140),
-                )
-                for tc in c.cells:  # [:1]:
-                    x0, y0, x1, y1 = tc.bbox.as_tuple()
-                    draw.rectangle([(x0, y0), (x1, y1)], outline=cell_color)
-
-            if show:
-                image.show()
-            else:
-                out_path: Path = (
-                    Path(settings.debug.debug_output_path)
-                    / f"debug_{conv_res.input.file.stem}"
-                )
-                out_path.mkdir(parents=True, exist_ok=True)
-
-                out_file = out_path / f"doc_page_{page_no:05}.png"
-                image.save(str(out_file), format="png")
-
-        # for item in ds_doc.page_dimensions:
-        #    page_no = item.page
-        #    draw_clusters_and_cells(ds_doc, page_no)
-
-        return docling_doc
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/easyocr_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/easyocr_model.py
deleted file mode 100644
index 0eccb9885d7a020c02d04798d60a24ca1abdb014..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/easyocr_model.py
+++ /dev/null
@@ -1,177 +0,0 @@
-import logging
-import warnings
-import zipfile
-from pathlib import Path
-from typing import Iterable, List, Optional
-
-import numpy
-from docling_core.types.doc import BoundingBox, CoordOrigin
-
-from docling.datamodel.base_models import Cell, OcrCell, Page
-from docling.datamodel.document import ConversionResult
-from docling.datamodel.pipeline_options import (
-    AcceleratorDevice,
-    AcceleratorOptions,
-    EasyOcrOptions,
-)
-from docling.datamodel.settings import settings
-from docling.models.base_ocr_model import BaseOcrModel
-from docling.utils.accelerator_utils import decide_device
-from docling.utils.profiling import TimeRecorder
-from docling.utils.utils import download_url_with_progress
-
-_log = logging.getLogger(__name__)
-
-
-class EasyOcrModel(BaseOcrModel):
-    _model_repo_folder = "EasyOcr"
-
-    def __init__(
-        self,
-        enabled: bool,
-        artifacts_path: Optional[Path],
-        options: EasyOcrOptions,
-        accelerator_options: AcceleratorOptions,
-    ):
-        super().__init__(enabled=enabled, options=options)
-        self.options: EasyOcrOptions
-
-        self.scale = 3  # multiplier for 72 dpi == 216 dpi.
-
-        if self.enabled:
-            try:
-                import easyocr
-            except ImportError:
-                raise ImportError(
-                    "EasyOCR is not installed. Please install it via `pip install easyocr` to use this OCR engine. "
-                    "Alternatively, Docling has support for other OCR engines. See the documentation."
-                )
-
-            if self.options.use_gpu is None:
-                device = decide_device(accelerator_options.device)
-                # Enable easyocr GPU if running on CUDA, MPS
-                use_gpu = any(
-                    [
-                        device.startswith(x)
-                        for x in [
-                            AcceleratorDevice.CUDA.value,
-                            AcceleratorDevice.MPS.value,
-                        ]
-                    ]
-                )
-            else:
-                warnings.warn(
-                    "Deprecated field. Better to set the `accelerator_options.device` in `pipeline_options`. "
-                    "When `use_gpu and accelerator_options.device == AcceleratorDevice.CUDA` the GPU is used "
-                    "to run EasyOCR. Otherwise, EasyOCR runs in CPU."
-                )
-                use_gpu = self.options.use_gpu
-
-            download_enabled = self.options.download_enabled
-            model_storage_directory = self.options.model_storage_directory
-            if artifacts_path is not None and model_storage_directory is None:
-                download_enabled = False
-                model_storage_directory = str(artifacts_path / self._model_repo_folder)
-
-            self.reader = easyocr.Reader(
-                lang_list=self.options.lang,
-                gpu=use_gpu,
-                model_storage_directory=model_storage_directory,
-                recog_network=self.options.recog_network,
-                download_enabled=download_enabled,
-                verbose=False,
-            )
-
-    @staticmethod
-    def download_models(
-        detection_models: List[str] = ["craft"],
-        recognition_models: List[str] = ["english_g2", "latin_g2"],
-        local_dir: Optional[Path] = None,
-        force: bool = False,
-        progress: bool = False,
-    ) -> Path:
-        # Models are located in https://github.com/JaidedAI/EasyOCR/blob/master/easyocr/config.py
-        from easyocr.config import detection_models as det_models_dict
-        from easyocr.config import recognition_models as rec_models_dict
-
-        if local_dir is None:
-            local_dir = settings.cache_dir / "models" / EasyOcrModel._model_repo_folder
-
-        local_dir.mkdir(parents=True, exist_ok=True)
-
-        # Collect models to download
-        download_list = []
-        for model_name in detection_models:
-            if model_name in det_models_dict:
-                download_list.append(det_models_dict[model_name])
-        for model_name in recognition_models:
-            if model_name in rec_models_dict["gen2"]:
-                download_list.append(rec_models_dict["gen2"][model_name])
-
-        # Download models
-        for model_details in download_list:
-            buf = download_url_with_progress(model_details["url"], progress=progress)
-            with zipfile.ZipFile(buf, "r") as zip_ref:
-                zip_ref.extractall(local_dir)
-
-        return local_dir
-
-    def __call__(
-        self, conv_res: ConversionResult, page_batch: Iterable[Page]
-    ) -> Iterable[Page]:
-
-        if not self.enabled:
-            yield from page_batch
-            return
-
-        for page in page_batch:
-
-            assert page._backend is not None
-            if not page._backend.is_valid():
-                yield page
-            else:
-                with TimeRecorder(conv_res, "ocr"):
-                    ocr_rects = self.get_ocr_rects(page)
-
-                    all_ocr_cells = []
-                    for ocr_rect in ocr_rects:
-                        # Skip zero area boxes
-                        if ocr_rect.area() == 0:
-                            continue
-                        high_res_image = page._backend.get_page_image(
-                            scale=self.scale, cropbox=ocr_rect
-                        )
-                        im = numpy.array(high_res_image)
-                        result = self.reader.readtext(im)
-
-                        del high_res_image
-                        del im
-
-                        cells = [
-                            OcrCell(
-                                id=ix,
-                                text=line[1],
-                                confidence=line[2],
-                                bbox=BoundingBox.from_tuple(
-                                    coord=(
-                                        (line[0][0][0] / self.scale) + ocr_rect.l,
-                                        (line[0][0][1] / self.scale) + ocr_rect.t,
-                                        (line[0][2][0] / self.scale) + ocr_rect.l,
-                                        (line[0][2][1] / self.scale) + ocr_rect.t,
-                                    ),
-                                    origin=CoordOrigin.TOPLEFT,
-                                ),
-                            )
-                            for ix, line in enumerate(result)
-                            if line[2] >= self.options.confidence_threshold
-                        ]
-                        all_ocr_cells.extend(cells)
-
-                    # Post-process the cells
-                    page.cells = self.post_process_cells(all_ocr_cells, page.cells)
-
-                # DEBUG code:
-                if settings.debug.visualize_ocr:
-                    self.draw_ocr_rects_and_cells(conv_res, page, ocr_rects)
-
-                yield page
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/layout_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/layout_model.py
deleted file mode 100644
index b3cbd954a2f82873c080be3ff263fa560fb7e70b..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/layout_model.py
+++ /dev/null
@@ -1,197 +0,0 @@
-import copy
-import logging
-import warnings
-from pathlib import Path
-from typing import Iterable, Optional, Union
-
-from docling_core.types.doc import DocItemLabel
-from docling_ibm_models.layoutmodel.layout_predictor import LayoutPredictor
-from PIL import Image
-
-from docling.datamodel.base_models import BoundingBox, Cluster, LayoutPrediction, Page
-from docling.datamodel.document import ConversionResult
-from docling.datamodel.pipeline_options import AcceleratorOptions
-from docling.datamodel.settings import settings
-from docling.models.base_model import BasePageModel
-from docling.utils.accelerator_utils import decide_device
-from docling.utils.layout_postprocessor import LayoutPostprocessor
-from docling.utils.profiling import TimeRecorder
-from docling.utils.visualization import draw_clusters
-
-_log = logging.getLogger(__name__)
-
-
-class LayoutModel(BasePageModel):
-    _model_repo_folder = "ds4sd--docling-models"
-    _model_path = "model_artifacts/layout"
-
-    TEXT_ELEM_LABELS = [
-        DocItemLabel.TEXT,
-        DocItemLabel.FOOTNOTE,
-        DocItemLabel.CAPTION,
-        DocItemLabel.CHECKBOX_UNSELECTED,
-        DocItemLabel.CHECKBOX_SELECTED,
-        DocItemLabel.SECTION_HEADER,
-        DocItemLabel.PAGE_HEADER,
-        DocItemLabel.PAGE_FOOTER,
-        DocItemLabel.CODE,
-        DocItemLabel.LIST_ITEM,
-        DocItemLabel.FORMULA,
-    ]
-    PAGE_HEADER_LABELS = [DocItemLabel.PAGE_HEADER, DocItemLabel.PAGE_FOOTER]
-
-    TABLE_LABELS = [DocItemLabel.TABLE, DocItemLabel.DOCUMENT_INDEX]
-    FIGURE_LABEL = DocItemLabel.PICTURE
-    FORMULA_LABEL = DocItemLabel.FORMULA
-    CONTAINER_LABELS = [DocItemLabel.FORM, DocItemLabel.KEY_VALUE_REGION]
-
-    def __init__(
-        self, artifacts_path: Optional[Path], accelerator_options: AcceleratorOptions
-    ):
-        device = decide_device(accelerator_options.device)
-
-        if artifacts_path is None:
-            artifacts_path = self.download_models() / self._model_path
-        else:
-            # will become the default in the future
-            if (artifacts_path / self._model_repo_folder).exists():
-                artifacts_path = (
-                    artifacts_path / self._model_repo_folder / self._model_path
-                )
-            elif (artifacts_path / self._model_path).exists():
-                warnings.warn(
-                    "The usage of artifacts_path containing directly "
-                    f"{self._model_path} is deprecated. Please point "
-                    "the artifacts_path to the parent containing "
-                    f"the {self._model_repo_folder} folder.",
-                    DeprecationWarning,
-                    stacklevel=3,
-                )
-                artifacts_path = artifacts_path / self._model_path
-
-        self.layout_predictor = LayoutPredictor(
-            artifact_path=str(artifacts_path),
-            device=device,
-            num_threads=accelerator_options.num_threads,
-        )
-
-    @staticmethod
-    def download_models(
-        local_dir: Optional[Path] = None,
-        force: bool = False,
-        progress: bool = False,
-    ) -> Path:
-        from huggingface_hub import snapshot_download
-        from huggingface_hub.utils import disable_progress_bars
-
-        if not progress:
-            disable_progress_bars()
-        download_path = snapshot_download(
-            repo_id="ds4sd/docling-models",
-            force_download=force,
-            local_dir=local_dir,
-            revision="v2.1.0",
-        )
-
-        return Path(download_path)
-
-    def draw_clusters_and_cells_side_by_side(
-        self, conv_res, page, clusters, mode_prefix: str, show: bool = False
-    ):
-        """
-        Draws a page image side by side with clusters filtered into two categories:
-        - Left: Clusters excluding FORM, KEY_VALUE_REGION, and PICTURE.
-        - Right: Clusters including FORM, KEY_VALUE_REGION, and PICTURE.
-        Includes label names and confidence scores for each cluster.
-        """
-        scale_x = page.image.width / page.size.width
-        scale_y = page.image.height / page.size.height
-
-        # Filter clusters for left and right images
-        exclude_labels = {
-            DocItemLabel.FORM,
-            DocItemLabel.KEY_VALUE_REGION,
-            DocItemLabel.PICTURE,
-        }
-        left_clusters = [c for c in clusters if c.label not in exclude_labels]
-        right_clusters = [c for c in clusters if c.label in exclude_labels]
-        # Create a deep copy of the original image for both sides
-        left_image = copy.deepcopy(page.image)
-        right_image = copy.deepcopy(page.image)
-
-        # Draw clusters on both images
-        draw_clusters(left_image, left_clusters, scale_x, scale_y)
-        draw_clusters(right_image, right_clusters, scale_x, scale_y)
-        # Combine the images side by side
-        combined_width = left_image.width * 2
-        combined_height = left_image.height
-        combined_image = Image.new("RGB", (combined_width, combined_height))
-        combined_image.paste(left_image, (0, 0))
-        combined_image.paste(right_image, (left_image.width, 0))
-        if show:
-            combined_image.show()
-        else:
-            out_path: Path = (
-                Path(settings.debug.debug_output_path)
-                / f"debug_{conv_res.input.file.stem}"
-            )
-            out_path.mkdir(parents=True, exist_ok=True)
-            out_file = out_path / f"{mode_prefix}_layout_page_{page.page_no:05}.png"
-            combined_image.save(str(out_file), format="png")
-
-    def __call__(
-        self, conv_res: ConversionResult, page_batch: Iterable[Page]
-    ) -> Iterable[Page]:
-
-        for page in page_batch:
-            assert page._backend is not None
-            if not page._backend.is_valid():
-                yield page
-            else:
-                with TimeRecorder(conv_res, "layout"):
-                    assert page.size is not None
-                    page_image = page.get_image(scale=1.0)
-                    assert page_image is not None
-
-                    clusters = []
-                    for ix, pred_item in enumerate(
-                        self.layout_predictor.predict(page_image)
-                    ):
-                        label = DocItemLabel(
-                            pred_item["label"]
-                            .lower()
-                            .replace(" ", "_")
-                            .replace("-", "_")
-                        )  # Temporary, until docling-ibm-model uses docling-core types
-                        cluster = Cluster(
-                            id=ix,
-                            label=label,
-                            confidence=pred_item["confidence"],
-                            bbox=BoundingBox.model_validate(pred_item),
-                            cells=[],
-                        )
-                        clusters.append(cluster)
-
-                    if settings.debug.visualize_raw_layout:
-                        self.draw_clusters_and_cells_side_by_side(
-                            conv_res, page, clusters, mode_prefix="raw"
-                        )
-
-                    # Apply postprocessing
-
-                    processed_clusters, processed_cells = LayoutPostprocessor(
-                        page.cells, clusters, page.size
-                    ).postprocess()
-                    # processed_clusters, processed_cells = clusters, page.cells
-
-                    page.cells = processed_cells
-                    page.predictions.layout = LayoutPrediction(
-                        clusters=processed_clusters
-                    )
-
-                if settings.debug.visualize_layout:
-                    self.draw_clusters_and_cells_side_by_side(
-                        conv_res, page, processed_clusters, mode_prefix="postprocessed"
-                    )
-
-                yield page
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/ocr_mac_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/ocr_mac_model.py
deleted file mode 100644
index 38bcf1ca724ee286026d0861de069b2c7d4652f8..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/ocr_mac_model.py
+++ /dev/null
@@ -1,118 +0,0 @@
-import logging
-import tempfile
-from typing import Iterable, Optional, Tuple
-
-from docling_core.types.doc import BoundingBox, CoordOrigin
-
-from docling.datamodel.base_models import OcrCell, Page
-from docling.datamodel.document import ConversionResult
-from docling.datamodel.pipeline_options import OcrMacOptions
-from docling.datamodel.settings import settings
-from docling.models.base_ocr_model import BaseOcrModel
-from docling.utils.profiling import TimeRecorder
-
-_log = logging.getLogger(__name__)
-
-
-class OcrMacModel(BaseOcrModel):
-    def __init__(self, enabled: bool, options: OcrMacOptions):
-        super().__init__(enabled=enabled, options=options)
-        self.options: OcrMacOptions
-
-        self.scale = 3  # multiplier for 72 dpi == 216 dpi.
-
-        if self.enabled:
-            install_errmsg = (
-                "ocrmac is not correctly installed. "
-                "Please install it via `pip install ocrmac` to use this OCR engine. "
-                "Alternatively, Docling has support for other OCR engines. See the documentation: "
-                "https://ds4sd.github.io/docling/installation/"
-            )
-            try:
-                from ocrmac import ocrmac
-            except ImportError:
-                raise ImportError(install_errmsg)
-
-            self.reader_RIL = ocrmac.OCR
-
-    def __call__(
-        self, conv_res: ConversionResult, page_batch: Iterable[Page]
-    ) -> Iterable[Page]:
-
-        if not self.enabled:
-            yield from page_batch
-            return
-
-        for page in page_batch:
-            assert page._backend is not None
-            if not page._backend.is_valid():
-                yield page
-            else:
-                with TimeRecorder(conv_res, "ocr"):
-
-                    ocr_rects = self.get_ocr_rects(page)
-
-                    all_ocr_cells = []
-                    for ocr_rect in ocr_rects:
-                        # Skip zero area boxes
-                        if ocr_rect.area() == 0:
-                            continue
-                        high_res_image = page._backend.get_page_image(
-                            scale=self.scale, cropbox=ocr_rect
-                        )
-
-                        with tempfile.NamedTemporaryFile(
-                            suffix=".png", mode="w"
-                        ) as image_file:
-                            fname = image_file.name
-                            high_res_image.save(fname)
-
-                            boxes = self.reader_RIL(
-                                fname,
-                                recognition_level=self.options.recognition,
-                                framework=self.options.framework,
-                                language_preference=self.options.lang,
-                            ).recognize()
-
-                        im_width, im_height = high_res_image.size
-                        cells = []
-                        for ix, (text, confidence, box) in enumerate(boxes):
-                            x = float(box[0])
-                            y = float(box[1])
-                            w = float(box[2])
-                            h = float(box[3])
-
-                            x1 = x * im_width
-                            y2 = (1 - y) * im_height
-
-                            x2 = x1 + w * im_width
-                            y1 = y2 - h * im_height
-
-                            left = x1 / self.scale
-                            top = y1 / self.scale
-                            right = x2 / self.scale
-                            bottom = y2 / self.scale
-
-                            cells.append(
-                                OcrCell(
-                                    id=ix,
-                                    text=text,
-                                    confidence=confidence,
-                                    bbox=BoundingBox.from_tuple(
-                                        coord=(left, top, right, bottom),
-                                        origin=CoordOrigin.TOPLEFT,
-                                    ),
-                                )
-                            )
-
-                        # del high_res_image
-                        all_ocr_cells.extend(cells)
-
-                    # Post-process the cells
-                    page.cells = self.post_process_cells(all_ocr_cells, page.cells)
-
-                # DEBUG code:
-                if settings.debug.visualize_ocr:
-                    self.draw_ocr_rects_and_cells(conv_res, page, ocr_rects)
-
-                yield page
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/page_assemble_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/page_assemble_model.py
deleted file mode 100644
index 4acf8c95851cedda738949efe50e1833f951460e..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/page_assemble_model.py
+++ /dev/null
@@ -1,152 +0,0 @@
-import logging
-import re
-from typing import Iterable, List
-
-from pydantic import BaseModel
-
-from docling.datamodel.base_models import (
-    AssembledUnit,
-    ContainerElement,
-    FigureElement,
-    Page,
-    PageElement,
-    Table,
-    TextElement,
-)
-from docling.datamodel.document import ConversionResult
-from docling.models.base_model import BasePageModel
-from docling.models.layout_model import LayoutModel
-from docling.utils.profiling import TimeRecorder
-
-_log = logging.getLogger(__name__)
-
-
-class PageAssembleOptions(BaseModel):
-    pass
-
-
-class PageAssembleModel(BasePageModel):
-    def __init__(self, options: PageAssembleOptions):
-        self.options = options
-
-    def sanitize_text(self, lines):
-        if len(lines) <= 1:
-            return " ".join(lines)
-
-        for ix, line in enumerate(lines[1:]):
-            prev_line = lines[ix]
-
-            if prev_line.endswith("-"):
-                prev_words = re.findall(r"\b[\w]+\b", prev_line)
-                line_words = re.findall(r"\b[\w]+\b", line)
-
-                if (
-                    len(prev_words)
-                    and len(line_words)
-                    and prev_words[-1].isalnum()
-                    and line_words[0].isalnum()
-                ):
-                    lines[ix] = prev_line[:-1]
-            else:
-                lines[ix] += " "
-
-        sanitized_text = "".join(lines)
-
-        return sanitized_text.strip()  # Strip any leading or trailing whitespace
-
-    def __call__(
-        self, conv_res: ConversionResult, page_batch: Iterable[Page]
-    ) -> Iterable[Page]:
-        for page in page_batch:
-            assert page._backend is not None
-            if not page._backend.is_valid():
-                yield page
-            else:
-                with TimeRecorder(conv_res, "page_assemble"):
-
-                    assert page.predictions.layout is not None
-
-                    # assembles some JSON output page by page.
-
-                    elements: List[PageElement] = []
-                    headers: List[PageElement] = []
-                    body: List[PageElement] = []
-
-                    for cluster in page.predictions.layout.clusters:
-                        # _log.info("Cluster label seen:", cluster.label)
-                        if cluster.label in LayoutModel.TEXT_ELEM_LABELS:
-
-                            textlines = [
-                                cell.text.replace("\x02", "-").strip()
-                                for cell in cluster.cells
-                                if len(cell.text.strip()) > 0
-                            ]
-                            text = self.sanitize_text(textlines)
-                            text_el = TextElement(
-                                label=cluster.label,
-                                id=cluster.id,
-                                text=text,
-                                page_no=page.page_no,
-                                cluster=cluster,
-                            )
-                            elements.append(text_el)
-
-                            if cluster.label in LayoutModel.PAGE_HEADER_LABELS:
-                                headers.append(text_el)
-                            else:
-                                body.append(text_el)
-                        elif cluster.label in LayoutModel.TABLE_LABELS:
-                            tbl = None
-                            if page.predictions.tablestructure:
-                                tbl = page.predictions.tablestructure.table_map.get(
-                                    cluster.id, None
-                                )
-                            if (
-                                not tbl
-                            ):  # fallback: add table without structure, if it isn't present
-                                tbl = Table(
-                                    label=cluster.label,
-                                    id=cluster.id,
-                                    text="",
-                                    otsl_seq=[],
-                                    table_cells=[],
-                                    cluster=cluster,
-                                    page_no=page.page_no,
-                                )
-
-                            elements.append(tbl)
-                            body.append(tbl)
-                        elif cluster.label == LayoutModel.FIGURE_LABEL:
-                            fig = None
-                            if page.predictions.figures_classification:
-                                fig = page.predictions.figures_classification.figure_map.get(
-                                    cluster.id, None
-                                )
-                            if (
-                                not fig
-                            ):  # fallback: add figure without classification, if it isn't present
-                                fig = FigureElement(
-                                    label=cluster.label,
-                                    id=cluster.id,
-                                    text="",
-                                    data=None,
-                                    cluster=cluster,
-                                    page_no=page.page_no,
-                                )
-                            elements.append(fig)
-                            body.append(fig)
-                        elif cluster.label in LayoutModel.CONTAINER_LABELS:
-                            container_el = ContainerElement(
-                                label=cluster.label,
-                                id=cluster.id,
-                                page_no=page.page_no,
-                                cluster=cluster,
-                            )
-                            elements.append(container_el)
-                            body.append(container_el)
-
-                    page.assembled = AssembledUnit(
-                        elements=elements, headers=headers, body=body
-                    )
-
-                yield page
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/page_preprocessing_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/page_preprocessing_model.py
deleted file mode 100644
index 63f1a4f6e2722a9bd42058839d1c32c0d00c3bdd..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/page_preprocessing_model.py
+++ /dev/null
@@ -1,79 +0,0 @@
-from pathlib import Path
-from typing import Iterable, Optional
-
-from PIL import ImageDraw
-from pydantic import BaseModel
-
-from docling.datamodel.base_models import Page
-from docling.datamodel.document import ConversionResult
-from docling.datamodel.settings import settings
-from docling.models.base_model import BasePageModel
-from docling.utils.profiling import TimeRecorder
-
-
-class PagePreprocessingOptions(BaseModel):
-    images_scale: Optional[float]
-
-
-class PagePreprocessingModel(BasePageModel):
-    def __init__(self, options: PagePreprocessingOptions):
-        self.options = options
-
-    def __call__(
-        self, conv_res: ConversionResult, page_batch: Iterable[Page]
-    ) -> Iterable[Page]:
-        for page in page_batch:
-            assert page._backend is not None
-            if not page._backend.is_valid():
-                yield page
-            else:
-                with TimeRecorder(conv_res, "page_parse"):
-                    page = self._populate_page_images(page)
-                    page = self._parse_page_cells(conv_res, page)
-                yield page
-
-    # Generate the page image and store it in the page object
-    def _populate_page_images(self, page: Page) -> Page:
-        # default scale
-        page.get_image(
-            scale=1.0
-        )  # puts the page image on the image cache at default scale
-
-        images_scale = self.options.images_scale
-        # user requested scales
-        if images_scale is not None:
-            page._default_image_scale = images_scale
-            page.get_image(
-                scale=images_scale
-            )  # this will trigger storing the image in the internal cache
-
-        return page
-
-    # Extract and populate the page cells and store it in the page object
-    def _parse_page_cells(self, conv_res: ConversionResult, page: Page) -> Page:
-        assert page._backend is not None
-
-        page.cells = list(page._backend.get_text_cells())
-
-        # DEBUG code:
-        def draw_text_boxes(image, cells, show: bool = False):
-            draw = ImageDraw.Draw(image)
-            for c in cells:
-                x0, y0, x1, y1 = c.bbox.as_tuple()
-                draw.rectangle([(x0, y0), (x1, y1)], outline="red")
-            if show:
-                image.show()
-            else:
-                out_path: Path = (
-                    Path(settings.debug.debug_output_path)
-                    / f"debug_{conv_res.input.file.stem}"
-                )
-                out_path.mkdir(parents=True, exist_ok=True)
-
-                out_file = out_path / f"cells_page_{page.page_no:05}.png"
-                image.save(str(out_file), format="png")
-
-        if settings.debug.visualize_cells:
-            draw_text_boxes(page.get_image(scale=1.0), page.cells)
-
-        return page
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/picture_description_api_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/picture_description_api_model.py
deleted file mode 100644
index 86b7694411d22d89d0d013cc89702a422923fa7e..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/picture_description_api_model.py
+++ /dev/null
@@ -1,101 +0,0 @@
-import base64
-import io
-import logging
-from typing import Iterable, List, Optional
-
-import requests
-from PIL import Image
-from pydantic import BaseModel, ConfigDict
-
-from docling.datamodel.pipeline_options import PictureDescriptionApiOptions
-from docling.models.picture_description_base_model import PictureDescriptionBaseModel
-
-_log = logging.getLogger(__name__)
-
-
-class ChatMessage(BaseModel):
-    role: str
-    content: str
-
-
-class ResponseChoice(BaseModel):
-    index: int
-    message: ChatMessage
-    finish_reason: str
-
-
-class ResponseUsage(BaseModel):
-    prompt_tokens: int
-    completion_tokens: int
-    total_tokens: int
-
-
-class ApiResponse(BaseModel):
-    model_config = ConfigDict(
-        protected_namespaces=(),
-    )
-
-    id: str
-    model: Optional[str] = None  # returned by openai
-    choices: List[ResponseChoice]
-    created: int
-    usage: ResponseUsage
-
-
-class PictureDescriptionApiModel(PictureDescriptionBaseModel):
-    # elements_batch_size = 4
-
-    def __init__(self, enabled: bool, options: PictureDescriptionApiOptions):
-        super().__init__(enabled=enabled, options=options)
-        self.options: PictureDescriptionApiOptions
-
-        if self.enabled:
-            if options.url.host != "localhost":
-                raise NotImplementedError(
-                    "The options try to connect to remote APIs which are not yet allowed."
-                )
-
-    def _annotate_images(self, images: Iterable[Image.Image]) -> Iterable[str]:
-        # Note: technically we could make a batch request here,
-        # but not all APIs will allow for it. For example, vllm won't allow more than 1.
-        for image in images:
-            img_io = io.BytesIO()
-            image.save(img_io, "PNG")
-            image_base64 = base64.b64encode(img_io.getvalue()).decode("utf-8")
-
-            messages = [
-                {
-                    "role": "user",
-                    "content": [
-                        {
-                            "type": "text",
-                            "text": self.options.prompt,
-                        },
-                        {
-                            "type": "image_url",
-                            "image_url": {
-                                "url": f"data:image/png;base64,{image_base64}"
-                            },
-                        },
-                    ],
-                }
-            ]
-
-            payload = {
-                "messages": messages,
-                **self.options.params,
-            }
-
-            r = requests.post(
-                str(self.options.url),
-                headers=self.options.headers,
-                json=payload,
-                timeout=self.options.timeout,
-            )
-            if not r.ok:
-                _log.error(f"Error calling the API. Reponse was {r.text}")
-            r.raise_for_status()
-
-            api_resp = ApiResponse.model_validate_json(r.text)
-            generated_text = api_resp.choices[0].message.content.strip()
-            yield generated_text
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/picture_description_base_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/picture_description_base_model.py
deleted file mode 100644
index b653e0e3e44e21154ee8491bfae9688e44c1a1e3..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/picture_description_base_model.py
+++ /dev/null
@@ -1,64 +0,0 @@
-import logging
-from pathlib import Path
-from typing import Any, Iterable, List, Optional, Union
-
-from docling_core.types.doc import (
-    DoclingDocument,
-    NodeItem,
-    PictureClassificationClass,
-    PictureItem,
-)
-from docling_core.types.doc.document import (  # TODO: move import to docling_core.types.doc
-    PictureDescriptionData,
-)
-from PIL import Image
-
-from docling.datamodel.pipeline_options import PictureDescriptionBaseOptions
-from docling.models.base_model import (
-    BaseItemAndImageEnrichmentModel,
-    ItemAndImageEnrichmentElement,
-)
-
-
-class PictureDescriptionBaseModel(BaseItemAndImageEnrichmentModel):
-    images_scale: float = 2.0
-
-    def __init__(
-        self,
-        enabled: bool,
-        options: PictureDescriptionBaseOptions,
-    ):
-        self.enabled = enabled
-        self.options = options
-        self.provenance = "not-implemented"
-
-    def is_processable(self, doc: DoclingDocument, element: NodeItem) -> bool:
-        return self.enabled and isinstance(element, PictureItem)
-
-    def _annotate_images(self, images: Iterable[Image.Image]) -> Iterable[str]:
-        raise NotImplementedError
-
-    def __call__(
-        self,
-        doc: DoclingDocument,
-        element_batch: Iterable[ItemAndImageEnrichmentElement],
-    ) -> Iterable[NodeItem]:
-        if not self.enabled:
-            for element in element_batch:
-                yield element.item
-            return
-
-        images: List[Image.Image] = []
-        elements: List[PictureItem] = []
-        for el in element_batch:
-            assert isinstance(el.item, PictureItem)
-            elements.append(el.item)
-            images.append(el.image)
-
-        outputs = self._annotate_images(images)
-
-        for item, output in zip(elements, outputs):
-            item.annotations.append(
-                PictureDescriptionData(text=output, provenance=self.provenance)
-            )
-            yield item
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/picture_description_vlm_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/picture_description_vlm_model.py
deleted file mode 100644
index 9fa4826da01f85947239d814824970dee71790e1..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/picture_description_vlm_model.py
+++ /dev/null
@@ -1,109 +0,0 @@
-from pathlib import Path
-from typing import Iterable, Optional, Union
-
-from PIL import Image
-
-from docling.datamodel.pipeline_options import (
-    AcceleratorOptions,
-    PictureDescriptionVlmOptions,
-)
-from docling.models.picture_description_base_model import PictureDescriptionBaseModel
-from docling.utils.accelerator_utils import decide_device
-
-
-class PictureDescriptionVlmModel(PictureDescriptionBaseModel):
-
-    def __init__(
-        self,
-        enabled: bool,
-        artifacts_path: Optional[Union[Path, str]],
-        options: PictureDescriptionVlmOptions,
-        accelerator_options: AcceleratorOptions,
-    ):
-        super().__init__(enabled=enabled, options=options)
-        self.options: PictureDescriptionVlmOptions
-
-        if self.enabled:
-
-            if artifacts_path is None:
-                artifacts_path = self.download_models(repo_id=self.options.repo_id)
-            else:
-                artifacts_path = Path(artifacts_path) / self.options.repo_cache_folder
-
-            self.device = decide_device(accelerator_options.device)
-
-            try:
-                import torch
-                from transformers import AutoModelForVision2Seq, AutoProcessor
-            except ImportError:
-                raise ImportError(
-                    "transformers >=4.46 is not installed. Please install Docling with the required extras `pip install docling[vlm]`."
-                )
-
-            # Initialize processor and model
-            self.processor = AutoProcessor.from_pretrained(self.options.repo_id)
-            self.model = AutoModelForVision2Seq.from_pretrained(
-                self.options.repo_id,
-                torch_dtype=torch.bfloat16,
-                _attn_implementation=(
-                    "flash_attention_2" if self.device.startswith("cuda") else "eager"
-                ),
-            ).to(self.device)
-
-            self.provenance = f"{self.options.repo_id}"
-
-    @staticmethod
-    def download_models(
-        repo_id: str,
-        local_dir: Optional[Path] = None,
-        force: bool = False,
-        progress: bool = False,
-    ) -> Path:
-        from huggingface_hub import snapshot_download
-        from huggingface_hub.utils import disable_progress_bars
-
-        if not progress:
-            disable_progress_bars()
-        download_path = snapshot_download(
-            repo_id=repo_id,
-            force_download=force,
-            local_dir=local_dir,
-        )
-
-        return Path(download_path)
-
-    def _annotate_images(self, images: Iterable[Image.Image]) -> Iterable[str]:
-        from transformers import GenerationConfig
-
-        # Create input messages
-        messages = [
-            {
-                "role": "user",
-                "content": [
-                    {"type": "image"},
-                    {"type": "text", "text": self.options.prompt},
-                ],
-            },
-        ]
-
-        # TODO: do batch generation
-
-        for image in images:
-            # Prepare inputs
-            prompt = self.processor.apply_chat_template(
-                messages, add_generation_prompt=True
-            )
-            inputs = self.processor(text=prompt, images=[image], return_tensors="pt")
-            inputs = inputs.to(self.device)
-
-            # Generate outputs
-            generated_ids = self.model.generate(
-                **inputs,
-                generation_config=GenerationConfig(**self.options.generation_config),
-            )
-            generated_texts = self.processor.batch_decode(
-                generated_ids[:, inputs["input_ids"].shape[1] :],
-                skip_special_tokens=True,
-            )
-
-            yield generated_texts[0].strip()
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/rapid_ocr_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/rapid_ocr_model.py
deleted file mode 100644
index fa3fbedf7ceffce9617b51ce671cfcc716a5f945..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/rapid_ocr_model.py
+++ /dev/null
@@ -1,128 +0,0 @@
-import logging
-from typing import Iterable
-
-import numpy
-from docling_core.types.doc import BoundingBox, CoordOrigin
-
-from docling.datamodel.base_models import OcrCell, Page
-from docling.datamodel.document import ConversionResult
-from docling.datamodel.pipeline_options import (
-    AcceleratorDevice,
-    AcceleratorOptions,
-    RapidOcrOptions,
-)
-from docling.datamodel.settings import settings
-from docling.models.base_ocr_model import BaseOcrModel
-from docling.utils.accelerator_utils import decide_device
-from docling.utils.profiling import TimeRecorder
-
-_log = logging.getLogger(__name__)
-
-
-class RapidOcrModel(BaseOcrModel):
-    def __init__(
-        self,
-        enabled: bool,
-        options: RapidOcrOptions,
-        accelerator_options: AcceleratorOptions,
-    ):
-        super().__init__(enabled=enabled, options=options)
-        self.options: RapidOcrOptions
-
-        self.scale = 3  # multiplier for 72 dpi == 216 dpi.
-
-        if self.enabled:
-            try:
-                from rapidocr_onnxruntime import RapidOCR  # type: ignore
-            except ImportError:
-                raise ImportError(
-                    "RapidOCR is not installed. Please install it via `pip install rapidocr_onnxruntime` to use this OCR engine. "
-                    "Alternatively, Docling has support for other OCR engines. See the documentation."
-                )
-
-            # Decide the accelerator devices
-            device = decide_device(accelerator_options.device)
-            use_cuda = str(AcceleratorDevice.CUDA.value).lower() in device
-            use_dml = accelerator_options.device == AcceleratorDevice.AUTO
-            intra_op_num_threads = accelerator_options.num_threads
-
-            self.reader = RapidOCR(
-                text_score=self.options.text_score,
-                cls_use_cuda=use_cuda,
-                rec_use_cuda=use_cuda,
-                det_use_cuda=use_cuda,
-                det_use_dml=use_dml,
-                cls_use_dml=use_dml,
-                rec_use_dml=use_dml,
-                intra_op_num_threads=intra_op_num_threads,
-                print_verbose=self.options.print_verbose,
-                det_model_path=self.options.det_model_path,
-                cls_model_path=self.options.cls_model_path,
-                rec_model_path=self.options.rec_model_path,
-                rec_keys_path=self.options.rec_keys_path,
-            )
-
-    def __call__(
-        self, conv_res: ConversionResult, page_batch: Iterable[Page]
-    ) -> Iterable[Page]:
-
-        if not self.enabled:
-            yield from page_batch
-            return
-
-        for page in page_batch:
-
-            assert page._backend is not None
-            if not page._backend.is_valid():
-                yield page
-            else:
-                with TimeRecorder(conv_res, "ocr"):
-                    ocr_rects = self.get_ocr_rects(page)
-
-                    all_ocr_cells = []
-                    for ocr_rect in ocr_rects:
-                        # Skip zero area boxes
-                        if ocr_rect.area() == 0:
-                            continue
-                        high_res_image = page._backend.get_page_image(
-                            scale=self.scale, cropbox=ocr_rect
-                        )
-                        im = numpy.array(high_res_image)
-                        result, _ = self.reader(
-                            im,
-                            use_det=self.options.use_det,
-                            use_cls=self.options.use_cls,
-                            use_rec=self.options.use_rec,
-                        )
-
-                        del high_res_image
-                        del im
-
-                        if result is not None:
-                            cells = [
-                                OcrCell(
-                                    id=ix,
-                                    text=line[1],
-                                    confidence=line[2],
-                                    bbox=BoundingBox.from_tuple(
-                                        coord=(
-                                            (line[0][0][0] / self.scale) + ocr_rect.l,
-                                            (line[0][0][1] / self.scale) + ocr_rect.t,
-                                            (line[0][2][0] / self.scale) + ocr_rect.l,
-                                            (line[0][2][1] / self.scale) + ocr_rect.t,
-                                        ),
-                                        origin=CoordOrigin.TOPLEFT,
-                                    ),
-                                )
-                                for ix, line in enumerate(result)
-                            ]
-                            all_ocr_cells.extend(cells)
-
-                    # Post-process the cells
-                    page.cells = self.post_process_cells(all_ocr_cells, page.cells)
-
-                # DEBUG code:
-                if settings.debug.visualize_ocr:
-                    self.draw_ocr_rects_and_cells(conv_res, page, ocr_rects)
-
-                yield page
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/table_structure_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/table_structure_model.py
deleted file mode 100644
index 649791572b41a084aaba8640c62438124965e287..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/table_structure_model.py
+++ /dev/null
@@ -1,288 +0,0 @@
-import copy
-import warnings
-from pathlib import Path
-from typing import Iterable, Optional, Union
-
-import numpy
-from docling_core.types.doc import BoundingBox, DocItemLabel, TableCell
-from docling_ibm_models.tableformer.data_management.tf_predictor import TFPredictor
-from PIL import ImageDraw
-
-from docling.datamodel.base_models import Page, Table, TableStructurePrediction
-from docling.datamodel.document import ConversionResult
-from docling.datamodel.pipeline_options import (
-    AcceleratorDevice,
-    AcceleratorOptions,
-    TableFormerMode,
-    TableStructureOptions,
-)
-from docling.datamodel.settings import settings
-from docling.models.base_model import BasePageModel
-from docling.utils.accelerator_utils import decide_device
-from docling.utils.profiling import TimeRecorder
-
-
-class TableStructureModel(BasePageModel):
-    _model_repo_folder = "ds4sd--docling-models"
-    _model_path = "model_artifacts/tableformer"
-
-    def __init__(
-        self,
-        enabled: bool,
-        artifacts_path: Optional[Path],
-        options: TableStructureOptions,
-        accelerator_options: AcceleratorOptions,
-    ):
-        self.options = options
-        self.do_cell_matching = self.options.do_cell_matching
-        self.mode = self.options.mode
-
-        self.enabled = enabled
-        if self.enabled:
-
-            if artifacts_path is None:
-                artifacts_path = self.download_models() / self._model_path
-            else:
-                # will become the default in the future
-                if (artifacts_path / self._model_repo_folder).exists():
-                    artifacts_path = (
-                        artifacts_path / self._model_repo_folder / self._model_path
-                    )
-                elif (artifacts_path / self._model_path).exists():
-                    warnings.warn(
-                        "The usage of artifacts_path containing directly "
-                        f"{self._model_path} is deprecated. Please point "
-                        "the artifacts_path to the parent containing "
-                        f"the {self._model_repo_folder} folder.",
-                        DeprecationWarning,
-                        stacklevel=3,
-                    )
-                    artifacts_path = artifacts_path / self._model_path
-
-            if self.mode == TableFormerMode.ACCURATE:
-                artifacts_path = artifacts_path / "accurate"
-            else:
-                artifacts_path = artifacts_path / "fast"
-
-            # Third Party
-            import docling_ibm_models.tableformer.common as c
-
-            device = decide_device(accelerator_options.device)
-
-            # Disable MPS here, until we know why it makes things slower.
-            if device == AcceleratorDevice.MPS.value:
-                device = AcceleratorDevice.CPU.value
-
-            self.tm_config = c.read_config(f"{artifacts_path}/tm_config.json")
-            self.tm_config["model"]["save_dir"] = artifacts_path
-            self.tm_model_type = self.tm_config["model"]["type"]
-
-            self.tf_predictor = TFPredictor(
-                self.tm_config, device, accelerator_options.num_threads
-            )
-            self.scale = 2.0  # Scale up table input images to 144 dpi
-
-    @staticmethod
-    def download_models(
-        local_dir: Optional[Path] = None, force: bool = False, progress: bool = False
-    ) -> Path:
-        from huggingface_hub import snapshot_download
-        from huggingface_hub.utils import disable_progress_bars
-
-        if not progress:
-            disable_progress_bars()
-        download_path = snapshot_download(
-            repo_id="ds4sd/docling-models",
-            force_download=force,
-            local_dir=local_dir,
-            revision="v2.1.0",
-        )
-
-        return Path(download_path)
-
-    def draw_table_and_cells(
-        self,
-        conv_res: ConversionResult,
-        page: Page,
-        tbl_list: Iterable[Table],
-        show: bool = False,
-    ):
-        assert page._backend is not None
-        assert page.size is not None
-
-        image = (
-            page._backend.get_page_image()
-        )  # make new image to avoid drawing on the saved ones
-
-        scale_x = image.width / page.size.width
-        scale_y = image.height / page.size.height
-
-        draw = ImageDraw.Draw(image)
-
-        for table_element in tbl_list:
-            x0, y0, x1, y1 = table_element.cluster.bbox.as_tuple()
-            y0 *= scale_x
-            y1 *= scale_y
-            x0 *= scale_x
-            x1 *= scale_x
-
-            draw.rectangle([(x0, y0), (x1, y1)], outline="red")
-
-            for cell in table_element.cluster.cells:
-                x0, y0, x1, y1 = cell.bbox.as_tuple()
-                x0 *= scale_x
-                x1 *= scale_x
-                y0 *= scale_x
-                y1 *= scale_y
-
-                draw.rectangle([(x0, y0), (x1, y1)], outline="green")
-
-            for tc in table_element.table_cells:
-                if tc.bbox is not None:
-                    x0, y0, x1, y1 = tc.bbox.as_tuple()
-                    x0 *= scale_x
-                    x1 *= scale_x
-                    y0 *= scale_x
-                    y1 *= scale_y
-
-                    if tc.column_header:
-                        width = 3
-                    else:
-                        width = 1
-                    draw.rectangle([(x0, y0), (x1, y1)], outline="blue", width=width)
-                    draw.text(
-                        (x0 + 3, y0 + 3),
-                        text=f"{tc.start_row_offset_idx}, {tc.start_col_offset_idx}",
-                        fill="black",
-                    )
-        if show:
-            image.show()
-        else:
-            out_path: Path = (
-                Path(settings.debug.debug_output_path)
-                / f"debug_{conv_res.input.file.stem}"
-            )
-            out_path.mkdir(parents=True, exist_ok=True)
-
-            out_file = out_path / f"table_struct_page_{page.page_no:05}.png"
-            image.save(str(out_file), format="png")
-
-    def __call__(
-        self, conv_res: ConversionResult, page_batch: Iterable[Page]
-    ) -> Iterable[Page]:
-
-        if not self.enabled:
-            yield from page_batch
-            return
-
-        for page in page_batch:
-            assert page._backend is not None
-            if not page._backend.is_valid():
-                yield page
-            else:
-                with TimeRecorder(conv_res, "table_structure"):
-
-                    assert page.predictions.layout is not None
-                    assert page.size is not None
-
-                    page.predictions.tablestructure = (
-                        TableStructurePrediction()
-                    )  # dummy
-
-                    in_tables = [
-                        (
-                            cluster,
-                            [
-                                round(cluster.bbox.l) * self.scale,
-                                round(cluster.bbox.t) * self.scale,
-                                round(cluster.bbox.r) * self.scale,
-                                round(cluster.bbox.b) * self.scale,
-                            ],
-                        )
-                        for cluster in page.predictions.layout.clusters
-                        if cluster.label
-                        in [DocItemLabel.TABLE, DocItemLabel.DOCUMENT_INDEX]
-                    ]
-                    if not len(in_tables):
-                        yield page
-                        continue
-
-                    page_input = {
-                        "width": page.size.width * self.scale,
-                        "height": page.size.height * self.scale,
-                        "image": numpy.asarray(page.get_image(scale=self.scale)),
-                    }
-
-                    table_clusters, table_bboxes = zip(*in_tables)
-
-                    if len(table_bboxes):
-                        for table_cluster, tbl_box in in_tables:
-
-                            tokens = []
-                            for c in table_cluster.cells:
-                                # Only allow non empty stings (spaces) into the cells of a table
-                                if len(c.text.strip()) > 0:
-                                    new_cell = copy.deepcopy(c)
-                                    new_cell.bbox = new_cell.bbox.scaled(
-                                        scale=self.scale
-                                    )
-
-                                    tokens.append(new_cell.model_dump())
-                            page_input["tokens"] = tokens
-
-                            tf_output = self.tf_predictor.multi_table_predict(
-                                page_input, [tbl_box], do_matching=self.do_cell_matching
-                            )
-                            table_out = tf_output[0]
-                            table_cells = []
-                            for element in table_out["tf_responses"]:
-
-                                if not self.do_cell_matching:
-                                    the_bbox = BoundingBox.model_validate(
-                                        element["bbox"]
-                                    ).scaled(1 / self.scale)
-                                    text_piece = page._backend.get_text_in_rect(
-                                        the_bbox
-                                    )
-                                    element["bbox"]["token"] = text_piece
-
-                                tc = TableCell.model_validate(element)
-                                if self.do_cell_matching and tc.bbox is not None:
-                                    tc.bbox = tc.bbox.scaled(1 / self.scale)
-                                table_cells.append(tc)
-
-                            assert "predict_details" in table_out
-
-                            # Retrieving cols/rows, after post processing:
-                            num_rows = table_out["predict_details"].get("num_rows", 0)
-                            num_cols = table_out["predict_details"].get("num_cols", 0)
-                            otsl_seq = (
-                                table_out["predict_details"]
-                                .get("prediction", {})
-                                .get("rs_seq", [])
-                            )
-
-                            tbl = Table(
-                                otsl_seq=otsl_seq,
-                                table_cells=table_cells,
-                                num_rows=num_rows,
-                                num_cols=num_cols,
-                                id=table_cluster.id,
-                                page_no=page.page_no,
-                                cluster=table_cluster,
-                                label=table_cluster.label,
-                            )
-
-                            page.predictions.tablestructure.table_map[
-                                table_cluster.id
-                            ] = tbl
-
-                    # For debugging purposes:
-                    if settings.debug.visualize_tables:
-                        self.draw_table_and_cells(
-                            conv_res,
-                            page,
-                            page.predictions.tablestructure.table_map.values(),
-                        )
-
-                yield page
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/tesseract_ocr_cli_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/tesseract_ocr_cli_model.py
deleted file mode 100644
index cdc5671d7f59d75cd37faf4fd8eaee7d803643d1..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/tesseract_ocr_cli_model.py
+++ /dev/null
@@ -1,252 +0,0 @@
-import csv
-import io
-import logging
-import os
-import tempfile
-from subprocess import DEVNULL, PIPE, Popen
-from typing import Iterable, List, Optional, Tuple
-
-import pandas as pd
-from docling_core.types.doc import BoundingBox, CoordOrigin
-
-from docling.datamodel.base_models import Cell, OcrCell, Page
-from docling.datamodel.document import ConversionResult
-from docling.datamodel.pipeline_options import TesseractCliOcrOptions
-from docling.datamodel.settings import settings
-from docling.models.base_ocr_model import BaseOcrModel
-from docling.utils.ocr_utils import map_tesseract_script
-from docling.utils.profiling import TimeRecorder
-
-_log = logging.getLogger(__name__)
-
-
-class TesseractOcrCliModel(BaseOcrModel):
-    def __init__(self, enabled: bool, options: TesseractCliOcrOptions):
-        super().__init__(enabled=enabled, options=options)
-        self.options: TesseractCliOcrOptions
-
-        self.scale = 3  # multiplier for 72 dpi == 216 dpi.
-
-        self._name: Optional[str] = None
-        self._version: Optional[str] = None
-        self._tesseract_languages: Optional[List[str]] = None
-        self._script_prefix: Optional[str] = None
-
-        if self.enabled:
-            try:
-                self._get_name_and_version()
-                self._set_languages_and_prefix()
-
-            except Exception as exc:
-                raise RuntimeError(
-                    f"Tesseract is not available, aborting: {exc} "
-                    "Install tesseract on your system and the tesseract binary is discoverable. "
-                    "The actual command for Tesseract can be specified in `pipeline_options.ocr_options.tesseract_cmd='tesseract'`. "
-                    "Alternatively, Docling has support for other OCR engines. See the documentation."
-                )
-
-    def _get_name_and_version(self) -> Tuple[str, str]:
-
-        if self._name != None and self._version != None:
-            return self._name, self._version  # type: ignore
-
-        cmd = [self.options.tesseract_cmd, "--version"]
-
-        proc = Popen(cmd, stdout=PIPE, stderr=PIPE)
-        stdout, stderr = proc.communicate()
-
-        proc.wait()
-
-        # HACK: Windows versions of Tesseract output the version to stdout, Linux versions
-        # to stderr, so check both.
-        version_line = (
-            (stdout.decode("utf8").strip() or stderr.decode("utf8").strip())
-            .split("\n")[0]
-            .strip()
-        )
-
-        # If everything else fails...
-        if not version_line:
-            version_line = "tesseract XXX"
-
-        name, version = version_line.split(" ")
-
-        self._name = name
-        self._version = version
-
-        return name, version
-
-    def _run_tesseract(self, ifilename: str):
-        r"""
-        Run tesseract CLI
-        """
-        cmd = [self.options.tesseract_cmd]
-
-        if "auto" in self.options.lang:
-            lang = self._detect_language(ifilename)
-            if lang is not None:
-                cmd.append("-l")
-                cmd.append(lang)
-        elif self.options.lang is not None and len(self.options.lang) > 0:
-            cmd.append("-l")
-            cmd.append("+".join(self.options.lang))
-
-        if self.options.path is not None:
-            cmd.append("--tessdata-dir")
-            cmd.append(self.options.path)
-
-        cmd += [ifilename, "stdout", "tsv"]
-        _log.info("command: {}".format(" ".join(cmd)))
-
-        proc = Popen(cmd, stdout=PIPE, stderr=DEVNULL)
-        output, _ = proc.communicate()
-
-        # _log.info(output)
-
-        # Decode the byte string to a regular string
-        decoded_data = output.decode("utf-8")
-        # _log.info(decoded_data)
-
-        # Read the TSV file generated by Tesseract
-        df = pd.read_csv(io.StringIO(decoded_data), quoting=csv.QUOTE_NONE, sep="\t")
-
-        # Display the dataframe (optional)
-        # _log.info("df: ", df.head())
-
-        # Filter rows that contain actual text (ignore header or empty rows)
-        df_filtered = df[df["text"].notnull() & (df["text"].str.strip() != "")]
-
-        return df_filtered
-
-    def _detect_language(self, ifilename: str):
-        r"""
-        Run tesseract in PSM 0 mode to detect the language
-        """
-        assert self._tesseract_languages is not None
-
-        cmd = [self.options.tesseract_cmd]
-        cmd.extend(["--psm", "0", "-l", "osd", ifilename, "stdout"])
-        _log.info("command: {}".format(" ".join(cmd)))
-        proc = Popen(cmd, stdout=PIPE, stderr=DEVNULL)
-        output, _ = proc.communicate()
-        decoded_data = output.decode("utf-8")
-        df = pd.read_csv(
-            io.StringIO(decoded_data), sep=":", header=None, names=["key", "value"]
-        )
-        scripts = df.loc[df["key"] == "Script"].value.tolist()
-        if len(scripts) == 0:
-            _log.warning("Tesseract cannot detect the script of the page")
-            return None
-
-        script = map_tesseract_script(scripts[0].strip())
-        lang = f"{self._script_prefix}{script}"
-
-        # Check if the detected language has been installed
-        if lang not in self._tesseract_languages:
-            msg = f"Tesseract detected the script '{script}' and language '{lang}'."
-            msg += " However this language is not installed in your system and will be ignored."
-            _log.warning(msg)
-            return None
-
-        _log.debug(
-            f"Using tesseract model for the detected script '{script}' and language '{lang}'"
-        )
-        return lang
-
-    def _set_languages_and_prefix(self):
-        r"""
-        Read and set the languages installed in tesseract and decide the script prefix
-        """
-        # Get all languages
-        cmd = [self.options.tesseract_cmd]
-        cmd.append("--list-langs")
-        _log.info("command: {}".format(" ".join(cmd)))
-        proc = Popen(cmd, stdout=PIPE, stderr=DEVNULL)
-        output, _ = proc.communicate()
-        decoded_data = output.decode("utf-8")
-        df = pd.read_csv(io.StringIO(decoded_data), header=None)
-        self._tesseract_languages = df[0].tolist()[1:]
-
-        # Decide the script prefix
-        if any([l.startswith("script/") for l in self._tesseract_languages]):
-            script_prefix = "script/"
-        else:
-            script_prefix = ""
-
-        self._script_prefix = script_prefix
-
-    def __call__(
-        self, conv_res: ConversionResult, page_batch: Iterable[Page]
-    ) -> Iterable[Page]:
-
-        if not self.enabled:
-            yield from page_batch
-            return
-
-        for page in page_batch:
-            assert page._backend is not None
-            if not page._backend.is_valid():
-                yield page
-            else:
-                with TimeRecorder(conv_res, "ocr"):
-                    ocr_rects = self.get_ocr_rects(page)
-
-                    all_ocr_cells = []
-                    for ocr_rect in ocr_rects:
-                        # Skip zero area boxes
-                        if ocr_rect.area() == 0:
-                            continue
-                        high_res_image = page._backend.get_page_image(
-                            scale=self.scale, cropbox=ocr_rect
-                        )
-                        try:
-                            with tempfile.NamedTemporaryFile(
-                                suffix=".png", mode="w+b", delete=False
-                            ) as image_file:
-                                fname = image_file.name
-                                high_res_image.save(image_file)
-
-                            df = self._run_tesseract(fname)
-                        finally:
-                            if os.path.exists(fname):
-                                os.remove(fname)
-
-                        # _log.info(df)
-
-                        # Print relevant columns (bounding box and text)
-                        for ix, row in df.iterrows():
-                            text = row["text"]
-                            conf = row["conf"]
-
-                            l = float(row["left"])
-                            b = float(row["top"])
-                            w = float(row["width"])
-                            h = float(row["height"])
-
-                            t = b + h
-                            r = l + w
-
-                            cell = OcrCell(
-                                id=ix,
-                                text=text,
-                                confidence=conf / 100.0,
-                                bbox=BoundingBox.from_tuple(
-                                    coord=(
-                                        (l / self.scale) + ocr_rect.l,
-                                        (b / self.scale) + ocr_rect.t,
-                                        (r / self.scale) + ocr_rect.l,
-                                        (t / self.scale) + ocr_rect.t,
-                                    ),
-                                    origin=CoordOrigin.TOPLEFT,
-                                ),
-                            )
-                            all_ocr_cells.append(cell)
-
-                    # Post-process the cells
-                    page.cells = self.post_process_cells(all_ocr_cells, page.cells)
-
-                # DEBUG code:
-                if settings.debug.visualize_ocr:
-                    self.draw_ocr_rects_and_cells(conv_res, page, ocr_rects)
-
-                yield page
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/models/tesseract_ocr_model.py b/Paper2Video/src/evaluation/PresentQuiz/docling/models/tesseract_ocr_model.py
deleted file mode 100644
index 5b70155e963fde67af07f3887d3baf2b940e3b5f..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/models/tesseract_ocr_model.py
+++ /dev/null
@@ -1,198 +0,0 @@
-import logging
-from typing import Iterable
-
-from docling_core.types.doc import BoundingBox, CoordOrigin
-
-from docling.datamodel.base_models import Cell, OcrCell, Page
-from docling.datamodel.document import ConversionResult
-from docling.datamodel.pipeline_options import TesseractOcrOptions
-from docling.datamodel.settings import settings
-from docling.models.base_ocr_model import BaseOcrModel
-from docling.utils.ocr_utils import map_tesseract_script
-from docling.utils.profiling import TimeRecorder
-
-_log = logging.getLogger(__name__)
-
-
-class TesseractOcrModel(BaseOcrModel):
-    def __init__(self, enabled: bool, options: TesseractOcrOptions):
-        super().__init__(enabled=enabled, options=options)
-        self.options: TesseractOcrOptions
-
-        self.scale = 3  # multiplier for 72 dpi == 216 dpi.
-        self.reader = None
-        self.osd_reader = None
-
-        if self.enabled:
-            install_errmsg = (
-                "tesserocr is not correctly installed. "
-                "Please install it via `pip install tesserocr` to use this OCR engine. "
-                "Note that tesserocr might have to be manually compiled for working with "
-                "your Tesseract installation. The Docling documentation provides examples for it. "
-                "Alternatively, Docling has support for other OCR engines. See the documentation: "
-                "https://ds4sd.github.io/docling/installation/"
-            )
-            missing_langs_errmsg = (
-                "tesserocr is not correctly configured. No language models have been detected. "
-                "Please ensure that the TESSDATA_PREFIX envvar points to tesseract languages dir. "
-                "You can find more information how to setup other OCR engines in Docling "
-                "documentation: "
-                "https://ds4sd.github.io/docling/installation/"
-            )
-
-            try:
-                import tesserocr
-            except ImportError:
-                raise ImportError(install_errmsg)
-            try:
-                tesseract_version = tesserocr.tesseract_version()
-            except:
-                raise ImportError(install_errmsg)
-
-            _, self._tesserocr_languages = tesserocr.get_languages()
-            if not self._tesserocr_languages:
-                raise ImportError(missing_langs_errmsg)
-
-            # Initialize the tesseractAPI
-            _log.debug("Initializing TesserOCR: %s", tesseract_version)
-            lang = "+".join(self.options.lang)
-
-            self.script_readers: dict[str, tesserocr.PyTessBaseAPI] = {}
-
-            if any([l.startswith("script/") for l in self._tesserocr_languages]):
-                self.script_prefix = "script/"
-            else:
-                self.script_prefix = ""
-
-            tesserocr_kwargs = {
-                "psm": tesserocr.PSM.AUTO,
-                "init": True,
-                "oem": tesserocr.OEM.DEFAULT,
-            }
-
-            if self.options.path is not None:
-                tesserocr_kwargs["path"] = self.options.path
-
-            if lang == "auto":
-                self.reader = tesserocr.PyTessBaseAPI(**tesserocr_kwargs)
-                self.osd_reader = tesserocr.PyTessBaseAPI(
-                    **{"lang": "osd", "psm": tesserocr.PSM.OSD_ONLY} | tesserocr_kwargs
-                )
-            else:
-                self.reader = tesserocr.PyTessBaseAPI(
-                    **{"lang": lang} | tesserocr_kwargs,
-                )
-            self.reader_RIL = tesserocr.RIL
-
-    def __del__(self):
-        if self.reader is not None:
-            # Finalize the tesseractAPI
-            self.reader.End()
-        for script in self.script_readers:
-            self.script_readers[script].End()
-
-    def __call__(
-        self, conv_res: ConversionResult, page_batch: Iterable[Page]
-    ) -> Iterable[Page]:
-        if not self.enabled:
-            yield from page_batch
-            return
-
-        for page in page_batch:
-            assert page._backend is not None
-            if not page._backend.is_valid():
-                yield page
-            else:
-                with TimeRecorder(conv_res, "ocr"):
-                    assert self.reader is not None
-                    assert self._tesserocr_languages is not None
-
-                    ocr_rects = self.get_ocr_rects(page)
-
-                    all_ocr_cells = []
-                    for ocr_rect in ocr_rects:
-                        # Skip zero area boxes
-                        if ocr_rect.area() == 0:
-                            continue
-                        high_res_image = page._backend.get_page_image(
-                            scale=self.scale, cropbox=ocr_rect
-                        )
-
-                        local_reader = self.reader
-                        if "auto" in self.options.lang:
-                            assert self.osd_reader is not None
-
-                            self.osd_reader.SetImage(high_res_image)
-                            osd = self.osd_reader.DetectOrientationScript()
-
-                            # No text, probably
-                            if osd is None:
-                                continue
-
-                            script = osd["script_name"]
-                            script = map_tesseract_script(script)
-                            lang = f"{self.script_prefix}{script}"
-
-                            # Check if the detected languge is present in the system
-                            if lang not in self._tesserocr_languages:
-                                msg = f"Tesseract detected the script '{script}' and language '{lang}'."
-                                msg += " However this language is not installed in your system and will be ignored."
-                                _log.warning(msg)
-                            else:
-                                if script not in self.script_readers:
-                                    import tesserocr
-
-                                    self.script_readers[script] = (
-                                        tesserocr.PyTessBaseAPI(
-                                            path=self.reader.GetDatapath(),
-                                            lang=lang,
-                                            psm=tesserocr.PSM.AUTO,
-                                            init=True,
-                                            oem=tesserocr.OEM.DEFAULT,
-                                        )
-                                    )
-                                local_reader = self.script_readers[script]
-
-                        local_reader.SetImage(high_res_image)
-                        boxes = local_reader.GetComponentImages(
-                            self.reader_RIL.TEXTLINE, True
-                        )
-
-                        cells = []
-                        for ix, (im, box, _, _) in enumerate(boxes):
-                            # Set the area of interest. Tesseract uses Bottom-Left for the origin
-                            local_reader.SetRectangle(
-                                box["x"], box["y"], box["w"], box["h"]
-                            )
-
-                            # Extract text within the bounding box
-                            text = local_reader.GetUTF8Text().strip()
-                            confidence = local_reader.MeanTextConf()
-                            left = box["x"] / self.scale
-                            bottom = box["y"] / self.scale
-                            right = (box["x"] + box["w"]) / self.scale
-                            top = (box["y"] + box["h"]) / self.scale
-
-                            cells.append(
-                                OcrCell(
-                                    id=ix,
-                                    text=text,
-                                    confidence=confidence,
-                                    bbox=BoundingBox.from_tuple(
-                                        coord=(left, top, right, bottom),
-                                        origin=CoordOrigin.TOPLEFT,
-                                    ),
-                                )
-                            )
-
-                        # del high_res_image
-                        all_ocr_cells.extend(cells)
-
-                    # Post-process the cells
-                    page.cells = self.post_process_cells(all_ocr_cells, page.cells)
-
-                # DEBUG code:
-                if settings.debug.visualize_ocr:
-                    self.draw_ocr_rects_and_cells(conv_res, page, ocr_rects)
-
-                yield page
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/pipeline/__init__.py b/Paper2Video/src/evaluation/PresentQuiz/docling/pipeline/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/pipeline/base_pipeline.py b/Paper2Video/src/evaluation/PresentQuiz/docling/pipeline/base_pipeline.py
deleted file mode 100644
index 1bf48ef0b9de55485d0b949cb3a5505007c97ee3..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/pipeline/base_pipeline.py
+++ /dev/null
@@ -1,230 +0,0 @@
-import functools
-import logging
-import time
-import traceback
-from abc import ABC, abstractmethod
-from typing import Any, Callable, Iterable, List
-
-from docling_core.types.doc import DoclingDocument, NodeItem
-
-from docling.backend.abstract_backend import AbstractDocumentBackend
-from docling.backend.pdf_backend import PdfDocumentBackend
-from docling.datamodel.base_models import (
-    ConversionStatus,
-    DoclingComponentType,
-    ErrorItem,
-    Page,
-)
-from docling.datamodel.document import ConversionResult, InputDocument
-from docling.datamodel.pipeline_options import PipelineOptions
-from docling.datamodel.settings import settings
-from docling.models.base_model import GenericEnrichmentModel
-from docling.utils.profiling import ProfilingScope, TimeRecorder
-from docling.utils.utils import chunkify
-
-_log = logging.getLogger(__name__)
-
-
-class BasePipeline(ABC):
-    def __init__(self, pipeline_options: PipelineOptions):
-        self.pipeline_options = pipeline_options
-        self.keep_images = False
-        self.build_pipe: List[Callable] = []
-        self.enrichment_pipe: List[GenericEnrichmentModel[Any]] = []
-
-    def execute(self, in_doc: InputDocument, raises_on_error: bool) -> ConversionResult:
-        conv_res = ConversionResult(input=in_doc)
-
-        _log.info(f"Processing document {in_doc.file.name}")
-        try:
-            with TimeRecorder(
-                conv_res, "pipeline_total", scope=ProfilingScope.DOCUMENT
-            ):
-                # These steps are building and assembling the structure of the
-                # output DoclingDocument.
-                conv_res = self._build_document(conv_res)
-                conv_res = self._assemble_document(conv_res)
-                # From this stage, all operations should rely only on conv_res.output
-                conv_res = self._enrich_document(conv_res)
-                conv_res.status = self._determine_status(conv_res)
-        except Exception as e:
-            conv_res.status = ConversionStatus.FAILURE
-            if raises_on_error:
-                raise e
-        finally:
-            self._unload(conv_res)
-
-        return conv_res
-
-    @abstractmethod
-    def _build_document(self, conv_res: ConversionResult) -> ConversionResult:
-        pass
-
-    def _assemble_document(self, conv_res: ConversionResult) -> ConversionResult:
-        return conv_res
-
-    def _enrich_document(self, conv_res: ConversionResult) -> ConversionResult:
-
-        def _prepare_elements(
-            conv_res: ConversionResult, model: GenericEnrichmentModel[Any]
-        ) -> Iterable[NodeItem]:
-            for doc_element, _level in conv_res.document.iterate_items():
-                prepared_element = model.prepare_element(
-                    conv_res=conv_res, element=doc_element
-                )
-                if prepared_element is not None:
-                    yield prepared_element
-
-        with TimeRecorder(conv_res, "doc_enrich", scope=ProfilingScope.DOCUMENT):
-            for model in self.enrichment_pipe:
-                for element_batch in chunkify(
-                    _prepare_elements(conv_res, model),
-                    model.elements_batch_size,
-                ):
-                    for element in model(
-                        doc=conv_res.document, element_batch=element_batch
-                    ):  # Must exhaust!
-                        pass
-
-        return conv_res
-
-    @abstractmethod
-    def _determine_status(self, conv_res: ConversionResult) -> ConversionStatus:
-        pass
-
-    def _unload(self, conv_res: ConversionResult):
-        pass
-
-    @classmethod
-    @abstractmethod
-    def get_default_options(cls) -> PipelineOptions:
-        pass
-
-    @classmethod
-    @abstractmethod
-    def is_backend_supported(cls, backend: AbstractDocumentBackend):
-        pass
-
-    # def _apply_on_elements(self, element_batch: Iterable[NodeItem]) -> Iterable[Any]:
-    #    for model in self.build_pipe:
-    #        element_batch = model(element_batch)
-    #
-    #    yield from element_batch
-
-
-class PaginatedPipeline(BasePipeline):  # TODO this is a bad name.
-
-    def __init__(self, pipeline_options: PipelineOptions):
-        super().__init__(pipeline_options)
-        self.keep_backend = False
-
-    def _apply_on_pages(
-        self, conv_res: ConversionResult, page_batch: Iterable[Page]
-    ) -> Iterable[Page]:
-        for model in self.build_pipe:
-            page_batch = model(conv_res, page_batch)
-
-        yield from page_batch
-
-    def _build_document(self, conv_res: ConversionResult) -> ConversionResult:
-
-        if not isinstance(conv_res.input._backend, PdfDocumentBackend):
-            raise RuntimeError(
-                f"The selected backend {type(conv_res.input._backend).__name__} for {conv_res.input.file} is not a PDF backend. "
-                f"Can not convert this with a PDF pipeline. "
-                f"Please check your format configuration on DocumentConverter."
-            )
-            # conv_res.status = ConversionStatus.FAILURE
-            # return conv_res
-
-        total_elapsed_time = 0.0
-        with TimeRecorder(conv_res, "doc_build", scope=ProfilingScope.DOCUMENT):
-
-            for i in range(0, conv_res.input.page_count):
-                start_page, end_page = conv_res.input.limits.page_range
-                if (start_page - 1) <= i <= (end_page - 1):
-                    conv_res.pages.append(Page(page_no=i))
-
-            try:
-                # Iterate batches of pages (page_batch_size) in the doc
-                for page_batch in chunkify(
-                    conv_res.pages, settings.perf.page_batch_size
-                ):
-                    start_batch_time = time.monotonic()
-
-                    # 1. Initialise the page resources
-                    init_pages = map(
-                        functools.partial(self.initialize_page, conv_res), page_batch
-                    )
-
-                    # 2. Run pipeline stages
-                    pipeline_pages = self._apply_on_pages(conv_res, init_pages)
-
-                    for p in pipeline_pages:  # Must exhaust!
-
-                        # Cleanup cached images
-                        if not self.keep_images:
-                            p._image_cache = {}
-
-                        # Cleanup page backends
-                        if not self.keep_backend and p._backend is not None:
-                            p._backend.unload()
-
-                    end_batch_time = time.monotonic()
-                    total_elapsed_time += end_batch_time - start_batch_time
-                    if (
-                        self.pipeline_options.document_timeout is not None
-                        and total_elapsed_time > self.pipeline_options.document_timeout
-                    ):
-                        _log.warning(
-                            f"Document processing time ({total_elapsed_time:.3f} seconds) exceeded the specified timeout of {self.pipeline_options.document_timeout:.3f} seconds"
-                        )
-                        conv_res.status = ConversionStatus.PARTIAL_SUCCESS
-                        break
-
-                    _log.debug(
-                        f"Finished converting page batch time={end_batch_time:.3f}"
-                    )
-
-            except Exception as e:
-                conv_res.status = ConversionStatus.FAILURE
-                trace = "\n".join(
-                    traceback.format_exception(type(e), e, e.__traceback__)
-                )
-                _log.warning(
-                    f"Encountered an error during conversion of document {conv_res.input.document_hash}:\n"
-                    f"{trace}"
-                )
-                raise e
-
-        return conv_res
-
-    def _unload(self, conv_res: ConversionResult) -> ConversionResult:
-        for page in conv_res.pages:
-            if page._backend is not None:
-                page._backend.unload()
-
-        if conv_res.input._backend:
-            conv_res.input._backend.unload()
-
-        return conv_res
-
-    def _determine_status(self, conv_res: ConversionResult) -> ConversionStatus:
-        status = ConversionStatus.SUCCESS
-        for page in conv_res.pages:
-            if page._backend is None or not page._backend.is_valid():
-                conv_res.errors.append(
-                    ErrorItem(
-                        component_type=DoclingComponentType.DOCUMENT_BACKEND,
-                        module_name=type(page._backend).__name__,
-                        error_message=f"Page {page.page_no} failed to parse.",
-                    )
-                )
-                status = ConversionStatus.PARTIAL_SUCCESS
-
-        return status
-
-    # Initialise and load resources for a page
-    @abstractmethod
-    def initialize_page(self, conv_res: ConversionResult, page: Page) -> Page:
-        pass
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/pipeline/simple_pipeline.py b/Paper2Video/src/evaluation/PresentQuiz/docling/pipeline/simple_pipeline.py
deleted file mode 100644
index fb9852312f24f2389698b9cf4ab9a13390151846..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/pipeline/simple_pipeline.py
+++ /dev/null
@@ -1,56 +0,0 @@
-import logging
-
-from docling.backend.abstract_backend import (
-    AbstractDocumentBackend,
-    DeclarativeDocumentBackend,
-)
-from docling.datamodel.base_models import ConversionStatus
-from docling.datamodel.document import ConversionResult
-from docling.datamodel.pipeline_options import PipelineOptions
-from docling.pipeline.base_pipeline import BasePipeline
-from docling.utils.profiling import ProfilingScope, TimeRecorder
-
-_log = logging.getLogger(__name__)
-
-
-class SimplePipeline(BasePipeline):
-    """SimpleModelPipeline.
-
-    This class is used at the moment for formats / backends
-    which produce straight DoclingDocument output.
-    """
-
-    def __init__(self, pipeline_options: PipelineOptions):
-        super().__init__(pipeline_options)
-
-    def _build_document(self, conv_res: ConversionResult) -> ConversionResult:
-
-        if not isinstance(conv_res.input._backend, DeclarativeDocumentBackend):
-            raise RuntimeError(
-                f"The selected backend {type(conv_res.input._backend).__name__} for {conv_res.input.file} is not a declarative backend. "
-                f"Can not convert this with simple pipeline. "
-                f"Please check your format configuration on DocumentConverter."
-            )
-            # conv_res.status = ConversionStatus.FAILURE
-            # return conv_res
-
-        # Instead of running a page-level pipeline to build up the document structure,
-        # the backend is expected to be of type DeclarativeDocumentBackend, which can output
-        # a DoclingDocument straight.
-        with TimeRecorder(conv_res, "doc_build", scope=ProfilingScope.DOCUMENT):
-            conv_res.document = conv_res.input._backend.convert()
-        return conv_res
-
-    def _determine_status(self, conv_res: ConversionResult) -> ConversionStatus:
-        # This is called only if the previous steps didn't raise.
-        # Since we don't have anything else to evaluate, we can
-        # safely return SUCCESS.
-        return ConversionStatus.SUCCESS
-
-    @classmethod
-    def get_default_options(cls) -> PipelineOptions:
-        return PipelineOptions()
-
-    @classmethod
-    def is_backend_supported(cls, backend: AbstractDocumentBackend):
-        return isinstance(backend, DeclarativeDocumentBackend)
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/pipeline/standard_pdf_pipeline.py b/Paper2Video/src/evaluation/PresentQuiz/docling/pipeline/standard_pdf_pipeline.py
deleted file mode 100644
index 13e435f9a1dc4e983537c3a00d75182ea7c5de98..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/pipeline/standard_pdf_pipeline.py
+++ /dev/null
@@ -1,296 +0,0 @@
-import logging
-import sys
-import warnings
-from pathlib import Path
-from typing import Optional
-
-from docling_core.types.doc import DocItem, ImageRef, PictureItem, TableItem
-
-from docling.backend.abstract_backend import AbstractDocumentBackend
-from docling.backend.pdf_backend import PdfDocumentBackend
-from docling.datamodel.base_models import AssembledUnit, Page
-from docling.datamodel.document import ConversionResult
-from docling.datamodel.pipeline_options import (
-    EasyOcrOptions,
-    OcrMacOptions,
-    PdfPipelineOptions,
-    PictureDescriptionApiOptions,
-    PictureDescriptionVlmOptions,
-    RapidOcrOptions,
-    TesseractCliOcrOptions,
-    TesseractOcrOptions,
-)
-from docling.datamodel.settings import settings
-from docling.models.base_ocr_model import BaseOcrModel
-from docling.models.code_formula_model import CodeFormulaModel, CodeFormulaModelOptions
-from docling.models.document_picture_classifier import (
-    DocumentPictureClassifier,
-    DocumentPictureClassifierOptions,
-)
-from docling.models.ds_glm_model import GlmModel, GlmOptions
-from docling.models.easyocr_model import EasyOcrModel
-from docling.models.layout_model import LayoutModel
-from docling.models.ocr_mac_model import OcrMacModel
-from docling.models.page_assemble_model import PageAssembleModel, PageAssembleOptions
-from docling.models.page_preprocessing_model import (
-    PagePreprocessingModel,
-    PagePreprocessingOptions,
-)
-from docling.models.picture_description_api_model import PictureDescriptionApiModel
-from docling.models.picture_description_base_model import PictureDescriptionBaseModel
-from docling.models.picture_description_vlm_model import PictureDescriptionVlmModel
-from docling.models.rapid_ocr_model import RapidOcrModel
-from docling.models.table_structure_model import TableStructureModel
-from docling.models.tesseract_ocr_cli_model import TesseractOcrCliModel
-from docling.models.tesseract_ocr_model import TesseractOcrModel
-from docling.pipeline.base_pipeline import PaginatedPipeline
-from docling.utils.model_downloader import download_models
-from docling.utils.profiling import ProfilingScope, TimeRecorder
-
-_log = logging.getLogger(__name__)
-
-
-class StandardPdfPipeline(PaginatedPipeline):
-    _layout_model_path = LayoutModel._model_path
-    _table_model_path = TableStructureModel._model_path
-
-    def __init__(self, pipeline_options: PdfPipelineOptions):
-        super().__init__(pipeline_options)
-        self.pipeline_options: PdfPipelineOptions
-
-        artifacts_path: Optional[Path] = None
-        if pipeline_options.artifacts_path is not None:
-            artifacts_path = Path(pipeline_options.artifacts_path).expanduser()
-
-        self.keep_images = (
-            self.pipeline_options.generate_page_images
-            or self.pipeline_options.generate_picture_images
-            or self.pipeline_options.generate_table_images
-        )
-
-        self.glm_model = GlmModel(options=GlmOptions())
-
-        if (ocr_model := self.get_ocr_model(artifacts_path=artifacts_path)) is None:
-            raise RuntimeError(
-                f"The specified OCR kind is not supported: {pipeline_options.ocr_options.kind}."
-            )
-
-        self.build_pipe = [
-            # Pre-processing
-            PagePreprocessingModel(
-                options=PagePreprocessingOptions(
-                    images_scale=pipeline_options.images_scale
-                )
-            ),
-            # OCR
-            ocr_model,
-            # Layout model
-            LayoutModel(
-                artifacts_path=artifacts_path,
-                accelerator_options=pipeline_options.accelerator_options,
-            ),
-            # Table structure model
-            TableStructureModel(
-                enabled=pipeline_options.do_table_structure,
-                artifacts_path=artifacts_path,
-                options=pipeline_options.table_structure_options,
-                accelerator_options=pipeline_options.accelerator_options,
-            ),
-            # Page assemble
-            PageAssembleModel(options=PageAssembleOptions()),
-        ]
-
-        # Picture description model
-        if (
-            picture_description_model := self.get_picture_description_model(
-                artifacts_path=artifacts_path
-            )
-        ) is None:
-            raise RuntimeError(
-                f"The specified picture description kind is not supported: {pipeline_options.picture_description_options.kind}."
-            )
-
-        self.enrichment_pipe = [
-            # Code Formula Enrichment Model
-            CodeFormulaModel(
-                enabled=pipeline_options.do_code_enrichment
-                or pipeline_options.do_formula_enrichment,
-                artifacts_path=artifacts_path,
-                options=CodeFormulaModelOptions(
-                    do_code_enrichment=pipeline_options.do_code_enrichment,
-                    do_formula_enrichment=pipeline_options.do_formula_enrichment,
-                ),
-                accelerator_options=pipeline_options.accelerator_options,
-            ),
-            # Document Picture Classifier
-            DocumentPictureClassifier(
-                enabled=pipeline_options.do_picture_classification,
-                artifacts_path=artifacts_path,
-                options=DocumentPictureClassifierOptions(),
-                accelerator_options=pipeline_options.accelerator_options,
-            ),
-            # Document Picture description
-            picture_description_model,
-        ]
-
-        if (
-            self.pipeline_options.do_formula_enrichment
-            or self.pipeline_options.do_code_enrichment
-            or self.pipeline_options.do_picture_description
-        ):
-            self.keep_backend = True
-
-    @staticmethod
-    def download_models_hf(
-        local_dir: Optional[Path] = None, force: bool = False
-    ) -> Path:
-        warnings.warn(
-            "The usage of StandardPdfPipeline.download_models_hf() is deprecated "
-            "use instead the utility `docling-tools models download`, or "
-            "the upstream method docling.utils.models_downloader.download_all()",
-            DeprecationWarning,
-            stacklevel=3,
-        )
-
-        output_dir = download_models(output_dir=local_dir, force=force, progress=False)
-        return output_dir
-
-    def get_ocr_model(
-        self, artifacts_path: Optional[Path] = None
-    ) -> Optional[BaseOcrModel]:
-        if isinstance(self.pipeline_options.ocr_options, EasyOcrOptions):
-            return EasyOcrModel(
-                enabled=self.pipeline_options.do_ocr,
-                artifacts_path=artifacts_path,
-                options=self.pipeline_options.ocr_options,
-                accelerator_options=self.pipeline_options.accelerator_options,
-            )
-        elif isinstance(self.pipeline_options.ocr_options, TesseractCliOcrOptions):
-            return TesseractOcrCliModel(
-                enabled=self.pipeline_options.do_ocr,
-                options=self.pipeline_options.ocr_options,
-            )
-        elif isinstance(self.pipeline_options.ocr_options, TesseractOcrOptions):
-            return TesseractOcrModel(
-                enabled=self.pipeline_options.do_ocr,
-                options=self.pipeline_options.ocr_options,
-            )
-        elif isinstance(self.pipeline_options.ocr_options, RapidOcrOptions):
-            return RapidOcrModel(
-                enabled=self.pipeline_options.do_ocr,
-                options=self.pipeline_options.ocr_options,
-                accelerator_options=self.pipeline_options.accelerator_options,
-            )
-        elif isinstance(self.pipeline_options.ocr_options, OcrMacOptions):
-            if "darwin" != sys.platform:
-                raise RuntimeError(
-                    f"The specified OCR type is only supported on Mac: {self.pipeline_options.ocr_options.kind}."
-                )
-            return OcrMacModel(
-                enabled=self.pipeline_options.do_ocr,
-                options=self.pipeline_options.ocr_options,
-            )
-        return None
-
-    def get_picture_description_model(
-        self, artifacts_path: Optional[Path] = None
-    ) -> Optional[PictureDescriptionBaseModel]:
-        if isinstance(
-            self.pipeline_options.picture_description_options,
-            PictureDescriptionApiOptions,
-        ):
-            return PictureDescriptionApiModel(
-                enabled=self.pipeline_options.do_picture_description,
-                options=self.pipeline_options.picture_description_options,
-            )
-        elif isinstance(
-            self.pipeline_options.picture_description_options,
-            PictureDescriptionVlmOptions,
-        ):
-            return PictureDescriptionVlmModel(
-                enabled=self.pipeline_options.do_picture_description,
-                artifacts_path=artifacts_path,
-                options=self.pipeline_options.picture_description_options,
-                accelerator_options=self.pipeline_options.accelerator_options,
-            )
-        return None
-
-    def initialize_page(self, conv_res: ConversionResult, page: Page) -> Page:
-        with TimeRecorder(conv_res, "page_init"):
-            page._backend = conv_res.input._backend.load_page(page.page_no)  # type: ignore
-            if page._backend is not None and page._backend.is_valid():
-                page.size = page._backend.get_size()
-
-        return page
-
-    def _assemble_document(self, conv_res: ConversionResult) -> ConversionResult:
-        all_elements = []
-        all_headers = []
-        all_body = []
-
-        with TimeRecorder(conv_res, "doc_assemble", scope=ProfilingScope.DOCUMENT):
-            for p in conv_res.pages:
-                if p.assembled is not None:
-                    for el in p.assembled.body:
-                        all_body.append(el)
-                    for el in p.assembled.headers:
-                        all_headers.append(el)
-                    for el in p.assembled.elements:
-                        all_elements.append(el)
-
-            conv_res.assembled = AssembledUnit(
-                elements=all_elements, headers=all_headers, body=all_body
-            )
-
-            conv_res.document = self.glm_model(conv_res)
-
-            # Generate page images in the output
-            if self.pipeline_options.generate_page_images:
-                for page in conv_res.pages:
-                    assert page.image is not None
-                    page_no = page.page_no + 1
-                    conv_res.document.pages[page_no].image = ImageRef.from_pil(
-                        page.image, dpi=int(72 * self.pipeline_options.images_scale)
-                    )
-
-            # Generate images of the requested element types
-            if (
-                self.pipeline_options.generate_picture_images
-                or self.pipeline_options.generate_table_images
-            ):
-                scale = self.pipeline_options.images_scale
-                for element, _level in conv_res.document.iterate_items():
-                    if not isinstance(element, DocItem) or len(element.prov) == 0:
-                        continue
-                    if (
-                        isinstance(element, PictureItem)
-                        and self.pipeline_options.generate_picture_images
-                    ) or (
-                        isinstance(element, TableItem)
-                        and self.pipeline_options.generate_table_images
-                    ):
-                        page_ix = element.prov[0].page_no - 1
-                        page = conv_res.pages[page_ix]
-                        assert page.size is not None
-                        assert page.image is not None
-
-                        crop_bbox = (
-                            element.prov[0]
-                            .bbox.scaled(scale=scale)
-                            .to_top_left_origin(page_height=page.size.height * scale)
-                        )
-
-                        cropped_im = page.image.crop(crop_bbox.as_tuple())
-                        element.image = ImageRef.from_pil(
-                            cropped_im, dpi=int(72 * scale)
-                        )
-
-        return conv_res
-
-    @classmethod
-    def get_default_options(cls) -> PdfPipelineOptions:
-        return PdfPipelineOptions()
-
-    @classmethod
-    def is_backend_supported(cls, backend: AbstractDocumentBackend):
-        return isinstance(backend, PdfDocumentBackend)
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/py.typed b/Paper2Video/src/evaluation/PresentQuiz/docling/py.typed
deleted file mode 100644
index 8b137891791fe96927ad78e64b0aad7bded08bdc..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/py.typed
+++ /dev/null
@@ -1 +0,0 @@
-
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/__init__.py b/Paper2Video/src/evaluation/PresentQuiz/docling/utils/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/accelerator_utils.py b/Paper2Video/src/evaluation/PresentQuiz/docling/utils/accelerator_utils.py
deleted file mode 100644
index 59b04796fb822794b745a59e671d5934f81d0ff4..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/accelerator_utils.py
+++ /dev/null
@@ -1,42 +0,0 @@
-import logging
-
-import torch
-
-from docling.datamodel.pipeline_options import AcceleratorDevice
-
-_log = logging.getLogger(__name__)
-
-
-def decide_device(accelerator_device: AcceleratorDevice) -> str:
-    r"""
-    Resolve the device based on the acceleration options and the available devices in the system
-    Rules:
-    1. AUTO: Check for the best available device on the system.
-    2. User-defined: Check if the device actually exists, otherwise fall-back to CPU
-    """
-    cuda_index = 0
-    device = "cpu"
-
-    has_cuda = torch.backends.cuda.is_built() and torch.cuda.is_available()
-    has_mps = torch.backends.mps.is_built() and torch.backends.mps.is_available()
-
-    if accelerator_device == AcceleratorDevice.AUTO:
-        if has_cuda:
-            device = f"cuda:{cuda_index}"
-        elif has_mps:
-            device = "mps"
-
-    else:
-        if accelerator_device == AcceleratorDevice.CUDA:
-            if has_cuda:
-                device = f"cuda:{cuda_index}"
-            else:
-                _log.warning("CUDA is not available in the system. Fall back to 'CPU'")
-        elif accelerator_device == AcceleratorDevice.MPS:
-            if has_mps:
-                device = "mps"
-            else:
-                _log.warning("MPS is not available in the system. Fall back to 'CPU'")
-
-    _log.info("Accelerator device: '%s'", device)
-    return device
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/export.py b/Paper2Video/src/evaluation/PresentQuiz/docling/utils/export.py
deleted file mode 100644
index 5b022f4aac6ee51016bbe35c82204e7f1d914b74..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/export.py
+++ /dev/null
@@ -1,146 +0,0 @@
-import logging
-from typing import Any, Dict, Iterable, List, Tuple, Union
-
-from docling_core.types.doc import BoundingBox, CoordOrigin
-from docling_core.types.legacy_doc.base import BaseCell, BaseText, Ref, Table
-
-from docling.datamodel.base_models import OcrCell
-from docling.datamodel.document import ConversionResult, Page
-
-_log = logging.getLogger(__name__)
-
-
-def generate_multimodal_pages(
-    doc_result: ConversionResult,
-) -> Iterable[Tuple[str, str, List[Dict[str, Any]], List[Dict[str, Any]], Page]]:
-
-    label_to_doclaynet = {
-        "title": "title",
-        "table-of-contents": "document_index",
-        "subtitle-level-1": "section_header",
-        "checkbox-selected": "checkbox_selected",
-        "checkbox-unselected": "checkbox_unselected",
-        "caption": "caption",
-        "page-header": "page_header",
-        "page-footer": "page_footer",
-        "footnote": "footnote",
-        "table": "table",
-        "formula": "formula",
-        "list-item": "list_item",
-        "code": "code",
-        "figure": "picture",
-        "picture": "picture",
-        "reference": "text",
-        "paragraph": "text",
-        "text": "text",
-    }
-
-    content_text = ""
-    page_no = 0
-    start_ix = 0
-    end_ix = 0
-    doc_items: List[Tuple[int, Union[BaseCell, BaseText]]] = []
-
-    doc = doc_result.legacy_document
-
-    def _process_page_segments(doc_items: list[Tuple[int, BaseCell]], page: Page):
-        segments = []
-
-        for ix, item in doc_items:
-            item_type = item.obj_type
-            label = label_to_doclaynet.get(item_type, None)
-
-            if label is None or item.prov is None or page.size is None:
-                continue
-
-            bbox = BoundingBox.from_tuple(
-                tuple(item.prov[0].bbox), origin=CoordOrigin.BOTTOMLEFT
-            )
-            new_bbox = bbox.to_top_left_origin(page_height=page.size.height).normalized(
-                page_size=page.size
-            )
-
-            new_segment = {
-                "index_in_doc": ix,
-                "label": label,
-                "text": item.text if item.text is not None else "",
-                "bbox": new_bbox.as_tuple(),
-                "data": [],
-            }
-
-            if isinstance(item, Table):
-                table_html = item.export_to_html()
-                new_segment["data"].append(
-                    {
-                        "html_seq": table_html,
-                        "otsl_seq": "",
-                    }
-                )
-
-            segments.append(new_segment)
-
-        return segments
-
-    def _process_page_cells(page: Page):
-        cells: List[dict] = []
-        if page.size is None:
-            return cells
-        for cell in page.cells:
-            new_bbox = cell.bbox.to_top_left_origin(
-                page_height=page.size.height
-            ).normalized(page_size=page.size)
-            is_ocr = isinstance(cell, OcrCell)
-            ocr_confidence = cell.confidence if isinstance(cell, OcrCell) else 1.0
-            cells.append(
-                {
-                    "text": cell.text,
-                    "bbox": new_bbox.as_tuple(),
-                    "ocr": is_ocr,
-                    "ocr_confidence": ocr_confidence,
-                }
-            )
-        return cells
-
-    def _process_page():
-        page_ix = page_no - 1
-        page = doc_result.pages[page_ix]
-
-        page_cells = _process_page_cells(page=page)
-        page_segments = _process_page_segments(doc_items=doc_items, page=page)
-        content_md = doc.export_to_markdown(
-            main_text_start=start_ix, main_text_stop=end_ix
-        )
-        # No page-tagging since we only do 1 page at the time
-        content_dt = doc.export_to_document_tokens(
-            main_text_start=start_ix, main_text_stop=end_ix, add_page_index=False
-        )
-
-        return content_text, content_md, content_dt, page_cells, page_segments, page
-
-    if doc.main_text is None:
-        return
-    for ix, orig_item in enumerate(doc.main_text):
-
-        item = doc._resolve_ref(orig_item) if isinstance(orig_item, Ref) else orig_item
-        if item is None or item.prov is None or len(item.prov) == 0:
-            _log.debug(f"Skipping item {orig_item}")
-            continue
-
-        item_page = item.prov[0].page
-
-        # Page is complete
-        if page_no > 0 and item_page > page_no:
-            yield _process_page()
-
-            start_ix = ix
-            doc_items = []
-            content_text = ""
-
-        page_no = item_page
-        end_ix = ix
-        doc_items.append((ix, item))
-        if item.text is not None and item.text != "":
-            content_text += item.text + " "
-
-    if len(doc_items) > 0:
-        yield _process_page()
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/glm_utils.py b/Paper2Video/src/evaluation/PresentQuiz/docling/utils/glm_utils.py
deleted file mode 100644
index c3c43536c427207ec5e550ece2260172ab2c9c90..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/glm_utils.py
+++ /dev/null
@@ -1,361 +0,0 @@
-import re
-from pathlib import Path
-from typing import List
-
-import pandas as pd
-from docling_core.types.doc import (
-    BoundingBox,
-    CoordOrigin,
-    DocItemLabel,
-    DoclingDocument,
-    DocumentOrigin,
-    GroupLabel,
-    ProvenanceItem,
-    Size,
-    TableCell,
-    TableData,
-)
-from docling_core.types.doc.document import ContentLayer
-
-
-def resolve_item(paths, obj):
-    """Find item in document from a reference path"""
-
-    if len(paths) == 0:
-        return obj
-
-    if paths[0] == "#":
-        return resolve_item(paths[1:], obj)
-
-    try:
-        key = int(paths[0])
-    except:
-        key = paths[0]
-
-    if len(paths) == 1:
-        if isinstance(key, str) and key in obj:
-            return obj[key]
-        elif isinstance(key, int) and key < len(obj):
-            return obj[key]
-        else:
-            return None
-
-    elif len(paths) > 1:
-        if isinstance(key, str) and key in obj:
-            return resolve_item(paths[1:], obj[key])
-        elif isinstance(key, int) and key < len(obj):
-            return resolve_item(paths[1:], obj[key])
-        else:
-            return None
-
-    else:
-        return None
-
-
-def _flatten_table_grid(grid: List[List[dict]]) -> List[dict]:
-    unique_objects = []
-    seen_spans = set()
-
-    for sublist in grid:
-        for obj in sublist:
-            # Convert the spans list to a tuple of tuples for hashing
-            spans_tuple = tuple(tuple(span) for span in obj["spans"])
-            if spans_tuple not in seen_spans:
-                seen_spans.add(spans_tuple)
-                unique_objects.append(obj)
-
-    return unique_objects
-
-
-def to_docling_document(doc_glm, update_name_label=False) -> DoclingDocument:
-    origin = DocumentOrigin(
-        mimetype="application/pdf",
-        filename=doc_glm["file-info"]["filename"],
-        binary_hash=doc_glm["file-info"]["document-hash"],
-    )
-    doc_name = Path(origin.filename).stem
-
-    doc: DoclingDocument = DoclingDocument(name=doc_name, origin=origin)
-
-    for page_dim in doc_glm["page-dimensions"]:
-        page_no = int(page_dim["page"])
-        size = Size(width=page_dim["width"], height=page_dim["height"])
-
-        doc.add_page(page_no=page_no, size=size)
-
-    if "properties" in doc_glm:
-        props = pd.DataFrame(
-            doc_glm["properties"]["data"], columns=doc_glm["properties"]["headers"]
-        )
-    else:
-        props = pd.DataFrame()
-
-    current_list = None
-
-    for ix, pelem in enumerate(doc_glm["page-elements"]):
-        ptype = pelem["type"]
-        span_i = pelem["span"][0]
-        span_j = pelem["span"][1]
-
-        if "iref" not in pelem:
-            # print(json.dumps(pelem, indent=2))
-            continue
-
-        iref = pelem["iref"]
-
-        if re.match("#/figures/(\\d+)/captions/(.+)", iref):
-            # print(f"skip {iref}")
-            continue
-
-        if re.match("#/tables/(\\d+)/captions/(.+)", iref):
-            # print(f"skip {iref}")
-            continue
-
-        path = iref.split("/")
-        obj = resolve_item(path, doc_glm)
-
-        if obj is None:
-            current_list = None
-            print(f"warning: undefined {path}")
-            continue
-
-        if ptype == "figure":
-            current_list = None
-            text = ""
-            caption_refs = []
-            for caption in obj["captions"]:
-                text += caption["text"]
-
-                for nprov in caption["prov"]:
-                    npaths = nprov["$ref"].split("/")
-                    nelem = resolve_item(npaths, doc_glm)
-
-                    if nelem is None:
-                        # print(f"warning: undefined caption {npaths}")
-                        continue
-
-                    span_i = nelem["span"][0]
-                    span_j = nelem["span"][1]
-
-                    cap_text = caption["text"][span_i:span_j]
-
-                    # doc_glm["page-elements"].remove(nelem)
-
-                    prov = ProvenanceItem(
-                        page_no=nelem["page"],
-                        charspan=tuple(nelem["span"]),
-                        bbox=BoundingBox.from_tuple(
-                            nelem["bbox"], origin=CoordOrigin.BOTTOMLEFT
-                        ),
-                    )
-
-                    caption_obj = doc.add_text(
-                        label=DocItemLabel.CAPTION, text=cap_text, prov=prov
-                    )
-                    caption_refs.append(caption_obj.get_ref())
-
-            prov = ProvenanceItem(
-                page_no=pelem["page"],
-                charspan=(0, len(text)),
-                bbox=BoundingBox.from_tuple(
-                    pelem["bbox"], origin=CoordOrigin.BOTTOMLEFT
-                ),
-            )
-
-            pic = doc.add_picture(prov=prov)
-            pic.captions.extend(caption_refs)
-            _add_child_elements(pic, doc, obj, pelem)
-
-        elif ptype == "table":
-            current_list = None
-            text = ""
-            caption_refs = []
-            item_label = DocItemLabel(pelem["name"])
-
-            for caption in obj["captions"]:
-                text += caption["text"]
-
-                for nprov in caption["prov"]:
-                    npaths = nprov["$ref"].split("/")
-                    nelem = resolve_item(npaths, doc_glm)
-
-                    if nelem is None:
-                        # print(f"warning: undefined caption {npaths}")
-                        continue
-
-                    span_i = nelem["span"][0]
-                    span_j = nelem["span"][1]
-
-                    cap_text = caption["text"][span_i:span_j]
-
-                    # doc_glm["page-elements"].remove(nelem)
-
-                    prov = ProvenanceItem(
-                        page_no=nelem["page"],
-                        charspan=tuple(nelem["span"]),
-                        bbox=BoundingBox.from_tuple(
-                            nelem["bbox"], origin=CoordOrigin.BOTTOMLEFT
-                        ),
-                    )
-
-                    caption_obj = doc.add_text(
-                        label=DocItemLabel.CAPTION, text=cap_text, prov=prov
-                    )
-                    caption_refs.append(caption_obj.get_ref())
-
-            table_cells_glm = _flatten_table_grid(obj["data"])
-
-            table_cells = []
-            for tbl_cell_glm in table_cells_glm:
-                if tbl_cell_glm["bbox"] is not None:
-                    bbox = BoundingBox.from_tuple(
-                        tbl_cell_glm["bbox"], origin=CoordOrigin.BOTTOMLEFT
-                    )
-                else:
-                    bbox = None
-
-                is_col_header = False
-                is_row_header = False
-                is_row_section = False
-
-                if tbl_cell_glm["type"] == "col_header":
-                    is_col_header = True
-                elif tbl_cell_glm["type"] == "row_header":
-                    is_row_header = True
-                elif tbl_cell_glm["type"] == "row_section":
-                    is_row_section = True
-
-                table_cells.append(
-                    TableCell(
-                        row_span=tbl_cell_glm["row-span"][1]
-                        - tbl_cell_glm["row-span"][0],
-                        col_span=tbl_cell_glm["col-span"][1]
-                        - tbl_cell_glm["col-span"][0],
-                        start_row_offset_idx=tbl_cell_glm["row-span"][0],
-                        end_row_offset_idx=tbl_cell_glm["row-span"][1],
-                        start_col_offset_idx=tbl_cell_glm["col-span"][0],
-                        end_col_offset_idx=tbl_cell_glm["col-span"][1],
-                        text=tbl_cell_glm["text"],
-                        bbox=bbox,
-                        column_header=is_col_header,
-                        row_header=is_row_header,
-                        row_section=is_row_section,
-                    )
-                )
-
-            tbl_data = TableData(
-                num_rows=obj.get("#-rows", 0),
-                num_cols=obj.get("#-cols", 0),
-                table_cells=table_cells,
-            )
-
-            prov = ProvenanceItem(
-                page_no=pelem["page"],
-                charspan=(0, 0),
-                bbox=BoundingBox.from_tuple(
-                    pelem["bbox"], origin=CoordOrigin.BOTTOMLEFT
-                ),
-            )
-
-            tbl = doc.add_table(data=tbl_data, prov=prov, label=item_label)
-            tbl.captions.extend(caption_refs)
-
-        elif ptype in [DocItemLabel.FORM.value, DocItemLabel.KEY_VALUE_REGION.value]:
-            label = DocItemLabel(ptype)
-            group_label = GroupLabel.UNSPECIFIED
-            if label == DocItemLabel.FORM:
-                group_label = GroupLabel.FORM_AREA
-            elif label == DocItemLabel.KEY_VALUE_REGION:
-                group_label = GroupLabel.KEY_VALUE_AREA
-
-            container_el = doc.add_group(label=group_label)
-
-            _add_child_elements(container_el, doc, obj, pelem)
-        elif "text" in obj:
-            text = obj["text"][span_i:span_j]
-
-            type_label = pelem["type"]
-            name_label = pelem["name"]
-            if update_name_label and len(props) > 0 and type_label == "paragraph":
-                prop = props[
-                    (props["type"] == "semantic") & (props["subj_path"] == iref)
-                ]
-                if len(prop) == 1 and prop.iloc[0]["confidence"] > 0.85:
-                    name_label = prop.iloc[0]["label"]
-
-            prov = ProvenanceItem(
-                page_no=pelem["page"],
-                charspan=(0, len(text)),
-                bbox=BoundingBox.from_tuple(
-                    pelem["bbox"], origin=CoordOrigin.BOTTOMLEFT
-                ),
-            )
-            label = DocItemLabel(name_label)
-
-            if label == DocItemLabel.LIST_ITEM:
-                if current_list is None:
-                    current_list = doc.add_group(label=GroupLabel.LIST, name="list")
-
-                # TODO: Infer if this is a numbered or a bullet list item
-                doc.add_list_item(
-                    text=text, enumerated=False, prov=prov, parent=current_list
-                )
-            elif label == DocItemLabel.SECTION_HEADER:
-                current_list = None
-
-                doc.add_heading(text=text, prov=prov)
-            elif label == DocItemLabel.CODE:
-                current_list = None
-
-                doc.add_code(text=text, prov=prov)
-            elif label == DocItemLabel.FORMULA:
-                current_list = None
-
-                doc.add_text(label=DocItemLabel.FORMULA, text="", orig=text, prov=prov)
-            elif label in [DocItemLabel.PAGE_HEADER, DocItemLabel.PAGE_FOOTER]:
-                current_list = None
-
-                doc.add_text(
-                    label=DocItemLabel(name_label),
-                    text=text,
-                    prov=prov,
-                    content_layer=ContentLayer.FURNITURE,
-                )
-            else:
-                current_list = None
-
-                doc.add_text(label=DocItemLabel(name_label), text=text, prov=prov)
-
-    return doc
-
-
-def _add_child_elements(container_el, doc, obj, pelem):
-    payload = obj.get("payload")
-    if payload is not None:
-        children = payload.get("children", [])
-
-        for child in children:
-            c_label = DocItemLabel(child["label"])
-            c_bbox = BoundingBox.model_validate(child["bbox"]).to_bottom_left_origin(
-                doc.pages[pelem["page"]].size.height
-            )
-            c_text = " ".join(
-                [
-                    cell["text"].replace("\x02", "-").strip()
-                    for cell in child["cells"]
-                    if len(cell["text"].strip()) > 0
-                ]
-            )
-
-            c_prov = ProvenanceItem(
-                page_no=pelem["page"], charspan=(0, len(c_text)), bbox=c_bbox
-            )
-            if c_label == DocItemLabel.LIST_ITEM:
-                # TODO: Infer if this is a numbered or a bullet list item
-                doc.add_list_item(parent=container_el, text=c_text, prov=c_prov)
-            elif c_label == DocItemLabel.SECTION_HEADER:
-                doc.add_heading(parent=container_el, text=c_text, prov=c_prov)
-            else:
-                doc.add_text(
-                    parent=container_el, label=c_label, text=c_text, prov=c_prov
-                )
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/layout_postprocessor.py b/Paper2Video/src/evaluation/PresentQuiz/docling/utils/layout_postprocessor.py
deleted file mode 100644
index 8cb6bc550d744a9402b4ed03a270f81e5a2919da..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/layout_postprocessor.py
+++ /dev/null
@@ -1,666 +0,0 @@
-import bisect
-import logging
-import sys
-from collections import defaultdict
-from typing import Dict, List, Set, Tuple
-
-from docling_core.types.doc import DocItemLabel, Size
-from rtree import index
-
-from docling.datamodel.base_models import BoundingBox, Cell, Cluster, OcrCell
-
-_log = logging.getLogger(__name__)
-
-
-class UnionFind:
-    """Efficient Union-Find data structure for grouping elements."""
-
-    def __init__(self, elements):
-        self.parent = {elem: elem for elem in elements}
-        self.rank = {elem: 0 for elem in elements}
-
-    def find(self, x):
-        if self.parent[x] != x:
-            self.parent[x] = self.find(self.parent[x])  # Path compression
-        return self.parent[x]
-
-    def union(self, x, y):
-        root_x, root_y = self.find(x), self.find(y)
-        if root_x == root_y:
-            return
-
-        if self.rank[root_x] > self.rank[root_y]:
-            self.parent[root_y] = root_x
-        elif self.rank[root_x] < self.rank[root_y]:
-            self.parent[root_x] = root_y
-        else:
-            self.parent[root_y] = root_x
-            self.rank[root_x] += 1
-
-    def get_groups(self) -> Dict[int, List[int]]:
-        """Returns groups as {root: [elements]}."""
-        groups = defaultdict(list)
-        for elem in self.parent:
-            groups[self.find(elem)].append(elem)
-        return groups
-
-
-class SpatialClusterIndex:
-    """Efficient spatial indexing for clusters using R-tree and interval trees."""
-
-    def __init__(self, clusters: List[Cluster]):
-        p = index.Property()
-        p.dimension = 2
-        self.spatial_index = index.Index(properties=p)
-        self.x_intervals = IntervalTree()
-        self.y_intervals = IntervalTree()
-        self.clusters_by_id: Dict[int, Cluster] = {}
-
-        for cluster in clusters:
-            self.add_cluster(cluster)
-
-    def add_cluster(self, cluster: Cluster):
-        bbox = cluster.bbox
-        self.spatial_index.insert(cluster.id, bbox.as_tuple())
-        self.x_intervals.insert(bbox.l, bbox.r, cluster.id)
-        self.y_intervals.insert(bbox.t, bbox.b, cluster.id)
-        self.clusters_by_id[cluster.id] = cluster
-
-    def remove_cluster(self, cluster: Cluster):
-        self.spatial_index.delete(cluster.id, cluster.bbox.as_tuple())
-        del self.clusters_by_id[cluster.id]
-
-    def find_candidates(self, bbox: BoundingBox) -> Set[int]:
-        """Find potential overlapping cluster IDs using all indexes."""
-        spatial = set(self.spatial_index.intersection(bbox.as_tuple()))
-        x_candidates = self.x_intervals.find_containing(
-            bbox.l
-        ) | self.x_intervals.find_containing(bbox.r)
-        y_candidates = self.y_intervals.find_containing(
-            bbox.t
-        ) | self.y_intervals.find_containing(bbox.b)
-        return spatial.union(x_candidates).union(y_candidates)
-
-    def check_overlap(
-        self,
-        bbox1: BoundingBox,
-        bbox2: BoundingBox,
-        overlap_threshold: float,
-        containment_threshold: float,
-    ) -> bool:
-        """Check if two bboxes overlap sufficiently."""
-        area1, area2 = bbox1.area(), bbox2.area()
-        if area1 <= 0 or area2 <= 0:
-            return False
-
-        overlap_area = bbox1.intersection_area_with(bbox2)
-        if overlap_area <= 0:
-            return False
-
-        iou = overlap_area / (area1 + area2 - overlap_area)
-        containment1 = overlap_area / area1
-        containment2 = overlap_area / area2
-
-        return (
-            iou > overlap_threshold
-            or containment1 > containment_threshold
-            or containment2 > containment_threshold
-        )
-
-
-class Interval:
-    """Helper class for sortable intervals."""
-
-    def __init__(self, min_val: float, max_val: float, id: int):
-        self.min_val = min_val
-        self.max_val = max_val
-        self.id = id
-
-    def __lt__(self, other):
-        if isinstance(other, Interval):
-            return self.min_val < other.min_val
-        return self.min_val < other
-
-
-class IntervalTree:
-    """Memory-efficient interval tree for 1D overlap queries."""
-
-    def __init__(self):
-        self.intervals: List[Interval] = []  # Sorted by min_val
-
-    def insert(self, min_val: float, max_val: float, id: int):
-        interval = Interval(min_val, max_val, id)
-        bisect.insort(self.intervals, interval)
-
-    def find_containing(self, point: float) -> Set[int]:
-        """Find all intervals containing the point."""
-        pos = bisect.bisect_left(self.intervals, point)
-        result = set()
-
-        # Check intervals starting before point
-        for interval in reversed(self.intervals[:pos]):
-            if interval.min_val <= point <= interval.max_val:
-                result.add(interval.id)
-            else:
-                break
-
-        # Check intervals starting at/after point
-        for interval in self.intervals[pos:]:
-            if point <= interval.max_val:
-                if interval.min_val <= point:
-                    result.add(interval.id)
-            else:
-                break
-
-        return result
-
-
-class LayoutPostprocessor:
-    """Postprocesses layout predictions by cleaning up clusters and mapping cells."""
-
-    # Cluster type-specific parameters for overlap resolution
-    OVERLAP_PARAMS = {
-        "regular": {"area_threshold": 1.3, "conf_threshold": 0.05},
-        "picture": {"area_threshold": 2.0, "conf_threshold": 0.3},
-        "wrapper": {"area_threshold": 2.0, "conf_threshold": 0.2},
-    }
-
-    WRAPPER_TYPES = {
-        DocItemLabel.FORM,
-        DocItemLabel.KEY_VALUE_REGION,
-        DocItemLabel.TABLE,
-        DocItemLabel.DOCUMENT_INDEX,
-    }
-    SPECIAL_TYPES = WRAPPER_TYPES.union({DocItemLabel.PICTURE})
-
-    CONFIDENCE_THRESHOLDS = {
-        DocItemLabel.CAPTION: 0.5,
-        DocItemLabel.FOOTNOTE: 0.5,
-        DocItemLabel.FORMULA: 0.5,
-        DocItemLabel.LIST_ITEM: 0.5,
-        DocItemLabel.PAGE_FOOTER: 0.5,
-        DocItemLabel.PAGE_HEADER: 0.5,
-        DocItemLabel.PICTURE: 0.5,
-        DocItemLabel.SECTION_HEADER: 0.45,
-        DocItemLabel.TABLE: 0.5,
-        DocItemLabel.TEXT: 0.5,  # 0.45,
-        DocItemLabel.TITLE: 0.45,
-        DocItemLabel.CODE: 0.45,
-        DocItemLabel.CHECKBOX_SELECTED: 0.45,
-        DocItemLabel.CHECKBOX_UNSELECTED: 0.45,
-        DocItemLabel.FORM: 0.45,
-        DocItemLabel.KEY_VALUE_REGION: 0.45,
-        DocItemLabel.DOCUMENT_INDEX: 0.45,
-    }
-
-    LABEL_REMAPPING = {
-        # DocItemLabel.DOCUMENT_INDEX: DocItemLabel.TABLE,
-        DocItemLabel.TITLE: DocItemLabel.SECTION_HEADER,
-    }
-
-    def __init__(self, cells: List[Cell], clusters: List[Cluster], page_size: Size):
-        """Initialize processor with cells and clusters."""
-        """Initialize processor with cells and spatial indices."""
-        self.cells = cells
-        self.page_size = page_size
-        self.regular_clusters = [
-            c for c in clusters if c.label not in self.SPECIAL_TYPES
-        ]
-        self.special_clusters = [c for c in clusters if c.label in self.SPECIAL_TYPES]
-
-        # Build spatial indices once
-        self.regular_index = SpatialClusterIndex(self.regular_clusters)
-        self.picture_index = SpatialClusterIndex(
-            [c for c in self.special_clusters if c.label == DocItemLabel.PICTURE]
-        )
-        self.wrapper_index = SpatialClusterIndex(
-            [c for c in self.special_clusters if c.label in self.WRAPPER_TYPES]
-        )
-
-    def postprocess(self) -> Tuple[List[Cluster], List[Cell]]:
-        """Main processing pipeline."""
-        self.regular_clusters = self._process_regular_clusters()
-        self.special_clusters = self._process_special_clusters()
-
-        # Remove regular clusters that are included in wrappers
-        contained_ids = {
-            child.id
-            for wrapper in self.special_clusters
-            if wrapper.label in self.SPECIAL_TYPES
-            for child in wrapper.children
-        }
-        self.regular_clusters = [
-            c for c in self.regular_clusters if c.id not in contained_ids
-        ]
-
-        # Combine and sort final clusters
-        final_clusters = self._sort_clusters(
-            self.regular_clusters + self.special_clusters, mode="id"
-        )
-        for cluster in final_clusters:
-            cluster.cells = self._sort_cells(cluster.cells)
-            # Also sort cells in children if any
-            for child in cluster.children:
-                child.cells = self._sort_cells(child.cells)
-
-        return final_clusters, self.cells
-
-    def _process_regular_clusters(self) -> List[Cluster]:
-        """Process regular clusters with iterative refinement."""
-        clusters = [
-            c
-            for c in self.regular_clusters
-            if c.confidence >= self.CONFIDENCE_THRESHOLDS[c.label]
-        ]
-
-        # Apply label remapping
-        for cluster in clusters:
-            if cluster.label in self.LABEL_REMAPPING:
-                cluster.label = self.LABEL_REMAPPING[cluster.label]
-
-        # Initial cell assignment
-        clusters = self._assign_cells_to_clusters(clusters)
-
-        # Remove clusters with no cells
-        clusters = [cluster for cluster in clusters if cluster.cells]
-
-        # Handle orphaned cells
-        unassigned = self._find_unassigned_cells(clusters)
-        if unassigned:
-            next_id = max((c.id for c in clusters), default=0) + 1
-            orphan_clusters = []
-            for i, cell in enumerate(unassigned):
-                conf = 1.0
-                if isinstance(cell, OcrCell):
-                    conf = cell.confidence
-
-                orphan_clusters.append(
-                    Cluster(
-                        id=next_id + i,
-                        label=DocItemLabel.TEXT,
-                        bbox=cell.bbox,
-                        confidence=conf,
-                        cells=[cell],
-                    )
-                )
-            clusters.extend(orphan_clusters)
-
-        # Iterative refinement
-        prev_count = len(clusters) + 1
-        for _ in range(3):  # Maximum 3 iterations
-            if prev_count == len(clusters):
-                break
-            prev_count = len(clusters)
-            clusters = self._adjust_cluster_bboxes(clusters)
-            clusters = self._remove_overlapping_clusters(clusters, "regular")
-
-        return clusters
-
-    def _process_special_clusters(self) -> List[Cluster]:
-        special_clusters = [
-            c
-            for c in self.special_clusters
-            if c.confidence >= self.CONFIDENCE_THRESHOLDS[c.label]
-        ]
-
-        special_clusters = self._handle_cross_type_overlaps(special_clusters)
-
-        # Calculate page area from known page size
-        page_area = self.page_size.width * self.page_size.height
-        if page_area > 0:
-            # Filter out full-page pictures
-            special_clusters = [
-                cluster
-                for cluster in special_clusters
-                if not (
-                    cluster.label == DocItemLabel.PICTURE
-                    and cluster.bbox.area() / page_area > 0.90
-                )
-            ]
-
-        for special in special_clusters:
-            contained = []
-            for cluster in self.regular_clusters:
-                overlap = cluster.bbox.intersection_area_with(special.bbox)
-                if overlap > 0:
-                    containment = overlap / cluster.bbox.area()
-                    if containment > 0.8:
-                        contained.append(cluster)
-
-            if contained:
-                # Sort contained clusters by minimum cell ID:
-                contained = self._sort_clusters(contained, mode="id")
-                special.children = contained
-
-                # Adjust bbox only for Form and Key-Value-Region, not Table or Picture
-                if special.label in [DocItemLabel.FORM, DocItemLabel.KEY_VALUE_REGION]:
-                    special.bbox = BoundingBox(
-                        l=min(c.bbox.l for c in contained),
-                        t=min(c.bbox.t for c in contained),
-                        r=max(c.bbox.r for c in contained),
-                        b=max(c.bbox.b for c in contained),
-                    )
-
-                # Collect all cells from children
-                all_cells = []
-                for child in contained:
-                    all_cells.extend(child.cells)
-                special.cells = self._deduplicate_cells(all_cells)
-                special.cells = self._sort_cells(special.cells)
-
-        picture_clusters = [
-            c for c in special_clusters if c.label == DocItemLabel.PICTURE
-        ]
-        picture_clusters = self._remove_overlapping_clusters(
-            picture_clusters, "picture"
-        )
-
-        wrapper_clusters = [
-            c for c in special_clusters if c.label in self.WRAPPER_TYPES
-        ]
-        wrapper_clusters = self._remove_overlapping_clusters(
-            wrapper_clusters, "wrapper"
-        )
-
-        return picture_clusters + wrapper_clusters
-
-    def _handle_cross_type_overlaps(self, special_clusters) -> List[Cluster]:
-        """Handle overlaps between regular and wrapper clusters before child assignment.
-
-        In particular, KEY_VALUE_REGION proposals that are almost identical to a TABLE
-        should be removed.
-        """
-        wrappers_to_remove = set()
-
-        for wrapper in special_clusters:
-            if wrapper.label not in self.WRAPPER_TYPES:
-                continue  # only treat KEY_VALUE_REGION for now.
-
-            for regular in self.regular_clusters:
-                if regular.label == DocItemLabel.TABLE:
-                    # Calculate overlap
-                    overlap = regular.bbox.intersection_area_with(wrapper.bbox)
-                    wrapper_area = wrapper.bbox.area()
-                    overlap_ratio = overlap / wrapper_area
-
-                    conf_diff = wrapper.confidence - regular.confidence
-
-                    # If wrapper is mostly overlapping with a TABLE, remove the wrapper
-                    if (
-                        overlap_ratio > 0.9 and conf_diff < 0.1
-                    ):  # self.OVERLAP_PARAMS["wrapper"]["conf_threshold"]):  # 80% overlap threshold
-                        wrappers_to_remove.add(wrapper.id)
-                        break
-
-        # Filter out the identified wrappers
-        special_clusters = [
-            cluster
-            for cluster in special_clusters
-            if cluster.id not in wrappers_to_remove
-        ]
-
-        return special_clusters
-
-    def _should_prefer_cluster(
-        self, candidate: Cluster, other: Cluster, params: dict
-    ) -> bool:
-        """Determine if candidate cluster should be preferred over other cluster based on rules.
-        Returns True if candidate should be preferred, False if not."""
-
-        # Rule 1: LIST_ITEM vs TEXT
-        if (
-            candidate.label == DocItemLabel.LIST_ITEM
-            and other.label == DocItemLabel.TEXT
-        ):
-            # Check if areas are similar (within 20% of each other)
-            area_ratio = candidate.bbox.area() / other.bbox.area()
-            area_similarity = abs(1 - area_ratio) < 0.2
-            if area_similarity:
-                return True
-
-        # Rule 2: CODE vs others
-        if candidate.label == DocItemLabel.CODE:
-            # Calculate how much of the other cluster is contained within the CODE cluster
-            overlap = other.bbox.intersection_area_with(candidate.bbox)
-            containment = overlap / other.bbox.area()
-            if containment > 0.8:  # other is 80% contained within CODE
-                return True
-
-        # If no label-based rules matched, fall back to area/confidence thresholds
-        area_ratio = candidate.bbox.area() / other.bbox.area()
-        conf_diff = other.confidence - candidate.confidence
-
-        if (
-            area_ratio <= params["area_threshold"]
-            and conf_diff > params["conf_threshold"]
-        ):
-            return False
-
-        return True  # Default to keeping candidate if no rules triggered rejection
-
-    def _select_best_cluster_from_group(
-        self,
-        group_clusters: List[Cluster],
-        params: dict,
-    ) -> Cluster:
-        """Select best cluster from a group of overlapping clusters based on all rules."""
-        current_best = None
-
-        for candidate in group_clusters:
-            should_select = True
-
-            for other in group_clusters:
-                if other == candidate:
-                    continue
-
-                if not self._should_prefer_cluster(candidate, other, params):
-                    should_select = False
-                    break
-
-            if should_select:
-                if current_best is None:
-                    current_best = candidate
-                else:
-                    # If both clusters pass rules, prefer the larger one unless confidence differs significantly
-                    if (
-                        candidate.bbox.area() > current_best.bbox.area()
-                        and current_best.confidence - candidate.confidence
-                        <= params["conf_threshold"]
-                    ):
-                        current_best = candidate
-
-        return current_best if current_best else group_clusters[0]
-
-    def _remove_overlapping_clusters(
-        self,
-        clusters: List[Cluster],
-        cluster_type: str,
-        overlap_threshold: float = 0.8,
-        containment_threshold: float = 0.8,
-    ) -> List[Cluster]:
-        if not clusters:
-            return []
-
-        spatial_index = (
-            self.regular_index
-            if cluster_type == "regular"
-            else self.picture_index if cluster_type == "picture" else self.wrapper_index
-        )
-
-        # Map of currently valid clusters
-        valid_clusters = {c.id: c for c in clusters}
-        uf = UnionFind(valid_clusters.keys())
-        params = self.OVERLAP_PARAMS[cluster_type]
-
-        for cluster in clusters:
-            candidates = spatial_index.find_candidates(cluster.bbox)
-            candidates &= valid_clusters.keys()  # Only keep existing candidates
-            candidates.discard(cluster.id)
-
-            for other_id in candidates:
-                if spatial_index.check_overlap(
-                    cluster.bbox,
-                    valid_clusters[other_id].bbox,
-                    overlap_threshold,
-                    containment_threshold,
-                ):
-                    uf.union(cluster.id, other_id)
-
-        result = []
-        for group in uf.get_groups().values():
-            if len(group) == 1:
-                result.append(valid_clusters[group[0]])
-                continue
-
-            group_clusters = [valid_clusters[cid] for cid in group]
-            best = self._select_best_cluster_from_group(group_clusters, params)
-
-            # Simple cell merging - no special cases
-            for cluster in group_clusters:
-                if cluster != best:
-                    best.cells.extend(cluster.cells)
-
-            best.cells = self._deduplicate_cells(best.cells)
-            best.cells = self._sort_cells(best.cells)
-            result.append(best)
-
-        return result
-
-    def _select_best_cluster(
-        self,
-        clusters: List[Cluster],
-        area_threshold: float,
-        conf_threshold: float,
-    ) -> Cluster:
-        """Iteratively select best cluster based on area and confidence thresholds."""
-        current_best = None
-        for candidate in clusters:
-            should_select = True
-            for other in clusters:
-                if other == candidate:
-                    continue
-
-                area_ratio = candidate.bbox.area() / other.bbox.area()
-                conf_diff = other.confidence - candidate.confidence
-
-                if area_ratio <= area_threshold and conf_diff > conf_threshold:
-                    should_select = False
-                    break
-
-            if should_select:
-                if current_best is None or (
-                    candidate.bbox.area() > current_best.bbox.area()
-                    and current_best.confidence - candidate.confidence <= conf_threshold
-                ):
-                    current_best = candidate
-
-        return current_best if current_best else clusters[0]
-
-    def _deduplicate_cells(self, cells: List[Cell]) -> List[Cell]:
-        """Ensure each cell appears only once, maintaining order of first appearance."""
-        seen_ids = set()
-        unique_cells = []
-        for cell in cells:
-            if cell.id not in seen_ids:
-                seen_ids.add(cell.id)
-                unique_cells.append(cell)
-        return unique_cells
-
-    def _assign_cells_to_clusters(
-        self, clusters: List[Cluster], min_overlap: float = 0.2
-    ) -> List[Cluster]:
-        """Assign cells to best overlapping cluster."""
-        for cluster in clusters:
-            cluster.cells = []
-
-        for cell in self.cells:
-            if not cell.text.strip():
-                continue
-
-            best_overlap = min_overlap
-            best_cluster = None
-
-            for cluster in clusters:
-                if cell.bbox.area() <= 0:
-                    continue
-
-                overlap = cell.bbox.intersection_area_with(cluster.bbox)
-                overlap_ratio = overlap / cell.bbox.area()
-
-                if overlap_ratio > best_overlap:
-                    best_overlap = overlap_ratio
-                    best_cluster = cluster
-
-            if best_cluster is not None:
-                best_cluster.cells.append(cell)
-
-        # Deduplicate cells in each cluster after assignment
-        for cluster in clusters:
-            cluster.cells = self._deduplicate_cells(cluster.cells)
-
-        return clusters
-
-    def _find_unassigned_cells(self, clusters: List[Cluster]) -> List[Cell]:
-        """Find cells not assigned to any cluster."""
-        assigned = {cell.id for cluster in clusters for cell in cluster.cells}
-        return [
-            cell for cell in self.cells if cell.id not in assigned and cell.text.strip()
-        ]
-
-    def _adjust_cluster_bboxes(self, clusters: List[Cluster]) -> List[Cluster]:
-        """Adjust cluster bounding boxes to contain their cells."""
-        for cluster in clusters:
-            if not cluster.cells:
-                continue
-
-            cells_bbox = BoundingBox(
-                l=min(cell.bbox.l for cell in cluster.cells),
-                t=min(cell.bbox.t for cell in cluster.cells),
-                r=max(cell.bbox.r for cell in cluster.cells),
-                b=max(cell.bbox.b for cell in cluster.cells),
-            )
-
-            if cluster.label == DocItemLabel.TABLE:
-                # For tables, take union of current bbox and cells bbox
-                cluster.bbox = BoundingBox(
-                    l=min(cluster.bbox.l, cells_bbox.l),
-                    t=min(cluster.bbox.t, cells_bbox.t),
-                    r=max(cluster.bbox.r, cells_bbox.r),
-                    b=max(cluster.bbox.b, cells_bbox.b),
-                )
-            else:
-                cluster.bbox = cells_bbox
-
-        return clusters
-
-    def _sort_cells(self, cells: List[Cell]) -> List[Cell]:
-        """Sort cells in native reading order."""
-        return sorted(cells, key=lambda c: (c.id))
-
-    def _sort_clusters(
-        self, clusters: List[Cluster], mode: str = "id"
-    ) -> List[Cluster]:
-        """Sort clusters in reading order (top-to-bottom, left-to-right)."""
-        if mode == "id":  # sort in the order the cells are printed in the PDF.
-            return sorted(
-                clusters,
-                key=lambda cluster: (
-                    (
-                        min(cell.id for cell in cluster.cells)
-                        if cluster.cells
-                        else sys.maxsize
-                    ),
-                    cluster.bbox.t,
-                    cluster.bbox.l,
-                ),
-            )
-        elif mode == "tblr":  # Sort top-to-bottom, then left-to-right ("row first")
-            return sorted(
-                clusters, key=lambda cluster: (cluster.bbox.t, cluster.bbox.l)
-            )
-        elif mode == "lrtb":  # Sort left-to-right, then top-to-bottom ("column first")
-            return sorted(
-                clusters, key=lambda cluster: (cluster.bbox.l, cluster.bbox.t)
-            )
-        else:
-            return clusters
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/model_downloader.py b/Paper2Video/src/evaluation/PresentQuiz/docling/utils/model_downloader.py
deleted file mode 100644
index 7d22b77b8a6c4642bb0284fbaf73f6b8c996934e..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/model_downloader.py
+++ /dev/null
@@ -1,84 +0,0 @@
-import logging
-from pathlib import Path
-from typing import Optional
-
-from docling.datamodel.pipeline_options import smolvlm_picture_description
-from docling.datamodel.settings import settings
-from docling.models.code_formula_model import CodeFormulaModel
-from docling.models.document_picture_classifier import DocumentPictureClassifier
-from docling.models.easyocr_model import EasyOcrModel
-from docling.models.layout_model import LayoutModel
-from docling.models.picture_description_vlm_model import PictureDescriptionVlmModel
-from docling.models.table_structure_model import TableStructureModel
-
-_log = logging.getLogger(__name__)
-
-
-def download_models(
-    output_dir: Optional[Path] = None,
-    *,
-    force: bool = False,
-    progress: bool = False,
-    with_layout: bool = True,
-    with_tableformer: bool = True,
-    with_code_formula: bool = True,
-    with_picture_classifier: bool = True,
-    with_smolvlm: bool = True,
-    with_easyocr: bool = True,
-):
-    if output_dir is None:
-        output_dir = settings.cache_dir / "models"
-
-    # Make sure the folder exists
-    output_dir.mkdir(exist_ok=True, parents=True)
-
-    if with_layout:
-        _log.info(f"Downloading layout model...")
-        LayoutModel.download_models(
-            local_dir=output_dir / LayoutModel._model_repo_folder,
-            force=force,
-            progress=progress,
-        )
-
-    if with_tableformer:
-        _log.info(f"Downloading tableformer model...")
-        TableStructureModel.download_models(
-            local_dir=output_dir / TableStructureModel._model_repo_folder,
-            force=force,
-            progress=progress,
-        )
-
-    if with_picture_classifier:
-        _log.info(f"Downloading picture classifier model...")
-        DocumentPictureClassifier.download_models(
-            local_dir=output_dir / DocumentPictureClassifier._model_repo_folder,
-            force=force,
-            progress=progress,
-        )
-
-    if with_code_formula:
-        _log.info(f"Downloading code formula model...")
-        CodeFormulaModel.download_models(
-            local_dir=output_dir / CodeFormulaModel._model_repo_folder,
-            force=force,
-            progress=progress,
-        )
-
-    if with_smolvlm:
-        _log.info(f"Downloading SmolVlm model...")
-        PictureDescriptionVlmModel.download_models(
-            repo_id=smolvlm_picture_description.repo_id,
-            local_dir=output_dir / smolvlm_picture_description.repo_cache_folder,
-            force=force,
-            progress=progress,
-        )
-
-    if with_easyocr:
-        _log.info(f"Downloading easyocr models...")
-        EasyOcrModel.download_models(
-            local_dir=output_dir / EasyOcrModel._model_repo_folder,
-            force=force,
-            progress=progress,
-        )
-
-    return output_dir
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/ocr_utils.py b/Paper2Video/src/evaluation/PresentQuiz/docling/utils/ocr_utils.py
deleted file mode 100644
index 59503f1f80d2f8ce47d14f42ee7a4297d4dfd12c..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/ocr_utils.py
+++ /dev/null
@@ -1,9 +0,0 @@
-def map_tesseract_script(script: str) -> str:
-    r""" """
-    if script == "Katakana" or script == "Hiragana":
-        script = "Japanese"
-    elif script == "Han":
-        script = "HanS"
-    elif script == "Korean":
-        script = "Hangul"
-    return script
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/profiling.py b/Paper2Video/src/evaluation/PresentQuiz/docling/utils/profiling.py
deleted file mode 100644
index 0d09f17d32600a8a6a7a3c830a0067b90864d38f..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/profiling.py
+++ /dev/null
@@ -1,62 +0,0 @@
-import time
-from datetime import datetime
-from enum import Enum
-from typing import TYPE_CHECKING, List
-
-import numpy as np
-from pydantic import BaseModel
-
-from docling.datamodel.settings import settings
-
-if TYPE_CHECKING:
-    from docling.datamodel.document import ConversionResult
-
-
-class ProfilingScope(str, Enum):
-    PAGE = "page"
-    DOCUMENT = "document"
-
-
-class ProfilingItem(BaseModel):
-    scope: ProfilingScope
-    count: int = 0
-    times: List[float] = []
-    start_timestamps: List[datetime] = []
-
-    def avg(self) -> float:
-        return np.average(self.times)  # type: ignore
-
-    def std(self) -> float:
-        return np.std(self.times)  # type: ignore
-
-    def mean(self) -> float:
-        return np.mean(self.times)  # type: ignore
-
-    def percentile(self, perc: float) -> float:
-        return np.percentile(self.times, perc)  # type: ignore
-
-
-class TimeRecorder:
-    def __init__(
-        self,
-        conv_res: "ConversionResult",
-        key: str,
-        scope: ProfilingScope = ProfilingScope.PAGE,
-    ):
-        if settings.debug.profile_pipeline_timings:
-            if key not in conv_res.timings.keys():
-                conv_res.timings[key] = ProfilingItem(scope=scope)
-            self.conv_res = conv_res
-            self.key = key
-
-    def __enter__(self):
-        if settings.debug.profile_pipeline_timings:
-            self.start = time.monotonic()
-            self.conv_res.timings[self.key].start_timestamps.append(datetime.utcnow())
-        return self
-
-    def __exit__(self, *args):
-        if settings.debug.profile_pipeline_timings:
-            elapsed = time.monotonic() - self.start
-            self.conv_res.timings[self.key].times.append(elapsed)
-            self.conv_res.timings[self.key].count += 1
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/utils.py b/Paper2Video/src/evaluation/PresentQuiz/docling/utils/utils.py
deleted file mode 100644
index 1261f8608fee4abe2d087e15132dd9e2c14d3832..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/utils.py
+++ /dev/null
@@ -1,65 +0,0 @@
-import hashlib
-from io import BytesIO
-from itertools import islice
-from pathlib import Path
-from typing import List, Union
-
-import requests
-from tqdm import tqdm
-
-
-def chunkify(iterator, chunk_size):
-    """Yield successive chunks of chunk_size from the iterable."""
-    if isinstance(iterator, List):
-        iterator = iter(iterator)
-    for first in iterator:  # Take the first element from the iterator
-        yield [first] + list(islice(iterator, chunk_size - 1))
-
-
-def create_file_hash(path_or_stream: Union[BytesIO, Path]) -> str:
-    """Create a stable page_hash of the path_or_stream of a file"""
-
-    block_size = 65536
-    hasher = hashlib.sha256()
-
-    def _hash_buf(binary_stream):
-        buf = binary_stream.read(block_size)  # read and page_hash in chunks
-        while len(buf) > 0:
-            hasher.update(buf)
-            buf = binary_stream.read(block_size)
-
-    if isinstance(path_or_stream, Path):
-        with path_or_stream.open("rb") as afile:
-            _hash_buf(afile)
-    elif isinstance(path_or_stream, BytesIO):
-        _hash_buf(path_or_stream)
-
-    return hasher.hexdigest()
-
-
-def create_hash(string: str):
-    hasher = hashlib.sha256()
-    hasher.update(string.encode("utf-8"))
-
-    return hasher.hexdigest()
-
-
-def download_url_with_progress(url: str, progress: bool = False) -> BytesIO:
-    buf = BytesIO()
-    with requests.get(url, stream=True, allow_redirects=True) as response:
-        total_size = int(response.headers.get("content-length", 0))
-        progress_bar = tqdm(
-            total=total_size,
-            unit="B",
-            unit_scale=True,
-            unit_divisor=1024,
-            disable=(not progress),
-        )
-
-        for chunk in response.iter_content(10 * 1024):
-            buf.write(chunk)
-            progress_bar.update(len(chunk))
-        progress_bar.close()
-
-    buf.seek(0)
-    return buf
diff --git a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/visualization.py b/Paper2Video/src/evaluation/PresentQuiz/docling/utils/visualization.py
deleted file mode 100644
index 465b7749fba06987ef3b6e0ab802d9773ca30046..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/docling/utils/visualization.py
+++ /dev/null
@@ -1,80 +0,0 @@
-from docling_core.types.doc import DocItemLabel
-from PIL import Image, ImageDraw, ImageFont
-from PIL.ImageFont import FreeTypeFont
-
-from docling.datamodel.base_models import Cluster
-
-
-def draw_clusters(
-    image: Image.Image, clusters: list[Cluster], scale_x: float, scale_y: float
-) -> None:
-    """
-    Draw clusters on an image
-    """
-    draw = ImageDraw.Draw(image, "RGBA")
-    # Create a smaller font for the labels
-    font: ImageFont.ImageFont | FreeTypeFont
-    try:
-        font = ImageFont.truetype("arial.ttf", 12)
-    except OSError:
-        # Fallback to default font if arial is not available
-        font = ImageFont.load_default()
-    for c_tl in clusters:
-        all_clusters = [c_tl, *c_tl.children]
-        for c in all_clusters:
-            # Draw cells first (underneath)
-            cell_color = (0, 0, 0, 40)  # Transparent black for cells
-            for tc in c.cells:
-                cx0, cy0, cx1, cy1 = tc.bbox.as_tuple()
-                cx0 *= scale_x
-                cx1 *= scale_x
-                cy0 *= scale_x
-                cy1 *= scale_y
-
-                draw.rectangle(
-                    [(cx0, cy0), (cx1, cy1)],
-                    outline=None,
-                    fill=cell_color,
-                )
-            # Draw cluster rectangle
-            x0, y0, x1, y1 = c.bbox.as_tuple()
-            x0 *= scale_x
-            x1 *= scale_x
-            y0 *= scale_x
-            y1 *= scale_y
-
-            cluster_fill_color = (*list(DocItemLabel.get_color(c.label)), 70)
-            cluster_outline_color = (
-                *list(DocItemLabel.get_color(c.label)),
-                255,
-            )
-            draw.rectangle(
-                [(x0, y0), (x1, y1)],
-                outline=cluster_outline_color,
-                fill=cluster_fill_color,
-            )
-            # Add label name and confidence
-            label_text = f"{c.label.name} ({c.confidence:.2f})"
-            # Create semi-transparent background for text
-            text_bbox = draw.textbbox((x0, y0), label_text, font=font)
-            text_bg_padding = 2
-            draw.rectangle(
-                [
-                    (
-                        text_bbox[0] - text_bg_padding,
-                        text_bbox[1] - text_bg_padding,
-                    ),
-                    (
-                        text_bbox[2] + text_bg_padding,
-                        text_bbox[3] + text_bg_padding,
-                    ),
-                ],
-                fill=(255, 255, 255, 180),  # Semi-transparent white
-            )
-            # Draw text
-            draw.text(
-                (x0, y0),
-                label_text,
-                fill=(0, 0, 0, 255),  # Solid black
-                font=font,
-            )
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/__init__.py b/Paper2Video/src/evaluation/PresentQuiz/utils/__init__.py
deleted file mode 100644
index 559b5ac416c503192cd37ff582cdb0013a5de125..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/__init__.py
+++ /dev/null
@@ -1 +0,0 @@
-from . import poster_eval_utils, pptx_utils, wei_utils, critic_utils, ablation_utils, src
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/ablation_utils.py b/Paper2Video/src/evaluation/PresentQuiz/utils/ablation_utils.py
deleted file mode 100644
index 5632cfe09212db5b6f219c5c5eb8613971688dd5..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/ablation_utils.py
+++ /dev/null
@@ -1,67 +0,0 @@
-import yaml
-import json
-from utils.wei_utils import account_token
-from jinja2 import Environment, StrictUndefined
-from camel.models import ModelFactory
-from camel.agents import ChatAgent
-from camel.messages import BaseMessage
-from utils.src.utils import get_json_from_response
-
-def no_tree_get_layout(poster_width, poster_height, panels, figures, agent_config):
-    total_input_token, total_output_token = 0, 0
-    agent_name = 'ablation_no_tree_layout'
-    with open(f"prompt_templates/{agent_name}.yaml", "r") as f:
-        planner_config = yaml.safe_load(f)
-
-    jinja_env = Environment(undefined=StrictUndefined)
-    template = jinja_env.from_string(planner_config["template"])
-    planner_jinja_args = {
-        'poster_width': poster_width,
-        'poster_height': poster_height,
-        'panels': json.dumps(panels, indent=4),
-        'figures': json.dumps(figures, indent=4),
-    }
-
-    planner_model = ModelFactory.create(
-        model_platform=agent_config['model_platform'],
-        model_type=agent_config['model_type'],
-        model_config_dict=agent_config['model_config'],
-    )
-
-    planner_agent = ChatAgent(
-        system_message=planner_config['system_prompt'],
-        model=planner_model,
-        message_window_size=None,
-    )
-
-    planner_prompt = template.render(**planner_jinja_args)
-
-    num_trials = 0
-    
-    while True:
-        num_trials += 1
-        print(f"Trial {num_trials}: Generating layout...")
-        planner_agent.reset()
-        response = planner_agent.step(planner_prompt)
-        input_token, output_token = account_token(response)
-        total_input_token += input_token
-        total_output_token += output_token
-    
-        arrangements = get_json_from_response(response.msgs[0].content)
-
-        if len(arrangements) == 0:
-            print('Error: Empty response, retrying...')
-            continue
-
-        if not 'panel_arrangement' in arrangements or\
-        not 'figure_arrangement' in arrangements or\
-        not 'text_arrangement' in arrangements:
-            print('Error: Invalid response, retrying...')
-            continue
-
-        if len(arrangements['panel_arrangement']) != len(panels) or\
-        len(arrangements['figure_arrangement']) != len(figures):
-            print('Error: Invalid response, retrying...')
-            continue
-        break
-    return arrangements['panel_arrangement'], arrangements['figure_arrangement'], arrangements['text_arrangement'], input_token, output_token
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/critic_utils.py b/Paper2Video/src/evaluation/PresentQuiz/utils/critic_utils.py
deleted file mode 100644
index 4142e33e61756e0ee7be68b11e802ae6e6f58213..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/critic_utils.py
+++ /dev/null
@@ -1,157 +0,0 @@
-from PIL import Image
-import io
-import json
-
-def crop_image(image, x:float, y:float, width:float, height:float):
-    """Crop the image based on the normalized coordinates.
-    Return the cropped image.
-    This has the effect of zooming in on the image crop.
-
-    Args:
-        image (PIL.Image.Image): the input image
-        x (float): the horizontal coordinate of the upper-left corner of the box
-        y (float): the vertical coordinate of that corner
-        width (float): the box width
-        height (float): the box height
-
-    Returns:
-        cropped_img (PIL.Image.Image): the cropped image
-        
-    Example:
-        image = Image.open("sample_img.jpg")
-        cropped_img = crop_image(image, 0.2, 0.3, 0.5, 0.4)
-        display(cropped_img)
-    """
-    
-    # get height and width of image
-    w, h = image.size
-    
-    # limit the range of x and y
-    x = min(max(0, x), 1)
-    y = min(max(0, y), 1)
-    x2 = min(max(0, x+width), 1)
-    y2 = min(max(0, y+height), 1)
-    
-    cropped_img = image.crop((x*w, y*h, x2*w, y2*h))
-
-    buffer = io.BytesIO()
-    cropped_img.save(buffer, format="JPEG")
-    buffer.seek(0)  # Reset buffer position
-
-    # Load as a JpegImageFile
-    jpeg_image = Image.open(buffer)
-    return jpeg_image
-
-
-def zoom_in_image_by_bbox(image, box, padding=0.01):
-    """A simple wrapper function to crop the image based on the bounding box.
-    The zoom factor cannot be too small. Minimum is 0.1
-
-    Args:
-        image (PIL.Image.Image): the input image
-        box (List[float]): the bounding box in the format of [x, y, w, h]
-        padding (float, optional): The padding for the image crop, outside of the bounding box. Defaults to 0.05.
-
-    Returns:
-        cropped_img (PIL.Image.Image): the cropped image
-        
-    Example:
-        image = Image.open("sample_img.jpg")
-        annotated_img, boxes = detection(image, "bus")
-        cropped_img = zoom_in_image_by_bbox(image, boxes[0], padding=0.1)
-        display(cropped_img)
-    """
-    assert padding >= 0.01, "The padding should be at least 0.01"
-    x, y, w, h = box
-    x, y, w, h = x-padding, y-padding, w+2*padding, h+2*padding
-    return crop_image(image, x, y, w, h)
-
-
-def parse_inch_string(inch_str: str) -> float:
-    """
-    Convert a string like '12.0 Inches' into a float (12.0).
-    """
-    return float(inch_str.replace(" Inches", "").strip())
-
-def convert_pptx_bboxes_to_image_space(bbox_dict, slide_width_in, slide_height_in):
-    """
-    Convert each PPTX bounding box (in inches) to normalized image coords.
-
-    bbox_dict format example:
-    {
-      'TitleAndAuthor': {
-         'left': '12.0 Inches', 'top': '1.0 Inches',
-         'width': '24.0 Inches', 'height': '2.0 Inches'
-      },
-      ...
-    }
-
-    Returns a dictionary with the same keys, but values as [x_norm, y_norm, w_norm, h_norm].
-    """
-    result = {}
-    for label, box in bbox_dict.items():
-        left_in   = parse_inch_string(box['left'])
-        top_in    = parse_inch_string(box['top'])
-        width_in  = parse_inch_string(box['width'])
-        height_in = parse_inch_string(box['height'])
-
-        x_norm = left_in / slide_width_in
-        y_norm = top_in  / slide_height_in
-        w_norm = width_in  / slide_width_in
-        h_norm = height_in / slide_height_in
-
-        result[label] = [x_norm, y_norm, w_norm, h_norm]
-    return result
-
-def convert_pptx_bboxes_json_to_image_json(bbox_json_str, slide_width_in, slide_height_in):
-    """
-    Convert bounding boxes (in inches) from a JSON string to normalized image coords [0..1].
-
-    Args:
-        bbox_json_str (str): JSON text of the bounding box dictionary you provided.
-                             Example of the structure (in JSON):
-                             {
-                                 "TitleAndAuthor": {
-                                    "left": "12.0 Inches",
-                                    "top": "1.0 Inches",
-                                    "width": "24.0 Inches",
-                                    "height": "2.0 Inches"
-                                 },
-                                 "Abstract-Section Title": { ... },
-                                 ...
-                             }
-        slide_width_in (float): The total slide width in inches.
-        slide_height_in (float): The total slide height in inches.
-
-    Returns:
-        str: A JSON string, where each key maps to [x_norm, y_norm, w_norm, h_norm].
-    """
-
-    def parse_inch_string(inch_str: str) -> float:
-        """Helper to parse '12.0 Inches' -> 12.0 (float)."""
-        return float(inch_str.replace(" Inches", "").strip())
-
-    # 1) Parse the incoming JSON string to a Python dict
-    if type(bbox_json_str) == str:
-        bbox_dict = json.loads(bbox_json_str)
-    else:
-        bbox_dict = bbox_json_str
-
-    # 2) Convert each bounding box to normalized coordinates [x, y, w, h]
-    normalized_bboxes = {}
-    for label, box in bbox_dict.items():
-        left_in   = parse_inch_string(box['left'])
-        top_in    = parse_inch_string(box['top'])
-        width_in  = parse_inch_string(box['width'])
-        height_in = parse_inch_string(box['height'])
-
-        x_norm = left_in / slide_width_in
-        y_norm = top_in  / slide_height_in
-        w_norm = width_in  / slide_width_in
-        h_norm = height_in / slide_height_in
-
-        normalized_bboxes[label] = [x_norm, y_norm, w_norm, h_norm]
-
-    # 3) Return as a JSON string
-    return normalized_bboxes
-
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/poster_eval_utils.py b/Paper2Video/src/evaluation/PresentQuiz/utils/poster_eval_utils.py
deleted file mode 100644
index 953edd2b45581d7c74f4d9f8e366457c1b7c9b9b..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/poster_eval_utils.py
+++ /dev/null
@@ -1,1273 +0,0 @@
-import random
-import string
-import yaml
-import PIL
-import tempfile
-import io
-from camel.models import ModelFactory
-from math import ceil
-from openai import OpenAI
-from camel.messages import BaseMessage
-from utils.src.model_utils import parse_pdf
-from urllib.parse import unquote
-from copy import deepcopy
-from transformers import AutoTokenizer, AutoModelForCausalLM
-from pytorch_fid.fid_score import compute_statistics_of_path
-import pytorch_fid.fid_score as fid
-from PIL import Image
-from httpx import Timeout
-from docling.document_converter import DocumentConverter, PdfFormatOption
-import re
-import shutil
-import pytesseract
-from utils.wei_utils import account_token
-from camel.types import ModelPlatformType, ModelType
-from marker.models import create_model_dict
-from camel.configs import ChatGPTConfig
-from camel.agents import ChatAgent
-from jinja2 import Environment, StrictUndefined
-from utils.src.utils import get_json_from_response
-from pathlib import Path
-from docling_core.types.doc import ImageRefMode, PictureItem, TableItem
-from collections import defaultdict
-
-from docling.datamodel.base_models import InputFormat
-from docling.datamodel.pipeline_options import PdfPipelineOptions
-from docling.document_converter import DocumentConverter, PdfFormatOption
-
-import math
-import base64
-import requests
-from io import BytesIO
-from PIL import Image
-
-import torch
-import json
-import os
-import pickle as pkl
-import numpy as np
-from transformers import AltCLIPProcessor, AltCLIPModel
-
-def pil_to_data_uri(img: Image.Image, fmt: str = "PNG") -> str:
-    """
-    Convert a PIL.Image to a base-64 data URI suitable for
-    the OpenAI/vLLM 'image_url' block.
-    fmt = 'PNG' (lossless) or 'JPEG' (smaller, 0-100 quality).
-    """
-    buf = io.BytesIO()
-    if fmt.upper() == "JPEG":
-        img.save(buf, format="JPEG", quality=90)
-        mime = "image/jpeg"
-    else:
-        img.save(buf, format="PNG")
-        mime = "image/png"
-    b64 = base64.b64encode(buf.getvalue()).decode()
-    return f"data:{mime};base64,{b64}"
-
-def md_to_blocks(
-    md: str,
-    base_dir=''
-):
-    blocks, pos = [], 0
-    pat = re.compile(r'!\[.*?\]\((.*?)\)', re.DOTALL)
-
-    for m in pat.finditer(md):
-        # --- text before this image ---------------------------------------
-        txt = md[pos : m.start()].strip()
-        if txt:
-            blocks.append({"type": "text", "text": txt})
-
-        # --- the image itself ---------------------------------------------
-        img_path = unquote(m.group(1))
-        img_path = os.path.join(base_dir, img_path)
-
-        blocks.append({"type": "image_url", "image_url": {"url": pil_to_data_uri(Image.open(img_path), fmt="PNG")}})
-        pos = m.end()
-
-    # --- any trailing text -------------------------------------------------
-    tail = md[pos:].strip()
-    if tail:
-        blocks.append({"type": "text", "text": tail})
-
-    return blocks
-
-def compute_vlm_ppl(content):
-    VLLM_BASE_URL = "http://localhost:7000/v1"
-    MODEL_ID = "Qwen/Qwen2.5-VL-7B-Instruct"
-
-    client = OpenAI(
-        api_key="EMPTY",            # vLLM ignores auth
-        base_url=VLLM_BASE_URL,
-        timeout=Timeout(5000)
-    )
-
-    resp = client.chat.completions.create(
-        model=MODEL_ID,
-        messages=[{
-            "role": "user",
-            "content": content,
-        }],
-        temperature=0.0,
-        max_tokens=1, 
-        logprobs=0,
-        extra_body={
-            "prompt_logprobs": 1,
-            "echo": True 
-        }
-    )
-
-    lp_list = resp.to_dict()["prompt_logprobs"]   # list[dict]
-    total_lp = 0.0
-    n_text   = 0
-
-    for token_entry in lp_list:
-        if not token_entry:
-            continue
-        # find the sub-entry with rank==1 (the real token)
-        token_info = next(v for v in token_entry.values() if v["rank"] == 1)
-        tok, lp = token_info["decoded_token"], token_info["logprob"]
-
-        # skip image sentinels / padding
-        if re.fullmatch(r"<\|?image[^>]*\|?>", tok):
-            continue
-
-        total_lp += lp
-        n_text   += 1
-
-    return math.exp(-total_lp / n_text)
-
-def compute_interleaved_ppl(paper_name, poster_method):
-    base_dir = f'eval_poster_markdown/{paper_name}/{poster_method}'
-    with open(os.path.join(base_dir, f'{paper_name}-with-image-refs.md'), 'r') as f:
-        md = f.read()
-    parts = md_to_blocks(md, base_dir)
-    while True:
-        try:
-            return compute_vlm_ppl(parts)
-        except:
-            parts = parts[:-1]
-            continue
-
-
-def get_visual_ppl(image, text):
-
-    img_uri = pil_to_data_uri(image, fmt="PNG")
-    content = [
-        {"type": "text",      "text": text},
-        {"type": "image_url", "image_url": {"url": img_uri}},
-    ]
-
-    return compute_vlm_ppl(content)
-
-def estimate_visual_tokens(
-    images,
-    *,
-    resized_height: int | None = None,
-    resized_width: int | None = None,
-    min_pixels: int | None = None,
-    max_pixels: int | None = None,
-):
-    """Return per‑image *visual‑token* counts for **Qwen‑2.5‑VL**.
-
-    Token count = ⌈H/28⌉ × ⌈W/28⌉ after the model’s resizing rules. The helper
-    mirrors those rules so your offline estimate aligns with server billing.
-    """
-    counts = []
-
-    for img in images:
-        h, w = img.height, img.width
-        # manual resize overrides (rarely used)
-        if resized_height and resized_width:
-            h, w = resized_height, resized_width
-        # area‑based resize to respect min/max tokens
-        if min_pixels and h * w < min_pixels:
-            scale = (min_pixels / (h * w)) ** 0.5
-            h, w = int(h * scale), int(w * scale)
-        if max_pixels and h * w > max_pixels:
-            scale = (max_pixels / (h * w)) ** 0.5
-            h, w = int(h * scale), int(w * scale)
-        # round each side to multiple of 28
-        h = ceil(h / 28) * 28
-        w = ceil(w / 28) * 28
-        counts.append((h // 28) * (w // 28))
-
-    return counts
-
-def image_memory_size(img: Image.Image, fmt="JPEG"):
-    buf = BytesIO()
-    img.save(buf, format=fmt)
-    return buf.tell()
-
-def truncate_images_to_fit(
-    images,
-    *,
-    max_ctx: int,
-    **resize_kwargs,
-):
-    """Drop **later** images until total visual tokens ≤ *max_ctx*.
-
-    Chronology‑preserving version: keeps the earliest images intact and
-    trims the tail when necessary.
-    """
-
-    tokens = estimate_visual_tokens(images, **resize_kwargs)
-    max_size = 45 * 1024 * 1024  # 45 MB
-    total_size = 0
-    keep = []
-    total = 0
-    for img, n_tok in zip(images, tokens):  # iterate in original order
-        if total + n_tok > max_ctx:
-            break  # stop adding once budget exceeded – we drop the rest
-        img_size = image_memory_size(img)
-        if total_size + img_size > max_size:
-            break
-        keep.append(img)
-        total += n_tok
-    return keep
-
-
-def compute_poster_image_ppl(images):
-    max_ctx = 128_000  # max visual tokens for Qwen2.5-VL
-    truncated_images = truncate_images_to_fit(images, max_ctx=max_ctx)
-    img_uris = [pil_to_data_uri(image, fmt="PNG") for image in truncated_images]
-    content = [
-        {"type": "image_url", "image_url": {"url": img_uri}} for img_uri in img_uris
-    ]
-
-    return compute_vlm_ppl(content)
-
-
-def compute_clip_embeddings(folder, model, processor, device):
-    """
-    Loads each image in `folder`, encodes it with the CLIP model,
-    and returns a list (or array) of embeddings, shape (N, D).
-    """
-    model.eval()
-    embeddings = []
-
-    # Gather all image files
-    image_files = [
-        f for f in os.listdir(folder)
-        if f.lower().endswith(('.png', '.jpg', '.jpeg'))
-    ]
-
-    if not image_files:
-        print(f"No valid images found in {folder}")
-        return np.array([])
-
-    for filename in image_files:
-        img_path = os.path.join(folder, filename)
-        image = Image.open(img_path).convert('RGB')
-
-        # Preprocess for CLIP
-        inputs = processor(images=image, return_tensors="pt").to(device)
-
-        # Encode and get the image embeddings
-        with torch.no_grad():
-            clip_emb = model.get_image_features(**inputs)
-            # Move to CPU and convert to NumPy
-            clip_emb = clip_emb[0].cpu().numpy()
-            embeddings.append(clip_emb)
-
-    return np.array(embeddings)  # shape: (N, D)
-
-def compute_clip_embedding(input_data, model, processor, device='cuda', input_type=None):
-    """
-    Compute a CLIP embedding for either an image or text.
-
-    Parameters
-    ----------
-    input_data : str or PIL.Image.Image
-        - If a string: treated as a file path to an image (if file exists) or as a text prompt.
-        - If a PIL.Image.Image: treated as an image.
-    model : CLIPModel
-        The loaded CLIP model (e.g., from Hugging Face).
-    processor : CLIPProcessor
-        The corresponding CLIP processor for tokenization/preprocessing.
-    device : torch.device
-        The device to run inference on.
-    input_type : {'image', 'text', None}, optional
-        Force the mode; if `None` (default) the function will try to infer from `input_data`.
-
-    Returns
-    -------
-    np.ndarray
-        A 1D NumPy array of length D (the CLIP embedding dimension).
-    """
-    model.eval()
-
-    # Decide mode
-    if input_type == "image":
-        mode = "image"
-    elif input_type == "text":
-        mode = "text"
-    else:
-        # auto-detect
-        if isinstance(input_data, Image.Image):
-            mode = "image"
-        elif isinstance(input_data, str) and os.path.isfile(input_data):
-            mode = "image"
-        else:
-            mode = "text"
-
-    # Preprocess + encode
-    with torch.no_grad():
-        if mode == "image":
-            if isinstance(input_data, str):
-                image = Image.open(input_data).convert("RGB")
-            else:
-                image = input_data.convert("RGB")
-            inputs = processor(images=image, return_tensors="pt").to(device)
-            features = model.get_image_features(**inputs)
-
-        else:  # text mode
-            # CLIP expects a list of strings
-            texts = [input_data] if isinstance(input_data, str) else list(input_data)
-            inputs = processor(
-                text=texts, 
-                return_tensors="pt", 
-                padding=True,
-                truncation=True,
-                max_length=processor.tokenizer.model_max_length,
-            ).to(device)
-            features = model.get_text_features(**inputs)
-
-        # extract, move to CPU, convert to numpy
-        emb = features[0].cpu().numpy()
-
-    return emb
-
-def compute_average_l2_distance(emb1, emb2):
-    """
-    Computes the average L2 distance across all pairs in emb1 x emb2.
-    - emb1 shape: (N1, D)
-    - emb2 shape: (N2, D)
-    Returns a single float: mean of all pairwise distances.
-    """
-    distances = []
-    for e1 in emb1:
-        for e2 in emb2:
-            dist = np.linalg.norm(e1 - e2)  # L2 distance
-            distances.append(dist)
-    return np.mean(distances) if distances else float('nan')
-
-def compute_cosine_similarity(e1, e2):
-    """
-    Computes the cosine similarity between two vectors.
-    - e1 shape: (D,)
-    - e2 shape: (D,)
-    Returns a single float: cosine similarity.
-    """
-    dot = np.dot(e1, e2)
-    norm_e1 = np.linalg.norm(e1)
-    norm_e2 = np.linalg.norm(e2)
-    return dot / (norm_e1 * norm_e2 + 1e-8)  # avoid division by zero
-
-def compute_average_cosine_similarity(emb1, emb2):
-    """
-    Computes the average cosine similarity across all pairs in emb1 x emb2.
-    - emb1 shape: (N1, D)
-    - emb2 shape: (N2, D)
-    Returns a single float: mean of all pairwise similarities.
-    """
-    similarities = []
-    for e1 in emb1:
-        for e2 in emb2:
-            # Cosine similarity = (e1 · e2) / (||e1|| * ||e2||)
-            dot = np.dot(e1, e2)
-            norm_e1 = np.linalg.norm(e1)
-            norm_e2 = np.linalg.norm(e2)
-            cos_sim = dot / (norm_e1 * norm_e2 + 1e-8)
-            similarities.append(cos_sim)
-    return np.mean(similarities) if similarities else float('nan')
-
-def compare_folders_with_clip(folder1, folder2):
-    """
-    Loads a CLIP model from Hugging Face,
-    gets embeddings for each folder,
-    and computes both average L2 distance and average cosine similarity.
-    """
-    device = "cuda" if torch.cuda.is_available() else "cpu"
-
-    model_name="openai/clip-vit-base-patch32"
-    model_name = "BAAI/AltCLIP"
-    model = AltCLIPModel.from_pretrained(model_name).to('cuda')
-    processor = AltCLIPProcessor.from_pretrained(model_name)
-
-    # Compute embeddings
-    emb1 = compute_clip_embeddings(folder1, model, processor, device)
-    emb2 = compute_clip_embeddings(folder2, model, processor, device)
-
-    if emb1.size == 0 or emb2.size == 0:
-        print("One of the folders had no valid images. Comparison not possible.")
-        return None, None
-
-    # Average L2 Distance
-    avg_l2 = compute_average_l2_distance(emb1, emb2)
-
-    # Average Cosine Similarity
-    avg_cos_sim = compute_average_cosine_similarity(emb1, emb2)
-
-    return avg_l2, avg_cos_sim
-
-def convert_folder_to_grayscale(input_folder, output_folder):
-    os.makedirs(output_folder, exist_ok=True)
-    for filename in os.listdir(input_folder):
-        if filename.lower().endswith(('.jpg', '.jpeg', '.png')):
-            input_path = os.path.join(input_folder, filename)
-            output_path = os.path.join(output_folder, filename)
-
-            img = Image.open(input_path).convert('L').convert('RGB')  # grayscale + 3 channels
-            img.save(output_path)
-
-def compute_fid_with_grayscale(reference_poster_folder, generated_poster_img_folder, clip=False):
-    # Step 1: Create grayscale versions in tmp/
-    tmp_ref = 'tmp/ref_gray'
-    tmp_gen = 'tmp/gen_gray'
-
-    if os.path.exists('tmp/ref_gray'):
-        shutil.rmtree('tmp/ref_gray')
-
-    if os.path.exists('tmp/gen_gray'):
-        shutil.rmtree('tmp/gen_gray')
-    os.makedirs(tmp_ref)
-    os.makedirs(tmp_gen)
-
-    convert_folder_to_grayscale(reference_poster_folder, tmp_ref)
-    convert_folder_to_grayscale(generated_poster_img_folder, tmp_gen)
-
-    if clip:
-        return compare_folders_with_clip(tmp_ref, tmp_gen)
-
-    # Step 2: Compute FID
-    model = fid.InceptionV3([fid.InceptionV3.BLOCK_INDEX_BY_DIM[2048]]).to('cuda')
-    m1, s1 = compute_statistics_of_path(tmp_ref, model, 1, 2048, 'cuda')
-    m2, s2 = compute_statistics_of_path(tmp_gen, model, 1, 2048, 'cuda')
-    fid_score = fid.calculate_frechet_distance(m1, s1, m2, s2)
-
-    return fid_score
-
-def compute_fid(reference_poster_folder, generated_poster_img_folder, clip=False):
-    if clip:
-        return compare_folders_with_clip(reference_poster_folder, generated_poster_img_folder)
-    model = fid.InceptionV3([fid.InceptionV3.BLOCK_INDEX_BY_DIM[2048]]).to('cuda')
-
-    m1, s1 = compute_statistics_of_path(reference_poster_folder, model, 1, 2048, 'cuda')
-    m2, s2 = compute_statistics_of_path(generated_poster_img_folder, model, 1, 2048, 'cuda')
-
-    fid_score = fid.calculate_frechet_distance(
-        m1, s1, m2, s2
-    )
-
-    return fid_score
-
-
-def get_poster_text(poster_path):
-    markdown_clean_pattern = re.compile(r"<!--[\s\S]*?-->")
-    converter = DocumentConverter()
-    raw_result = converter.convert(poster_path)
-
-    raw_markdown = raw_result.document.export_to_markdown()
-    text_content = markdown_clean_pattern.sub("", raw_markdown)
-    if len(text_content) < 500:
-        print('\nParsing with docling failed, using marker instead\n')
-        parser_model = create_model_dict(device='cuda', dtype=torch.float16)
-        text_content, rendered = parse_pdf(poster_path, model_lst=parser_model, save_file=False)
-    return text_content
-
-def qwen2_vl_ppl(
-    image: Image.Image,
-    text: str,
-    *,
-    vllm_url: str = "http://localhost:8000/v1/chat/completions",
-    model: str   = "Qwen/Qwen2-VL-7B",     # whatever name you passed to vLLM
-) -> float:
-    """
-    Compute PPL(text | image) with a Qwen2-VL-7B model served by vLLM.
-
-    Parameters
-    ----------
-    image : PIL.Image.Image
-        Input image.
-    text : str
-        Prompt text that follows the image.
-    vllm_url : str, default "http://localhost:8000/v1/chat/completions"
-        The full URL of the vLLM chat endpoint.
-    model : str, default "Qwen2-VL-7B"
-        Model name as registered when you launched vLLM.
-
-    Returns
-    -------
-    float
-        Per-token perplexity of `text` conditioned on `image`.
-    """
-
-    # 1) Encode the image as base64‑PNG
-    buf = BytesIO()
-    image.save(buf, format="PNG")
-    img_b64 = base64.b64encode(buf.getvalue()).decode()
-
-    # 2) Build a multimodal chat message: image first, then text
-    messages = [
-        {
-            "role": "user",
-            "content": [
-                {
-                    "type": "image_url",
-                    "image_url": {"url": f"data:image/png;base64,{img_b64}"}
-                },
-                {
-                    "type": "text",
-                    "text": text
-                }
-            ],
-        }
-    ]
-
-    # 3) Ask vLLM to echo the prompt and give log‑probs
-    payload = {
-        "model":       model,
-        "messages":    messages,
-        "temperature": 0.0,
-        "max_tokens":  0,    # no generation – just evaluate prompt
-        "echo":        True,
-        "logprobs":    1
-    }
-
-    resp = requests.post(vllm_url, json=payload, timeout=60)
-    resp.raise_for_status()
-    data = resp.json()
-
-    # 4) Extract prompt‑token log‑probs
-    token_logps = data["choices"][0]["logprobs"]["token_logprobs"]
-
-    # Ignore special tokens & image placeholders (returned as None)
-    valid = [lp for lp in token_logps if lp is not None]
-    if not valid:
-        raise ValueError("No valid text tokens found in logprobs")
-
-    # 5) Perplexity = exp( − average logp )
-    return math.exp(-sum(valid) / len(valid))
-
-def get_ppl(
-    text: str,
-    model_name: str = "meta-llama/Llama-2-7b-hf",
-    stride: int = 512,
-) -> float:
-    """Compute perplexity for arbitrarily long *text* using a sliding‑window approach.
-
-    Parameters
-    ----------
-    text : str
-        The input string (any length).
-    model_name : str, optional
-        HF Hub id of the model to use, by default "meta-llama/Llama-2-7b-hf".
-    stride : int, optional
-        Overlap between successive windows. 512 tends to work well for most
-        Transformer LMs with a 2 k context. Increase it for higher accuracy at
-        the cost of more compute.
-
-    Returns
-    -------
-    float
-        Per‑token perplexity under the given model.
-    """
-    # Load tokenizer / model once per call (cache makes subsequent calls cheap)
-    tokenizer = AutoTokenizer.from_pretrained(model_name)
-    model = AutoModelForCausalLM.from_pretrained(
-        model_name,
-        torch_dtype=torch.float16,
-        device_map="auto",  # place on GPU if available
-    )
-    model.eval()
-
-    # Encode the whole string in one shot
-    encodings = tokenizer(text, return_tensors="pt")
-    input_ids = encodings.input_ids[0]
-
-    # Model context length (e.g. 2048 for Llama‑2)
-    max_len = model.config.max_position_embeddings
-
-    # --- Short input: fits in a single window --------------------------------
-    if input_ids.size(0) <= max_len:
-        with torch.no_grad():
-            out = model(input_ids.unsqueeze(0).to(model.device), labels=input_ids.unsqueeze(0).to(model.device))
-        return torch.exp(out.loss).item()
-
-    # --- Long input: sliding window with overlap -----------------------------
-    nlls = []  # negative‑log‑likelihoods (already multiplied by #tokens scored)
-    for i in range(0, input_ids.size(0), stride):
-        begin_loc = max(i + stride - max_len, 0)
-        end_loc = min(i + stride, input_ids.size(0))
-        trg_len = end_loc - i  # tokens we actually score in this window
-
-        ids_chunk = input_ids[begin_loc:end_loc]
-        labels = ids_chunk.clone()
-        labels[:-trg_len] = -100  # mask out purely‑context tokens
-
-        with torch.no_grad():
-            out = model(ids_chunk.unsqueeze(0).to(model.device), labels=labels.unsqueeze(0).to(model.device))
-            nll = out.loss * trg_len  # make additive so we can sum across windows
-        nlls.append(nll)
-
-        if end_loc == input_ids.size(0):
-            break
-
-    ppl = torch.exp(torch.stack(nlls).sum() / input_ids.size(0))
-    return ppl.item()
-
-def extract_text_from_image(image_path):
-    """
-    Open an image file and use Tesseract OCR to extract text.
-    :param image_path: Path to the image file
-    :return: Extracted text as a string
-    """
-    image = Image.open(image_path)
-    text = pytesseract.image_to_string(image)
-    return text
-
-import tiktoken
-
-def count_tokens(text: str, model: str = "gpt-4o") -> int:
-    """
-    Count the number of tokens in `text` according to OpenAI's tokenizer.
-    
-    :param text: The input string you want to measure.
-    :param model: Which model’s encoding to mimic (defaults to “gpt-4o”).
-                  Common choices: "gpt-3.5-turbo", "gpt-4o", "gpt-4o-mini".
-    :return: The number of tokens.
-    """
-    # Grab the right encoder for the model; falls back to the nearest base if needed
-    try:
-        enc = tiktoken.encoding_for_model(model)
-    except KeyError:
-        # All chat models use the cl100k_base encoding
-        enc = tiktoken.get_encoding("cl100k_base")
-    
-    return len(enc.encode(text))
-
-def count_words(text):
-    """
-    Count the number of words in a given text string.
-    :param text: Input text
-    :return: Number of words found
-    """
-    # Use a regex to find word-like sequences
-    words = re.findall(r"\w+", text)
-    return len(words)
-
-
-def count_words_in_image(image_path):
-    """
-    Extract text from an image and count its words.
-    :param image_path: Path to the image file
-    :return: Word count (int)
-    """
-    text = extract_text_from_image(image_path)
-    return count_words(text)
-
-def count_tokens_in_image(image_path, model="gpt-4o"):
-    """
-    Extract text from an image and count its tokens.
-    :param image_path: Path to the image file
-    :param model: Which model’s encoding to mimic (defaults to “gpt-4o”).
-                  Common choices: "gpt-3.5-turbo", "gpt-4o", "gpt-4o-mini".
-    :return: Token count (int)
-    """
-    text = extract_text_from_image(image_path)
-    return count_tokens(text, model=model)
-
-def png_to_optimized_jpeg(img: Image.Image,
-                          max_size=(2048, 2048),
-                          quality=80) -> BytesIO:
-    """
-    Take a PNG PIL Image, downsample it to fit within max_size (preserving aspect
-    ratio), then JPEG-compress it at the given quality into a BytesIO buffer.
-    
-    Args:
-      img:     PIL.Image opened from your .png
-      max_size: (width, height) ceiling for downsampling
-      quality: JPEG quality 1–95 (higher = better quality / larger file)
-    
-    Returns:
-      BytesIO containing the JPEG bytes.
-    """
-    # 1) Downsample in place (preserves aspect ratio)
-    img_copy = img.copy()
-    img_copy.thumbnail(max_size, resample=Image.LANCZOS)
-    
-    # 2) Convert to RGB (drop alpha) and save with compression
-    rgb = img_copy.convert("RGB")
-    buf = BytesIO()
-    rgb.save(
-        buf,
-        format="JPEG",
-        quality=quality,        # try 80–90 for minimal artifacts
-        optimize=True,          # runs an extra pass to squeeze out redundant data
-        progressive=True        # allows incremental render in browsers/viewers
-    )
-    buf.seek(0)
-    return buf
-
-def get_answers_and_remove_answers(questions):
-    question_only, answers, aspects = {}, {}, {}
-    for key, val in questions.items():
-        question_only[key] = {
-            'question': val['question'],
-            'options': val['options']
-        }
-        answers[key] = val['answer']
-        aspects[key] = val['aspect']
-    return question_only, answers, aspects
-
-def open_folder_images(
-    folder_path,
-    paper_name,
-    return_path=False,
-    format='png',
-    max_size=(700, 700),
-    quality=80
-):
-    """
-    Opens all PNG images in folder_path named '{paper_name}-{index}.png',
-    starting from index=1 up to the first missing, and returns them
-    either as file-paths (if return_path=True) or as PIL.Image objects.
-    
-    If img_format!='png', each PNG is downsampled to fit within max_size
-    (preserving aspect ratio), converted to RGB, and saved into an
-    in-memory JPEG with the given quality, optimize and progressive flags.
-    """
-    images = []
-    index = 1
-
-    while True:
-        png_name = f"{paper_name}-{index}.png"
-        path = os.path.join(folder_path, png_name)
-        if not os.path.isfile(path):
-            break
-
-        if format == 'png':
-            if return_path:
-                images.append(path)
-            else:
-                images.append(Image.open(path))
-        else:
-            # 1) Load and downsample
-            with Image.open(path) as im:
-                thumb = im.copy()
-                thumb.thumbnail(max_size, resample=Image.LANCZOS)
-
-                # 2) Convert & compress to JPEG in-memory
-                rgb = thumb.convert("RGB")
-                buf = BytesIO()
-                rgb.save(
-                    buf,
-                    format="JPEG",
-                    quality=quality,        # e.g. 80–90
-                    optimize=True,          # extra pass to strip redundant data
-                    progressive=True        # for incremental rendering
-                )
-                buf.seek(0)
-
-                if return_path:
-                    # we return a tuple of (fake-jpg-filename, buffer)
-                    jpg_name = png_name.rsplit('.', 1)[0] + '.jpg'
-                    images.append((jpg_name, buf))
-                else:
-                    images.append(Image.open(buf))
-
-        index += 1
-
-    return images
-
-def ensure_under_limit_pil(img, max_bytes: int = 10 * 1024 * 1024) -> Image.Image:
-    # Ensure RGB mode for JPEG compatibility
-    if img.mode in ("RGBA", "P"):
-        img = img.convert("RGB")
-
-    # Try saving at decreasing qualities until under the limit
-    for quality in (90, 80, 70, 60, 50):
-        buf = io.BytesIO()
-        img.save(buf, format="JPEG", quality=quality)
-        new_raw = buf.getvalue()
-        if len(new_raw) <= max_bytes:
-            return Image.open(io.BytesIO(new_raw))
-
-    # Fallback: resize by half and save at low quality
-    w, h = img.size
-    img_resized = img.resize((w // 2, h // 2), Image.LANCZOS)
-    buf = io.BytesIO()
-    img_resized.save(buf, format="JPEG", quality=50)
-    new_raw = buf.getvalue()
-    if len(new_raw) > max_bytes:
-        raise RuntimeError("Could not reduce image under size limit")
-
-    return Image.open(io.BytesIO(new_raw))
-
-def eval_qa_get_answer(poster_input, questions, answers, aspects, input_type, agent_config):
-    agent_name = f'answer_question_from_{input_type}'
-    with open(f"utils/prompt_templates/{agent_name}.yaml", "r") as f:
-        config = yaml.safe_load(f)
-
-    if agent_config['model_platform'].is_vllm:
-        actor_model = ModelFactory.create(
-            model_platform=agent_config['model_platform'],
-            model_type=agent_config['model_type'],
-            model_config_dict=agent_config['model_config'],
-            url=agent_config['url'],
-        )
-    else:
-        actor_model = ModelFactory.create(
-            model_platform=agent_config['model_platform'],
-            model_type=agent_config['model_type'],
-            model_config_dict=agent_config['model_config'],
-        )
-
-    actor_sys_msg = config['system_prompt']
-
-    actor_agent = ChatAgent(
-        system_message=actor_sys_msg,
-        model=actor_model,
-        message_window_size=None,
-    )
-
-    actor_agent.reset()
-
-    jinja_env = Environment(undefined=StrictUndefined)
-
-    template = jinja_env.from_string(config["template"])
-
-    if input_type == 'text':
-        prompt = template.render(**{
-            'questions': questions,
-            'poster_text': poster_input,
-        })
-        response = actor_agent.step(prompt)
-        agent_answers = get_json_from_response(response.msgs[0].content)
-    elif input_type == 'image':
-        if 'max_images' in agent_config:
-            max_images = agent_config['max_images']
-        else:
-            max_images = len(poster_input)
-        prompt = template.render(**{
-            'questions': questions,
-        })
-        msg = BaseMessage.make_user_message(
-            role_name="User",
-            content=prompt,
-            image_list=poster_input[:max_images],
-        )
-        response = actor_agent.step(msg)
-        agent_answers = get_json_from_response(response.msgs[0].content)
-
-    input_token, output_token = account_token(response)
-
-    accuracy, aspect_accuracy = compute_accuracy(agent_answers, answers, aspects)
-
-    return accuracy, aspect_accuracy, agent_answers, input_token, output_token
-    
-
-def compute_accuracy(predicted, ground_truth, aspects):
-    """
-    Parameters
-    ----------
-    predicted : dict
-        {question: {'answer': <letter>, 'reference': ...}, ...}
-    ground_truth : dict
-        {question: '<letter>. full answer', ...}
-    aspects : dict
-        {question: '<aspect name>', ...}
-
-    Returns
-    -------
-    overall_accuracy : float
-    aspect_summary : dict
-        {
-          '<aspect name>': {
-              'total':    <int>,   # questions in this aspect
-              'correct':  <int>,   # correctly answered questions
-              'accuracy': <float>  # correct / total (0–1)
-          },
-          ...
-        }
-    """
-    correct_global = 0
-    total_global   = len(ground_truth)
-
-    total_by_aspect   = defaultdict(int)
-    correct_by_aspect = defaultdict(int)
-
-    for q, pred_info in predicted.items():
-        letter_pred = pred_info['answer']
-        ref = pred_info.get('reference', 'NA')
-
-        # Count this question toward its aspect, even if NA or missing gt
-        aspect = aspects.get(q, 'Unknown')
-        total_by_aspect[aspect] += 1
-
-        if letter_pred == 'NA' or ref == 'NA':
-            continue  # automatically wrong
-
-        if q in ground_truth:
-            letter_gt = ground_truth[q].split('.')[0].strip()
-
-            if len(letter_pred) > 0:
-                letter_pred = letter_pred[0].upper()
-            if letter_pred == letter_gt:
-                correct_global += 1
-                correct_by_aspect[aspect] += 1
-
-    overall_accuracy = correct_global / total_global if total_global else 0.0
-
-    # Build the per-aspect dictionary
-    aspect_summary = {}
-    for aspect, total in total_by_aspect.items():
-        correct = correct_by_aspect[aspect]
-        acc     = correct / total if total else 0.0
-        aspect_summary[aspect] = {
-            'total':   total,
-            'correct': correct,
-            'accuracy': acc
-        }
-
-    return overall_accuracy, aspect_summary
-
-def shuffle_question_options(question_data):
-    """
-    Shuffle the order of the options for each question in the question_data.
-    Also updates the "answer" field so that it uses the new letter corresponding
-    to the correct option.
-    
-    Parameters:
-        question_data (dict): A dictionary where keys are question identifiers (e.g., "Question 1")
-                              and values are dictionaries containing at least the keys "options" (a list
-                              of option strings) and "answer" (a string matching one of the options).
-    
-    Returns:
-        dict: A new dictionary with the same structure as question_data but with options shuffled
-              and answers updated.
-    """
-    # Make a deep copy so we do not modify the original data
-    new_data = deepcopy(question_data)
-    
-    # Loop over each question
-    for q_key, q_content in new_data.items():
-        original_options = q_content.get("options", [])
-        original_answer = q_content.get("answer", "")
-        
-        # Extract the text portion of the original answer.
-        # We assume that each option (and the answer) has the format "X. <option text>"
-        if ". " in original_answer:
-            orig_letter, orig_text = original_answer.split(". ", 1)
-        else:
-            # If format not as expected, use the whole answer string
-            orig_text = original_answer
-        
-        # Remove the letter prefixes from each option to obtain a list of option texts.
-        option_texts = []
-        for opt in original_options:
-            if ". " in opt:
-                _, text = opt.split(". ", 1)
-            else:
-                text = opt
-            option_texts.append(text)
-        
-        # Shuffle the list of option texts
-        random.shuffle(option_texts)
-        
-        # Reassign new letter labels (A, B, C, etc.) to the shuffled options.
-        new_options = []
-        correct_answer_new = None
-        letters = list(string.ascii_uppercase)
-        for idx, text in enumerate(option_texts):
-            new_opt = f"{letters[idx]}. {text}"
-            new_options.append(new_opt)
-            # When the option's text matches the original answer text, update the answer field.
-            if text == orig_text:
-                correct_answer_new = new_opt
-        
-        # Fallback in case no match is found (should not happen if data is consistent)
-        if correct_answer_new is None:
-            correct_answer_new = original_answer
-        
-        # Update the question entry with the new options and answer.
-        q_content["options"] = new_options
-        q_content["answer"] = correct_answer_new
-
-    return new_data
-
-def png_to_pdf(input_path: str, output_path: str) -> None:
-    """
-    Convert a PNG image to a PDF file.
-
-    Args:
-        input_path: Path to the source .png file.
-        output_path: Path where the resulting .pdf will be saved.
-    """
-    with Image.open(input_path) as img:
-        # Convert image to RGB if it has an alpha channel
-        if img.mode in ("RGBA", "LA") or (img.mode == "P" and "transparency" in img.info):
-            background = Image.new("RGB", img.size, (255, 255, 255))
-            if img.mode != "RGBA":
-                img = img.convert("RGBA")
-            background.paste(img, mask=img.split()[-1])  # use alpha channel as mask
-            img = background
-        else:
-            img = img.convert("RGB")
-
-        img.save(output_path, "PDF", resolution=200.0)
-
-def extract_images_and_sections(md):
-    parts = re.split(r'(## [^\n]+)', md)
-    records = []
-    for i in range(1, len(parts), 2):
-        header = parts[i].strip()
-        content = parts[i+1]
-        # Find all image paths
-        images = re.findall(r'!\[.*?\]\((.*?)\)', content)
-        if images:
-            # Remove lines that are image markdown
-            lines = content.splitlines()
-            cleaned = [
-                line for line in lines
-                if not re.match(r'!\[.*?\]\(.*?\)', line.strip())
-            ]
-            section_text = "\n".join(cleaned).strip()
-            for img in images:
-                records.append({
-                    'section': header,
-                    'image_path': unquote(img),
-                    'section_text': section_text
-                })
-
-    return records
-
-def gen_eval_markdown(paper_name, poster_method, poster_path, figure_count_only=False):
-    model_name="openai/clip-vit-base-patch32"
-    model_name = "BAAI/AltCLIP"
-    model = AltCLIPModel.from_pretrained(model_name).to('cuda')
-    processor = AltCLIPProcessor.from_pretrained(model_name)
-
-    # create a uniquely‐named file in your system temp dir (or specify dir="tmp")
-    with tempfile.NamedTemporaryFile(suffix=".pdf", prefix="poster_", dir="tmp", delete=False) as tf:
-        unique_pdf = tf.name
-
-    if poster_method != 'paper':
-        # convert into that file
-        png_to_pdf(poster_path, unique_pdf)
-        poster_path = unique_pdf
-    IMAGE_RESOLUTION_SCALE = 5.0
-    agent_name = f'image_captioner'
-    with open(f"utils/prompt_templates/{agent_name}.yaml", "r") as f:
-        config = yaml.safe_load(f)
-    actor_model = ModelFactory.create(
-        model_platform=ModelPlatformType.OPENAI,
-        model_type=ModelType.GPT_4O,
-        model_config_dict=ChatGPTConfig().as_dict(), # [Optional] the config for model
-    )
-
-    actor_sys_msg = config['system_prompt']
-
-    actor_agent = ChatAgent(
-        system_message=actor_sys_msg,
-        model=actor_model,
-        message_window_size=None,
-    )
-    jinja_env = Environment(undefined=StrictUndefined)
-
-    template = jinja_env.from_string(config["template"])
-    prompt = template.render()
-
-    raw_source = poster_path
-    converter = DocumentConverter()
-    raw_result = converter.convert(raw_source)
-    raw_markdown = raw_result.document.export_to_markdown()
-
-    output_dir = Path(f'eval_poster_markdown/{paper_name}/{poster_method}')
-    output_dir.mkdir(parents=True, exist_ok=True)
-
-    pipeline_options = PdfPipelineOptions()
-    pipeline_options.images_scale = IMAGE_RESOLUTION_SCALE
-    pipeline_options.generate_page_images = True
-    pipeline_options.generate_picture_images = True
-
-    doc_converter = DocumentConverter(
-        format_options={
-            InputFormat.PDF: PdfFormatOption(pipeline_options=pipeline_options)
-        }
-    )
-
-    conv_res = doc_converter.convert(raw_source)
-
-    output_dir.mkdir(parents=True, exist_ok=True)
-    doc_filename = paper_name
-
-    # Save images of figures and tables
-    table_counter = 0
-    picture_counter = 0
-    for element, _level in list(conv_res.document.iterate_items()):
-        if isinstance(element, TableItem):
-            table_counter += 1
-            element_image_filename = (
-                output_dir / f"table-{table_counter}.png"
-            )
-            with element_image_filename.open("wb") as fp:
-                element.get_image(conv_res.document).save(fp, "PNG")
-
-        if isinstance(element, PictureItem):
-            picture_counter += 1
-            element_image_filename = (
-                output_dir / f"picture-{picture_counter}.png"
-            )
-            with element_image_filename.open("wb") as fp:
-                element.get_image(conv_res.document).save(fp, "PNG")
-
-    # # Save markdown with embedded pictures
-    # md_filename = output_dir / f"{doc_filename}-with-images.md"
-    # conv_res.document.save_as_markdown(md_filename, image_mode=ImageRefMode.EMBEDDED)
-
-    # Save markdown with externally referenced pictures
-    md_filename = output_dir / f"{doc_filename}-with-image-refs.md"
-    markdown = conv_res.document.save_as_markdown(md_filename, image_mode=ImageRefMode.REFERENCED)
-
-    # # Save HTML with externally referenced pictures
-    # html_filename = output_dir / f"{doc_filename}-with-image-refs.html"
-    # conv_res.document.save_as_html(html_filename, image_mode=ImageRefMode.REFERENCED)
-
-    images = {}
-    images_and_text = extract_images_and_sections(markdown)
-    if figure_count_only:
-        return len(images_and_text)
-    for res in images_and_text:
-        image_path = os.path.join('eval_poster_markdown', paper_name, poster_method, res['image_path'])
-        image_img = Image.open(image_path)
-        section_text = res['section_text']
-        image_clip_embedding = compute_clip_embedding(image_img, model, processor)
-        section_text_clip_embedding = compute_clip_embedding(section_text, model, processor)
-        msg = BaseMessage.make_user_message(
-            role_name="User",
-            content=prompt,
-            image_list=[image_img],
-        )
-        response = actor_agent.step(msg)
-        images[res['image_path']] = {
-            'image_clip_embedding': image_clip_embedding,
-            'section_text_clip_embedding': section_text_clip_embedding,
-            'section_text': section_text,
-            'LLM_caption': response.msgs[0].content,
-        }
-        actor_agent.reset()
-
-    def replace_with_caption(match):
-        # match.group(1) is the URL‐encoded path
-        path = match.group(1)
-        # lookup the caption (fallback to empty string if missing)
-        caption = images.get(path.replace('%20', ' '), {}).get("LLM_caption", "")
-        return f"Image: {caption}"
-
-    # perform the replacement
-    new_md = re.sub(
-        r'!\[.*?\]\((.*?)\)',   # find ![…](path)
-        replace_with_caption,   # callback to build replacement
-        markdown
-    )
-
-    pkl.dump(images, open(f'eval_poster_markdown/{paper_name}/{poster_method}/images.pkl', 'wb'))
-    with open(f'eval_poster_markdown/{paper_name}/{poster_method}/markdown_with_images.md', 'w') as f:
-        f.write(new_md)
-
-    poster_text = get_poster_text(poster_path)
-
-    return images, poster_text, markdown, new_md
-
-def get_questions(paper_text, mode, model_type):
-    from dotenv import load_dotenv
-    load_dotenv()
-    agent_name = f'generate_question_{mode}'
-    with open(f"utils/prompt_templates/{agent_name}.yaml", "r") as f:
-        config = yaml.safe_load(f)
-
-    actor_model = ModelFactory.create(
-        model_platform=ModelPlatformType.OPENAI,
-        model_type=model_type,
-        model_config_dict=ChatGPTConfig().as_dict(), # [Optional] the config for model
-    )
-
-    actor_sys_msg = config['system_prompt']
-
-    actor_agent = ChatAgent(
-        system_message=actor_sys_msg,
-        model=actor_model,
-        message_window_size=10,
-    )
-
-    jinja_env = Environment(undefined=StrictUndefined)
-
-    template = jinja_env.from_string(config["template"])
-    question_generation_prompt = template.render(**{
-        'document_markdown': paper_text,
-    })
-    response = actor_agent.step(question_generation_prompt)
-    questions = get_json_from_response(response.msgs[0].content)
-    questions = shuffle_question_options(questions)
-
-    return questions
-
-def eval_vlm_as_judge_aspect(poster_image_list, agent_config, eval_aspect):
-    judge_model = ModelFactory.create(
-        model_platform=agent_config['model_platform'],
-        model_type=agent_config['model_type'],
-        model_config_dict=agent_config['model_config'],
-    )
-
-    judge_name = f'{eval_aspect}_judge'
-    with open(f"utils/prompt_templates/{judge_name}.yaml", "r") as f:
-        judge_config = yaml.safe_load(f)
-    
-    judge_sys_msg = judge_config['system_prompt']
-    judge_agent = ChatAgent(
-        system_message=judge_sys_msg,
-        model=judge_model,
-        message_window_size=None,
-    )
-    jinja_env = Environment(undefined=StrictUndefined)
-    template = jinja_env.from_string(judge_config["template"])
-    prompt = template.render()
-
-    judge_message = BaseMessage.make_user_message(
-        role_name="User",
-        content=prompt,
-        image_list=poster_image_list,
-    )
-
-    response = judge_agent.step(judge_message)
-    return get_json_from_response(response.msgs[0].content)
-
-def eval_vlm_as_judge(poster_image_list, agent_config, aspect=None):
-    aspects = [
-        'aesthetic_element',
-        'aesthetic_engagement',
-        'aesthetic_layout',
-        'information_low_level',
-        'information_logic',
-        'information_content',
-    ]
-
-    if aspect == 'aesthetic':
-        aspects = [
-            'aesthetic_element',
-            'aesthetic_engagement',
-            'aesthetic_layout',
-        ]
-    elif aspect == 'information':
-        aspects = [
-            'information_low_level',
-            'information_logic',
-            'information_content',
-        ]
-
-    results = {}
-    for aspect in aspects:
-        results[aspect] = eval_vlm_as_judge_aspect(poster_image_list, agent_config, aspect)
-    
-    return results
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/pptx_utils.py b/Paper2Video/src/evaluation/PresentQuiz/utils/pptx_utils.py
deleted file mode 100644
index 83fa534674937ef21efa2338179a09f877425900..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/pptx_utils.py
+++ /dev/null
@@ -1,2004 +0,0 @@
-from pptx.enum.text import PP_ALIGN
-from pptx.enum.shapes import MSO_SHAPE, MSO_CONNECTOR
-from pptx.dml.color import RGBColor
-from pptx.enum.dml import MSO_LINE_DASH_STYLE
-from pptx.dml.color import RGBColor
-from pptx.util import Pt
-from pptx.oxml.xmlchemy import OxmlElement
-from pptx.oxml.ns import qn
-import json
-
-add_border_label_function = r'''
-from pptx.enum.shapes import MSO_SHAPE_TYPE, MSO_SHAPE, MSO_AUTO_SHAPE_TYPE
-from pptx.util import Inches, Pt
-from pptx.dml.color import RGBColor
-from pptx.enum.text import PP_ALIGN, MSO_ANCHOR
-
-def pt_to_emu(points: float) -> int:
-    return int(points * 12700)
-
-def emu_to_inches(emu: int) -> float:
-    return emu / 914400
-
-def add_border_and_labels(
-    prs,
-    border_color=RGBColor(255, 0, 0),   # Red border for shapes
-    border_width=Pt(2),                # 2-point border width
-    label_outline_color=RGBColor(0, 0, 255),  # Blue outline for label circle
-    label_text_color=RGBColor(0, 0, 255),     # Blue text color
-    label_diameter_pt=40                       # Diameter of the label circle in points
-):
-    """
-    Iterates over all slides and shapes in the Presentation 'prs', applies a 
-    red border to each shape, and places a transparent (no fill), blue-outlined 
-    circular label with a blue number in the center of each shape. Labels start 
-    from 0 and increment for every shape that gets a border.
-
-    Args:
-        prs: The Presentation object to modify.
-        border_color: RGBColor for the shape border color (default: red).
-        border_width: The width of the shape border (Pt).
-        label_outline_color: The outline color for the label circle (default: blue).
-        label_text_color: The color of the label text (default: blue).
-        label_diameter_pt: The diameter of the label circle, in points (default: 40).
-    """
-    label_diameter_emu = pt_to_emu(label_diameter_pt)  # convert diameter (points) to EMUs
-    label_counter = 0  # Start labeling at 0
-    labeled_elements = {}
-
-    for slide in prs.slides:
-        for shape in slide.shapes:
-            # Skip shapes that are labels themselves
-            if shape.name.startswith("Label_"):
-                continue
-
-            try:
-                # --- 1) Add red border to the shape (if supported) ---
-                shape.line.fill.solid()
-                shape.line.fill.fore_color.rgb = border_color
-                shape.line.width = border_width
-
-                # --- 2) Calculate center for the label circle ---
-                label_left = shape.left + (shape.width // 2) - (label_diameter_emu // 2)
-                label_top  = shape.top  + (shape.height // 2) - (label_diameter_emu // 2)
-
-                # --- 3) Create label circle (an OVAL) in the center of the shape ---
-                label_shape = slide.shapes.add_shape(
-                    MSO_AUTO_SHAPE_TYPE.OVAL,
-                    label_left,
-                    label_top,
-                    label_diameter_emu,
-                    label_diameter_emu
-                )
-                label_shape.name = f"Label_{label_counter}"  # so we can skip it later
-
-                # **Make the circle completely transparent** (no fill at all)
-                label_shape.fill.background()
-
-                # **Give it a blue outline**
-                label_shape.line.fill.solid()
-                label_shape.line.fill.fore_color.rgb = label_outline_color
-                label_shape.line.width = Pt(3)
-
-                # --- 4) Add the label number (centered, blue text) ---
-                tf = label_shape.text_frame
-                tf.text = str(label_counter)
-                paragraph = tf.paragraphs[0]
-                paragraph.alignment = PP_ALIGN.CENTER
-
-                run = paragraph.runs[0]
-                font = run.font
-                font.size = Pt(40)      # Larger font
-                font.bold = True
-                font.name = "Arial"
-                font._element.get_or_change_to_solidFill()
-                font.fill.fore_color.rgb = label_text_color
-                # Record properties from the original shape and label text.
-                labeled_elements[label_counter] = {
-                    'left': f'{emu_to_inches(shape.left)} Inches',
-                    'top': f'{emu_to_inches(shape.top)} Inches',
-                    'width': f'{emu_to_inches(shape.width)} Inches',
-                    'height': f'{emu_to_inches(shape.height)} Inches',
-                    'font_size': f'{shape.text_frame.font.size} PT' if hasattr(shape, 'text_frame') else None,
-                }
-
-                # --- 5) Increment label counter (so every shape has a unique label) ---
-                label_counter += 1
-
-            except Exception as e:
-                # If the shape doesn't support borders or text, skip gracefully
-                print(f"Could not add border/label to shape (type={shape.shape_type}): {e}")
-
-    return labeled_elements
-'''
-
-add_border_function = r'''
-from pptx.enum.shapes import MSO_SHAPE_TYPE, MSO_SHAPE, MSO_AUTO_SHAPE_TYPE
-from pptx.util import Inches, Pt
-from pptx.dml.color import RGBColor
-from pptx.enum.text import PP_ALIGN, MSO_ANCHOR
-
-def emu_to_inches(emu: int) -> float:
-    return emu / 914400
-
-def add_border(
-    prs,
-    border_color=RGBColor(255, 0, 0),   # Red border for shapes
-    border_width=Pt(2),                # 2-point border width
-):
-    """
-    Iterates over all slides and shapes in the Presentation 'prs', applies a 
-    red border to each shape, and places a transparent (no fill).
-
-    Args:
-        prs: The Presentation object to modify.
-        border_color: RGBColor for the shape border color (default: red).
-        border_width: The width of the shape border (Pt).
-    """
-    labeled_elements = {}
-
-    for slide in prs.slides:
-        for shape in slide.shapes:
-            try:
-                # --- 1) Add red border to the shape (if supported) ---
-                shape.line.fill.solid()
-                shape.line.fill.fore_color.rgb = border_color
-                shape.line.width = border_width
-
-                if hasattr(shape, 'name'):
-                    labeled_elements[shape.name] = {
-                        'left': f'{emu_to_inches(shape.left)} Inches',
-                        'top': f'{emu_to_inches(shape.top)} Inches',
-                        'width': f'{emu_to_inches(shape.width)} Inches',
-                        'height': f'{emu_to_inches(shape.height)} Inches',
-                    }
-
-            except Exception as e:
-                # If the shape doesn't support borders or text, skip gracefully
-                print(f"Could not add border to shape (type={shape.shape_type}): {e}")
-    
-    return labeled_elements
-'''
-
-create_id_map_function = r'''
-def create_element_id_map(presentation):
-    """
-    Given a python-pptx Presentation object, this function creates
-    and returns a dictionary mapping each element's (shape's) unique id
-    to a sequential integer starting from 0.
-    
-    Parameters:
-        presentation (Presentation): A python-pptx Presentation object.
-        
-    Returns:
-        dict: A dictionary with keys as element IDs (integers) and values as sequential integers.
-    """
-    element_id_map = {}
-    counter = 0
-
-    # Iterate over each slide in the presentation
-    for slide in presentation.slides:
-        # Iterate over each shape (element) on the slide
-        for shape in slide.shapes:
-            if hasattr(shape, "name"):
-                element_id_map[counter] = shape.name
-                counter += 1
-
-    return element_id_map
-'''
-
-save_helper_info_border_label = r'''
-location_info = add_border_and_labels(poster, label_diameter_pt=80)
-id_map = create_element_id_map(poster)
-import json
-
-with open('{}_element_id_map.json', 'w') as f:
-    json.dump(id_map, f)
-
-with open('{}_location_info.json', 'w') as f:
-    json.dump(location_info, f)
-
-poster.save("{}_bordered.pptx")
-'''
-
-save_helper_info_border = r'''
-location_info = add_border(poster)
-import json
-
-with open('{}_location_info.json', 'w') as f:
-    json.dump(location_info, f)
-
-poster.save("{}_bordered.pptx")
-'''
-
-utils_functions = r'''
-
-from pptx import Presentation
-from pptx.util import Inches, Pt
-from pptx.enum.text import PP_ALIGN
-from pptx.enum.shapes import MSO_SHAPE, MSO_CONNECTOR
-from pptx.dml.color import RGBColor
-from pptx.enum.dml import MSO_LINE_DASH_STYLE
-from pptx.dml.color import RGBColor
-from pptx.util import Pt
-from pptx.oxml.xmlchemy import OxmlElement
-from pptx.oxml.ns import qn
-import pptx
-import json
-
-from pptx.enum.text import MSO_AUTO_SIZE
-
-def emu_to_inches(emu: int) -> float:
-    return emu / 914400
-
-def _px_to_pt(px):
-    """
-    Approximate conversion from pixels to points.
-    A common assumption is 1px ~ 0.75pt.
-    Adjust as needed for your environment.
-    """
-    return px * 0.75
-
-def _parse_font_size(font_size):
-    """
-    Internal helper to convert a numeric font size (e.g., 12) 
-    to a python-pptx Pt object. If it's already a Pt, return as-is.
-    """
-    if font_size is None:
-        return None
-    if isinstance(font_size, (int, float)):
-        return Pt(font_size)
-    return font_size  # Assume user provided a Pt object already
-
-def _parse_alignment(alignment):
-    """
-    Internal helper to convert a string alignment (e.g., "left", "center") 
-    to the corresponding PP_ALIGN constant. 
-    Default to PP_ALIGN.LEFT if unrecognized or None.
-    """
-    if not isinstance(alignment, str):
-        # If user passed None or something else, default to PP_ALIGN.LEFT
-        return PP_ALIGN.LEFT
-
-    alignment = alignment.lower().strip()
-    alignment_map = {
-        "left": PP_ALIGN.LEFT,
-        "center": PP_ALIGN.CENTER,
-        "right": PP_ALIGN.RIGHT,
-        "justify": PP_ALIGN.JUSTIFY,
-    }
-    return alignment_map.get(alignment, PP_ALIGN.LEFT)
-
-def create_poster(width_inch=48, height_inch=36):
-    """
-    Create a new Presentation object, set its slide size (e.g., 48x36 inches).
-    
-    :param width_inch: Float or int specifying width in inches (default 48).
-    :param height_inch: Float or int specifying height in inches (default 36).
-    :return: A python-pptx Presentation object.
-    """
-    prs = Presentation()
-    prs.slide_width = Inches(width_inch)
-    prs.slide_height = Inches(height_inch)
-    return prs
-
-def add_blank_slide(prs):
-    """
-    Add a blank slide to the Presentation (layout index 6 is typically blank).
-    
-    :param prs: The Presentation object to add a slide to.
-    :return: The newly added slide object.
-    """
-    blank_layout = prs.slide_layouts[6]
-    return prs.slides.add_slide(blank_layout)
-
-def shape_fill_color(shape, fill_color):
-    """
-    Set the fill color of a shape to the specified RGB color.
-
-    :param shape: The shape object to modify.
-    :param fill_color: A tuple (r, g, b) for the fill color.
-    """
-    shape.fill.solid()
-    shape.fill.fore_color.rgb = RGBColor(*fill_color)
-
-
-def add_textbox(
-    slide, 
-    name, 
-    left_inch, 
-    top_inch, 
-    width_inch, 
-    height_inch, 
-    text="", 
-    word_wrap=True,
-    font_size=40,
-    bold=False,
-    italic=False,
-    alignment="left",
-    fill_color=None,
-    font_name="Arial"
-):
-    """
-    Create a textbox shape on the given slide, optionally fill its background with
-    a color if fill_color is specified as (r, g, b).
-    
-    :param slide: Slide object to place the textbox on.
-    :param name: Name for the shape (shape.name).
-    :param left_inch: Left coordinate (in inches).
-    :param top_inch: Top coordinate (in inches).
-    :param width_inch: Width (in inches).
-    :param height_inch: Height (in inches).
-    :param text: Text to display in the textbox.
-    :param word_wrap: If True, wrap text in the textbox.
-    :param font_size: Numeric font size (e.g. 40).
-    :param bold: Boolean to set run.font.bold.
-    :param italic: Boolean to set run.font.italic.
-    :param alignment: String alignment: "left", "center", "right", or "justify".
-    :param fill_color: (r, g, b) tuple for solid fill background color, or None to skip.
-    :param font_name: String font name (e.g., "Arial").
-    :return: The newly created textbox shape.
-    """
-    shape = slide.shapes.add_textbox(
-        Inches(left_inch), Inches(top_inch),
-        Inches(width_inch), Inches(height_inch)
-    )
-    
-    shape.name = name
-    
-    # If a fill color is specified, apply a solid fill
-    if fill_color is not None:
-        shape.fill.solid()
-        shape.fill.fore_color.rgb = RGBColor(*fill_color)
-    else:
-        # Otherwise, set "no fill" if you want it transparent
-        shape.fill.background()
-
-    text_frame = shape.text_frame
-    # Turn off auto-size to ensure stable font size, etc.
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-    text_frame.word_wrap = word_wrap
-    
-    # Clear any default paragraphs
-    text_frame.clear()
-    
-    # Add a new paragraph
-    p = text_frame.add_paragraph()
-    # Instead of setting p.text, explicitly create a Run
-    run = p.add_run()
-    run.text = text
-    
-    # Parse alignment and set it
-    p.alignment = _parse_alignment(alignment)
-    
-    # Set the font formatting on the run
-    font = run.font
-    font.size = _parse_font_size(font_size)
-    font.bold = bold
-    font.italic = italic
-    font.name = font_name
-    
-    return shape
-
-def edit_textbox(
-    shape,
-    text=None,
-    word_wrap=None,
-    font_size=None,
-    bold=None,
-    italic=None,
-    alignment=None,
-    fill_color=None,
-    font_name=None
-):
-    """
-    Edit properties of an existing textbox shape.
-
-    :param shape: The shape object (textbox) to edit.
-    :param text: New text to set. If None, leaves text unmodified.
-    :param word_wrap: Boolean to enable/disable word wrap. If None, leaves unmodified.
-    :param font_size: Font size (int/float or string like '12pt'). If None, leaves unmodified.
-    :param bold: Boolean to set bold. If None, leaves unmodified.
-    :param italic: Boolean to set italic. If None, leaves unmodified.
-    :param alignment: One of 'left', 'center', 'right', 'justify'. If None, leaves unmodified.
-    :param fill_color: A tuple (r, g, b) for background fill color, or None to leave unmodified.
-    """
-
-    text_frame = shape.text_frame
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-
-    # Update fill color if provided
-    if fill_color is not None:
-        shape.fill.solid()
-        shape.fill.fore_color.rgb = RGBColor(*fill_color)
-    # else: If you'd like to remove any existing fill if None, you could:
-    # else:
-    #     shape.fill.background()
-
-    # Update word wrap if provided
-    if word_wrap is not None:
-        text_frame.word_wrap = word_wrap
-
-    # If text is provided, clear existing paragraphs and add the new text
-    if text is not None:
-        text_frame.clear()
-        p = text_frame.add_paragraph()
-        run = p.add_run()
-        run.text = text
-
-        # If alignment is provided, apply to the paragraph
-        if alignment is not None:
-            p.alignment = _parse_alignment(alignment)
-
-        # If font formatting info is provided, apply to the run font
-        font = run.font
-        if font_size is not None:
-            font.size = _parse_font_size(font_size)
-        if bold is not None:
-            font.bold = bold
-        if italic is not None:
-            font.italic = italic
-
-    else:
-        # If no new text is given, we can selectively change existing text properties.
-        for p in text_frame.paragraphs:
-            if alignment is not None:
-                p.alignment = _parse_alignment(alignment)
-            for run in p.runs:
-                font = run.font
-                if font_size is not None:
-                    font.size = _parse_font_size(font_size)
-                if bold is not None:
-                    font.bold = bold
-                if italic is not None:
-                    font.italic = italic
-                if font_name is not None:
-                    font.name = font_name
-
-def add_image(slide, name, left_inch, top_inch, width_inch, height_inch, image_path):
-    """
-    Add an image to the slide at the specified position and size.
-    
-    :param slide: The slide object where the image should be placed.
-    :param name: A string name/label for the shape.
-    :param left_inch: Left position in inches.
-    :param top_inch: Top position in inches.
-    :param width_inch: Width in inches.
-    :param height_inch: Height in inches.
-    :param image_path: File path to the image.
-    :return: The newly created picture shape object.
-    """
-    shape = slide.shapes.add_picture(
-        image_path,
-        Inches(left_inch), Inches(top_inch),
-        width=Inches(width_inch), height=Inches(height_inch)
-    )
-    shape.name = name
-    return shape
-
-def set_shape_position(shape, left_inch, top_inch, width_inch, height_inch):
-    """
-    Move or resize an existing shape to the specified position/dimensions.
-    
-    :param shape: The shape object to be repositioned.
-    :param left_inch: New left position in inches.
-    :param top_inch: New top position in inches.
-    :param width_inch: New width in inches.
-    :param height_inch: New height in inches.
-    """
-    shape.left = Inches(left_inch)
-    shape.top = Inches(top_inch)
-    shape.width = Inches(width_inch)
-    shape.height = Inches(height_inch)
-
-def add_line_simple(slide, name, left_inch, top_inch, length_inch, thickness=2, color=(0, 0, 0), orientation="horizontal"):
-    """
-    Add a simple horizontal or vertical line to the slide.
-    
-    Parameters:
-      slide: The slide object.
-      name: The name/label for the line shape.
-      left_inch: The left (X) coordinate in inches for the starting point.
-      top_inch: The top (Y) coordinate in inches for the starting point.
-      length_inch: The length of the line in inches.
-      thickness: The thickness of the line in points (default is 2).
-      color: An (R, G, B) tuple specifying the line color (default is black).
-      orientation: "horizontal" or "vertical" (case-insensitive).
-      
-    Returns:
-      The created line shape object.
-    """
-    x1 = Inches(left_inch)
-    y1 = Inches(top_inch)
-    
-    if orientation.lower() == "horizontal":
-        x2 = Inches(left_inch + length_inch)
-        y2 = y1
-    elif orientation.lower() == "vertical":
-        x2 = x1
-        y2 = Inches(top_inch + length_inch)
-    else:
-        raise ValueError("Orientation must be either 'horizontal' or 'vertical'")
-    
-    # Create a straight connector (used as a line)
-    line_shape = slide.shapes.add_connector(MSO_CONNECTOR.STRAIGHT, x1, y1, x2, y2)
-    line_shape.name = name
-    
-    # Set the line thickness and color
-    line_shape.line.width = Pt(thickness)
-    line_shape.line.color.rgb = RGBColor(*color)
-    
-    return line_shape
-
-def set_paragraph_line_spacing(shape, line_spacing=1.0):
-    """
-    Set line spacing for all paragraphs in a textbox shape.
-    E.g., line_spacing=1.5 for 1.5x spacing, 2 for double spacing, etc.
-    
-    :param shape: The textbox shape to modify.
-    :param line_spacing: A float indicating multiple of single spacing.
-    """
-    text_frame = shape.text_frame
-    for paragraph in text_frame.paragraphs:
-        paragraph.line_spacing = line_spacing  # direct float: 1.5, 2.0, etc.
-
-def set_shape_text_margins(
-    shape, 
-    top_px=0, 
-    right_px=0, 
-    bottom_px=0, 
-    left_px=0
-):
-    """
-    Set the internal text margins (like "padding") for a textbox shape.
-    python-pptx uses points or EMUs for margins, so we convert from px -> points -> EMUs as needed.
-    
-    Note: If your output environment uses a different PX:PT ratio, adjust _px_to_pt().
-    """
-    text_frame = shape.text_frame
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-    text_frame.margin_top = Pt(_px_to_pt(top_px))
-    text_frame.margin_right = Pt(_px_to_pt(right_px))
-    text_frame.margin_bottom = Pt(_px_to_pt(bottom_px))
-    text_frame.margin_left = Pt(_px_to_pt(left_px))
-
-def adjust_font_size(shape, delta=2):
-    """
-    Increase or decrease the current font size of all runs in a shape by `delta` points.
-    If a run has no explicitly set font size (font.size is None), we can either skip it or assume a default.
-    For simplicity, let's skip runs without an explicit size to avoid overwriting theme defaults.
-    
-    :param shape: The textbox shape to update.
-    :param delta: Positive or negative integer to adjust the font size.
-    """
-    text_frame = shape.text_frame
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-    for paragraph in text_frame.paragraphs:
-        for run in paragraph.runs:
-            current_size = run.font.size
-            if current_size is not None:
-                new_size = current_size.pt + delta
-                # Prevent negative or zero font size
-                if new_size < 1:
-                    new_size = 1
-                run.font.size = Pt(new_size)
-
-def center_shape_horizontally(prs, shape):
-    """
-    Center a shape horizontally on the slide using the presentation's slide width.
-    
-    :param prs: The Presentation object (which holds slide_width).
-    :param shape: The shape to center.
-    """
-    new_left = (prs.slide_width - shape.width) // 2
-    shape.left = new_left
-
-def center_shape_vertically(prs, shape):
-    """
-    Center a shape vertically on the slide using the presentation's slide height.
-    
-    :param prs: The Presentation object (which holds slide_height).
-    :param shape: The shape to center.
-    """
-    new_top = (prs.slide_height - shape.height) // 2
-    shape.top = new_top
-
-def set_shape_text(shape, text, clear_first=True):
-    """
-    Set or replace the text of an existing shape (commonly a textbox).
-    
-    :param shape: The shape (textbox) whose text needs to be updated.
-    :param text: The new text content.
-    :param clear_first: Whether to clear existing paragraphs before adding.
-    """
-    text_frame = shape.text_frame
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-    if clear_first:
-        text_frame.clear()
-    p = text_frame.add_paragraph()
-    p.text = text
-
-def _set_run_font_color(run, rgb_tuple):
-    """
-    Manually create or replace the solidFill element in this run's XML
-    to force the color if run.font.color is None or doesn't exist yet.
-    """
-    # Underlying run properties element
-    rPr = run.font._element
-    
-    # Remove any existing <a:solidFill> elements to avoid duplicates
-    for child in rPr.iterchildren():
-        if child.tag == qn('a:solidFill'):
-            rPr.remove(child)
-
-    # Create a new solidFill element with the specified color
-    solid_fill = OxmlElement('a:solidFill')
-    srgb_clr = OxmlElement('a:srgbClr')
-    # Format the tuple (r, g, b) into a hex string "RRGGBB"
-    srgb_clr.set('val', '{:02X}{:02X}{:02X}'.format(*rgb_tuple))
-    solid_fill.append(srgb_clr)
-    rPr.append(solid_fill)
-
-def set_text_style(shape, font_size=None, bold=None, italic=None, alignment=None, color=None, font_name=None):
-    """
-    Adjust text style on an existing textbox shape.
-    
-    :param shape: The textbox shape whose style is being updated.
-    :param font_size: Numeric font size (e.g. 40) or None to skip.
-    :param bold: Boolean or None (to skip).
-    :param italic: Boolean or None (to skip).
-    :param alignment: String alignment ('left', 'center', 'right', 'justify') or None (to skip).
-    :param color: A tuple (r, g, b), each int from 0-255, or None (to skip).
-    :param font_name: String font name (e.g., 'Arial') or None
-    """
-    text_frame = shape.text_frame
-    # Disable auto-sizing so our manual settings are respected
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-    
-    # Convert the alignment string into a PP_ALIGN enum value
-    parsed_alignment = _parse_alignment(alignment) if alignment else None
-    
-    # Convert the raw font size to a python-pptx Pt object
-    parsed_font_size = _parse_font_size(font_size)
-
-    # Iterate over paragraphs and runs in the shape
-    for paragraph in text_frame.paragraphs:
-        if parsed_alignment is not None:
-            paragraph.alignment = parsed_alignment
-        
-        for run in paragraph.runs:
-            # Font size
-            if parsed_font_size is not None:
-                run.font.size = parsed_font_size
-            
-            # Bold
-            if bold is not None:
-                run.font.bold = bold
-            
-            # Italic
-            if italic is not None:
-                run.font.italic = italic
-
-            # Font name
-            if font_name is not None:
-                run.font.name = font_name
-            
-            # Color
-            if color is not None:
-                # Sometimes run.font.color may be None. We can try:
-                if run.font.color is not None:
-                    # If a ColorFormat object already exists, just set it
-                    run.font.color.rgb = RGBColor(*color)
-                else:
-                    # Otherwise, manually set the run color in the underlying XML
-                    _set_run_font_color(run, color)
-
-def save_presentation(prs, file_name="poster.pptx"):
-    """
-    Save the current Presentation object to disk.
-    
-    :param prs: The Presentation object.
-    :param file_name: The file path/name for the saved pptx file.
-    """
-    prs.save(file_name)
-
-def set_slide_background_color(slide, rgb=(255, 255, 255)):
-    """
-    Sets the background color for a single Slide object.
-
-    :param slide: A pptx.slide.Slide object
-    :param rgb: A tuple of (R, G, B) color values, e.g. (255, 0, 0) for red
-    """
-    bg_fill = slide.background.fill
-    bg_fill.solid()
-    bg_fill.fore_color.rgb = RGBColor(*rgb)
-
-def style_shape_border(shape, color=(30, 144, 255), thickness=2, line_style="square_dot"):
-    """
-    Applies a border (line) style to a given shape, where line_style is a 
-    string corresponding to an MSO_LINE_DASH_STYLE enum value from python-pptx.
-
-    Valid line_style strings (based on the doc snippet) are:
-    -----------------------------------------------------------------
-    'solid'        -> MSO_LINE_DASH_STYLE.SOLID
-    'round_dot'    -> MSO_LINE_DASH_STYLE.ROUND_DOT
-    'square_dot'   -> MSO_LINE_DASH_STYLE.SQUARE_DOT
-    'dash'         -> MSO_LINE_DASH_STYLE.DASH
-    'dash_dot'     -> MSO_LINE_DASH_STYLE.DASH_DOT
-    'dash_dot_dot' -> MSO_LINE_DASH_STYLE.DASH_DOT_DOT
-    'long_dash'    -> MSO_LINE_DASH_STYLE.LONG_DASH
-    'long_dash_dot'-> MSO_LINE_DASH_STYLE.LONG_DASH_DOT
-    -----------------------------------------------------------------
-
-    :param shape:     pptx.shapes.base.Shape object to style
-    :param color:     A tuple (R, G, B) for the border color (default is (30, 144, 255))
-    :param thickness: Border thickness in points (default is 2)
-    :param line_style:String representing the line dash style; defaults to 'square_dot'
-    """
-    # Map our string keys to MSO_LINE_DASH_STYLE values from your doc snippet
-    dash_style_map = {
-        "solid": MSO_LINE_DASH_STYLE.SOLID,
-        "round_dot": MSO_LINE_DASH_STYLE.ROUND_DOT,
-        "square_dot": MSO_LINE_DASH_STYLE.SQUARE_DOT,
-        "dash": MSO_LINE_DASH_STYLE.DASH,
-        "dash_dot": MSO_LINE_DASH_STYLE.DASH_DOT,
-        "dash_dot_dot": MSO_LINE_DASH_STYLE.DASH_DOT_DOT,
-        "long_dash": MSO_LINE_DASH_STYLE.LONG_DASH,
-        "long_dash_dot": MSO_LINE_DASH_STYLE.LONG_DASH_DOT
-    }
-
-    line = shape.line
-    line.width = Pt(thickness)
-    line.color.rgb = RGBColor(*color)
-
-    # Default to 'solid' if the requested style isn't in dash_style_map
-    dash_style_enum = dash_style_map.get(line_style.lower(), MSO_LINE_DASH_STYLE.SOLID)
-    line.dash_style = dash_style_enum
-
-def fill_textframe(shape, paragraphs_spec):
-    """
-    Given an existing shape (with a text frame) and a paragraphs_spec
-    describing paragraphs and runs, populate the shape’s text frame.
-
-    'paragraphs_spec' is a list of paragraphs, each containing:
-      - bullet: bool
-      - level: int (indent level)
-      - alignment: str ("left", "center", "right", or "justify")
-      - font_size: int
-      - runs: list of run dictionaries, each with:
-          text: str
-          bold: bool
-          italic: bool
-          color: [r,g,b] or None
-          font_size: int (optional, overrides paragraph default)
-          fill_color: [r,g,b] or None
-    """
-    text_frame = shape.text_frame
-    # Ensure stable layout
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-    text_frame.word_wrap = True
-    # Clear out existing paragraphs
-    text_frame.clear()
-
-    for p_data in paragraphs_spec:
-        p = text_frame.add_paragraph()
-        
-        # # bulleting
-        # p.bullet = p_data.get("bullet", False)
-        
-        # bullet level (indent)
-        p.level = p_data.get("level", 0)
-        
-        # paragraph alignment
-        align_str = p_data.get("alignment", "left")
-        p.alignment = _parse_alignment(align_str)
-        
-        # paragraph-level font size
-        default_font_size = p_data.get("font_size", 24)
-        p.font.size = Pt(default_font_size)
-
-        # Add runs
-        runs_spec = p_data.get("runs", [])
-        for run_info in runs_spec:
-            run = p.add_run()
-            if p_data.get("bullet", False):
-                if p.level == 0:
-                    run.text = '\u2022' + run_info.get("text", "")
-                elif p.level == 1:
-                    run.text = '\u25E6' + run_info.get("text", "")
-                else:
-                    run.text = '\u25AA' + run_info.get("text", "")
-            else:
-                run.text = run_info.get("text", "")
-
-            # Font styling
-            font = run.font
-            font.bold = run_info.get("bold", False)
-            font.italic = run_info.get("italic", False)
-
-            # If run-specific color was provided
-            color_tuple = run_info.get("color", None)
-            if (
-                color_tuple
-                and len(color_tuple) == 3
-                and all(isinstance(c, int) for c in color_tuple)
-            ):
-                if run.font.color is not None:
-                    # If a ColorFormat object already exists, just set it
-                    run.font.color.rgb = RGBColor(*color_tuple)
-                else:
-                    # Otherwise, manually set the run color in the underlying XML
-                    _set_run_font_color(run, color_tuple)
-
-            # If run-specific font size was provided
-            if "font_size" in run_info:
-                font.size = Pt(run_info["font_size"])
-
-            # If run-specific shape fill color was provided:
-            fill_color_tuple = run_info.get("fill_color", None)
-            if (
-                fill_color_tuple
-                and len(fill_color_tuple) == 3
-                and all(isinstance(c, int) for c in fill_color_tuple)
-            ):
-                shape.fill.solid()
-                shape.fill.fore_color.rgb = RGBColor(*fill_color_tuple)
-
-
-def add_border_hierarchy(
-    prs,
-    name_to_hierarchy: dict,
-    hierarchy: int,
-    border_color=RGBColor(255, 0, 0),
-    border_width=2,
-    fill_boxes: bool = False,
-    fill_color=RGBColor(255, 0, 0),
-    regardless=False
-):
-    """
-    Iterates over all slides and shapes in the Presentation 'prs'.
-    - For shapes whose name maps to the given 'hierarchy' in 'name_to_hierarchy' (or if 'regardless'
-      is True), draws a red border. Optionally fills the shape with red if 'fill_boxes' is True.
-    - For all other shapes, removes their border and hides any text.
-
-    Returns:
-        labeled_elements: dict of shape geometry for ALL shapes, regardless of hierarchy match.
-    """
-    border_width = Pt(border_width)
-    labeled_elements = {}
-
-    for slide_idx, slide in enumerate(prs.slides):
-        for shape_idx, shape in enumerate(slide.shapes):
-            # Record basic geometry in labeled_elements
-            shape_name = shape.name if hasattr(shape, 'name') else f"Shape_{slide_idx}_{shape_idx}"
-            labeled_elements[shape_name] = {
-                'left': f"{emu_to_inches(shape.left):.2f} Inches",
-                'top': f"{emu_to_inches(shape.top):.2f} Inches",
-                'width': f"{emu_to_inches(shape.width):.2f} Inches",
-                'height': f"{emu_to_inches(shape.height):.2f} Inches",
-            }
-
-            # Determine if this shape should have a border
-            current_hierarchy = name_to_hierarchy.get(shape_name, None)
-            if current_hierarchy is None:
-                # Optional: Print a debug message if the shape’s name isn’t in the dict
-                print(f"Warning: shape '{shape_name}' not found in name_to_hierarchy.")
-
-            try:
-                if current_hierarchy == hierarchy or regardless:
-                    # Draw border
-                    shape.line.fill.solid()
-                    shape.line.fill.fore_color.rgb = border_color
-                    shape.line.width = border_width
-
-                    # Optionally fill the shape with red color
-                    if fill_boxes:
-                        shape.fill.solid()
-                        shape.fill.fore_color.rgb = fill_color
-                else:
-                    # Remove border
-                    shape.line.width = Pt(0)
-                    shape.line.fill.background()
-
-                    # Hide text if present
-                    if shape.has_text_frame:
-                        shape.text_frame.text = ""
-            except Exception as e:
-                print(f"Could not process shape '{shape_name}' (type={shape.shape_type}): {e}")
-
-    return labeled_elements
-
-
-def get_visual_cues(name_to_hierarchy, identifier, poster_path='poster.pptx'):
-    prs = pptx.Presentation(poster_path)
-
-    position_dict_1 = add_border_hierarchy(prs, name_to_hierarchy, 1, border_width=10)
-    json.dump(position_dict_1, open(f"tmp/position_dict_1_<{identifier}>.json", "w"))
-
-    # Save the presentation to disk.
-    save_presentation(prs, file_name=f"tmp/poster_<{identifier}>_hierarchy_1.pptx")
-
-    prs = pptx.Presentation(poster_path)
-
-    add_border_hierarchy(prs, name_to_hierarchy, 1, border_width=10, fill_boxes=True)
-    save_presentation(prs, file_name=f"tmp/poster_<{identifier}>_hierarchy_1_filled.pptx")
-
-    prs = pptx.Presentation(poster_path)
-
-    position_dict_2 = add_border_hierarchy(prs, name_to_hierarchy, 2, border_width=10)
-    json.dump(position_dict_2, open(f"tmp/position_dict_2_<{identifier}>.json", "w"))
-
-    # Save the presentation to disk.
-    save_presentation(prs, file_name=f"tmp/poster_<{identifier}>_hierarchy_2.pptx")
-
-    prs = pptx.Presentation(poster_path)
-
-    add_border_hierarchy(prs, name_to_hierarchy, 2, border_width=10, fill_boxes=True)
-
-    # Save the presentation to disk.
-    save_presentation(prs, file_name=f"tmp/poster_<{identifier}>_hierarchy_2_filled.pptx")
-
-'''
-
-
-documentation = r'''
-create_poster(width_inch=48, height_inch=36):
-    """
-    Create a new Presentation object, set its slide size (e.g., 48x36 inches).
-    
-    :param width_inch: Float or int specifying width in inches (default 48).
-    :param height_inch: Float or int specifying height in inches (default 36).
-    :return: A python-pptx Presentation object.
-    """
-    
-add_blank_slide(prs):
-    """
-    Add a blank slide to the Presentation (layout index 6 is typically blank).
-    
-    :param prs: The Presentation object to add a slide to.
-    :return: The newly added slide object.
-    """
-
-def shape_fill_color(shape, fill_color):
-    """
-    Set the fill color of a shape to the specified RGB color.
-
-    :param shape: The shape object to modify.
-    :param fill_color: A tuple (r, g, b) for the fill color.
-    """
-
-def add_textbox(
-    slide, 
-    name, 
-    left_inch, 
-    top_inch, 
-    width_inch, 
-    height_inch, 
-    text="", 
-    word_wrap=True,
-    font_size=40,
-    bold=False,
-    italic=False,
-    alignment="left",
-    fill_color=None,
-    font_name="Arial"
-):
-    """
-    Create a textbox shape on the given slide, optionally fill its background with
-    a color if fill_color is specified as (r, g, b).
-    
-    :param slide: Slide object to place the textbox on.
-    :param name: Name for the shape (shape.name).
-    :param left_inch: Left coordinate (in inches).
-    :param top_inch: Top coordinate (in inches).
-    :param width_inch: Width (in inches).
-    :param height_inch: Height (in inches).
-    :param text: Text to display in the textbox.
-    :param word_wrap: If True, wrap text in the textbox.
-    :param font_size: Numeric font size (e.g. 40).
-    :param bold: Boolean to set run.font.bold.
-    :param italic: Boolean to set run.font.italic.
-    :param alignment: String alignment: "left", "center", "right", or "justify".
-    :param fill_color: (r, g, b) tuple for solid fill background color, or None to skip.
-    :param font_name: String font name (e.g., "Arial").
-    :return: The newly created textbox shape.
-    """
-
-add_image(slide, name, left_inch, top_inch, width_inch, height_inch, image_path):
-    """
-    Add an image to the slide at the specified position and size.
-    
-    :param slide: The slide object where the image should be placed.
-    :param name: A string name/label for the shape.
-    :param left_inch: Left position in inches.
-    :param top_inch: Top position in inches.
-    :param width_inch: Width in inches.
-    :param height_inch: Height in inches.
-    :param image_path: File path to the image.
-    :return: The newly created picture shape object.
-    """
-
-set_shape_position(shape, left_inch, top_inch, width_inch, height_inch):
-    """
-    Move or resize an existing shape to the specified position/dimensions.
-    
-    :param shape: The shape object to be repositioned.
-    :param left_inch: New left position in inches.
-    :param top_inch: New top position in inches.
-    :param width_inch: New width in inches.
-    :param height_inch: New height in inches.
-    """
-
-def set_text_style(shape, font_size=None, bold=None, italic=None, alignment=None, color=None, font_name=None):
-    """
-    Adjust text style on an existing textbox shape.
-    
-    :param shape: The textbox shape whose style is being updated.
-    :param font_size: Numeric font size (e.g. 40) or None to skip.
-    :param bold: Boolean or None (to skip).
-    :param italic: Boolean or None (to skip).
-    :param alignment: String alignment ('left', 'center', 'right', 'justify') or None (to skip).
-    :param color: A tuple (r, g, b), each int from 0-255, or None (to skip).
-    :param font_name: String font name (e.g., 'Arial') or None
-    """
-
-add_line_simple(slide, name, left_inch, top_inch, length_inch, thickness=2, color=(0, 0, 0), orientation="horizontal"):
-    """
-    Add a simple horizontal or vertical line to the slide.
-    
-    Parameters:
-      slide: The slide object.
-      name: The name/label for the line shape.
-      left_inch: The left (X) coordinate in inches for the starting point.
-      top_inch: The top (Y) coordinate in inches for the starting point.
-      length_inch: The length of the line in inches.
-      thickness: The thickness of the line in points (default is 2).
-      color: An (R, G, B) tuple specifying the line color (default is black).
-      orientation: "horizontal" or "vertical" (case-insensitive).
-      
-    Returns:
-      The created line shape object.
-    """
-
-set_paragraph_line_spacing(shape, line_spacing=1.0):
-    """
-    Set line spacing for all paragraphs in a textbox shape.
-    E.g., line_spacing=1.5 for 1.5x spacing, 2 for double spacing, etc.
-    
-    :param shape: The textbox shape to modify.
-    :param line_spacing: A float indicating multiple of single spacing.
-    """
-
-set_shape_text_margins(
-    shape, 
-    top_px=0, 
-    right_px=0, 
-    bottom_px=0, 
-    left_px=0
-):
-    """
-    Set the internal text margins (like "padding") for a textbox shape.
-    python-pptx uses points or EMUs for margins, so we convert from px -> points -> EMUs as needed.
-    
-    Note: If your output environment uses a different PX:PT ratio, adjust _px_to_pt().
-    """
-
-adjust_font_size(shape, delta=2):
-    """
-    Increase or decrease the current font size of all runs in a shape by `delta` points.
-    If a run has no explicitly set font size (font.size is None), we can either skip it or assume a default.
-    For simplicity, let's skip runs without an explicit size to avoid overwriting theme defaults.
-    
-    :param shape: The textbox shape to update.
-    :param delta: Positive or negative integer to adjust the font size.
-    """
-
-def set_slide_background_color(slide, rgb=(255, 255, 255)):
-    """
-    Sets the background color for a single Slide object.
-
-    :param slide: A pptx.slide.Slide object
-    :param rgb: A tuple of (R, G, B) color values, e.g. (255, 0, 0) for red
-    """
-
-def style_shape_border(shape, color=(30, 144, 255), thickness=2, line_style="square_dot"):
-    """
-    Applies a border (line) style to a given shape, where line_style is a 
-    string corresponding to an MSO_LINE_DASH_STYLE enum value from python-pptx.
-
-    Valid line_style strings (based on the doc snippet) are:
-    -----------------------------------------------------------------
-    'solid'        -> MSO_LINE_DASH_STYLE.SOLID
-    'round_dot'    -> MSO_LINE_DASH_STYLE.ROUND_DOT
-    'square_dot'   -> MSO_LINE_DASH_STYLE.SQUARE_DOT
-    'dash'         -> MSO_LINE_DASH_STYLE.DASH
-    'dash_dot'     -> MSO_LINE_DASH_STYLE.DASH_DOT
-    'dash_dot_dot' -> MSO_LINE_DASH_STYLE.DASH_DOT_DOT
-    'long_dash'    -> MSO_LINE_DASH_STYLE.LONG_DASH
-    'long_dash_dot'-> MSO_LINE_DASH_STYLE.LONG_DASH_DOT
-    -----------------------------------------------------------------
-
-    :param shape:     pptx.shapes.base.Shape object to style
-    :param color:     A tuple (R, G, B) for the border color (default is (30, 144, 255))
-    :param thickness: Border thickness in points (default is 2)
-    :param line_style:String representing the line dash style; defaults to 'square_dot'
-    """
-
-save_presentation(prs, file_name="poster.pptx"):
-    """
-    Save the current Presentation object to disk.
-    
-    :param prs: The Presentation object.
-    :param file_name: The file path/name for the saved pptx file.
-    """
-
---------------------------------------
-
-Example usage:
-poster = create_poster(width_inch=48, height_inch=36)
-slide = add_blank_slide(poster)
-# Set this particular slide's background to light gray
-set_slide_background_color(slide, (200, 200, 200))
-
-title_text_box = add_textbox(
-    slide, 
-    name='title', 
-    left_inch=5, 
-    top_inch=0, 
-    width_inch=30, 
-    height_inch=5, 
-    text="Poster Title", 
-    word_wrap=True,
-    font_size=100,
-    bold=True,
-    italic=False,
-    alignment="center",
-    fill_color=(255, 255, 255),  # Fill color
-    font_name="Arial"
-)
-
-shape_fill_color(title_text_box, fill_color=(173, 216, 230)) # Fill color
-
-# Apply a dashed border with "square_dot"
-style_shape_border(title_text_box, color=(30, 144, 255), thickness=8, line_style="square_dot")
-image = add_image(slide, 'img', 10, 25, 30, 30, 'data/poster_exp/pdf/attention/_page_3_Figure_0.jpeg')
-
-set_shape_position(image, 10, 25, 15, 15)
-set_shape_position(image, 10, 5, 20, 15)
-
-set_text_style(title_text_box, font_size=60, bold=True, italic=True, alignment='center', color=(255, 0, 0), font_name='Times New Roman')
-
-added_line = add_line_simple(
-    slide,
-    'separation_line',
-    20,
-    0,
-    20,
-    thickness=2,   # in points
-    color=(120, 120, 20),
-    orientation='vertical'
-)
-
-set_shape_text_margins(
-    title_text_box, 
-    top_px=10, 
-    right_px=20, 
-    bottom_px=30, 
-    left_px=40
-)
-
-adjust_font_size(title_text_box, delta=-20)
-
-set_paragraph_line_spacing(title_text_box, line_spacing=2.0)
-
-save_presentation(poster, file_name="poster.pptx")
-
-'''
-
-
-from pptx import Presentation
-from pptx.util import Inches, Pt
-from pptx.enum.text import PP_ALIGN
-from pptx.enum.shapes import MSO_SHAPE, MSO_CONNECTOR
-from pptx.dml.color import RGBColor
-import pptx
-
-from pptx.enum.text import MSO_AUTO_SIZE
-
-def emu_to_inches(emu: int) -> float:
-    return emu / 914400
-
-def _px_to_pt(px):
-    """
-    Approximate conversion from pixels to points.
-    A common assumption is 1px ~ 0.75pt.
-    Adjust as needed for your environment.
-    """
-    return px * 0.75
-
-def _parse_font_size(font_size):
-    """
-    Internal helper to convert a numeric font size (e.g., 12) 
-    to a python-pptx Pt object. If it's already a Pt, return as-is.
-    """
-    if font_size is None:
-        return None
-    if isinstance(font_size, (int, float)):
-        return Pt(font_size)
-    return font_size  # Assume user provided a Pt object already
-
-def _parse_alignment(alignment):
-    """
-    Internal helper to convert a string alignment (e.g., "left", "center") 
-    to the corresponding PP_ALIGN constant. 
-    Default to PP_ALIGN.LEFT if unrecognized or None.
-    """
-    if not isinstance(alignment, str):
-        # If user passed None or something else, default to PP_ALIGN.LEFT
-        return PP_ALIGN.LEFT
-
-    alignment = alignment.lower().strip()
-    alignment_map = {
-        "left": PP_ALIGN.LEFT,
-        "center": PP_ALIGN.CENTER,
-        "right": PP_ALIGN.RIGHT,
-        "justify": PP_ALIGN.JUSTIFY,
-    }
-    return alignment_map.get(alignment, PP_ALIGN.LEFT)
-
-def create_poster(width_inch=48, height_inch=36):
-    """
-    Create a new Presentation object, set its slide size (e.g., 48x36 inches).
-    
-    :param width_inch: Float or int specifying width in inches (default 48).
-    :param height_inch: Float or int specifying height in inches (default 36).
-    :return: A python-pptx Presentation object.
-    """
-    prs = Presentation()
-    prs.slide_width = Inches(width_inch)
-    prs.slide_height = Inches(height_inch)
-    return prs
-
-def add_blank_slide(prs):
-    """
-    Add a blank slide to the Presentation (layout index 6 is typically blank).
-    
-    :param prs: The Presentation object to add a slide to.
-    :return: The newly added slide object.
-    """
-    blank_layout = prs.slide_layouts[6]
-    return prs.slides.add_slide(blank_layout)
-
-def shape_fill_color(shape, fill_color):
-    """
-    Set the fill color of a shape to the specified RGB color.
-
-    :param shape: The shape object to modify.
-    :param fill_color: A tuple (r, g, b) for the fill color.
-    """
-    shape.fill.solid()
-    shape.fill.fore_color.rgb = RGBColor(*fill_color)
-
-def add_textbox(
-    slide, 
-    name, 
-    left_inch, 
-    top_inch, 
-    width_inch, 
-    height_inch, 
-    text="", 
-    word_wrap=True,
-    font_size=40,
-    bold=False,
-    italic=False,
-    alignment="left",
-    fill_color=None,
-    font_name="Arial"
-):
-    """
-    Create a textbox shape on the given slide, optionally fill its background with
-    a color if fill_color is specified as (r, g, b).
-    
-    :param slide: Slide object to place the textbox on.
-    :param name: Name for the shape (shape.name).
-    :param left_inch: Left coordinate (in inches).
-    :param top_inch: Top coordinate (in inches).
-    :param width_inch: Width (in inches).
-    :param height_inch: Height (in inches).
-    :param text: Text to display in the textbox.
-    :param word_wrap: If True, wrap text in the textbox.
-    :param font_size: Numeric font size (e.g. 40).
-    :param bold: Boolean to set run.font.bold.
-    :param italic: Boolean to set run.font.italic.
-    :param alignment: String alignment: "left", "center", "right", or "justify".
-    :param fill_color: (r, g, b) tuple for solid fill background color, or None to skip.
-    :param font_name: String font name (e.g., "Arial").
-    :return: The newly created textbox shape.
-    """
-    shape = slide.shapes.add_textbox(
-        Inches(left_inch), Inches(top_inch),
-        Inches(width_inch), Inches(height_inch)
-    )
-    
-    shape.name = name
-    
-    # If a fill color is specified, apply a solid fill
-    if fill_color is not None:
-        shape.fill.solid()
-        shape.fill.fore_color.rgb = RGBColor(*fill_color)
-    else:
-        # Otherwise, set "no fill" if you want it transparent
-        shape.fill.background()
-
-    text_frame = shape.text_frame
-    # Turn off auto-size to ensure stable font size, etc.
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-    text_frame.word_wrap = word_wrap
-    
-    # Clear any default paragraphs
-    text_frame.clear()
-    
-    # Add a new paragraph
-    p = text_frame.add_paragraph()
-    # Instead of setting p.text, explicitly create a Run
-    run = p.add_run()
-    run.text = text
-    
-    # Parse alignment and set it
-    p.alignment = _parse_alignment(alignment)
-    
-    # Set the font formatting on the run
-    font = run.font
-    font.size = _parse_font_size(font_size)
-    font.bold = bold
-    font.italic = italic
-    font.name = font_name
-    
-    return shape
-
-def fill_textframe(shape, paragraphs_spec):
-    """
-    Given an existing shape (with a text frame) and a paragraphs_spec
-    describing paragraphs and runs, populate the shape’s text frame.
-
-    'paragraphs_spec' is a list of paragraphs, each containing:
-      - bullet: bool
-      - level: int (indent level)
-      - alignment: str ("left", "center", "right", or "justify")
-      - font_size: int
-      - runs: list of run dictionaries, each with:
-          text: str
-          bold: bool
-          italic: bool
-          color: [r,g,b] or None
-          font_size: int (optional, overrides paragraph default)
-          fill_color: [r,g,b] or None
-    """
-    text_frame = shape.text_frame
-    # Ensure stable layout
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-    text_frame.word_wrap = True
-    # Clear out existing paragraphs
-    text_frame.clear()
-
-    for p_data in paragraphs_spec:
-        p = text_frame.add_paragraph()
-        
-        # # bulleting
-        # p.bullet = p_data.get("bullet", False)
-        
-        # bullet level (indent)
-        p.level = p_data.get("level", 0)
-        
-        # paragraph alignment
-        align_str = p_data.get("alignment", "left")
-        p.alignment = _parse_alignment(align_str)
-        
-        # paragraph-level font size
-        default_font_size = p_data.get("font_size", 24)
-        p.font.size = Pt(default_font_size)
-
-        # Add runs
-        runs_spec = p_data.get("runs", [])
-        for run_info in runs_spec:
-            run = p.add_run()
-            if p_data.get("bullet", False):
-                if p.level == 0:
-                    run.text = '\u2022' + run_info.get("text", "")
-                elif p.level == 1:
-                    run.text = '\u25E6' + run_info.get("text", "")
-                else:
-                    run.text = '\u25AA' + run_info.get("text", "")
-            else:
-                run.text = run_info.get("text", "")
-
-            # Font styling
-            font = run.font
-            font.bold = run_info.get("bold", False)
-            font.italic = run_info.get("italic", False)
-
-            # If run-specific color was provided
-            color_tuple = run_info.get("color", None)
-            if (
-                color_tuple
-                and len(color_tuple) == 3
-                and all(isinstance(c, int) for c in color_tuple)
-            ):
-                if run.font.color is not None:
-                    # If a ColorFormat object already exists, just set it
-                    run.font.color.rgb = RGBColor(*color_tuple)
-                else:
-                    # Otherwise, manually set the run color in the underlying XML
-                    _set_run_font_color(run, color_tuple)
-
-            # If run-specific font size was provided
-            if "font_size" in run_info:
-                font.size = Pt(run_info["font_size"])
-
-            # If run-specific shape fill color was provided:
-            fill_color_tuple = run_info.get("fill_color", None)
-            if (
-                fill_color_tuple
-                and len(fill_color_tuple) == 3
-                and all(isinstance(c, int) for c in fill_color_tuple)
-            ):
-                shape.fill.solid()
-                shape.fill.fore_color.rgb = RGBColor(*fill_color_tuple)
-
-
-def edit_textbox(
-    shape,
-    text=None,
-    word_wrap=None,
-    font_size=None,
-    bold=None,
-    italic=None,
-    alignment=None,
-    fill_color=None,
-    font_name=None
-):
-    """
-    Edit properties of an existing textbox shape.
-
-    :param shape: The shape object (textbox) to edit.
-    :param text: New text to set. If None, leaves text unmodified.
-    :param word_wrap: Boolean to enable/disable word wrap. If None, leaves unmodified.
-    :param font_size: Font size (int/float or string like '12pt'). If None, leaves unmodified.
-    :param bold: Boolean to set bold. If None, leaves unmodified.
-    :param italic: Boolean to set italic. If None, leaves unmodified.
-    :param alignment: One of 'left', 'center', 'right', 'justify'. If None, leaves unmodified.
-    :param fill_color: A tuple (r, g, b) for background fill color, or None to leave unmodified.
-    """
-
-    text_frame = shape.text_frame
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-
-    # Update fill color if provided
-    if fill_color is not None:
-        shape.fill.solid()
-        shape.fill.fore_color.rgb = RGBColor(*fill_color)
-    # else: If you'd like to remove any existing fill if None, you could:
-    # else:
-    #     shape.fill.background()
-
-    # Update word wrap if provided
-    if word_wrap is not None:
-        text_frame.word_wrap = word_wrap
-
-    # If text is provided, clear existing paragraphs and add the new text
-    if text is not None:
-        text_frame.clear()
-        p = text_frame.add_paragraph()
-        run = p.add_run()
-        run.text = text
-
-        # If alignment is provided, apply to the paragraph
-        if alignment is not None:
-            p.alignment = _parse_alignment(alignment)
-
-        # If font formatting info is provided, apply to the run font
-        font = run.font
-        if font_size is not None:
-            font.size = _parse_font_size(font_size)
-        if bold is not None:
-            font.bold = bold
-        if italic is not None:
-            font.italic = italic
-
-    else:
-        # If no new text is given, we can selectively change existing text properties.
-        for p in text_frame.paragraphs:
-            if alignment is not None:
-                p.alignment = _parse_alignment(alignment)
-            for run in p.runs:
-                font = run.font
-                if font_size is not None:
-                    font.size = _parse_font_size(font_size)
-                if bold is not None:
-                    font.bold = bold
-                if italic is not None:
-                    font.italic = italic
-                if font_name is not None:
-                    font.name = font_name
-
-def add_image(slide, name, left_inch, top_inch, width_inch, height_inch, image_path):
-    """
-    Add an image to the slide at the specified position and size.
-    
-    :param slide: The slide object where the image should be placed.
-    :param name: A string name/label for the shape.
-    :param left_inch: Left position in inches.
-    :param top_inch: Top position in inches.
-    :param width_inch: Width in inches.
-    :param height_inch: Height in inches.
-    :param image_path: File path to the image.
-    :return: The newly created picture shape object.
-    """
-    shape = slide.shapes.add_picture(
-        image_path,
-        Inches(left_inch), Inches(top_inch),
-        width=Inches(width_inch), height=Inches(height_inch)
-    )
-    shape.name = name
-    return shape
-
-def set_shape_position(shape, left_inch, top_inch, width_inch, height_inch):
-    """
-    Move or resize an existing shape to the specified position/dimensions.
-    
-    :param shape: The shape object to be repositioned.
-    :param left_inch: New left position in inches.
-    :param top_inch: New top position in inches.
-    :param width_inch: New width in inches.
-    :param height_inch: New height in inches.
-    """
-    shape.left = Inches(left_inch)
-    shape.top = Inches(top_inch)
-    shape.width = Inches(width_inch)
-    shape.height = Inches(height_inch)
-
-def add_line_simple(slide, name, left_inch, top_inch, length_inch, thickness=2, color=(0, 0, 0), orientation="horizontal"):
-    """
-    Add a simple horizontal or vertical line to the slide.
-    
-    Parameters:
-      slide: The slide object.
-      name: The name/label for the line shape.
-      left_inch: The left (X) coordinate in inches for the starting point.
-      top_inch: The top (Y) coordinate in inches for the starting point.
-      length_inch: The length of the line in inches.
-      thickness: The thickness of the line in points (default is 2).
-      color: An (R, G, B) tuple specifying the line color (default is black).
-      orientation: "horizontal" or "vertical" (case-insensitive).
-      
-    Returns:
-      The created line shape object.
-    """
-    x1 = Inches(left_inch)
-    y1 = Inches(top_inch)
-    
-    if orientation.lower() == "horizontal":
-        x2 = Inches(left_inch + length_inch)
-        y2 = y1
-    elif orientation.lower() == "vertical":
-        x2 = x1
-        y2 = Inches(top_inch + length_inch)
-    else:
-        raise ValueError("Orientation must be either 'horizontal' or 'vertical'")
-    
-    # Create a straight connector (used as a line)
-    line_shape = slide.shapes.add_connector(MSO_CONNECTOR.STRAIGHT, x1, y1, x2, y2)
-    line_shape.name = name
-    
-    # Set the line thickness and color
-    line_shape.line.width = Pt(thickness)
-    line_shape.line.color.rgb = RGBColor(*color)
-    
-    return line_shape
-
-def set_paragraph_line_spacing(shape, line_spacing=1.0):
-    """
-    Set line spacing for all paragraphs in a textbox shape.
-    E.g., line_spacing=1.5 for 1.5x spacing, 2 for double spacing, etc.
-    
-    :param shape: The textbox shape to modify.
-    :param line_spacing: A float indicating multiple of single spacing.
-    """
-    text_frame = shape.text_frame
-    for paragraph in text_frame.paragraphs:
-        paragraph.line_spacing = line_spacing  # direct float: 1.5, 2.0, etc.
-
-def set_shape_text_margins(
-    shape, 
-    top_px=0, 
-    right_px=0, 
-    bottom_px=0, 
-    left_px=0
-):
-    """
-    Set the internal text margins (like "padding") for a textbox shape.
-    python-pptx uses points or EMUs for margins, so we convert from px -> points -> EMUs as needed.
-    
-    Note: If your output environment uses a different PX:PT ratio, adjust _px_to_pt().
-    """
-    text_frame = shape.text_frame
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-    text_frame.margin_top = Pt(_px_to_pt(top_px))
-    text_frame.margin_right = Pt(_px_to_pt(right_px))
-    text_frame.margin_bottom = Pt(_px_to_pt(bottom_px))
-    text_frame.margin_left = Pt(_px_to_pt(left_px))
-
-def adjust_font_size(shape, delta=2):
-    """
-    Increase or decrease the current font size of all runs in a shape by `delta` points.
-    If a run has no explicitly set font size (font.size is None), we can either skip it or assume a default.
-    For simplicity, let's skip runs without an explicit size to avoid overwriting theme defaults.
-    
-    :param shape: The textbox shape to update.
-    :param delta: Positive or negative integer to adjust the font size.
-    """
-    text_frame = shape.text_frame
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-    for paragraph in text_frame.paragraphs:
-        for run in paragraph.runs:
-            current_size = run.font.size
-            if current_size is not None:
-                new_size = current_size.pt + delta
-                # Prevent negative or zero font size
-                if new_size < 1:
-                    new_size = 1
-                run.font.size = Pt(new_size)
-
-def center_shape_horizontally(prs, shape):
-    """
-    Center a shape horizontally on the slide using the presentation's slide width.
-    
-    :param prs: The Presentation object (which holds slide_width).
-    :param shape: The shape to center.
-    """
-    new_left = (prs.slide_width - shape.width) // 2
-    shape.left = new_left
-
-def center_shape_vertically(prs, shape):
-    """
-    Center a shape vertically on the slide using the presentation's slide height.
-    
-    :param prs: The Presentation object (which holds slide_height).
-    :param shape: The shape to center.
-    """
-    new_top = (prs.slide_height - shape.height) // 2
-    shape.top = new_top
-
-def set_shape_text(shape, text, clear_first=True):
-    """
-    Set or replace the text of an existing shape (commonly a textbox).
-    
-    :param shape: The shape (textbox) whose text needs to be updated.
-    :param text: The new text content.
-    :param clear_first: Whether to clear existing paragraphs before adding.
-    """
-    text_frame = shape.text_frame
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-    if clear_first:
-        text_frame.clear()
-    p = text_frame.add_paragraph()
-    p.text = text
-
-def _set_run_font_color(run, rgb_tuple):
-    """
-    Manually create or replace the solidFill element in this run's XML
-    to force the color if run.font.color is None or doesn't exist yet.
-    """
-    # Underlying run properties element
-    rPr = run.font._element
-    
-    # Remove any existing <a:solidFill> elements to avoid duplicates
-    for child in rPr.iterchildren():
-        if child.tag == qn('a:solidFill'):
-            rPr.remove(child)
-
-    # Create a new solidFill element with the specified color
-    solid_fill = OxmlElement('a:solidFill')
-    srgb_clr = OxmlElement('a:srgbClr')
-    # Format the tuple (r, g, b) into a hex string "RRGGBB"
-    srgb_clr.set('val', '{:02X}{:02X}{:02X}'.format(*rgb_tuple))
-    solid_fill.append(srgb_clr)
-    rPr.append(solid_fill)
-
-def set_text_style(shape, font_size=None, bold=None, italic=None, alignment=None, color=None, font_name=None):
-    """
-    Adjust text style on an existing textbox shape.
-    
-    :param shape: The textbox shape whose style is being updated.
-    :param font_size: Numeric font size (e.g. 40) or None to skip.
-    :param bold: Boolean or None (to skip).
-    :param italic: Boolean or None (to skip).
-    :param alignment: String alignment ('left', 'center', 'right', 'justify') or None (to skip).
-    :param color: A tuple (r, g, b), each int from 0-255, or None (to skip).
-    :param font_name: String font name (e.g., 'Arial') or None
-    """
-    text_frame = shape.text_frame
-    # Disable auto-sizing so our manual settings are respected
-    text_frame.auto_size = MSO_AUTO_SIZE.NONE
-    
-    # Convert the alignment string into a PP_ALIGN enum value
-    parsed_alignment = _parse_alignment(alignment) if alignment else None
-    
-    # Convert the raw font size to a python-pptx Pt object
-    parsed_font_size = _parse_font_size(font_size)
-
-    # Iterate over paragraphs and runs in the shape
-    for paragraph in text_frame.paragraphs:
-        if parsed_alignment is not None:
-            paragraph.alignment = parsed_alignment
-        
-        for run in paragraph.runs:
-            # Font size
-            if parsed_font_size is not None:
-                run.font.size = parsed_font_size
-            
-            # Bold
-            if bold is not None:
-                run.font.bold = bold
-            
-            # Italic
-            if italic is not None:
-                run.font.italic = italic
-
-            # Font name
-            if font_name is not None:
-                run.font.name = font_name
-            
-            # Color
-            if color is not None:
-                # Sometimes run.font.color may be None. We can try:
-                if run.font.color is not None:
-                    # If a ColorFormat object already exists, just set it
-                    run.font.color.rgb = RGBColor(*color)
-                else:
-                    # Otherwise, manually set the run color in the underlying XML
-                    _set_run_font_color(run, color)
-
-def save_presentation(prs, file_name="poster.pptx"):
-    """
-    Save the current Presentation object to disk.
-    
-    :param prs: The Presentation object.
-    :param file_name: The file path/name for the saved pptx file.
-    """
-    prs.save(file_name)
-
-def set_slide_background_color(slide, rgb=(255, 255, 255)):
-    """
-    Sets the background color for a single Slide object.
-
-    :param slide: A pptx.slide.Slide object
-    :param rgb: A tuple of (R, G, B) color values, e.g. (255, 0, 0) for red
-    """
-    bg_fill = slide.background.fill
-    bg_fill.solid()
-    bg_fill.fore_color.rgb = RGBColor(*rgb)
-
-def style_shape_border(shape, color=(30, 144, 255), thickness=2, line_style="square_dot"):
-    """
-    Applies a border (line) style to a given shape, where line_style is a 
-    string corresponding to an MSO_LINE_DASH_STYLE enum value from python-pptx.
-
-    Valid line_style strings (based on the doc snippet) are:
-    -----------------------------------------------------------------
-    'solid'        -> MSO_LINE_DASH_STYLE.SOLID
-    'round_dot'    -> MSO_LINE_DASH_STYLE.ROUND_DOT
-    'square_dot'   -> MSO_LINE_DASH_STYLE.SQUARE_DOT
-    'dash'         -> MSO_LINE_DASH_STYLE.DASH
-    'dash_dot'     -> MSO_LINE_DASH_STYLE.DASH_DOT
-    'dash_dot_dot' -> MSO_LINE_DASH_STYLE.DASH_DOT_DOT
-    'long_dash'    -> MSO_LINE_DASH_STYLE.LONG_DASH
-    'long_dash_dot'-> MSO_LINE_DASH_STYLE.LONG_DASH_DOT
-    -----------------------------------------------------------------
-
-    :param shape:     pptx.shapes.base.Shape object to style
-    :param color:     A tuple (R, G, B) for the border color (default is (30, 144, 255))
-    :param thickness: Border thickness in points (default is 2)
-    :param line_style:String representing the line dash style; defaults to 'square_dot'
-    """
-    # Map our string keys to MSO_LINE_DASH_STYLE values from your doc snippet
-    dash_style_map = {
-        "solid": MSO_LINE_DASH_STYLE.SOLID,
-        "round_dot": MSO_LINE_DASH_STYLE.ROUND_DOT,
-        "square_dot": MSO_LINE_DASH_STYLE.SQUARE_DOT,
-        "dash": MSO_LINE_DASH_STYLE.DASH,
-        "dash_dot": MSO_LINE_DASH_STYLE.DASH_DOT,
-        "dash_dot_dot": MSO_LINE_DASH_STYLE.DASH_DOT_DOT,
-        "long_dash": MSO_LINE_DASH_STYLE.LONG_DASH,
-        "long_dash_dot": MSO_LINE_DASH_STYLE.LONG_DASH_DOT
-    }
-
-    line = shape.line
-    line.width = Pt(thickness)
-    line.color.rgb = RGBColor(*color)
-
-    # Default to 'solid' if the requested style isn't in dash_style_map
-    dash_style_enum = dash_style_map.get(line_style.lower(), MSO_LINE_DASH_STYLE.SOLID)
-    line.dash_style = dash_style_enum
-
-def get_visual_cues(name_to_hierarchy, identifier, poster_path='poster.pptx'):
-    prs = pptx.Presentation(poster_path)
-
-    position_dict_1 = add_border_hierarchy(prs, name_to_hierarchy, 1, border_width=10)
-    json.dump(position_dict_1, open(f"tmp/position_dict_1_<{identifier}>.json", "w"))
-
-    # Save the presentation to disk.
-    save_presentation(prs, file_name=f"tmp/poster_<{identifier}>_hierarchy_1.pptx")
-
-    prs = pptx.Presentation(poster_path)
-
-    add_border_hierarchy(prs, name_to_hierarchy, 1, border_width=10, fill_boxes=True)
-    save_presentation(prs, file_name=f"tmp/poster_<{identifier}>_hierarchy_1_filled.pptx")
-
-    prs = pptx.Presentation(poster_path)
-
-    position_dict_2 = add_border_hierarchy(prs, name_to_hierarchy, 2, border_width=10)
-    json.dump(position_dict_2, open(f"tmp/position_dict_2_<{identifier}>.json", "w"))
-
-    # Save the presentation to disk.
-    save_presentation(prs, file_name=f"tmp/poster_<{identifier}>_hierarchy_2.pptx")
-
-    prs = pptx.Presentation(poster_path)
-
-    add_border_hierarchy(prs, name_to_hierarchy, 2, border_width=10, fill_boxes=True)
-
-    # Save the presentation to disk.
-    save_presentation(prs, file_name=f"tmp/poster_<{identifier}>_hierarchy_2_filled.pptx")
-
-from pptx.enum.shapes import MSO_SHAPE_TYPE, MSO_SHAPE, MSO_AUTO_SHAPE_TYPE
-from pptx.util import Inches, Pt
-from pptx.dml.color import RGBColor
-from pptx.enum.text import PP_ALIGN, MSO_ANCHOR
-
-def emu_to_inches(emu: int) -> float:
-    return emu / 914400
-
-def add_border(
-    prs,
-    border_color=RGBColor(255, 0, 0),   # Red border for shapes
-    border_width=Pt(2),                # 2-point border width
-):
-    """
-    Iterates over all slides and shapes in the Presentation 'prs', applies a 
-    red border to each shape, and places a transparent (no fill).
-
-    Args:
-        prs: The Presentation object to modify.
-        border_color: RGBColor for the shape border color (default: red).
-        border_width: The width of the shape border (Pt).
-    """
-    labeled_elements = {}
-
-    for slide in prs.slides:
-        for shape in slide.shapes:
-            try:
-                # --- 1) Add red border to the shape (if supported) ---
-                shape.line.fill.solid()
-                shape.line.fill.fore_color.rgb = border_color
-                shape.line.width = border_width
-
-                if hasattr(shape, 'name'):
-                    labeled_elements[shape.name] = {
-                        'left': f'{emu_to_inches(shape.left)} Inches',
-                        'top': f'{emu_to_inches(shape.top)} Inches',
-                        'width': f'{emu_to_inches(shape.width)} Inches',
-                        'height': f'{emu_to_inches(shape.height)} Inches',
-                    }
-
-            except Exception as e:
-                # If the shape doesn't support borders or text, skip gracefully
-                print(f"Could not add border to shape (type={shape.shape_type}): {e}")
-    
-    return labeled_elements
-
-def get_hierarchy(outline, hierarchy=1):
-    name_to_hierarchy = {}
-    for key, section in outline.items():
-        if key == "meta":
-            continue
-        name_to_hierarchy[section['name']] = hierarchy
-        if 'subsections' in section:
-            name_to_hierarchy.update(get_hierarchy(section['subsections'], hierarchy+1))
-    return name_to_hierarchy
-
-def get_hierarchy_by_keys(outline, hierarchy=1):
-    name_to_hierarchy = {}
-    for key, section in outline.items():
-        if key == "meta":
-            continue
-        name_to_hierarchy[key] = hierarchy
-        if 'subsections' in section:
-            name_to_hierarchy.update(get_hierarchy_by_keys(section['subsections'], hierarchy+1))
-    return name_to_hierarchy
-
-def rename_keys_with_name(data):
-    """
-    Recursively rename dictionary keys to data['name'] if:
-      - The value is a dict,
-      - It contains a 'name' field.
-    Otherwise, keep the original key.
-    """
-    if not isinstance(data, dict):
-        # If it's not a dictionary (e.g. list or scalar), just return it as-is
-        return data
-
-    new_dict = {}
-    for key, value in data.items():
-        if isinstance(value, dict) and "name" in value:
-            # Rename the key to whatever 'name' is in the nested dictionary
-            new_key = value["name"]
-            # Recursively process the value (which may contain its own subsections)
-            new_dict[new_key] = rename_keys_with_name(value)
-        else:
-            # Keep the same key if there's no 'name' in value or it's not a dictionary
-            new_dict[key] = rename_keys_with_name(value)
-
-    return new_dict
-
-def add_border_hierarchy(
-    prs,
-    name_to_hierarchy: dict,
-    hierarchy: int,
-    border_color=RGBColor(255, 0, 0),
-    border_width=2,
-    fill_boxes: bool = False,
-    fill_color=RGBColor(255, 0, 0),
-    regardless=False
-):
-    """
-    Iterates over all slides and shapes in the Presentation 'prs'.
-    - For shapes whose name maps to the given 'hierarchy' in 'name_to_hierarchy' (or if 'regardless'
-      is True), draws a red border. Optionally fills the shape with red if 'fill_boxes' is True.
-    - For all other shapes, removes their border and hides any text.
-
-    Returns:
-        labeled_elements: dict of shape geometry for ALL shapes, regardless of hierarchy match.
-    """
-    border_width = Pt(border_width)
-    labeled_elements = {}
-
-    for slide_idx, slide in enumerate(prs.slides):
-        for shape_idx, shape in enumerate(slide.shapes):
-            # Record basic geometry in labeled_elements
-            shape_name = shape.name if hasattr(shape, 'name') else f"Shape_{slide_idx}_{shape_idx}"
-            labeled_elements[shape_name] = {
-                'left': f"{emu_to_inches(shape.left):.2f} Inches",
-                'top': f"{emu_to_inches(shape.top):.2f} Inches",
-                'width': f"{emu_to_inches(shape.width):.2f} Inches",
-                'height': f"{emu_to_inches(shape.height):.2f} Inches",
-            }
-
-            # Determine if this shape should have a border
-            current_hierarchy = name_to_hierarchy.get(shape_name, None)
-            if current_hierarchy is None:
-                # Optional: Print a debug message if the shape’s name isn’t in the dict
-                print(f"Warning: shape '{shape_name}' not found in name_to_hierarchy.")
-
-            try:
-                if current_hierarchy == hierarchy or regardless:
-                    # Draw border
-                    shape.line.fill.solid()
-                    shape.line.fill.fore_color.rgb = border_color
-                    shape.line.width = border_width
-
-                    # Optionally fill the shape with red color
-                    if fill_boxes:
-                        shape.fill.solid()
-                        shape.fill.fore_color.rgb = fill_color
-                else:
-                    # Remove border
-                    shape.line.width = Pt(0)
-                    shape.line.fill.background()
-
-                    # Hide text if present
-                    if shape.has_text_frame:
-                        shape.text_frame.text = ""
-            except Exception as e:
-                print(f"Could not process shape '{shape_name}' (type={shape.shape_type}): {e}")
-
-    return labeled_elements
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/LLM_gen_HTML.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/LLM_gen_HTML.yaml
deleted file mode 100644
index 10ee7233ae12bf2cad06fb45cb6f8d9ba93a5af8..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/LLM_gen_HTML.yaml
+++ /dev/null
@@ -1,35 +0,0 @@
-system_prompt: |
-  You are a document-to-poster generation agent.  
-  Your task is to read the supplied Markdown text (``document_markdown``) and
-  design a professional, visually appealing academic conference poster by
-  generating an HTML file.  
-  Follow the guidelines below precisely.
-
-template: |
-  ================================================================
-  INSTRUCTIONS
-  ================================================================
-  1. Carefully read the Markdown in ``document_markdown``.
-  2. Design a full-page academic conference poster in HTML + CSS:
-     • Include a prominent header with title, authors, and affiliations.  
-     • Break content into logical sections (Introduction, Methods, Results, Conclusions, etc.).  
-     • Provide clear, informative text summaries.  
-     • Embed relevant figures and tables, neatly arranged and aligned.  
-     • Accurately represent key findings, methods, and conclusions.  
-     • Ensure the layout is engaging, easy to follow, and visually attractive.  
-     • Include all essential poster elements commonly found at scientific conferences.
-  3. Write complete HTML code (with inline or embedded CSS) that, when rendered,
-     produces the poster layout.
-  5. The poster width should be {{poster_width}}px and height should be {{poster_height}}px.
-  4. **Output only** a JSON object with a single key ``HTML``, whose value is
-     the entire HTML code for the poster.
-
-  ----------------------------------------------------------------
-  document_markdown:
-  {{ document_markdown }}
-  ----------------------------------------------------------------
-
-jinja_args:
-  - document_markdown
-  - poster_width
-  - poster_height
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/ablation_no_tree_layout.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/ablation_no_tree_layout.yaml
deleted file mode 100644
index a4fe5161d4c347604d17bf5b724abc51d7254531..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/ablation_no_tree_layout.yaml
+++ /dev/null
@@ -1,96 +0,0 @@
-system_prompt: |
-  You are an expert scientific-poster layout engine.
-  You receive:
-    • panels – ordered list of sections; each has panel_id, section_name,
-      text_len (≈ word count).
-    • figures – a dictionary keyed by section_name whose value indicates
-      ONE figure (image or table) in that section.  A key may be absent,
-      meaning that panel has no figure.  A panel therefore hosts either
-      zero or exactly one figure, and each figure appears at most once
-      in the entire poster.
-    • poster_width and poster_height – usable pixel dimensions.
-
-  Output a single JSON object with three lists:
-    1) "panel_arrangement" – one entry per panel:
-         panel_name, panel_id, x, y, width, height
-       (x,y is upper-left; panels must stay in bounds and never overlap.)
-    2) "figure_arrangement" – zero-or-one entry per panel:
-         panel_id, x, y, width, height, figure_id, figure_name
-       (The box lies completely inside its parent panel.
-       Name format:  "p<SECTION NAME>_f0".)
-    3) "text_arrangement" – textboxes per panel:
-         panel_id, x, y, width, height, textbox_id, textbox_name
-       Rules:
-         • Panel with no figure ⇒ exactly ONE textbox covering the whole
-           panel (leave 3-pixel inner margin on all sides).
-         • Panel with a figure ⇒ TWO textboxes:
-             – top textbox: full width, flush to top, stops 3 px above
-               the figure.
-             – bottom textbox: full width, starts 3 px below the figure,
-               stops 3 px above panel bottom.
-         • Textboxes never overlap figures or leave panel bounds.
-
-  Aesthetics & sizing:
-    • Allocate panel height roughly proportional to text_len
-      while ensuring total height ≤ poster_height.
-    • Arrange panels row-wise (newspaper style) for clarity.
-    • Keep numbers as floating-point with ≥1 decimal place.
-    • No overlaps and nothing outside the poster.
-
-  OUTPUT STRICTLY THE JSON OBJECT—no commentary, no markdown.
-
-template: |
-  Inputs:
-    panels:
-    {{ panels }}
-
-    figures:
-    {{ figures }}
-
-    poster_width: {{ poster_width }}
-    poster_height: {{ poster_height }}
-
-  Instructions:
-    1) Produce a clean, non-overlapping panel layout inside
-       [0,0]–[poster_width,poster_height].
-    2) For each panel that appears in the "figures" dictionary, insert
-       exactly one figure (figure_id = 0) sized ~35-45 % of the panel
-       height.  Panels absent from the dictionary get no figure.
-    3) Place textboxes following the figure/zero-figure rules given in
-       the system_prompt.
-    4) Remember:
-         • Each panel hosts zero or one figure—never more.
-         • Each specific figure appears only once in the
-           "figure_arrangement" list.
-    5) Return ONLY the JSON object described above.
-
-  Toy example (schema only):
-    Given:
-      panels = [{"panel_id":0,"section_name":"Title","text_len":50},
-                {"panel_id":1,"section_name":"Intro","text_len":300}]
-      figures = {"Intro":{"image":1}}
-      poster_width = 800
-      poster_height = 600
-
-    Example output:
-    {
-      "panel_arrangement":[
-        {"panel_name":"Title","panel_id":0,"x":0,"y":0,"width":800,"height":120},
-        {"panel_name":"Intro","panel_id":1,"x":0,"y":120,"width":800,"height":480}
-      ],
-      "figure_arrangement":[
-        {"panel_id":1,"x":280,"y":300,"width":240,"height":180,
-         "figure_id":0,"figure_name":"p<Intro>_f0"}
-      ],
-      "text_arrangement":[
-        {"panel_id":0,"x":3,"y":3,"width":794,"height":114,
-         "textbox_id":0,"textbox_name":"p<Title>_t0"},
-        {"panel_id":1,"x":3,"y":123,"width":794,"height":174,
-         "textbox_id":0,"textbox_name":"p<Intro>_t0"},
-        {"panel_id":1,"x":3,"y":483,"width":794,"height":114,
-         "textbox_id":1,"textbox_name":"p<Intro>_t1"}
-      ]
-    }
-
-  Now create the layout for the CURRENT POSTER using the data above and
-  output the required JSON only.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/actor_editor_agent.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/actor_editor_agent.yaml
deleted file mode 100644
index 0ec756cd956fde9eaa47355ef6e009bb0820b96b..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/actor_editor_agent.yaml
+++ /dev/null
@@ -1,43 +0,0 @@
-system_prompt: |
-  You are an agent tasked with updating existing Python code ("existing_code") based on a set of suggested changes ("suggestion_json").
-  You also have:  
-    • "content_json": The JSON content for the CURRENT SECTION.  
-    • "function_docs": Documentation for helper functions.  
-  The "suggestion_json" may specify changes such as reducing font size, adjusting margins, etc.—but you must not alter the bounding box dimensions.   
-  Your goal:  
-    1) Incorporate each suggestion into the code precisely as indicated.  
-    2) Update all relevant lines of code so that the final Python script addresses and resolves the text overflow.  
-  Return ONLY the updated Python code, wrapped in triple backticks.
-
-
-template: |
-  Instructions:
-    1. Review "content_json" for the current section's data.  
-    2. Refer to "function_docs" for any needed helper function references.  
-    3. Look at the "existing_code" for the base Python script.  
-    4. Then apply the suggestions from "suggestion_json" exactly as described:   
-       - Adjust font sizes, margins, or line spacing if specified.  
-       - Do not modify the bounding box size under any circumstances.  
-    5. Return only the modified Python code in triple backticks.
-
-  content_json:
-  {{ content_json }}
-
-  function_docs:
-  {{ function_docs }}
-
-  existing_code:
-  {{ existing_code }}
-
-  suggestion_json:
-  {{ suggestion_json }}
-
-  ```
-  # Your updated Python code here
-  ```
-
-jinja_args:
-  - content_json
-  - function_docs
-  - existing_code
-  - suggestion_json
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/actor_layout_hierarchy_2.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/actor_layout_hierarchy_2.yaml
deleted file mode 100644
index 36dd36815be193270a9581cde2ac54f74c8dee6e..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/actor_layout_hierarchy_2.yaml
+++ /dev/null
@@ -1,40 +0,0 @@
-system_prompt: |
-  You are an agent that organizes a poster section into subsections. You are provided exactly one input, "section_outline", which is a JSON object describing:
-    • The current section's bounding box in inches, specified under "location".
-    • A list of possible subsections (if any). Each subsection has a name and type (such as "title", "body", etc.).
-
-  Your task:
-    1. If no subsections are listed, return an empty JSON object (i.e. "{}").
-    2. If subsections are listed:
-       • Generate bounding boxes for each subsection, partitioning the current section’s bounding box so that:
-         a. No two subsection bounding boxes overlap.
-         b. All subsection bounding boxes are completely contained within the current section’s bounding box.
-         c. The subsection bounding boxes collectively occupy (partition) the entire current section’s bounding box.
-       • If a subsection is marked as a "title", place its bounding box at the top of the section.
-       • Return a single JSON object where:
-           - Each key is a subsection name.
-           - Each value is a dictionary containing the bounding box for that subsection.
-    3. All dimensions must be consistent with the "section_outline.location" field, which defines the entire bounding box for this section.
-
-template: |
-  Instructions:
-    1. Read the "section_outline" JSON to identify:
-       • location: the bounding box for the current section.
-       • subsections: sub-parts of this section, which will be in the key "subsections". (e.g. "title", "overview", "image", etc.).
-    2. If there are no subsections, return "{}".
-    3. If there are subsections, create bounding boxes for each. The sum of these bounding boxes must exactly fill (partition) the current section’s bounding box without overlap.
-    4. A "title" subsection must appear at the top.
-    5. If there are multiple subsections aside from the section title, the splitting of the remaining space should be along the longer dimension.
-    5. Output exactly one JSON object with:
-       {
-         "subsection_name": { "location": [x, y, width, height] },
-         ...
-       }
-    6. Use inches for all bounding box coordinates.
-    7. You should NEVER return an empty JSON object.
-
-  section_outline:
-  {{ section_outline }}
-
-jinja_args:
-  - section_outline
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_element_judge copy.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_element_judge copy.yaml
deleted file mode 100644
index 5c7a874cda3104543ca053f3e96326c4203025b9..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_element_judge copy.yaml	
+++ /dev/null
@@ -1,38 +0,0 @@
-system_prompt: |
-  You are an extremely discerning visual-element judge. Scrutinize every figure, chart, and image for any visual or stylistic issue. Always look for even subtle flaws: low contrast, imperfect resolutions, slightly inconsistent styles, crowded or mislabeled legends, etc. Be wary of awarding high scores unless the visuals truly meet the strictest standards.
-
-template: |
-  Instructions:
-    Five-Point Scale
-    1 Point:
-      • Graphics are blurry, pixelated, or illegible. 
-      • Color choices severely hinder interpretation.
-      • Visuals may significantly detract from comprehension.
-
-    2 Points:
-      • At least one graphic is clear, while others suffer from poor resolution or style. 
-      • Legends or labels are missing or too small to read comfortably.
-      • Color schemes create some confusion or difficulty.
-
-    3 Points:
-      • Most graphics are legible and relevant, but have notable issues with consistency, sizing, or clarity. 
-      • Some mismatches in style or color usage detract from cohesion.
-      • Minor but noticeable labeling/legend shortcomings.
-
-    4 Points:
-      • High-quality graphics with generally consistent styling.
-      • Clear legends and color schemes aid interpretation.
-      • Any remaining flaws are slight and do not significantly hinder understanding.
-
-    5 Points:
-      • Rarely awarded; strictly reserved for publication-grade visuals.
-      • Crisp resolution with no instances of blurriness.
-      • Harmonious color palette, impeccable labeling, and an exceptionally consistent style.
-
-    Example Output:
-    {
-      "reason": "xx",
-      "score": int
-    }
-
-    Think step by step and be conservative with your rating.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_element_judge.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_element_judge.yaml
deleted file mode 100644
index da453ba9c2cd2758d7659a71570d76516335def9..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_element_judge.yaml
+++ /dev/null
@@ -1,39 +0,0 @@
-system_prompt: |
-  You are an extremely discerning visual-element judge. Scrutinize every figure, chart, and image for any visual or stylistic issue. Always look for even subtle flaws: low contrast, slightly inconsistent styles, crowded or mislabeled legends, etc. Be wary of awarding high scores unless the visuals truly meet the strictest standards.
-  You should not penalize on any aspect of the textual content (grammar, spelling, etc.), just the visual elements.
-
-template: |
-  Instructions:
-    Five-Point Scale
-    1 Point:
-      • Graphics are blurry, pixelated, or illegible. 
-      • Color choices severely hinder interpretation.
-      • Visuals may significantly detract from comprehension.
-
-    2 Points:
-      • At least one graphic is clear, while others suffer from poor resolution or style. 
-      • Legends or labels are missing or too small to read comfortably.
-      • Color schemes create some confusion or difficulty.
-
-    3 Points:
-      • Most graphics are legible and relevant, but have notable issues with consistency, sizing, or clarity. 
-      • Some mismatches in style or color usage detract from cohesion.
-      • Minor but noticeable labeling/legend shortcomings.
-
-    4 Points:
-      • High-quality graphics with generally consistent styling.
-      • Clear legends and color schemes aid interpretation.
-      • Any remaining flaws are slight and do not significantly hinder understanding.
-
-    5 Points:
-      • Rarely awarded; strictly reserved for publication-grade visuals.
-      • Crisp resolution with no instances of blurriness.
-      • Harmonious color palette, impeccable labeling, and an exceptionally consistent style.
-
-    Example Output:
-    {
-      "reason": "xx",
-      "score": int
-    }
-
-    Think step by step and be conservative with your rating.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_engagement_judge copy.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_engagement_judge copy.yaml
deleted file mode 100644
index 94cd0d6b5eca068cf19136a833421a9792d70cdc..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_engagement_judge copy.yaml	
+++ /dev/null
@@ -1,38 +0,0 @@
-system_prompt: |
-  You are an uncompromising poster-aesthetics judge focusing on engagement. Be extremely critical of color harmony, typography, visual balance, and the poster’s ability to grab and hold attention. Always look for subtle issues—color clashes, overly busy or dull designs, inappropriate font choices, awkward spacing, or anything that might reduce engagement. Reserve high scores for truly exemplary work.
-
-template: |
-  Instructions:
-    Five-Point Scale
-    1 Point:
-      • Visually off-putting; clashing colors or crowded design repel viewers.
-      • Typography choice is jarring or illegible at a glance.
-      • Overall fails to engage or entice.
-
-    2 Points:
-      • Some visually appealing elements exist but are overshadowed by dull or inconsistent design moments.
-      • Font sizes or styles reduce accessibility or attractiveness.
-      • Limited capacity to draw an audience's focus.
-
-    3 Points:
-      • Shows generally pleasing color scheme and typography, though lacking a "wow" factor.
-      • Balance and visual flow are acceptable but reveal minor weaknesses (e.g., slightly crowded or sparse areas).
-      • Engagement is average; neither strong nor particularly weak.
-
-    4 Points:
-      • Eye-catching design using mostly harmonious colors and effective typography.
-      • Good use of negative space; the layout guides the viewer’s eye effectively.
-      • Only minor flaws or bland spots prevent it from being top-tier.
-
-    5 Points:
-      • Rarely awarded—reserved for truly striking, magazine-cover-caliber visuals.
-      • Flawless color palette and typography; everything works together seamlessly.
-      • Immediately captivating design that retains audience interest without any noticeable weakness.
-
-    Example Output:
-    {
-      "reason": "xx",
-      "score": int
-    }
-
-    Think step by step and be very conservative when scoring.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_engagement_judge.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_engagement_judge.yaml
deleted file mode 100644
index 11258991ae6f025eb3774b1ddddb59d42cf33398..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_engagement_judge.yaml
+++ /dev/null
@@ -1,39 +0,0 @@
-system_prompt: |
-  You are an uncompromising poster-aesthetics judge focusing on engagement. Be extremely critical of color harmony, typography, visual balance, and the poster's ability to grab and hold attention. Always look for subtle issues—color clashes, overly busy or dull designs, inappropriate font choices, awkward spacing, or anything that might reduce engagement. Reserve high scores for truly exemplary work.
-  You should not penalize on any aspect of the textual content (grammar, spelling, etc.), just the visual elements.
-
-template: |
-  Instructions:
-    Five-Point Scale
-    1 Point:
-      • Visually off-putting; clashing colors or crowded design repel viewers.
-      • Typography choice is jarring or illegible at a glance.
-      • Overall fails to engage or entice.
-
-    2 Points:
-      • Some visually appealing elements exist but are overshadowed by dull or inconsistent design moments.
-      • Font sizes or styles reduce accessibility or attractiveness.
-      • Limited capacity to draw an audience's focus.
-
-    3 Points:
-      • Shows generally pleasing color scheme and typography, though lacking a "wow" factor.
-      • Balance and visual flow are acceptable but reveal minor weaknesses (e.g., slightly crowded or sparse areas).
-      • Engagement is average; neither strong nor particularly weak.
-
-    4 Points:
-      • Eye-catching design using mostly harmonious colors and effective typography.
-      • Good use of negative space; the layout guides the viewer’s eye effectively.
-      • Only minor flaws or bland spots prevent it from being top-tier.
-
-    5 Points:
-      • Rarely awarded—reserved for truly striking, magazine-cover-caliber visuals.
-      • Flawless color palette and typography; everything works together seamlessly.
-      • Immediately captivating design that retains audience interest without any noticeable weakness.
-
-    Example Output:
-    {
-      "reason": "xx",
-      "score": int
-    }
-
-    Think step by step and be very conservative when scoring.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_layout_judge copy.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_layout_judge copy.yaml
deleted file mode 100644
index ed4e0af37a12027d2163acd592d92590fa6dca48..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_layout_judge copy.yaml	
+++ /dev/null
@@ -1,38 +0,0 @@
-system_prompt: |
-  You are an uncompromising poster-layout judge. Critique the overall arrangement of all visual components (text blocks, headings, figures, white-space, alignment) that affect readability. Always scan for subtle alignment issues, uneven spacing, or any layout feature that might disrupt reader comprehension. Resist giving high scores unless the layout is exceptionally polished.
-
-template: |
-  Instructions:
-    Five-Point Scale
-    1 Point:
-      • Highly disorganized layout; elements overlap, making text or graphics illegible.
-      • Margins are violated or reading path is nearly impossible to follow.
-      • Severely hinders comprehension.
-
-    2 Points:
-      • Some semblance of structure (columns/rows) but marred by inconsistent alignment or overcrowded sections.
-      • White-space distribution may be haphazard or insufficient.
-      • Reading flow is interrupted, though one can still piece it together.
-
-    3 Points:
-      • Recognizable structure with mostly consistent alignment and spacing.
-      • Some minor layout distractions remain (e.g., slightly cramped text, uneven spacing, or small alignment slips).
-      • Generally readable but not particularly polished.
-
-    4 Points:
-      • Well-organized grid or arrangement; logical reading path that mostly flows.
-      • Appropriate font sizes, spacing, and alignment; only subtle layout imperfections.
-      • White-space usage clean and deliberate; nearly professional.
-
-    5 Points:
-      • Very rarely granted; must be a pristine, professional-grade layout.
-      • Seamless alignment, balanced spacing, and expertly guided reading path.
-      • Flawless design synergy that maximizes readability and comprehension.
-
-    Example Output:
-    {
-      "reason": "xx",
-      "score": int
-    }
-
-    Think step by step and be tough on small alignment/spacing issues.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_layout_judge.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_layout_judge.yaml
deleted file mode 100644
index fafeec9692179265f41510505646d0ca24c30eee..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/aesthetic_layout_judge.yaml
+++ /dev/null
@@ -1,39 +0,0 @@
-system_prompt: |
-  You are an uncompromising poster-layout judge. Critique the overall arrangement of all visual components (text blocks, headings, figures, white-space, alignment) that affect readability. Always scan for subtle alignment issues, uneven spacing, or any layout feature that might disrupt reader comprehension. Resist giving high scores unless the layout is exceptionally polished.
-  You should not penalize on any aspect of the textual content (grammar, spelling, etc.), just the visual elements.
-
-template: |
-  Instructions:
-    Five-Point Scale
-    1 Point:
-      • Highly disorganized layout; elements overlap, making text or graphics illegible.
-      • Margins are violated or reading path is nearly impossible to follow.
-      • Severely hinders comprehension.
-
-    2 Points:
-      • Some semblance of structure (columns/rows) but marred by inconsistent alignment or overcrowded sections.
-      • White-space distribution may be haphazard or insufficient.
-      • Reading flow is interrupted, though one can still piece it together.
-
-    3 Points:
-      • Recognizable structure with mostly consistent alignment and spacing.
-      • Some minor layout distractions remain (e.g., slightly cramped text, uneven spacing, or small alignment slips).
-      • Generally readable but not particularly polished.
-
-    4 Points:
-      • Well-organized grid or arrangement; logical reading path that mostly flows.
-      • Appropriate font sizes, spacing, and alignment; only subtle layout imperfections.
-      • White-space usage clean and deliberate; nearly professional.
-
-    5 Points:
-      • Very rarely granted; must be a pristine, professional-grade layout.
-      • Seamless alignment, balanced spacing, and expertly guided reading path.
-      • Flawless design synergy that maximizes readability and comprehension.
-
-    Example Output:
-    {
-      "reason": "xx",
-      "score": int
-    }
-
-    Think step by step and be tough on small alignment/spacing issues.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/answer_question_from_image.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/answer_question_from_image.yaml
deleted file mode 100644
index b9115abcf0664662be444a6a6ee69f3a108b4764..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/answer_question_from_image.yaml
+++ /dev/null
@@ -1,58 +0,0 @@
-system_prompt: |
-  You are an answering agent. You will be provided with:
-    1. An image of a poster.
-    2. A JSON object called "questions" which contains multiple questions. Each question has four possible answers: A, B, C, or D.
-
-  Your goal is to analyze the poster thoroughly and answer each question based on the information it provides.
-  You should **NOT** use any external knowledge or context beyond the poster image. You must rely solely on the content of the poster to answer the questions.
-
-  For each question:
-    • If you find enough evidence in the poster to decide on a specific option (A, B, C, or D), then choose that option. Also include a brief reference to the part of the poster that supports your answer (e.g., “Top-left text”, “Event date section”, etc.).
-    • If the poster does not offer sufficient information to confidently choose any of the options, respond with "NA" for both the answer and the reference.
-
-  Your final output must be returned as a JSON object. For each question, the structure should be:
-    "Question N": {
-      "answer": "A" | "B" | "C" | "D" | "NA",
-      "reference": "<short description or 'NA'>"
-    }
-
-template: |
-  Follow these steps to create your response:
-
-  1. Study the poster image along with the "questions" provided.
-  2. For each question:
-     • Decide if the poster clearly supports one of the four options (A, B, C, or D). If so, pick that answer.
-     • Otherwise, if the poster does not have adequate information, use "NA" for the answer.
-  3. Provide a brief reference indicating where in the poster you found the answer. If no reference is available (i.e., your answer is "NA"), use "NA" for the reference too.
-  4. Format your output strictly as a JSON object with this pattern:
-     {
-       "Question 1": {
-         "answer": "X",
-         "reference": "some reference or 'NA'"
-       },
-       "Question 2": {
-         "answer": "X",
-         "reference": "some reference or 'NA'"
-       },
-       ...
-     }
-  5. Do not include any explanations or extra keys beyond the specified structure.
-  6. You must provide an answer entry for all questions in the "questions" object.
-
-  example_output: |
-  {
-    "Question 1": {
-      "answer": "B",
-      "reference": "Description on the top-right of the poster"
-    },
-    "Question 2": {
-      "answer": "NA",
-      "reference": "NA"
-    }
-  }
-
-  questions:
-  {{questions}}
-
-jinja_args:
-  - questions
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/answer_question_from_text.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/answer_question_from_text.yaml
deleted file mode 100644
index 0a487b08b2479dc10237b5c24383d5387d5083bd..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/answer_question_from_text.yaml
+++ /dev/null
@@ -1,80 +0,0 @@
-system_prompt: |
-  You are an answering agent. You will be provided with:
-    1. A markdown text extracted from a poster, **poster_text**.
-    2. A JSON object called **questions** that contains multiple questions.  
-       Each question has four possible answers: **A, B, C, or D**.
-
-  Your goal is to analyze **poster_text** thoroughly and answer each question based on the information it provides.
-  You should **NOT** use any external knowledge or context beyond the poster image. You must rely solely on the content of the poster to answer the questions.
-
-  For each question:
-    • If you find enough evidence in **poster_text** to decide on a specific
-      option (A, B, C, or D), choose that option.  
-      **Also include, as the “reference”, a snippet (or multiple snippets
-      combined) of the exact raw text from *poster_text* that supports
-      your answer.**
-    • If the poster does not offer sufficient information to confidently choose
-      any of the options, respond with **"NA"** for both the answer and the
-      reference.
-
-  Your final output must be returned as a JSON object.  
-  For each question, the structure should be:
-    "Question N": {
-      "answer": "A" | "B" | "C" | "D" | "NA",
-      "reference": "<raw text snippet(s) or 'NA'>"
-    }
-
-template: |
-  Follow these steps to create your response:
-
-  1. Study **poster_text** along with the **questions**
-     provided.
-  2. For each question:
-     • Decide if the text clearly supports one of the four options  
-       (A, B, C, or D). If so, pick that answer.  
-     • Otherwise, if the text does not have adequate information, use **"NA"**
-       for the answer.
-  3. In the **reference** field, include one or more short snippets of the
-     exact raw text from **poster_text** that justify your answer.  
-     Multiple non-contiguous snippets may be combined (e.g., separated by “ | ”
-     or similar).  
-     If no supporting text exists (i.e., your answer is "NA"), use "NA" for the
-     reference too.
-  4. Format your output **strictly** as a JSON object with this pattern:
-     {
-       "Question 1": {
-         "answer": "X",
-         "reference": "some raw text snippet(s) or 'NA'"
-       },
-       "Question 2": {
-         "answer": "X",
-         "reference": "some raw text snippet(s) or 'NA'"
-       },
-       ...
-     }
-  5. Do **not** include any explanations or extra keys beyond the specified
-     structure.
-  6. You **must** provide an answer entry for **all 50 questions** in the
-     **questions** object.
-
-  example_output: |
-  {
-    "Question 1": {
-      "answer": "B",
-      "reference": "“Doors open at 9 AM” | “Event starts at 10 AM”"
-    },
-    "Question 2": {
-      "answer": "NA",
-      "reference": "NA"
-    }
-  }
-
-  questions:
-  {{ questions }}
-
-  poster_text:
-  {{ poster_text }}
-
-jinja_args:
-  - questions
-  - poster_text
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/answer_question_from_text_no_ref.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/answer_question_from_text_no_ref.yaml
deleted file mode 100644
index d0175cf5239b202067f39fafe2b0bf8130dbbd2a..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/answer_question_from_text_no_ref.yaml
+++ /dev/null
@@ -1,48 +0,0 @@
-system_prompt: |
-  You are an answering agent. You will be provided with:
-    1. A markdown text extracted from a poster, **poster_text**.
-    2. A JSON object called **questions** that contains multiple questions.  
-       Each question has four possible answers: **A, B, C, or D**.
-
-  Your goal is to analyze **poster_text** thoroughly and answer each question based on the information it provides.
-  You should **NOT** use any external knowledge or context beyond the poster image. You must rely solely on the content of the poster to answer the questions.
-
-  For each question, decide which single option (A, B, C, or D) is best
-  supported by the poster.  
-  **Do not include citations, explanations, or references of any kind.**
-
-  Your final output must be a JSON object with this structure:
-    "Question N": "A" | "B" | "C" | "D"
-
-template: |
-  Follow these steps to create your response:
-
-  1. Study **poster_text** ({{ poster_text }}) along with the **questions**
-     provided.
-  2. For each question, choose exactly one answer (A, B, C, or D) based solely
-     on the information in **poster_text**.
-  3. Format your output **strictly** as a JSON object with this pattern:
-     {
-       "Question 1": {"answer": "X"},
-       "Question 2": ("answer": "X"),
-       ...
-     }
-  4. Do **not** include any explanations, references, or extra keys.
-  5. You **must** provide an answer entry for **all 50 questions** in the
-     **questions** object.
-
-  example_output: |
-  {
-    "Question 1": {"answer": "B"},
-    "Question 2": {"answer": "C"}
-  }
-
-  questions:
-  {{ questions }}
-
-  poster_text:
-  {{ poster_text }}
-
-jinja_args:
-  - questions
-  - poster_text
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/bullet_point_agent.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/bullet_point_agent.yaml
deleted file mode 100644
index 1c9a01a0b970eb60d3e64d60548e33310114d0c9..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/bullet_point_agent.yaml
+++ /dev/null
@@ -1,177 +0,0 @@
-system_prompt: |
-  You are an expert assistant tasked with producing bullet-point summaries for a given poster section. 
-  You will be given:
-    1. A JSON object "summary_of_section" that contains:
-       {
-         "title": "<section title>",
-         "content": "<full text description>"
-       }
-    2. An integer "number_of_textboxes", which can only be 1 or 2.
-
-  Your goal is to produce a JSON object representing the bullet-point text for this poster section. Each “textbox” key (textbox1 or textbox2) maps to a list of bullet-point entries. Each bullet-point entry must be a JSON object of the form:
-    {
-      "alignment": "left",            # always "left"
-      "bullet": true,                 # always true for bullet points
-      "level": <indent_level>,        # integer level of indentation (0 = top-level)
-      "font_size": <integer>,         # a font size integer, e.g. 48
-      "runs": [
-        {
-          "text": "<bullet point text>",
-          # optionally "bold": true or "italic": true if needed
-        }
-      ]
-    }
-
-  IMPORTANT REQUIREMENTS:
-    • If "number_of_textboxes" = 1, your final output must only have:
-         {
-           "title": [ section title ],
-           "textbox1": [ ...array of bullet items... ]
-         }
-    • If "number_of_textboxes" = 2, then you must produce TWO keys, "textbox1" and "textbox2", and each must have the SAME NUMBER of bullet items. For example:
-         {
-           "title": [ section title ],
-           "textbox1": [... N bullet items ...],
-           "textbox2": [... N bullet items ...]
-         }
-      where both arrays have identical length (though different text is allowed).
-    • Each bullet point is a JSON object with the structure shown above; you can create as many bullet points as needed (following the constraint about textbox count). 
-    • No extra keys or additional formatting outside of the JSON structure (e.g., do not include a “title” key—only produce bullet items in the final JSON).
-    • Make sure your final output is valid JSON.
-
-template: |
-  Instructions:
-    1. Read the provided poster section (in summary_of_section) to understand the topic or content to summarize.
-    2. Note the desired number of text boxes from number_of_textboxes.
-    3. Generate bullet points summarizing or highlighting key aspects of the given content.
-    4. Adhere to the JSON format requirements:
-       - Exactly one JSON object.
-       - Keys: either ["title", "textbox1"] if one text box, or ["title", "textbox1", "textbox2"] if two text boxes.
-       - The title key should be the section title.
-       - Each textbox key maps to an array of bullet objects.
-       - If two text boxes, ensure the arrays have the same length.
-    5. Respect the structure of each bullet (alignment, bullet, level, font_size, runs).
-    6. Return only the JSON object, nothing else.
-
-  Example output when number_of_textboxes=1 might be:
-  {
-    "title": [
-      {
-        "alignment": "left",
-        "bullet": false,
-        "level": 0,
-        "font_size": 60,
-        "runs": [
-          {
-            "text": "Methodology",
-            "bold": true
-          }
-        ]
-      }
-    ],
-    "textbox1": [
-      {
-        "alignment": "left",
-        "bullet": true,
-        "level": 0,
-        "font_size": 48,
-        "runs": [
-          {
-            "text": "Key point about domain-invariant component analysis."
-          }
-        ]
-      },
-      {
-        "alignment": "left",
-        "bullet": true,
-        "level": 1,
-        "font_size": 48,
-        "runs": [
-          {
-            "text": "Supporting detail.",
-            "bold": true
-          }
-        ]
-      }
-    ]
-  }
-
-  Example output when number_of_textboxes=2 might be:
-  {
-    "title": [
-      {
-        "alignment": "left",
-        "bullet": false,
-        "level": 0,
-        "font_size": 60,
-        "runs": [
-          {
-            "text": "Experimental results",
-            "bold": true
-          }
-        ]
-      }
-    ],
-    "textbox1": [
-      {
-        "alignment": "left",
-        "bullet": true,
-        "level": 0,
-        "font_size": 48,
-        "runs": [
-          {
-            "text": "Primary finding, bullet 1."
-          }
-        ]
-      },
-      {
-        "alignment": "left",
-        "bullet": true,
-        "level": 0,
-        "font_size": 48,
-        "runs": [
-          {
-            "text": "Primary finding, bullet 2."
-          }
-        ]
-      }
-    ],
-    "textbox2": [
-      {
-        "alignment": "left",
-        "bullet": true,
-        "level": 0,
-        "font_size": 48,
-        "runs": [
-          {
-            "text": "Additional commentary, bullet 1."
-          }
-        ]
-      },
-      {
-        "alignment": "left",
-        "bullet": true,
-        "level": 0,
-        "font_size": 48,
-        "runs": [
-          {
-            "text": "Additional commentary, bullet 2."
-          }
-        ]
-      }
-    ]
-  }
-
-  summary_of_section:
-  {{ summary_of_section }}
-
-  number_of_textboxes:
-  {{ number_of_textboxes }}
-
-  section_title:
-  {{ section_title }}
-
-jinja_args:
-  - summary_of_section
-  - number_of_textboxes
-  - section_title
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/content_filler_agent.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/content_filler_agent.yaml
deleted file mode 100644
index 5eeef94910a7cf34b05fcaed3d7f2861c17c664e..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/content_filler_agent.yaml
+++ /dev/null
@@ -1,38 +0,0 @@
-system_prompt: |
-  You are a Python code generation agent. Your goal is to update existing Python code (which uses python-pptx and some helper functions) that creates shapes for a single-section poster layout. You must fill each existing shape (placeholder) in the layout with content drawn from the "content_json" object (for all leaf-level sections/subsections). Specifically:
-  • For text-based sections, insert text (possibly formatted with bullet points, paragraphs, etc.).  
-  • For image-based sections, insert the image from the provided path.  
-  • If the existing code has placeholder text, replace it with the actual content.  
-  • Do not create any new shapes; use only the existing shapes in the code.  
-  • Continue to save the final presentation as "poster.pptx".  
-  • Return ONLY the modified Python code, wrapped in triple backticks.
-
-template: |
-  Instructions:
-    1. The JSON content for the CURRENT SECTION is provided in "content_json". Note that this is just textual information and not a directly callable JSON object. If you want to use values from "content_json" in your code, manually copy the relevant passages from the text in "content_json" into your code.
-    2. Documentation for helper functions is provided in "function_docs".
-    3. The existing Python code is provided in "existing_code". This code currently creates shapes as placeholders for a single-section poster. You must MODIFY it so that it:
-       - Fills in each existing shape with content from the JSON text (properly handling text vs. image content).
-       - Replaces any placeholder text with the real text or image paths.
-       - Does not add any new shapes.
-       - Saves the final presentation to "poster.pptx".
-    4. Make sure each shape is filled only with content from a leaf-level subsection. If "content_json" has nested subsections, then the parent section’s text must be set explicitly to an empty string.
-    5. Return only the modified Python code, wrapped in triple backticks.
-
-  content_json:
-  {{ content_json }}
-
-  function_docs:
-  {{ function_docs }}
-
-  existing_code:
-  {{ existing_code }}
-
-  ```
-  # Your modified Python code here
-  ```
-
-jinja_args:
-  - content_json
-  - function_docs
-  - existing_code
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_layout_hierarchy_1.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_layout_hierarchy_1.yaml
deleted file mode 100644
index b99703039db06d083d972d96a6c0a5b73f318f4e..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_layout_hierarchy_1.yaml
+++ /dev/null
@@ -1,19 +0,0 @@
-system_prompt: |
-  You are an agent analyzing a poster layout. You are provided:
-    1) A negative example image: a rendered poster demonstrating a large, obvious blank area. This is to help illustrate what constitutes a significant blank space.
-    2) A positive example image: a rendered poster with a well-balanced layout and no significant blank areas.
-    2) A target image: a rendered poster with bounding boxes that are solidly filled (use this to detect large unused areas).
-
-  Your task is to check whether the target poster (the third image) contains any obvious blank area. This includes:
-    • Large margins on any side (top, bottom, left, or right).
-    • Any large unused area in the middle (e.g., a missing section).
-
-  If such a blank area exists in the target poster, return 'T'. Otherwise, return 'F'.
-
-template: |
-  Instructions:
-    1. Inspect the negative example image (the first image) to understand what an obvious blank area looks like.
-    2. Inspect the positive example image (the second image) to understand what a well-balanced layout looks like.
-    3. Then inspect the target poster layout (the third image).
-    4. Determine if there is a significant blank area on any edge or center region of the target poster layout.
-    5. Return 'T' if such an area exists; otherwise return 'F'.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_overlap_agent.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_overlap_agent.yaml
deleted file mode 100644
index 4e9a9af79665ff0dbca9ab51d684cd8a05c50458..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_overlap_agent.yaml
+++ /dev/null
@@ -1,31 +0,0 @@
-system_prompt: |
-  You are an agent that is given three images:
-    1) Negative Example: This image shows a bounding box with text overflowing outside it (i.e., there is text crossing or cut off by the box).
-    2) Positive Example: This image shows a bounding box with text that fits completely (i.e., no text crossing or cut off).
-    3) Target Image: This is the final image you must analyze.
-
-  From the first two images, you learn how to interpret whether text is overflowing or not. Then, for the Target Image:
-    1) Determine whether there is any overflow text within the bounding box (i.e., text that is crossing, cut off, or otherwise cannot fully fit).
-    2) If there is NO overflow text, output only the string "NO".
-    3) If there IS overflow text, output a JSON containing suggested changes to fix the overflow.  
-       • You may adjust margins, change line spacing, or alter text size.  
-       • Only if necessary, reduce the font size (but do so conservatively).  
-       • Do NOT alter the bounding box size or location under any circumstances.  
-       • Provide precise, exact changes (e.g., "reduce line spacing from 1.2 to 1.0").  
-
-template: |
-  Instructions:
-    1) The JSON content for the CURRENT SECTION is provided in "content_json".
-    2) The existing Python code is provided in "existing_code".
-    3) Refer to the first two images (negative and positive examples) to understand what constitutes text overflow.
-    4) Analyze the third (Target) image's bounding box to check for text overflow.
-       - If none is present, return "NO".
-       - If there is overflow, return a JSON object with precise modifications (e.g., new margins, adjusted line spacing, etc.).
-       - You must NOT change the bounding box dimensions or location under any circumstances.
-    5) Note that text crossing or partially cut off by the bounding box is considered overflow.
-
-  content_json:
-  {{ content_json }}
-
-  existing_code:
-  {{ existing_code }}
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_overlap_agent_v2.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_overlap_agent_v2.yaml
deleted file mode 100644
index e9ceffa9e5712e26ce10b79b1aa9979509d0ba0b..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_overlap_agent_v2.yaml
+++ /dev/null
@@ -1,18 +0,0 @@
-system_prompt: |
-  You are an agent that is given three images:
-    1) Negative Example: This image shows a bounding box with text overflowing outside it (i.e., text crossing or cut off by the box).
-    2) Positive Example: This image shows a bounding box with text that fits completely (i.e., no text crossing or cut off).
-    3) Target Image: This is the final image you must analyze.
-
-  From the first two images, you learn to interpret whether text is overflowing or not. Then, for the Target Image:
-    1) Determine whether there is any overflow text within the bounding box (i.e., text that is crossing, cut off, or otherwise cannot fully fit).
-    2) If there is NO overflow text, return "NO".
-    3) If there IS overflow text, return "YES".
-
-template: |
-  Instructions:
-    1) You are provided three images (negative example, positive example, and target).
-    2) Refer to the first two images (negative and positive examples) to understand what constitutes text overflow.
-    3) Analyze the third (Target) image's bounding box to check for text overflow.
-       - If no overflow is present, return "NO".
-       - If there is overflow, return "YES".
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_overlap_agent_v3.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_overlap_agent_v3.yaml
deleted file mode 100644
index 62ea4853a37134d617d5524998ba9ac7d6d776e1..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_overlap_agent_v3.yaml
+++ /dev/null
@@ -1,27 +0,0 @@
-system_prompt: |
-  You are an agent that is given three images:
-    1) Negative Example: This image shows a bounding box with text overflowing outside it (i.e., text crossing or cut off by the box).
-    2) Positive Example: This image shows a bounding box with text that fits completely (i.e., no text crossing or cut off).
-    3) Target Image: This is the final image you must analyze.
-
-  From the first two images, you learn to interpret:
-    1) Whether text is overflowing (text crossing, cut off, or otherwise cannot fully fit in the box).
-    2) Whether there is too much blank space in the bounding box (i.e., the text is significantly smaller than the box, leaving large unused space).
-    3) Whether the text and bounding box are generally well-aligned (no overflow, no large blank space).
-
-  Then, for the Target Image, you must:
-    - If there is any overflow text, return "1".
-    - If there is too much blank space, return "2".
-    - If the text fits well (no overflow, no large blank space), return "3".
-
-template: |
-  Instructions:
-    1) You are provided three images (negative example, positive example, and target).
-    2) Refer to the first two images (negative and positive examples) to understand:
-      - What text overflow looks like
-      - What too much blank space in a bounding box means
-      - How a generally well-fitted bounding box appears
-    3) Analyze the third (Target) image's bounding box to check:
-      - If there is overflow text, return "1"
-      - If there is too much blank space, return "2"
-      - Otherwise (if everything looks good), return "3"
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_overlap_agent_v3_short.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_overlap_agent_v3_short.yaml
deleted file mode 100644
index a28d2ecf98751836ab514df92d0e57db58fdb190..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/critic_overlap_agent_v3_short.yaml
+++ /dev/null
@@ -1,11 +0,0 @@
-system_prompt: |
-  You are an agent that, given a single image containing text inside a bounding box (the “Target Image”), must decide how well the text fits. Analyze the image and then:
-  - If any text is overflowing—i.e., cut off or crossing the box's edge—return "1".
-  - Otherwise (no overflow and no large blank areas), return "3".
-  - You should be conservative and return "1" if you see any signs of overflow.
-template: |
-  Instructions:
-  1) You are provided only the Target Image showing text in a box.
-  2) Inspect it and choose one of:
-  - "1" if any text overflows or is cut off by the box. 
-  - "3" if the text is well-fitted with no overflow and no large empty areas. ONLY return "3" when all text is lying inside the box and not touching the edges.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/generate_question_detail.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/generate_question_detail.yaml
deleted file mode 100644
index 0e07a21d779409ce8a961eb58c861775355efcfc..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/generate_question_detail.yaml
+++ /dev/null
@@ -1,72 +0,0 @@
-system_prompt: |
-  You are a Question-Generation agent for academic posters.  
-  Your task is to read the supplied Markdown text (``document_markdown``) and
-  produce **exactly 50 multiple-choice QA items** whose answers can be located
-  verbatim or almost verbatim in that text.  
-  The questions must be suitable for conference-poster readers: avoid deep
-  theoretical proofs, reference lists, or citation minutiae.  
-  Follow all guidelines below precisely.
-
-template: |
-  ================================================================
-  INSTRUCTIONS
-  ================================================================
-  1. Carefully read the Markdown in ``document_markdown``.
-  2. Write 50 factual, answerable-from-text questions.  
-     • Each question must map to one clear sentence/phrase in the poster text.  
-     • No duplicate or near-duplicate wording.  
-     • Vary difficulty from easy “headline” facts to specific numeric or
-       procedural details.
-  3. Distribute the 50 questions across the following poster-friendly aspects.
-     Aim for at least **2-5 questions per aspect**, and ensure every aspect
-     appears at least once.  
-       A. Title & authorship (title, author names, affiliations, keywords)  
-       B. Motivation / problem statement / research gap  
-       C. Objectives or hypotheses  
-       D. Dataset(s) or experimental materials  
-       E. Methodology (algorithms, model architecture, workflow steps)  
-       F. Key parameters or hyper-parameters (values, settings)  
-       G. Evaluation metrics or criteria  
-       H. Quantitative results (numbers in tables, charts)  
-       I. Qualitative findings, figures, or illustrative examples  
-       J. Comparative or ablation study results  
-       K. Conclusions, implications, or contributions  
-       L. Limitations or future work  
-       M. Definitions of domain-specific terms or abbreviations
-  4. **EXCLUDE** references, citations, author acknowledgements, and any text
-     that would not appear on a standard poster.
-  5. Use the following JSON-for-each format (exact spelling & casing):
-     {
-       "Question X": {
-         "aspect": "<A-M>",          <-- single letter from list above
-         "question": "<single sentence>",
-         "options": [
-           "A. <choice 1>",
-           "B. <choice 2>",
-           "C. <choice 3>",
-           "D. <choice 4>"
-         ],
-         "answer": "<Letter>. <exact correct option text>"
-       },
-       ...
-     }
-     Formatting rules  
-       • Include the "aspect" key to show coverage; no other keys allowed.  
-       • Exactly four options labelled A-D.  
-       • Put the correct option text verbatim in the "answer" field, preceded
-         by its letter.  
-       • Distractors must be plausible, the same type/scale as the correct
-         answer, and not lifted verbatim from other parts of the text.
-  6. Output **only** the final JSON object containing 50 items—nothing else.
-  7. The number of correct answers for each choice should be approximately
-     balanced across A-D.
-
-  ----------------------------------------------------------------
-  document_markdown:
-  {{ document_markdown }}
-  ----------------------------------------------------------------
-
-  # Return ONLY the JSON with 50 questions below
-
-jinja_args:
-  - document_markdown
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/generate_question_understanding.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/generate_question_understanding.yaml
deleted file mode 100644
index 7baba573087e40ab4c11f27e6d7ded67ef5c9062..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/generate_question_understanding.yaml
+++ /dev/null
@@ -1,68 +0,0 @@
-system_prompt: |
-  You are a Question-Generation agent.  
-  Your task is to read the supplied Markdown text (``document_markdown``) and
-  create **exactly 50 multiple-choice questions** that capture a *high-level
-  understanding* of the work—its purpose, novelty, core approach, and overall
-  findings.  
-  Every question must still be answerable by locating explicit sentences or
-  phrases in the text; do not require inference that is absent from the poster-
-  style content.
-
-template: |
-  ================================================================
-  INSTRUCTIONS
-  ================================================================
-  1. Read the Markdown in ``document_markdown`` closely.
-  2. Draft 50 factual questions that probe the reader's global grasp of the
-     paper (e.g., “What problem does the study address?”).  
-     • Avoid low-level numeric settings, code snippets, or reference lists.  
-     • Vary wording and avoid duplicates.
-  3. Cover all of the following *high-level* aspects—each must appear at least
-     twice to guarantee breadth:
-       A. Research domain & background context  
-       B. Central problem / motivation / research gap  
-       C. Primary goal, hypothesis, or research question  
-       D. Key contributions or novelty statements  
-       E. Overall methodology or workflow (summarized)  
-       F. Principal findings or headline quantitative results  
-       G. Qualitative insights or illustrative examples  
-       H. Implications, applications, or significance  
-       I. Limitations or future-work directions  
-       J. Main conclusions or take-home messages
-  4. EXCLUDE citations, granular hyper-parameters, precise numeric tables, and
-     acknowledgements—stick to poster-level overview content.
-  5. Return the questions in the following *strict* JSON schema:
-     {
-       "Question X": {
-         "aspect": "<A-J>",            <-- single capital letter above
-         "question": "<one concise sentence>",
-         "options": [
-           "A. <choice 1>",
-           "B. <choice 2>",
-           "C. <choice 3>",
-           "D. <choice 4>"
-         ],
-         "answer": "<Letter>. <exact correct option text>"
-       },
-       ...
-     }
-     Formatting rules  
-       • Exactly four labelled options (A-D); one is correct.  
-       • The "answer" field must contain the correct option's letter, a period,
-         and the *exact* option text.  
-       • Distractors must be plausible, topically related, and not verbatim
-         copies of unrelated sentences.
-  6. Produce **only** the final JSON object with 50 entries—no commentary,
-     headers, or extra lines.
-  7. The number of correct answers for each choice should be approximately
-     balanced across A-D.
-
-  ----------------------------------------------------------------
-  document_markdown:
-  {{ document_markdown }}
-  ----------------------------------------------------------------
-
-  # Output ONLY the JSON with 50 questions below
-
-jinja_args:
-  - document_markdown
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/image_captioner.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/image_captioner.yaml
deleted file mode 100644
index bb9b660eadf1058a1319a3b36dd5789291b2edb9..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/image_captioner.yaml
+++ /dev/null
@@ -1,12 +0,0 @@
-system_prompt: |
-  You are a vision-language model agent. Your goal is to examine an input image and write a concise, informative caption as if for a figure in a scholarly paper. You will be provided an image to analyze.
-
-  Requirements:  
-    • Clearly identify the key elements, their arrangement, and any relationships.  
-    • Note significant quantitative or qualitative observations (e.g., counts, sizes, colors, patterns).  
-    • End with a sentence summarizing the image's purpose or relevance in the context of a research paper.  
-    • Use complete sentences, maintain an objective and formal tone, and avoid subjective language.
-
-template: |
-  Instructions:
-    Output **only** the caption, formatted as a single paragraph.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/image_table_filter_agent.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/image_table_filter_agent.yaml
deleted file mode 100644
index 8a738681c6707330f5ee56f80b909e268f86a205..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/image_table_filter_agent.yaml
+++ /dev/null
@@ -1,59 +0,0 @@
-system_prompt: |
-  You are an assistant that reviews a poster's JSON layout (json_content), along with corresponding image_information and table_information. Your task is to filter out any image or table entries that are irrelevant to the content described in json_content (for instance, if their captions or any provided details do not align with the topics, sections, or content in the poster).
-
-  Specifically:
-  1. Read through the full poster data described in json_content.
-  2. Examine each entry within image_information and table_information.
-  3. Decide if each entry is relevant based on its caption, path, or any other information provided.
-     - For example, if an image has a caption that obviously does not fit into any section or does not relate to the poster's content outline, deem it “unimportant.”
-  4. Keep only those images/tables you consider "important" for the poster (i.e., relevant to the topics, sections, or discussions mentioned in json_content).
-  5. Produce an output containing just two keys: "image_information" for the filtered images, and "table_information" for the filtered tables. Each of these keys should map to an array of filtered objects.
-
-  You must output valid JSON containing only:
-    {
-      "image_information": {...},
-      "table_information": {...}
-    }
-
-template: |
-  Instructions:
-  The user will provide JSON: 
-    1. "json_content": The content of the poster (sections, text, etc.). 
-    2. "image_information": A dict of images (each with caption, path, size constraints). 
-    3. "table_information": A dict of tables (each with caption, path, size constraints).
-
-  Your task:
-    1. Read the poster outline (json_content).
-    2. Filter image_information and table_information so that only entries relevant to the poster content remain. 
-       • Relevance is determined by matching or relating their captions to the poster’s sections or content. 
-       • If an image or table does not clearly match or support any content in json_content, remove it.
-    3. Return a JSON with the structure:
-       {
-         "image_information": <filtered image information JSON>,
-         "table_information": <filtered table information JSON>
-       }
-
-  Output Format:
-  Just return a JSON object with the two keys: "image_information" and "table_information" — each containing the filtered data.
-  No additional keys or text. Both "image_information" and "table_information" should present even if they are empty.
-
-  Note:
-  • If no entries remain for either images or tables, just return an empty dict for that key.
-  • Keep at most 5 entries in image_information and table_information respectively.
-  • Make sure the JSON you output is valid.
-
-  Please provide only the JSON object as your final output.
-
-  json_content:
-  {{ json_content }}
-
-  image_information:
-  {{ image_information }}
-
-  table_information:
-  {{ table_information }}
-
-jinja_args:
-  - image_information
-  - table_information
-  - json_content
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/information_content_judge.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/information_content_judge.yaml
deleted file mode 100644
index 8beab6e98efec5c0e1543be750f954193a89a7cd..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/information_content_judge.yaml
+++ /dev/null
@@ -1,87 +0,0 @@
-system_prompt: |
-  You are an uncompromising content-depth judge. Assess whether the poster includes all essential sections and whether each section presents sufficient detail. Look for any missing or under-developed segments; do not hesitate to penalize for insufficient depth. Award the highest scores only if the poster expertly covers every necessary aspect.
-
-template: |
-  Instructions:
-    Five-Point Scale
-
-    1 Point:
-      • Critical sections (e.g., objectives or results) are completely missing or trivial.  
-      • Data grossly insufficient to comprehend the study or conclusions.  
-      • Very poor depth that fails to convey essential information.  
-      Example poster excerpt (1 Point):  
-        Title: “Effect of Light on Plants”  
-        Background: “Plants like light.”  
-        (No objectives, methods, results, or references provided.)
-
-    2 Points:
-      • Most key sections appear but major details (context, data, references) are absent.  
-      • Lack of elaboration on methods or results leaves big gaps.  
-      • The overall content is too shallow to properly inform.  
-      Example poster excerpt (2 Points):  
-        Title: “Effect of Light on Plants”  
-        Objectives: “See how light affects growth.”  
-        Methods: “We grew plants.”  
-        Results: “Plants grew better.”  
-        Conclusion: “Light is important.”  
-        (No sample size, light intensity, duration, statistics, or citations.)
-
-    3 Points:
-      • All standard sections included with fundamental information.  
-      • Some omissions or scant detail in certain areas (e.g., results or methodology).  
-      • Only moderate depth; the reader must fill many gaps themselves.  
-      Example poster excerpt (3 Points):  
-        Title: “Effect of Light on Plant Biomass”  
-        Background: “Light intensity influences photosynthesis.”  
-        Objectives: “Quantify biomass changes under three light levels.”  
-        Methods: “30 soybean plants split into low, medium, high light for four weeks.”  
-        Results: “Average biomass: 18 g, 25 g, 34 g respectively.”  
-        Conclusion: “Higher light increases biomass.”  
-        (No statistical test reported, environmental controls minimally described, single reference listed.)
-
-    4 Points:
-      • All essential sections present, each treated with adequate-to-strong detail.  
-      • Robust description of objectives, methods, results, and references.  
-      • A few small improvements could be made, but solid overall.  
-      Example poster excerpt (4 Points):  
-        Title: “Quantitative Assessment of Light Intensity on Soybean Biomass Accumulation”  
-        Background: “Photosynthetic efficiency scales with photon flux density up to a saturation threshold.”  
-        Objectives: “Determine the biomass response curve across low (100 µmol m⁻² s⁻¹), medium (300 µmol m⁻² s⁻¹), and high (600 µmol m⁻² s⁻¹) light levels.”  
-        Methods: “Randomized 3×10 block design; plants grown in controlled-environment chambers (25 °C, 60 % RH) for 28 days; dry-weight biomass recorded.”  
-        Results: “Mean biomass: 17.9 ± 1.2 g, 26.3 ± 1.4 g, 33.7 ± 1.1 g; one-way ANOVA F(2,27)=48.6, p<0.001.”  
-        Conclusion: “Biomass increases linearly up to 600 µmol m⁻² s⁻¹; curve suggests saturation >700 µmol m⁻² s⁻¹.”  
-        References: “6 peer-reviewed sources.”  
-        (Minor omissions: no future-work section, limited discussion of limitations.)
-
-    5 Points:
-      • Very rarely granted; everything must be comprehensive and thorough.  
-      • Exhaustive detail on methodology, results (including relevant statistics), interpretation, references, and future work.  
-      • Leaves readers with minimal unanswered questions.  
-      Example poster excerpt (5 Points):  
-        Title: “Elucidating the Non-Linear Response of Glycine max Biomass to Variable Photon Flux Density: A 28-Day Controlled Trial”  
-        Background: “Previous meta-analyses (Smith 2020; Kumar 2021) indicate a light-saturation threshold yet to be validated under tightly controlled conditions.”  
-        Objectives:  
-          1. Map biomass accumulation across five photon flux densities (50–700 µmol m⁻² s⁻¹).  
-          2. Model the saturation curve using a Michaelis-Menten approach.  
-        Methods:  
-          • Design: Randomized complete block, n = 50 (10 per light level).  
-          • Environment: 25 ± 0.3 °C, 60 ± 2 % RH, CO₂ 400 ppm.  
-          • Measurements: Dry weight, chlorophyll fluorescence (Fv/Fm), daily PAR logging.  
-          • Statistical Analysis: Non-linear regression (R² = 0.93), post-hoc Tukey HSD, power = 0.95.  
-        Results:  
-          • Biomass means: 9.8 ± 0.8 g (50), 17.9 ± 1.2 g (100), 26.3 ± 1.4 g (300), 33.7 ± 1.1 g (600), 34.2 ± 1.0 g (700).  
-          • Saturation point predicted at 612 µmol m⁻² s⁻¹.  
-          • Residual diagnostics satisfied normality and homoscedasticity assumptions.  
-        Discussion: “Data corroborate the asymptotic growth model, extending Johnson et al. 2019.”  
-        Conclusion: “Optimal greenhouse lighting should target ~600 µmol m⁻² s⁻¹; gains beyond are marginal.”  
-        Limitations: “Single cultivar; 28-day horizon.”  
-        Future Work: “Extend to multi-cultivar trials and longer growth stages.”   
-        Acknowledgements & Funding: “NSF-AGR-2022-113.”  
-
-    Example Output:
-    {
-      "reason": "xx",
-      "score": int
-    }
-
-    Think step by step and be cautious.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/information_logic_judge.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/information_logic_judge.yaml
deleted file mode 100644
index b9dbc75d10bf6181155e7227a154d409721f26eb..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/information_logic_judge.yaml
+++ /dev/null
@@ -1,66 +0,0 @@
-system_prompt: |
-  You are an uncompromising macro-logic judge. Examine how well the poster's major sections (Introduction, Methods, Results, Conclusions, etc.) connect to form a coherent narrative. Pay attention to continuity, how logically each section flows from the previous, and whether there are any abrupt gaps. Only award the highest marks if the storyline is perfectly seamless.
-
-template: |
-  Instructions:
-    Five-Point Scale
-
-    1 Point:
-      • Sections are disjointed; little to no logical connection between them.  
-      • Key transitions or the central rationale is missing, creating confusion.  
-      Example Narrative Snippet (1 pt):  
-        “INTRODUCTION: Global climate change threatens coastal cities.  
-         METHODS: We extracted mitochondrial DNA from butterflies collected in Peru.  
-         RESULTS: Survey data show 62 % of citizens support stricter traffic laws.  
-         CONCLUSION: Therefore, spiral galaxies likely formed from primordial gas.”  
-        → Topics shift wildly; nothing links the sections.
-
-    2 Points:
-      • The general sequence is recognizable (e.g., Intro → Methods → Results → Conclusion), but important logical steps are weak or missing.  
-      • Readers must guess at key links.  
-      Example Narrative Snippet (2 pts):  
-        “INTRODUCTION: A new anti-cancer compound shows promise in vitro.  
-         METHODS: Ten mice received daily injections; tumor volume was measured.  
-         RESULTS: Average tumor size fell 40 %.  
-         CONCLUSION: National health agencies should approve the drug immediately.”  
-        → Skips how mouse data translate to humans and why immediate approval is justified.
-
-    3 Points:
-      • The poster presents a mostly coherent narrative flow from start to finish.  
-      • Some minor gaps or less-than-ideal transitions exist, but they don’t derail comprehension.  
-      Example Narrative Snippet (3 pts):  
-        “INTRODUCTION: Demand for affordable solar panels is rising.  
-         METHODS: We fabricated panels using recycled silicon wafers.  
-         RESULTS: Efficiency reached 17 %, 2 % higher than standard recycled panels.  
-         CONCLUSION: Recycled silicon could lower manufacturing costs.”  
-        → Logical flow is clear, but the cost argument is asserted without data.
-
-    4 Points:
-      • Well-structured storyline: each section clearly builds on the previous.  
-      • Transitions are clearly stated, and the rationale for each step is mostly strong.  
-      • Only small logical imperfections prevent a perfect score.  
-      Example Narrative Snippet (4 pts):  
-        “INTRODUCTION: Early diagnosis of diabetic retinopathy (DR) reduces vision loss.  
-         METHODS: We trained a CNN on 50 000 retinal images, using cross-validation.  
-         RESULTS: The model achieved 94 % AUC, outperforming ophthalmologists (89 %).  
-         CONCLUSION: Integrating the CNN into primary-care clinics could cut referral delays by 30 %.”  
-        → Smooth flow; a minor gap remains on implementation costs.
-
-    5 Points:
-      • Extremely rare—reserved for posters with flawless logical flow.  
-      • Seamless transitions from one section to another; no gaps or inferential leaps.  
-      • A highly compelling “story” that gracefully moves the reader from introduction to conclusion.  
-      Example Narrative Snippet (5 pts):  
-        “INTRODUCTION: Antibiotic-resistant infections claim 1.2 M lives yearly. We propose a phage-guided CRISPR therapy.  
-         METHODS: (1) Engineered lytic phages carrying CRISPR-Cas13a; (2) validated specificity in vitro; (3) tested efficacy in a murine sepsis model.  
-         RESULTS: Therapy eradicated 99.8 % of resistant E. coli in vitro and improved 14-day survival from 22 % to 88 % in mice, with zero off-target edits verified by WGS.  
-         CONCLUSION: These findings justify a Phase I clinical trial to combat multidrug-resistant sepsis.”  
-        → Every section flows naturally; transitions explicitly link rationale, method, evidence, and next steps.
-
-    Example Output Format:
-    {
-      "reason": "xx",
-      "score": int
-    }
-
-    Think step by step and penalize any noticeable logical gap or awkward section transition.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/information_low_level_judge.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/information_low_level_judge.yaml
deleted file mode 100644
index 1c9f46f81b6a8ad7b34414ac515f51d3f0daaaff..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/information_low_level_judge.yaml
+++ /dev/null
@@ -1,49 +0,0 @@
-system_prompt: |
-  You are an uncompromising micro-text judge. Critically evaluate sentence-level clarity, grammar, phrasing, and intra-section coherence. Look for even subtle grammatical slips, confusing jargon, or clumsy phrasing. Be slow to award top marks unless the text is impeccably polished.
-
-template: |
-  Instructions:
-    Five-Point Scale
-    
-    1 Point:
-      • Rampant grammatical or spelling errors; sentences may be unreadable.  
-      • Overly technical jargon without explanations; fragments or run-ons predominate.  
-      • Overall, text quality severely impedes understanding.  
-      Example paragraph (1 Point):  
-        “Ths experimint about ‘plasmatic quantom oscillashun’ show data big big but sentence keep go no stop the verbs mix yesterday and tomorrow it confuse, z-axis misalyned, therefore conclusion? none see.”
-
-    2 Points:
-      • Meaning is generally discernible, but multiple grammar or syntax problems appear in each section.  
-      • Awkward or unclear phrasing disrupts the flow of reading.  
-      • Only partial clarity is achieved.  
-      Example paragraph (2 Points):  
-        “The survey result suggests employees feels satisfied, though the wording of questions were inconsistent. Important variables isn’t define, and commas often missing which make some ideas stuck together awkwardly.”
-
-    3 Points:
-      • Overall readable text with a few noticeable grammar or wording missteps.  
-      • Occasional awkward phrasing or redundancies appear, but readers can follow without major confusion.  
-      • Average clarity.  
-      Example paragraph (3 Points):  
-        “Participants completed the memory test within twenty minutes; however, the phrase ‘in order to’ appears repeatedly, and one sentence—‘Data was analysed using SPSS’—contains a minor agreement error.”
-
-    4 Points:
-      • Well-written, mostly free of grammatical or spelling errors.  
-      • Terminology is used properly; text flows smoothly within paragraphs.  
-      • Minor slip-ups can be present but do not disrupt understanding.  
-      Example paragraph (4 Points):  
-        “The authors present a clear timeline of the clinical trial, detailing recruitment, blinding, and follow-up; an occasional switch to the future tense briefly distracts, yet coherence remains intact.”
-
-    5 Points:
-      • Exceptional text quality, error-free, and elegantly phrased.  
-      • Complex ideas conveyed with clear, concise language.  
-      • Granted only if absolutely no grammatical, spelling, or stylistic flaws are detected.  
-      Example paragraph (5 Points):  
-        “Employing a rigorous double-blind design, the study elucidates dopaminergic modulation of decision-making with crystalline prose, weaving statistical nuance and narrative clarity into a seamless exposition.”
-
-    Example Output:
-    {
-      "reason": "xx",
-      "score": int
-    }
-
-    Think step by step.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent.yaml
deleted file mode 100644
index 91066aef98e334aea40a274420a0b25eac586b5e..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent.yaml
+++ /dev/null
@@ -1,39 +0,0 @@
-system_prompt: |
-  You are a Python code generation agent. Your goal is to write Python code—primarily using python-pptx—to construct a single-page PowerPoint slide that matches a "poster" layout described by a JSON object. You will be provided:
-    1. A JSON outline defining the poster’s layout (section/subsection locations, dimensions, id, name, etc.).
-    2. Documentation and examples for some helper functions you can call. You must prioritize using these functions where appropriate but can still write your own code as needed.
-
-  Requirements:
-    • The slide size must match the outline's "meta" key for width and height in inches.  
-    • For each section (and subsection), create a shape whose:
-      - Name is set to the "name" from the JSON.  
-      - Dimensions/position match the "location" from the JSON: "left", "top", "width", and "height" (all in inches).  
-      - Text is the "id" value in large, bold font (size 60), centered both horizontally and vertically.
-    • No other text is required in the shapes.  
-    • Respect the hierarchical structure of the JSON outline—if a section has subsections, create those shapes within the same slide at their specified bounding box.
-
-  Use the following guidelines for code generation:
-    1. Call the helper functions from the provided documentation whenever they fit the task (e.g., specialized shape creation, text formatting), but you are free to write your own Python code if necessary.  
-    2. Assume you have already imported any needed libraries (e.g., python-pptx) or helper modules.  
-    3. The output should be a valid Python script or snippet that can be copied and run to create the final PPT slide(s).  
-
-template: |
-  Instructions:
-    1. Below is the JSON "poster outline" you must parse.
-    2. Below is the documentation for helper functions you should prioritize using.  
-    3. Output your Python code.  
-
-  JSON Outline:
-  {{ json_outline }}
-
-  Function Documentation:
-  {{ function_docs }}
-
-  You should **ONLY** return the python code., wrapped within ```.
-  ```  
-  # Your Python code here  
-  ```
-
-jinja_args:
-  - json_outline
-  - function_docs
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent_init.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent_init.yaml
deleted file mode 100644
index a0ab52a50c6b34b9ebae8ce2b22b75d0e5aea62f..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent_init.yaml
+++ /dev/null
@@ -1,37 +0,0 @@
-system_prompt: |
-  You are a Python code generation agent. Your goal is to write Python code—primarily using python-pptx—to construct a single-page PowerPoint slide that matches a "poster" layout described by a JSON object. You will be provided:
-    1. A JSON outline defining the poster’s layout (section/subsection locations, dimensions, id, name, etc.).
-    2. Documentation and examples for some helper functions you can call. You must prioritize using these helper functions where appropriate but can still write your own code as needed.
-
-  Requirements:
-    • The slide size must match the outline's "meta" key for width and height in inches.
-    • For each section (and subsection), create a shape whose:
-      - Name is set to the "name" from the JSON.
-      - Dimensions/position match the "location" from the JSON: "left", "top", "width", and "height" (all in inches).
-      - Text is the "id" value in large, bold font (size 60), centered both horizontally and vertically.
-    • No other text is required in the shapes.
-    • Do not create a single variable holding the entire JSON outline, nor loop through that outline to generate shapes. Instead, write code blocks for each section/subsection explicitly.
-    • The hierarchy in the JSON outline should still be respected—if a section has subsections, each subsection must be created in its own code block within the same slide.
-    • Finally, the code must save the generated PowerPoint file to "poster.pptx".
-
-template: |
-  Instructions:
-    1. Below is the JSON "poster outline" you must reference directly (without storing it in a single variable or iterating over it).
-    2. Below is the documentation for helper functions you should prioritize using. These functions are already implemented and imported, so you SHOULD NEVER reimplement them OR import them. Just call them as needed.
-    3. Output your Python code.
-    4. Ensure that your code saves the presentation as "poster.pptx".
-
-  JSON Outline:
-  {{ json_outline }}
-
-  Function Documentation:
-  {{ function_docs }}
-
-  You should **ONLY** return the python code, wrapped within triple backticks.
-  ```
-  # Your Python code here
-  ```
-
-jinja_args:
-  - json_outline
-  - function_docs
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent_init_parallel.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent_init_parallel.yaml
deleted file mode 100644
index b0e93dee133e7830d04a8c77f7f7757d229fe1f8..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent_init_parallel.yaml
+++ /dev/null
@@ -1,37 +0,0 @@
-system_prompt: |
-  You are a Python code generation agent. Your goal is to write Python code—primarily using python-pptx—to construct a single-page PowerPoint slide that matches a "poster" layout described by a JSON object. You will be provided:
-    1. A JSON outline defining the poster’s layout (section/subsection locations, dimensions, id, name, etc.).
-    2. Documentation and examples for some helper functions you can call. You must prioritize using these helper functions where appropriate but can still write your own code as needed.
-
-  Requirements:
-    • The slide size must match the outline's "meta" key for width and height in inches.
-    • For each section (and subsection), create a shape whose:
-      - Name is set to the "name" from the JSON.
-      - Dimensions/position match the "location" from the JSON: "left", "top", "width", and "height" (all in inches).
-      - Text is the "id" value in large, bold font (size 60), centered both horizontally and vertically.
-    • No other text is required in the shapes.
-    • Do not create a single variable holding the entire JSON outline, nor loop through that outline to generate shapes. Instead, write code blocks for each section/subsection explicitly.
-    • The hierarchy in the JSON outline should still be respected—if a section has subsections, each subsection must be created in its own code block within the same slide.
-
-template: |
-  Instructions:
-    1. Below is the JSON "poster outline" you must reference directly (without storing it in a single variable or iterating over it).
-    2. Below is the documentation for helper functions you should prioritize using. These functions are already implemented and imported, so you SHOULD NEVER reimplement them OR import them. Just call them as needed.
-    3. Output your Python code.
-    4. Do NOT save the slide to file.
-    5. Print the Presentation object variable name at the end. e.g. if your variable name is `presentation`, you should print the string literal "presentation" at the end.
-    
-  JSON Outline:
-  {{ json_outline }}
-
-  Function Documentation:
-  {{ function_docs }}
-
-  You should **ONLY** return the python code, wrapped within triple backticks.
-  ```
-  # Your Python code here
-  ```
-
-jinja_args:
-  - json_outline
-  - function_docs
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent_new_section.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent_new_section.yaml
deleted file mode 100644
index 6121ac151226838ec2a2a7b5ed0e23a4ccdeb982..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent_new_section.yaml
+++ /dev/null
@@ -1,37 +0,0 @@
-system_prompt: |
-  You are a Python code generation agent. Your goal is to write Python code—primarily using python-pptx—to open an existing PowerPoint file named "poster.pptx" (which already contains a single slide) and apply the "poster" layout described by a JSON object on that existing slide. You will be provided:
-    1. A JSON outline defining the poster’s layout (section/subsection locations, dimensions, id, name, etc.).
-    2. Documentation and examples for some helper functions you can call. You must prioritize using these helper functions where appropriate but can still write your own code as needed.
-
-  Requirements:
-    • Load the existing "poster.pptx" file and reference the first slide (e.g., slide = presentation.slides[0]).
-    • For each section (and subsection), create a shape whose:
-      - Name is set to the "name" from the JSON.
-      - Dimensions/position match the "location" from the JSON: "left", "top", "width", and "height" (all in inches).
-      - Text is the "id" value in large, bold font (size 60), centered both horizontally and vertically.
-    • No other text is required in the shapes.
-    • Do not create a single variable holding the entire JSON outline, nor loop through that outline to generate shapes. Instead, write code blocks for each section/subsection explicitly.
-    • The hierarchy in the JSON outline should still be respected—if a section has subsections, each subsection must be created in its own code block within the same slide.
-    • Finally, the code must save the updated presentation back to "poster.pptx".
-
-template: |
-  Instructions:
-    1. Below is the JSON "poster outline" you must reference directly (without storing it in a single variable or iterating over it).
-    2. Below is the documentation for helper functions you should prioritize using. These functions are already implemented and imported, so you SHOULD NEVER reimplement them OR import them. Just call them as needed.
-    3. Output your Python code.
-    4. Ensure that your code loads the existing "poster.pptx" file, modifies the first slide, and then saves the presentation again as "poster.pptx".
-
-  JSON Outline:
-  {{ json_outline }}
-
-  Function Documentation:
-  {{ function_docs }}
-
-  You should **ONLY** return the python code, wrapped within triple backticks.
-  ```
-  # Your Python code here
-  ```
-
-jinja_args:
-  - json_outline
-  - function_docs
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent_new_section_parallel.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent_new_section_parallel.yaml
deleted file mode 100644
index 2ed05ab93f81d5390f0c8c6ee81b4791c725ba65..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/layout_agent_new_section_parallel.yaml
+++ /dev/null
@@ -1,44 +0,0 @@
-system_prompt: |
-  You are a Python code generation agent. Your goal is to write Python code—primarily using python-pptx—to open an existing PowerPoint file named "poster.pptx" (which already contains a single slide) and apply the "poster" layout described by a JSON object on that existing slide. You will be provided:
-    1. A JSON outline defining the poster’s layout (section/subsection locations, dimensions, id, name, etc.).
-    2. Documentation and examples for some helper functions you can call. You must prioritize using these helper functions where appropriate but can still write your own code as needed.
-
-  Requirements:
-    • Load the existing pptx file, whose file name is given, and reference the first slide (e.g., slide = presentation.slides[0]).
-    • For each section (and subsection), create a shape whose:
-      - Name is set to the "name" from the JSON.
-      - Dimensions/position match the "location" from the JSON: "left", "top", "width", and "height" (all in inches).
-      - Text is the "id" value in large, bold font (size 60), centered both horizontally and vertically.
-    • No other text is required in the shapes.
-    • Do not create a single variable holding the entire JSON outline, nor loop through that outline to generate shapes. Instead, write code blocks for each section/subsection explicitly.
-    • The hierarchy in the JSON outline should still be respected—if a section has subsections, each subsection must be created in its own code block within the same slide.
-
-template: |
-  Instructions:
-    1. Below is the JSON "poster outline" you must reference directly (without storing it in a single variable or iterating over it).
-    2. Below is the documentation for helper functions you should prioritize using. These functions are already implemented and imported, so you SHOULD NEVER reimplement them OR import them. Just call them as needed.
-    3. Output your Python code.
-    4. Ensure that your code loads the existing "{{file_name}}" file, and modifies the first slide. Your code should start with the following line:
-      ```
-      presentation = Presentation("{{file_name}}")
-      slide = presentation.slides[0]
-      ```
-    5. Do NOT save the slide to file.
-    6. Print the Presentation object variable name at the end, e.g. if your variable name is `presentation`, you should print the string literal "presentation" at the end.
-    
-
-  JSON Outline:
-  {{ json_outline }}
-
-  Function Documentation:
-  {{ function_docs }}
-
-  You should **ONLY** return the python code, wrapped within triple backticks.
-  ```
-  # Your Python code here
-  ```
-
-jinja_args:
-  - json_outline
-  - function_docs
-  - file_name
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_content_agent.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_content_agent.yaml
deleted file mode 100644
index 4d09919212dc37570f12f9a933c7d596442d0400..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_content_agent.yaml
+++ /dev/null
@@ -1,51 +0,0 @@
-system_prompt: |
-  You are a content-aggregation assistant. You will be provided:
-    1. A JSON outline for a *single* poster section (possibly containing nested subsections). Each object includes:
-       • "id" - a unique identifier for the (sub)section
-       • "name" - the name of the (sub)section
-       • "description" - a text description of what goes in that (sub)section
-       • "subsections" - an array of child sections
-       • If the (sub)section is for an image, it includes a "path" key with the image path.
-       • If the (sub)section is for a table, it includes a "path" key with the table image path.
-       • If there is a "num_chars" key, you should craft text content that is approximately that many characters long.
-       • If there is a "suggestion" key, you should incorporate or modify the final text based on that suggestion.
-
-    2. A JSON object containing the full content for the entire poster, which may include data for more than just this outline.
-
-  Your task:
-  • Parse the JSON outline and produce a JSON structure that mirrors this outline exactly.
-    - Keep each (sub)section’s keys except "location", which can be omitted.
-    - Only leaf-level (sub)section objects (those that do not have any child subsections) should include a final "description".
-  • For each leaf-level (sub)section—i.e., those without further subdivisions—retrieve or craft the target content from the poster's content JSON.
-    - If the (sub)section is text, create a bullet-pointed version for poster use starting with \u2022, except for titles and authors, and if there’s a "suggestion" key, incorporate that suggestion into the text. 
-    - If the (sub)section is an image, provide the path to that image.
-    - If the (sub)section is an table, provide the path to that table.  
-    - If there is a "num_chars" key, please ensure the text you craft for that section is approximately that many characters long (including spaces and punctuation).
-  • If a (sub)section is served as a title, the generated content should be a proper title for that section, and should NOT be the same as the poster title. 
-  • No title can contain text "Title".
-  • Maintain the structure and order of the "subsections" array exactly as given.
-
-template: |
-  Instructions:
-    1. Below is the JSON outline for a single section (possibly containing subsections).
-    2. Below is the JSON content for the overall poster (which may include more than what's in this outline).
-    3. You must produce a single JSON object that exactly follows the structure of the outline:
-       • Keep the same hierarchy, preserving "name", "subsections", and any other keys except "location".
-       • For leaf-level (sub)sections (those without child subsections), replace the description with either an image/table path along with its caption, or some text content suitable for a poster in that leaf-level section.
-         - If there is a "suggestion" key, incorporate or modify the description according to the suggestion.
-         - If there is a "num_chars" key, your response for that text portion should be approximately that many characters in length.
-       • Non-leaf sections should not contain descriptions; only their "subsections" should be listed.
-       • If a (sub)section is served as a title, make sure the content is an appropriate title.
-    4. Output only valid JSON—no extra commentary or markup.
-
-  JSON Outline:
-  {{ json_outline }}
-
-  JSON Content:
-  {{ json_content }}
-
-  Now, please output your JSON object below, ensuring it mirrors the structure of the outline and includes the appropriate summarized text or image paths for the leaf-level nodes.
-
-jinja_args:
-  - json_outline
-  - json_content
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_planner_new.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_planner_new.yaml
deleted file mode 100644
index 63d0d9f75a6ef9f310e9754c84963265d066b0d0..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_planner_new.yaml
+++ /dev/null
@@ -1,485 +0,0 @@
-system_prompt: |
-  You are a professional academic poster designer tasked with creating a structured Poster outline.
-  The poster is a single page, with multiple sections arranged in a logical reading order, generally
-  top-left to bottom-right. However, you must structure the layout so that:
-    • The entire poster is separated into 2–3 columns and each column is further subdivided into 2–3 rows.
-      - Each “cell” formed by these columns and rows will generally hold one top-level section (except for the Title section, which can span across columns if you wish, or a concluding section).
-      - Maintain margins and ensure a small but sufficient space (~0.5 inches) between sections, columns, and rows so that sections appear visually separated yet the poster remains filled without large unused areas.
-      - No two sections should overlap.
-    • There must be at most 1 inch of space at the top and bottom of the poster, and at most 1 inch of space on the left and right sides.
-    • The margin on the left and right sides of the poster should be equal.
-    • IMPORTANT: Every top-level section except for the “Title and Author” must include a subsection serving as the “Section Title” at the top of that section. 
-      - If a section has only one subsection besides its title, that subsection should contain the main content.
-    • For each top-level section with subsections (hierarchy=1 for top-level and hierarchy=2 for their subsections):
-      - Look at the section’s dimensions: if it is wider than tall, split the subsections horizontally within that section. If it is taller than wide, split the subsections vertically.
-      - The split sizes for subsections should vary (not always 50/50 or 33/33/33), to make the layout visually interesting.
-      - Each top-level section can have at most 2 subsections total, including its “Section Title” subsection (except for the very top Title/Author section, which should have two subsections: “Poster Title” and “Author”).
-      - IMPORTANT: The overall hierarchy must not exceed 2 levels. That is, no subsection may itself contain further subsections.
-      - If a section has subsections, they should together almost fill the entire section, leaving only a 0.5-inch margin around the edges of that section.
-      - The “Section Title” subsection, if present, must be placed at the top (leaving 0.5 inches from the top edge of the section), and the main text subsection (if present) should occupy the remaining space below it.
-    • Title sections should be wider than they are tall. In the example, ensure that the "Title and Author" top-level section includes two subsections: one specifically for the poster title and another for the author.
-    • If "image_information" is provided in the input, it will be a JSON structure (e.g., a list of images). Each image has:
-        - "caption"
-        - "min_width"
-        - "min_height"
-        - "max_width"
-        - "max_height"
-        - "path"
-      The values for min_width, min_height, max_width, max_height are in inches. If a section contains an image, ensure that section’s width is between min_width and max_width, and the section’s height is between min_height and max_height. In addition:
-      - You MUST include at least one image subsection if image_information is provided.
-      - Preserve any relevant aspect ratio as needed and include a "path" field for that image (you may assume the path is provided or can be a placeholder).
-    • If "table_information" is provided in the input, it will be a JSON object (e.g., a list of tables). Each table entry has:
-        - "caption"
-        - "min_width"
-        - "min_height"
-        - "max_width"
-        - "max_height"
-        - "path"
-      The values for min_width, min_height, max_width, max_height are in inches. If a section contains a table, ensure that section’s width is between min_width and max_width, and the section’s height is between min_height and max_height. In addition:
-      - You MUST include at least one table subsection if table_information is non-empty.
-      - Include the "path" and "caption" for the table, with appropriate sizing and layout.
-    • Each image/table should be placed in a separate section.
-    • The overall poster height and width must not exceed 56 inches.
-    • The poster should be filled nicely without leaving large blank spaces. Arrange sections so that the layout uses the available space in a balanced way.
-    • Any references, acknowledgments, or other final information should be placed in a concluding section (e.g., "Acknowledgments and References") at the bottom portion of the poster.
-
-  Each section and subsection’s location must be carefully chosen so that there is no overlap, consistent margins are maintained within the specified constraints, and a small gap exists between sections for clarity. The output must be a single JSON object where each top-level key corresponds to a top-level section of the poster (except for "meta", which defines overall dimensions).
-
-template: |
-  Instructions:
-  The input will provide:
-    1. Title or high-level information about the poster.
-    2. A list of major sections or points to include.
-    3. Any images or figures to include in the poster. These come via the "image_information" JSON structure, where each image has "caption", "width", and "height" in inches.
-    4. (Optional) Table data or figures to include, specified by "table_information": each table has "caption", "width", "height", and "path".
-    5. Additional textual content or references.
-
-  Your task:
-    1. Create a single-page poster outline in JSON.
-    2. Include a "meta" key with "height" and "width" to define the overall poster dimensions (in inches).
-       - Ensure that both "height" and "width" are at most 56.
-    3. Divide the poster into 2–3 columns and 2–3 rows, with each resulting "cell" containing a top-level section (except the Title or concluding section, which may span columns if desired), leaving small horizontal or vertical gaps between sections.
-        - There should be 8-12 sections in total.
-    4. Within each top-level section (hierarchy=1):
-       • Provide an "id", "name", "location", "description", "hierarchy", and "subsections".
-       • If the section is wider than it is tall, split subsections (hierarchy=2) horizontally; if taller, split vertically.
-       • Make subsection sizes varied.
-       • Each section can have at most 2 subsections total, excluding the “Poster Title” and “Author” for the top section.
-       • Every section except “Title and Author” must include a subsection for the “Section Title”. 
-       • Subsections (hierarchy=2) must not contain further subdivisions.
-       • If the section has a “Section Title” subsection and a main text subsection, place the title at the top and the main text below it, occupying the remaining space, with a 0.5-inch margin around the edges of the section.
-    5. If "image_information" is provided:
-       • You must include at least one subsection for an image (hierarchy=2).
-       • The section containing this image must be within image’s "min_width" and "max_width" wide and within the image’s "min_height" and "max_height" tall.
-       • Preserve any relevant aspect ratio, and add a "path" field for that image.
-    6. If "table_information" is provided (and non-empty):
-       • You must include at least one subsection for a table (hierarchy=2).
-       • The section containing the table must be within the table’s "min_width" and "max_width" and between its "min_height" and "max_height".
-       • Include a "path" field for the table, along with its "caption".
-    7. Place any references, acknowledgments, or additional final information in a concluding bottom section (hierarchy=1).
-    8. No section’s height can be less than 2 inches.
-    9. Every row should contain at least two sections.
-    10. IMPORTANT: If multiple images or tables are provided, you should carefully choose which ones to include based on their captions and relevance rather than selecting randomly.
-
-  Output Format (example with an image subsection):
-  {
-      "meta": {
-          "height": 56.0,
-          "width": 48.0
-      },
-      "Title and Author": {
-          "id": 1,
-          "name": "Title and Author",
-          "location": {
-              "left": 0.5,
-              "top": 0.5,
-              "width": 47.0,
-              "height": 4.0
-          },
-          "description": "Overall space for the poster title and author overview.",
-          "hierarchy": 1,
-          "subsections": {
-              "Poster Title": {
-                  "id": 2,
-                  "description": "The main title of the poster.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 0.5,
-                      "top": 0.5,
-                      "width": 47.0,
-                      "height": 2.0
-                  },
-                  "name": "Poster Title"
-              },
-              "Author": {
-                  "id": 3,
-                  "description": "Name of the author(s) and affiliation(s).",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 0.5,
-                      "top": 2.5,
-                      "width": 47.0,
-                      "height": 2.0
-                  },
-                  "name": "Author"
-              }
-          }
-      },
-      "Abstract": {
-          "id": 4,
-          "name": "Abstract",
-          "location": {
-              "left": 0.5,
-              "top": 5.0,
-              "width": 15.0,
-              "height": 12.0
-          },
-          "description": "Brief overview of the entire study.",
-          "hierarchy": 1,
-          "subsections": {
-              "Section Title": {
-                  "id": 5,
-                  "description": "Title for this section.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 0.5,
-                      "top": 5.0,
-                      "width": 15.0,
-                      "height": 2.0
-                  },
-                  "name": "Abstract Section Title"
-              },
-              "Overview": {
-                  "id": 6,
-                  "description": "Synopsis of ClavaDDPM's approach and findings.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 0.5,
-                      "top": 7.0,
-                      "width": 15.0,
-                      "height": 10.0
-                  },
-                  "name": "Overview"
-              }
-          }
-      },
-      "Introduction": {
-          "id": 7,
-          "name": "Introduction",
-          "location": {
-              "left": 16.0,
-              "top": 5.0,
-              "width": 15.0,
-              "height": 12.0
-          },
-          "description": "Introduction to the motivation and challenges of the research.",
-          "hierarchy": 1,
-          "subsections": {
-              "Section Title": {
-                  "id": 8,
-                  "description": "Title for this section.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 16.0,
-                      "top": 5.0,
-                      "width": 15.0,
-                      "height": 2.0
-                  },
-                  "name": "Introduction Section Title"
-              },
-              "Motivation and Challenges": {
-                  "id": 9,
-                  "description": "Inspirations and hurdles in multi-relational data synthesis.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 16.0,
-                      "top": 7.0,
-                      "width": 15.0,
-                      "height": 10.0
-                  },
-                  "name": "Motivation and Challenges"
-              }
-          }
-      },
-      "Related Work": {
-          "id": 10,
-          "name": "Related Work",
-          "location": {
-              "left": 32.0,
-              "top": 5.0,
-              "width": 15.0,
-              "height": 12.0
-          },
-          "description": "Overview of prior works and existing models in data synthesis.",
-          "hierarchy": 1,
-          "subsections": {
-              "Section Title": {
-                  "id": 11,
-                  "description": "Title for this section.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 32.0,
-                      "top": 5.0,
-                      "width": 15.0,
-                      "height": 2.0
-                  },
-                  "name": "Related Work Section Title"
-              },
-              "Synthesis Models": {
-                  "id": 12,
-                  "description": "Comparison of various synthesis models.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 32.0,
-                      "top": 7.0,
-                      "width": 15.0,
-                      "height": 10.0
-                  },
-                  "name": "Single-table and Multi-table Synthesis Models"
-              }
-          }
-      },
-      "Background": {
-          "id": 13,
-          "name": "Background",
-          "location": {
-              "left": 0.5,
-              "top": 18.0,
-              "width": 15.0,
-              "height": 12.0
-          },
-          "description": "Background information on multi-relational data and synthesis problem.",
-          "hierarchy": 1,
-          "subsections": {
-              "Section Title": {
-                  "id": 14,
-                  "description": "Title for this section.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 0.5,
-                      "top": 18.0,
-                      "width": 15.0,
-                      "height": 2.0
-                  },
-                  "name": "Background Section Title"
-              },
-              "Synthesis Problem": {
-                  "id": 15,
-                  "description": "Explains the challenges of synthesizing multi-relational data.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 0.5,
-                      "top": 20.0,
-                      "width": 15.0,
-                      "height": 10.0
-                  },
-                  "name": "Multi-relational Databases and Synthesis Problem"
-              }
-          }
-      },
-      "ClavaDDPM": {
-          "id": 16,
-          "name": "ClavaDDPM",
-          "location": {
-              "left": 16.0,
-              "top": 18.0,
-              "width": 31.5,
-              "height": 12.0
-          },
-          "description": "Detailed description of the ClavaDDPM model and its features.",
-          "hierarchy": 1,
-          "subsections": {
-              "Section Title": {
-                  "id": 17,
-                  "description": "Title for this section.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 16.0,
-                      "top": 18.0,
-                      "width": 31.5,
-                      "height": 2.0
-                  },
-                  "name": "ClavaDDPM Section Title"
-              },
-              "Modeling Process": {
-                  "id": 18,
-                  "description": "Process description of the generative model and its capabilities.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 16.0,
-                      "top": 20.0,
-                      "width": 31.5,
-                      "height": 10.0
-                  },
-                  "name": "Modeling Generative Process"
-              }
-          }
-      },
-      "Results": {
-          "id": 19,
-          "name": "Results",
-          "location": {
-              "left": 0.5,
-              "top": 31.0,
-              "width": 31.5,
-              "height": 8.0
-          },
-          "description": "Presentation of experiment results and model evaluations.",
-          "hierarchy": 1,
-          "subsections": {
-              "Section Title": {
-                  "id": 20,
-                  "description": "Title for this section.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 0.5,
-                      "top": 31.0,
-                      "width": 31.5,
-                      "height": 2.0
-                  },
-                  "name": "Results Section Title"
-              },
-              "Experimental Results": {
-                  "id": 21,
-                  "description": "Evaluation results showcasing model efficacy.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 0.5,
-                      "top": 33.0,
-                      "width": 31.5,
-                      "height": 6.0
-                  },
-                  "name": "Experimental Results"
-              }
-          }
-      },
-      "Evaluation Image": {
-          "id": 22,
-          "name": "Evaluation Image",
-          "location": {
-              "left": 32.5,
-              "top": 31.0,
-              "width": 15.0,
-              "height": 8.0
-          },
-          "description": "Visual representation of evaluation metrics and results.",
-          "hierarchy": 1,
-          "subsections": {
-              "Section Title": {
-                  "id": 23,
-                  "description": "Title for this image section.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 32.5,
-                      "top": 31.0,
-                      "width": 15.0,
-                      "height": 2.0
-                  },
-                  "name": "Evaluation Image Title"
-              },
-              "Image": {
-                  "id": 24,
-                  "description": "Chart: Line graph illustrating AVG 2-way percentage against k.",
-                  "hierarchy": 2,
-                  "path": "data/examples/pdf/clava/_page_19_Figure_6.jpeg",
-                  "location": {
-                      "left": 32.5,
-                      "top": 33.0,
-                      "width": 15.0,
-                      "height": 6.0
-                  },
-                  "name": "Evaluation Image"
-              }
-          }
-      },
-      "Evaluation Table": {
-          "id": 25,
-          "name": "Evaluation Table",
-          "location": {
-              "left": 0.5,
-              "top": 40.0,
-              "width": 23.0,
-              "height": 13.0
-          },
-          "description": "Tabular representation of evaluation results and metrics.",
-          "hierarchy": 1,
-          "subsections": {
-              "Section Title": {
-                  "id": 26,
-                  "description": "Title for this table section.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 0.5,
-                      "top": 40.0,
-                      "width": 23.0,
-                      "height": 2.0
-                  },
-                  "name": "Evaluation Table Title"
-              },
-              "Table": {
-                  "id": 27,
-                  "description": "Table 1: End-to-end results. DNC denotes Did Not Converge, and TLE denotes Time Limit Exceeded.",
-                  "hierarchy": 2,
-                  "path": "tables/clava/clava-table-1.png",
-                  "location": {
-                      "left": 0.5,
-                      "top": 42.0,
-                      "width": 23.0,
-                      "height": 11.0
-                  },
-                  "name": "Table 1: End-to-end results"
-              }
-          }
-      },
-      "Conclusion": {
-          "id": 28,
-          "name": "Conclusion",
-          "location": {
-              "left": 24.0,
-              "top": 40.0,
-              "width": 23.5,
-              "height": 13.0
-          },
-          "description": "Concluding remarks and directions for future research.",
-          "hierarchy": 1,
-          "subsections": {
-              "Section Title": {
-                  "id": 29,
-                  "description": "Title for this section.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 24.0,
-                      "top": 40.0,
-                      "width": 23.5,
-                      "height": 2.0
-                  },
-                  "name": "Conclusion Section Title"
-              },
-              "Summary and Future Directions": {
-                  "id": 30,
-                  "description": "Final thoughts on ClavaDDPM and potential future developments.",
-                  "hierarchy": 2,
-                  "location": {
-                      "left": 24.0,
-                      "top": 42.0,
-                      "width": 23.5,
-                      "height": 11.0
-                  },
-                  "name": "Summary and Future Directions"
-              }
-          }
-      }
-
-  json_content:
-  {{ json_content }}
-
-  image_information:
-  {{ image_information }}
-
-  table_information:
-  {{ table_information }}
-
-jinja_args:
-  - image_information
-  - table_information
-  - json_content
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_planner_new_v2.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_planner_new_v2.yaml
deleted file mode 100644
index 8c07b39813fb03c7d2f8f739fd2e65de71004bb2..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_planner_new_v2.yaml
+++ /dev/null
@@ -1,53 +0,0 @@
-system_prompt: |
-  You are an expert assistant tasked with assigning images or tables to the most relevant poster sections. 
-  You will be given:
-    1. JSON content of the poster outline, including each section's title and a brief description.
-    2. A list of images (image_information) with captions and size constraints.
-    3. A list of tables (table_information) with captions and size constraints.
-
-  Your goal is to produce a JSON mapping of each top-level section to exactly zero or one image/table that best fits that section’s content. For each top-level section (named in the provided JSON “json_content”), decide:
-    • Whether an image or table (or none) is most relevant to the section's theme or description.
-    • If relevant, select the single most appropriate image or table to assign.
-    • Base this selection on the conceptual content described in the section (“research methods”, “results”, “conclusion”, etc.) and compare it with the captions of the provided images or tables, choosing whichever fits best.
-    • If assigning an image, specify “image”: <id>, where <id> is the identifier of the chosen image from “image_information”.
-    • If assigning a table, specify “table”: <id>, where <id> is the identifier of the chosen table from “table_information”.
-    • Include an additional “reason” field briefly explaining why this assignment was made (e.g., how the image/table relates to the section content).
-    • If no image or table is assigned to a given section, omit that section from the final JSON (i.e., only list sections where you actually assign something).
-
-  IMPORTANT: 
-    • The assignment should not be arbitrary. It must be logically consistent with the section’s description and the provided caption for the image or table. 
-    • Do not produce any layout properties or subsections here. 
-    • The final output must be a single JSON object, mapping from section names to the chosen image/table ID plus the “reason” field.
-    • Extra note: If multiple images or tables are suitable, select the single best one and assign only that. 
-    • If “image_information” or “table_information” is empty, you may end up assigning nothing to any section.
-
-template: |
-  Instructions:
-    1. Read and analyze the poster's top-level sections from {{ json_content }} (each top-level section has a title and description). 
-    2. Look at {{ image_information }} and {{ table_information }}. Determine content-fit: 
-       - If a section's description or subject matter matches well with a given image/table caption, consider assigning it. 
-       - If multiple images or tables seem relevant, choose the single best fit. 
-       - If none of the images or tables are relevant, or if none are provided, do not assign anything for that section.
-    3. Produce a single JSON object. Each key is the exact name of a top-level section (e.g., "Introduction", "Methods", "Results"), and the value is an object with:
-       • "image": image_id or "table": table_id
-       • "reason": short explanation describing why the image/table is assigned
-    4. If no assignment is made for a section, exclude that section from the JSON. 
-    5. No image can be reused for multiple sections. Each image/table can only be assigned to one section.
-    6. Ensure your final response strictly follows JSON syntax with no extra commentary.
-
-  Example output format if two sections are assigned:
-  {
-    "Introduction": {
-      "image": 1,
-      "reason": "Image 1 depicts the central concept introduced in this section."
-    },
-    "Results": {
-      "table": 2,
-      "reason": "Table 2 summarizes the key metrics discussed in the results."
-    }
-  }
-
-jinja_args:
-  - json_content
-  - image_information
-  - table_information
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_title_agent.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_title_agent.yaml
deleted file mode 100644
index 7f0adb24ed8941aea6e629fb6161d469fb19f01e..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_title_agent.yaml
+++ /dev/null
@@ -1,66 +0,0 @@
-system_prompt: |
-  You are an expert assistant tasked with producing a JSON object for a given input string "title_string" that includes:
-    • A paper title
-    • A list of authors with their affiliations
-
-  Your goal is to output a well-structured JSON with two keys: "title" and "textbox1". The "title" key must be an array containing exactly one bullet item with:
-    {
-      "alignment": "center",
-      "bullet": false,
-      "level": 0,
-      "font_size": 60,
-      "runs": [
-        {
-          "text": "<extracted paper title>",
-          "bold": true
-        }
-      ]
-    }
-
-  The "textbox1" key must be an array. It should be like:
-    [
-      {
-        "alignment": "center",
-        "bullet": false,
-        "level": 0,
-        "font_size": 36,
-        "runs": [
-          {
-            "text": "<authors with superscript numerals>"
-          },
-        ]
-      },
-      {
-        "alignment": "center",
-        "bullet": false,
-        "level": 0,
-        "font_size": 36,
-        "runs": [
-          {
-            "text": "<affiliations with matching numerals>"
-          }
-        ]
-      }
-    ]
-
-  Make sure:
-    • The authors appear all together on one line, using superscript numerals to match their affiliations (e.g., "Author¹, AnotherAuthor²").
-    • The corresponding affiliations follow on the next line, in the same bullet item, also using matching numerals.
-    • Output only the JSON; do not include additional explanation or text.
-
-template: |
-  Instructions:
-    1. Parse the input "title_string" to separate the paper title from the authors and their affiliations.
-    2. Create the JSON structure with keys "title" and "textbox1".
-    3. Under "title", provide an array with one bullet item containing the paper title, centered, not a bullet, level 0, font size 66, bold text.
-    4. Under "textbox1", provide an array with one bullet item containing:
-       • The authors (in a single line) with superscripted numerals.
-       • The corresponding affiliations on the next line with matching numerals.
-       Both runs should be in the same bullet item, centered, not bullets, level 0, font size 48.
-    5. Return only the JSON object.
-
-  title_string:
-  {{title_string}}
-
-jinja_args:
-  - title_string
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_title_agent_easy.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_title_agent_easy.yaml
deleted file mode 100644
index 3c48c23ac0c236c41099ccff860ada90bd42358c..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/poster_title_agent_easy.yaml
+++ /dev/null
@@ -1,49 +0,0 @@
-system_prompt: |
-  You are an expert assistant tasked with producing a JSON object for a given input string "title_string" that includes:
-    • A paper title
-    • A list of authors
-
-  Your goal is to output a well-structured JSON with two keys: "title" and "textbox1".
-
-  The "title" key must be an array containing exactly one item with:
-    {
-      "alignment": "center",
-      "bullet": false,
-      "level": 0,
-      "font_size": 66,
-      "runs": [
-        {
-          "text": "<extracted paper title>",
-          "bold": true
-        }
-      ]
-    }
-
-  The "textbox1" key must be an array containing exactly one item with:
-    {
-      "alignment": "center",
-      "bullet": false,
-      "level": 0,
-      "font_size": 48,
-      "runs": [
-        {
-          "text": "<authors>"
-        }
-      ]
-    }
-
-  Output only the JSON; do not include additional explanation or text.
-
-template: |
-  Instructions:
-    1. Parse the input "title_string" to separate the paper title from the authors.
-    2. Create the JSON object with keys "title" and "textbox1".
-    3. Under "title", produce an array with one item containing the paper title, centered, not a bullet, level 0, font size 66, bold text.
-    4. Under "textbox1", produce an array with one item containing the authors (in a single line), centered, not a bullet, level 0, font size 48.
-    5. Return only the JSON object.
-
-  title_string:
-  {{title_string}}
-
-jinja_args:
-  - title_string
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/style_agent.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/style_agent.yaml
deleted file mode 100644
index d4356d2474f59672fe0c2c4fc04cbba70170ba95..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/style_agent.yaml
+++ /dev/null
@@ -1,51 +0,0 @@
-system_prompt: |
-  You are a Python code generation agent. Your goal is to update existing Python code (which uses python-pptx and some helper functions) that creates shapes for a single-section poster layout, so that it now follows specific style and design rules. These rules include:
-    1. All text must be left aligned, except for the title, author, image, table. The poster title, poster author should be centered horizontally.
-    2. Normal text should not be bold.
-    3. Only keywords and titles should be bold.
-    4. The title and author should not be formatted as bullet points.
-    5. Do not alter or remove any of the existing text content; only change its style and formatting.
-    6. All font sizes must be at least 60Pt.
-    7. Section titles should be set between 60Pt and 100Pt.
-    8. The poster title must be at least 100Pt.
-    9. You should decide whether it is poster title or section title from the "name" field in content json.
-    10. All font must be in Arial.
-    11. Images and tables should be centered horizontally within their subsections.
-  You must still save the final presentation as "poster.pptx". 
-  In addition, you have an argument "image_ratio" which may specify the width-to-height ratio for images in the current section. If "image_ratio" is not empty, you must reshape images to fit the ratio, as large as possible within their subsection.
-  If "image_ratio" is empty, do not modify the images.
-  Return ONLY the modified Python code, wrapped in triple backticks.
-
-template: |
-  Instructions:
-    1. The JSON content for the CURRENT SECTION is provided in "content_json".
-    2. Documentation for helper functions is provided in "function_docs".
-    3. The existing Python code is provided in "existing_code". This code currently creates shapes for a single-section poster. You must MODIFY it so that it:
-       - Applies the listed style/design rules regarding text alignment, bolding, and minimum font sizes.
-       - Ensures font sizes: normal text ≥ 60Pt, section titles between 60Pt and 100Pt, poster title ≥ 100Pt.
-       - Applies the "image_ratio" if provided, resizing images to match the specified ratio within the space available.
-       - Preserves all existing text content exactly; do not add, remove, or change the text itself.
-       - Saves the final presentation to "poster.pptx".
-    4. Return only the modified Python code, wrapped in triple backticks.
-
-  content_json:
-  {{ content_json }}
-
-  function_docs:
-  {{ function_docs }}
-
-  existing_code:
-  {{ existing_code }}
-
-  image_ratio:
-  {{ image_ratio }}
-
-  ```
-  # Your modified Python code here
-  ```
-
-jinja_args:
-  - content_json
-  - function_docs
-  - existing_code
-  - image_ratio
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_agent_section.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_agent_section.yaml
deleted file mode 100644
index 00e7c6f16f15499a22a8d576a408d5de82d890fd..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_agent_section.yaml
+++ /dev/null
@@ -1,36 +0,0 @@
-system_prompt: |
-  You are a Python code generation agent. Your goal is to update existing Python code (which uses python-pptx and some helper functions) to apply style rules specified in "style_json". These rules will be specifically related to poster creation. You should use the provided helper functions (described in "function_docs") directly; do not reimplement or omit them, and do not add any new imports. The final code must save the presentation as "poster.pptx". 
-  Also, do NOT create any new presentation objects or slides; you should only modify the style of existing slides and shapes.
-
-template: |
-  Instructions:
-    1. The JSON content describing the desired style is provided in "style_json". Each portion of "style_json" indicates the corresponding part of the existing code it intends to modify.
-    2. Documentation for helper functions is provided in "function_docs". You should ONLY use these functions for the task.
-    3. The existing Python code is provided in "existing_code". 
-       - You must locate the code sections that match the style_json targeting. Only modify those sections to apply the design rules from "style_json".
-       - Leave all other parts of the code (outside what "style_json" describes) unchanged.
-       - Preserve all existing text content exactly; do not add, remove, or change the text itself.
-       - Call the helper functions as needed – do not reimplement them.
-       - Do not import any new libraries (do not add any import statements).
-       - Do not add, remove, or move shapes or slides from the existing code; only modify styles.
-       - Save the final presentation to "poster.pptx" at the end of the script.
-       - Do NOT modify font sizes.
-       - Do NOT create any new presentation objects or slides; only modify the existing one.
-
-  style_json:
-  {{ style_json }}
-
-  function_docs:
-  {{ function_docs }}
-
-  existing_code:
-  {{ existing_code }}
-
-  ```
-  # Your modified Python code here
-  ```
-
-jinja_args:
-  - style_json
-  - function_docs
-  - existing_code
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_agent_title.yaml b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_agent_title.yaml
deleted file mode 100644
index b93dfd4d2fd7c0b90045a026b3dec92be88cd7eb..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_agent_title.yaml
+++ /dev/null
@@ -1,35 +0,0 @@
-system_prompt: |
-  You are a Python code generation agent. Your goal is to update existing Python code (which uses python-pptx and some helper functions) to apply style rules specified in "style_json". These rules will be specifically related to poster creation. You should use the provided helper functions (described in "function_docs") directly; do not reimplement or omit them, and do not add any new imports. The final code must save the presentation as "poster.pptx". 
-  Return ONLY the modified Python code, wrapped in triple backticks.
-
-template: |
-  Instructions:
-    1. The JSON content describing the desired style is provided in "style_json".
-    2. Documentation for helper functions is provided in "function_docs". You should ONLY use these functions for the task.
-    3. The existing Python code is provided in "existing_code". This code currently creates shapes for a single-section poster. You must MODIFY it so that it:
-       - Applies the design rules from "style_json".
-       - Preserves all existing text content exactly; do not add, remove, or change the text itself.
-       - Calls the helper functions as needed – do not reimplement them.
-       - Does not import any new libraries (do not add any import statements).
-       - Does not add, remove, or move shapes or slides from the existing code; only modify styles.
-       - Does not modify font sizes.
-       - Saves the final presentation to "poster.pptx".
-    4. Return only the modified Python code, wrapped in triple backticks.
-
-  style_json:
-  {{ style_json }}
-
-  function_docs:
-  {{ function_docs }}
-
-  existing_code:
-  {{ existing_code }}
-
-  ```
-  # Your modified Python code here
-  ```
-
-jinja_args:
-  - style_json
-  - function_docs
-  - existing_code
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_background.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_background.txt
deleted file mode 100644
index 52384567feb5c7343a83f271bd273bcdb92cb4d5..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_background.txt
+++ /dev/null
@@ -1,17 +0,0 @@
-You are an expert in analyzing images for stylistic properties.
-
-I will provide you with a poster image. Your task is to determine the following about the poster’s background:
-• Whether there is a background color. If there is one, provide that color in RGB notation (e.g., "255,255,255").  
-• Identify and describe any other notable style elements that format the background (e.g., gradient, pattern, texture, etc.).
-
-After completing your analysis, respond with a single JSON object.  
-Each key in the JSON should be the name of the style property you are describing, and the value should be the corresponding property value.  
-Do not include any additional commentary outside of JSON.  
-
-Example of how your final output should look (structure only, not literal values):
-{
-  "backgroundColorRGB": "R,G,B",
-  "backgroundPattern": "pattern or style if any"
-}
-
-Please analyze the image and produce your final JSON below.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_section_body.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_section_body.txt
deleted file mode 100644
index 7c3f192a260fb8e8f02335a42a1a985981fa1272..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_section_body.txt
+++ /dev/null
@@ -1,24 +0,0 @@
-You are an expert in analyzing images for stylistic properties.
-
-I will provide you with a poster image. Your task is to determine the following about each section of the poster’s content:
-• What is the font/typeface of the content text? (e.g., Arial, Times New Roman, or another specific font?)  
-• Is the text italic, or have any other stylistic attribute?  
-• What is the color of the text in RGB notation (e.g., "255,255,255")? 
-• What is the filled color of the section in RGB notation (e.g., "255,255,255")? 
-• Is there a box or container behind the section text? If so, is it filled with any color? If so, what is that color in RGB notation?
-
-I want you to always explicitly point out the filled color of the section, and provide the RGB. 
-If it is white, you should explicitly give the RGB value as "255,255,255".
-
-After examining the image, respond with a single JSON object.  
-Each key in the JSON should be the name of the style property you are auditing, and the value should be the property’s value.  
-Please do not include any commentary outside of the JSON object.
-
-Example of how your final output might look (structure only, not literal values):
-{
-  "fontName": "Arial",
-  "textColorRGB": "R,G,B",
-  "sectionBodyBoxFillColorRGB": "R,G,B"
-}
-
-Please analyze the poster and produce your final JSON below.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_section_border.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_section_border.txt
deleted file mode 100644
index 7b42902fc40138826a454139caa0e0cec11c8c40..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_section_border.txt
+++ /dev/null
@@ -1,20 +0,0 @@
-You are an expert in analyzing images for stylistic properties.
-
-I will provide you with a poster image. Your task is to determine the following about the sections of the poster’s content:
-• Do they have a border?  
-• If so, what is the color of the border in RGB notation (e.g., "255,255,255"), and what is the style of the border (solid, dashed, dotted, or any other)? What is the thickness of the border? 
-• If there is no border, return an empty JSON object.
-
-When you respond, produce a single JSON object. Please do not include any commentary outside of the JSON object.
-
-Example of how your final output might look:
-{
-  "borderColorRGB": "R,G,B",
-  "borderStyle": "solid",
-  "borderThickness": "5pt",
-}
-
-If no border is present, return:
-{}
-
-Please analyze the poster and produce your final JSON below.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_section_title.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_section_title.txt
deleted file mode 100644
index 6155fa6bdb19ba44184488be42034ae827d045e8..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_section_title.txt
+++ /dev/null
@@ -1,22 +0,0 @@
-You are an expert in analyzing images for stylistic properties.
-
-I will provide you with a poster image. Your task is to determine the following about each section title of the poster’s content:
-• Font typeface (e.g., Arial, Times New Roman, or another).  
-• Font style (e.g., bold, italic).  
-• Color of the section title text in RGB notation (e.g., "255,255,255").  
-• Filled color of the section title in RGB notation (e.g., "255,255,255").
-• Whether the section title area/box is filled with a background color. If yes, what that color is in RGB notation. If no, you should still specify "255,255,255".
-
-After examining the image, respond with a single JSON object.  
-Each key in the JSON should be the name of the style property, and the value should be the property’s specified value.  
-Do not include any commentary outside of the JSON object.
-
-Example of how your final output might look (structure only, not literal values):
-{
-  "fontName": "Arial",
-  "fontStyle": "Bold",
-  "titleColorRGB": "R,G,B",
-  "titleBoxFillColorRGB": "R,G,B"
-}
-
-Please analyze the poster and produce your final JSON below.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_title_author.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_title_author.txt
deleted file mode 100644
index b2d241f491ba4e568647c4866b5a37ac92f05b3b..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_title_author.txt
+++ /dev/null
@@ -1,22 +0,0 @@
-You are an expert in analyzing images for stylistic properties.
-
-I will provide you with a poster image. Your task is to determine the following about the poster’s title:
-• What is the font or typeface of the title? (e.g., Arial, Times New Roman, or another specific font)  
-• Is the title text bold, italic, or have any other stylistic attributes?  
-• What is the color of the title text in RGB notation (e.g., "255,255,255")?
-• What is the filled color of the title in RGB notation (e.g., "255,255,255")?
-• Does the title box have a fill color? If it exists, what is that color in RGB notation? If it's white, you should explicity specify "255,255,255".
-
-After examining the image, respond with a single JSON object.  
-Each key in the JSON should be the name of the style property you are auditing, and the value should be the property’s value.  
-Please do not include any commentary outside of the JSON object.
-
-Example of how your final output might look (structure only, not literal values):
-{
-  "fontName": "Arial",
-  "fontStyle": "Bold",
-  "titleColorRGB": "R,G,B",
-  "titleBoxFillColorRGB": "R,G,B"
-}
-
-Please analyze the poster and produce your final JSON below.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_title_author_border.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_title_author_border.txt
deleted file mode 100644
index 65b3cfc2ddd2265d980d9e6b01a18dcb53516fc2..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_templates/theme_templates/theme_title_author_border.txt
+++ /dev/null
@@ -1,24 +0,0 @@
-You are an expert in analyzing images for stylistic properties.
-
-I will provide you with a poster image. Your task is to examine the border around the poster’s title, specifically:
-• Determine whether a border exists.
-• If there is a border, identify:
-  – The color in RGB notation (e.g., "255,255,255").  
-  – The thickness of the border (e.g., "3pt", "5pt", etc.).  
-  – The style of the border (solid, dashed, dotted, or something else).  
-• If the poster title does not have a border, return an empty JSON object.
-
-When you respond, produce a single JSON object with these details.  
-Please do not include any commentary outside of the JSON object.
-
-Example of how your final output might look:
-{
-  "borderColorRGB": "R,G,B",
-  "borderThickness": "5pt",
-  "borderStyle": "solid"
-}
-
-If the poster title does not have a border, return:
-{}
-
-Please analyze the poster and produce your final JSON below.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_utils.py b/Paper2Video/src/evaluation/PresentQuiz/utils/prompt_utils.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ask_category.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ask_category.txt
deleted file mode 100644
index 815f0565638fbaecdab67567a968f76f1c6a6442..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ask_category.txt
+++ /dev/null
@@ -1,21 +0,0 @@
-Analyze the content layout and media types in the provided slide images.
-Your objective is to provide a concise, descriptive title that captures purely the layout pattern.
-Requirements:
-Focus on HOW content is structured and presented, not WHAT the content is
-Describe the visual arrangement and interaction between different content types (text, images, diagrams, etc.)
-
-Avoid:
-Specific topics or subjects
-Detailed content descriptions
-
-
-You cannot use the following layout names:
-{{ existed_layoutnames }}
-
-Example Outputs:
-Central Image with explanatory text
-Growth Overview Illustrated with the Chart
-Picture and illustrative key points
-Plain Bullet Points
-
-Output: Provide a one-line layout pattern title.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/caption.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/caption.txt
deleted file mode 100644
index f3d9733f50db0e9aadc2ae54113feaff5d2abcd3..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/caption.txt
+++ /dev/null
@@ -1,10 +0,0 @@
-Describe the main content of the image in less than 50 words, avoiding unnecessary details.
-Additionally, classify the image as 'Table,' 'Chart,', 'Diagram', 'Banner', 'Background', 'Icon', 'Logo', etc. or 'Picture' if it cannot be classified as one of the above.
-Give your answer in the following format:
-<type>:<description>
-Example Output:
-Chart: Bar graph showing quarterly revenue growth over five years. Color-coded bars represent different product lines. Notable spike in Q4 of the most recent year, with a dotted line indicating industry average for comparison.
-
-Input:
-<image>
-Output:
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/category_split.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/category_split.txt
deleted file mode 100644
index 4d6348bafe5e48d56d3971ac234fa2ab726036b2..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/category_split.txt
+++ /dev/null
@@ -1,20 +0,0 @@
-You are an expert presentation analyst specializing in categorizing PowerPoint slides, particularly skilled at identifying structural slides (such as Opening, Transitions, and Ending slides) that guide the flow of the presentation.
-
-Instructions:
-	1.	Analyze provided slides and identify structural slides (such as Opening and Ending) based on their content.
-	2.	Category names for structural slides should be simple, reflect their function, and not contain any specific details(such as the name of any entity).
-	3.	Opening and Ending slides are typically located at the beginning or end of the presentation and may consist of only one slide.
-	4.	Other structural categories should include multiple slides that share similar text elements (e.g., titles or "Table of Contents"). Moreover, they should not convey any specific information and serve as a guide for the presentation.
-
-Example output:
-```json
-{
-    "opening": [1],
-    "table of contents": [2, 5],
-    "ending": [10]
-}
-```
-
-Input: {{slides}}
-
-Output:
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/content_induct.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/content_induct.txt
deleted file mode 100644
index 27ab40f38eb7f6724f66b7c154cabb0b2f7cecb9..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/content_induct.txt
+++ /dev/null
@@ -1,23 +0,0 @@
-Please analyze the slide elements and create a structured template schema in JSON format. The schema should:
-
-1. Identify key content elements (both text and images) that make up the slide
-2. For each element, specify:
-   - "name": The role of the element, such as "main title", "content bullets", "main image", "presenters", "background information", "acknowledgments", "data", etc.
-   - "description": A clear description of the element's purpose, do not mention any detail
-   - "type": Literal["text", "image"]
-   - "data":
-      * For text elements: The actual text content as string or array in paragraph level(<p> or <li>)
-      * For image elements: Use the `alt` attribute of the <img> tag as the data of the image
-
-Example format:
-{
-  "element_name": {
-    "description": "purpose of this element", # do not mention any detail, just purpose
-    "type": "text" or "image",
-    "data": ["text1", "text2"] or ["logo:...", "logo:..."]
-  }
-}
-Input:
-{{slide}}
-
-Output: Please provide a schema that could be used as a template for creating similar slides
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/document_refine.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/document_refine.txt
deleted file mode 100644
index f3387a77ad00baecf55d95b7be403148e05970a8..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/document_refine.txt
+++ /dev/null
@@ -1,51 +0,0 @@
-You are a document content divider and extractor specialist, expert in dividing and extracting content from various types of documents and reorganizing it into a two-level json format for later PPT generation.
-
-Based on given markdown document, generate a JSON output for later PPT generation, make sure the output is concise and focused.
-Step-by-Step Instructions:
-1. Identify Sections and Subsections in document and identify sections and subsections based on the heading levels and logical structure.
-
-2. Divide Content: Reorganize the content into sections and subsections, ensuring that each subsection contains approximately 500 words.
-
-3. Refine Titles: Use the provided headings as titles for each section and subsection if they exist, otherwise create an appropriate and relevant title for it.
-
-4. Remove Unwanted Elements: Eliminate any unwanted elements such as headers, footers, text surrounded by `~~` indicating deletion.
-
-5. Refine Text: Appropriately remove unnecessary(like citations) or trivial(repetitive or non-important information) text to make the content more concise and focused.
-Example Output:
-{
-    "metadata": {
-        "title": "title of document",
-        "author": "name of authors",
-        "publish date": "date of publication",
-        "organization": "name of organization"
-    },
-    "sections": [
-        {
-            "title": "title of section1",
-            "subsections": [
-                {
-                    "title": "title of subsection1.1",
-                    "content": "content of subsection1.1"
-                },
-                {
-                    "title": "title of subsection1.2",
-                    "content": "content of subsection1.2"
-                }
-            ]
-        },
-        {
-            "title": "title of section2",
-            "subsections": [
-                {
-                    "title": "title of subsection2.1",
-                    "content": "content of subsection2.1"
-                }
-            ]
-        }
-    ]
-}
-
-Give your output in JSON format
-Input:
-{{ markdown_document }}
-Output:
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/gen_poster_raw_content.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/gen_poster_raw_content.txt
deleted file mode 100644
index 8e971348a55b589129fe153410720e25c7b1fe7c..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/gen_poster_raw_content.txt
+++ /dev/null
@@ -1,51 +0,0 @@
-You are a document content divider and extractor specialist, expert in dividing and extracting content from various types of documents and reorganizing it into a two-level json format for later poster generation.
-
-Based on given markdown document, generate a JSON output for later poster generation, make sure the output is concise and focused.
-Step-by-Step Instructions:
-1. Identify Sections and Subsections in document and identify sections and subsections based on the heading levels and logical structure.
-
-2. Divide Content: Reorganize the content into sections and subsections, ensuring that each subsection contains approximately 500 words.
-
-3. Refine Titles: Use the provided headings as titles for each section and subsection if they exist, otherwise create an appropriate and relevant title for it.
-
-4. Remove Unwanted Elements: Eliminate any unwanted elements such as headers, footers, text surrounded by `~~` indicating deletion.
-
-5. Refine Text: Appropriately remove unnecessary(like citations) or trivial(repetitive or non-important information) text to make the content more concise and focused.
-Example Output:
-{
-    "metadata": {
-        "title": "title of document",
-        "author": "name of authors",
-        "publish date": "date of publication",
-        "organization": "name of organization"
-    },
-    "sections": [
-        {
-            "title": "title of section1",
-            "subsections": [
-                {
-                    "title": "title of subsection1.1",
-                    "content": "content of subsection1.1"
-                },
-                {
-                    "title": "title of subsection1.2",
-                    "content": "content of subsection1.2"
-                }
-            ]
-        },
-        {
-            "title": "title of section2",
-            "subsections": [
-                {
-                    "title": "title of subsection2.1",
-                    "content": "content of subsection2.1"
-                }
-            ]
-        }
-    ]
-}
-
-Give your output in JSON format
-Input:
-{{ markdown_document }}
-Output:
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/gen_poster_raw_content_v2.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/gen_poster_raw_content_v2.txt
deleted file mode 100644
index a3486c939a6def38c677dd6caee4745d1447d438..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/gen_poster_raw_content_v2.txt
+++ /dev/null
@@ -1,49 +0,0 @@
-You are a document content divider and extractor specialist, expert in dividing and extracting content from various types of documents and reorganizing it into a two-level json format for later poster generation.
-
-Based on given markdown document, generate a JSON output for later poster generation, make sure the output is concise and focused.
-Step-by-Step Instructions:
-1. Identify Sections and Subsections in document and identify sections and subsections based on the heading levels and logical structure.
-
-2. Divide Content: Reorganize the content into sections and subsections, ensuring that each subsection contains approximately 500 words.
-
-3. Refine Titles: Create titles for each section with at most 3 words.
-
-4. Remove Unwanted Elements: Eliminate any unwanted elements such as headers, footers, text surrounded by `~~` indicating deletion.
-
-5. Refine Text: For content, you should keep as much raw text as possible. Do not include citations.
-
-6. Length: you should control the length of each section, according to their importance according to your understanding of the paper. For important sections, their content should be long.
-
-7. Make sure there is a poster title section at the beginning, and it should contain information like paper title, author, organization etc.
-
-8. The "meta" key contains the meta information of the poster, where the title should be the raw title of the paper and is not summarized.
-
-9. Ther **must** be a section for the poster title.
-
-Example Output:
-{
-    "meta": {
-        "poster_title": "raw title of the paper",
-        "authors": "authors of the paper",
-        "affiliations": "affiliations of the authors"
-    },
-    "sections": [
-        {
-            "title": "Poster Title & Author",
-            "content": "content of poster title and author"
-        },
-        {
-            "title": "title of section1",
-            "content": "content of section 1"
-        },
-        {
-            "title": "title of section2",
-            "content": "content of section 2"
-        }
-    ]
-}
-
-Give your output in JSON format
-Input:
-{{ markdown_document }}
-Output:
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/gen_theme.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/gen_theme.txt
deleted file mode 100644
index 31a97b7a1584740bc2d35d747a9353566915c2c2..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/gen_theme.txt
+++ /dev/null
@@ -1,51 +0,0 @@
-You have been given an image of a poster. Your task is to analyze the poster’s visual and textual design elements and generate a JSON object describing the styles. Follow these steps carefully:
-
-1. Observe the poster’s background:
-   • Does the background have a color? If so, note it as an RGB value. For example, 'color': '(255, 255, 255)'.
-
-2. Observe the poster title:
-   • Determine the font style (bold, italic, normal).  
-   • Determine the font color (in RGB).
-   • Determine the font size (small, medium, large).  
-   • Check if the title has a filled color behind it; if so, specify it in RGB.
-   • Check if the title has a border (color, thickness, line type).
-
-3. Observe each section title:
-   • Determine the font style (bold, italic, normal).  
-   • Determine the font color (in RGB).  
-   • Determine the font size (small, medium, large).  
-   • Check if the section title has a filled color behind it; if so, specify it in RGB.
-
-4. Observe the main section text (the bulk of the poster):
-   • Determine the font style (bold, italic, normal).
-   • Determine the font color (in RGB).
-   • Determine the font size (small, medium, large).
-   • Check if the section or its container has a border (color, thickness, line type).
-
-5. Look for any additional elements:
-   • Are there any other graphics or important design components (e.g., logos, icons, illustrations)?
-   • If so, include any relevant style information (color in RGB, border details, etc.).
-
-Output Requirements:
-• Provide your answer as a JSON object with descriptive keys for each poster element (e.g., 'poster_background', 'poster_title', 'section_title', 'section_text', 'section_border', etc.).  
-• The values should be objects containing the style properties you observed (e.g., 'color': '(R,G,B)', 'font_style': 'bold', 'font_size': 'large', 'border_color': '(R,G,B)', 'border_thickness': '1px', 'border_type': 'solid').
-
-IMPORTANT: Your output should be strictly in JSON format, with no explanatory text outside the JSON. For example:
-
-{
-  "poster_background": {
-    "color": "(R,G,B)"
-  },
-  "poster_title": {
-    "font_style": "bold",
-    "font_color": "(R,G,B)",
-    "font_size": "large",
-    ...
-  },
-  "section_title": {
-    ...
-  },
-  ...
-}
-
-Please analyze the poster and then output exactly one JSON object with all details. No additional text or explanation should be provided outside the JSON response.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_coherence.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_coherence.txt
deleted file mode 100644
index ba972f5f79adb060c0a05980f03edab002283109..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_coherence.txt
+++ /dev/null
@@ -1,28 +0,0 @@
-You are an unbiased presentation analysis judge responsible for evaluating the coherence of the presentation. Please carefully review the provided summary of the presentation, assessing its logical flow and contextual information. Each score level requires that all evaluation criteria meet the standards of that level.
-Scoring Criteria (Five-Point Scale)
-
-1 Point:
-The logical structure is chaotic, making it difficult for the audience to understand.
-
-2 Points:
-The logical structure is generally reasonable, with minor issues in transitions.
-
-3 Points:
-The presentation demonstrates a clear and logical structure, with smooth transitions between sections. However, it lacks essential background information.
-
-4 Points:
-The presentation features a well-organized logical flow and includes basic background information (e.g., speaker, date, or institution).
-
-5 Points:
-The narrative structure is engaging and meticulously organized with detailed and comprehensive background information(speaker/institution, date, and acknowledgments/conclusion) included.
-
-Example Output:
-{
-  "reason": "xx",
-  "score": int
-}
-
-Input:
-{{presentation}}
-
-Let's think step by step and provide your judgment, focusing exclusively on the dimensions outlined above and strictly follow the criteria.
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_content.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_content.txt
deleted file mode 100644
index 23a2d784918d466a0aa363816de9afff8e1fdb1e..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_content.txt
+++ /dev/null
@@ -1,26 +0,0 @@
-You are an unbiased presentation analysis judge responsible for evaluating the quality of slide content. Please carefully review the provided slide image, assessing its content, and provide your judgement in a JSON object containing the reason and score. Each score level requires that all evaluation criteria meet the standards of that level.
-
-Scoring Criteria (Five-Point Scale):
-
-1 Point (Poor):
-The text on the slides contains significant grammatical errors or is poorly structured, making it difficult to understand.
-
-2 Points (Below Average):
-The slides lack a clear focus, the text is awkwardly phrased, and the overall organization is weak, making it hard to engage the audience.
-
-3 Points (Average):
-The slide content is clear and complete but lacks visual aids, resulting in insufficient overall appeal.
-
-4 Points (Good):
-The slide content is clear and well-developed, but the images have weak relevance to the theme, limiting the effectiveness of the presentation.
-
-5 Points (Excellent):
-The slides are well-developed with a clear focus, and the images and text effectively complement each other to convey the information successfully.
-
-Example Output:
-{
-  "reason": "xx",
-  "score": int
-}
-Input: {{descr}}
-Please evaluate the slide step by step, ensuring your judgment strictly adheres to the scoring criteria.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_describe_content.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_describe_content.txt
deleted file mode 100644
index da60038780e90d17c991b2688646c74367ff7c79..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_describe_content.txt
+++ /dev/null
@@ -1,9 +0,0 @@
-Please describe the input slide based on the following three dimensions:
-	1.	The amount of information conveyed
-	Whether the slide conveys too lengthy or too little information, resulting in a large white space without colors or images.
-	2.	Content Clarity and Language Quality
-	Check if there are any grammatical errors or unclear expressions of textual content.
-	3.	Images and Relevance
-	Assess the use of visual aids such as images or icons, their presence, and how well they relate to the theme and content of the slides.
-
-Provide an objective and concise description without comments, focusing exclusively on the dimensions outlined above.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_describe_style.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_describe_style.txt
deleted file mode 100644
index b416199c9fcee9adcef2a5fc4a649704c3c6b963..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_describe_style.txt
+++ /dev/null
@@ -1,9 +0,0 @@
-Describe the input slide based on the following dimensions:
-1. Visual Consistency
-Evaluate if any stylistic choices affect readability, such as overlapping elements or low contrast.
-2. Color Scheme
-Identify the colors used in the slide and determine if the design is monochromatic (black and white) or colorful (including gray).
-3. Use of Visual Elements
-Assess the presence of supporting visual elements, such as backgrounds, textures, patterns, or geometric shapes (e.g., rectangles, circles, bold dividers).
-
-Provide an objective and concise description, focusing solely on the specified dimensions.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_extract.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_extract.txt
deleted file mode 100644
index 5cfb1755d7ee7fb014892ba3f0cc388eba95949f..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_extract.txt
+++ /dev/null
@@ -1,20 +0,0 @@
-You are an expert presentation content extractor responsible for analyzing and summarizing key elements and metadata of presentations. Your task is to extract and provide the following information:
-1. Slide Descriptions: conclude the purpose of each slide.
-2. Presentation Metadata: Identify explicit background information (presented as a standalone paragraph and not embedded within other paragraphs), such as the author, speaker, date, and other directly stated details from the opening and closing slides.
-
-Example Output:
-{
-    "slide_descriptions": [
-        "This slide introduces the xx, xx.",
-        "...",
-    ],
-    "background": {
-        "speaker": "speaker x",
-        "date": "date x"
-    }
-}
-
-Input:
-{{presentation}}
-
-Output:
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_style.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_style.txt
deleted file mode 100644
index 528b6198c284f83976d54b30719c38dca18c5564..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/prompts/ppteval_style.txt
+++ /dev/null
@@ -1,27 +0,0 @@
-You are an unbiased presentation analysis judge responsible for evaluating the visual appeal of slides. Please carefully review the provided description of the slide, assessing their aesthetics only, and provide your judgment in a JSON object containing the reason and score. Each score level requires that all evaluation criteria meet the standards of that level.
-
-Scoring Criteria (Five-point scale):
-
-1 Point (Poor):
-There is a conflict between slide styles, making the content difficult to read.
-
-2 Points (Fair):
-The slide uses monotonous colors(black and white), ensuring readability while lacking visual appeal.
-
-3 Points (Average):
-The slide employs a basic color scheme; however, it lacks supplementary visual elements such as icons, backgrounds, images, or geometric shapes(like rectangles), making it look plain.
-
-4 Points (Good):
-The slide uses a harmonious color scheme and contains some visual elements(like icons, backgrounds, images, or geometric shapes); however, minor flaws may exist in the overall design.
-
-5 Points (Excellent):
-The style of the slide is harmonious and engaging, the use of supplementary visual elements like images and geometric shapes enhances the slide’s overall visual appeal.
-
-Example Output:
-{
-  "reason": "xx",
-  "score": int
-}
-
-Input: {{descr}}
-Please evaluate the slide step by step, ensuring your judgment strictly adheres to the scoring criteria.
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/__init__.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/__init__.py
deleted file mode 100644
index 698a58bd2186c7ec0325258487a4ba0c9c441d03..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/__init__.py
+++ /dev/null
@@ -1,5 +0,0 @@
-from . import (
-    model_utils,
-    presentation,
-    utils
-)
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/apis.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/apis.py
deleted file mode 100644
index 4e7a1242d4fd3f19eca523674e926760898fc936..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/apis.py
+++ /dev/null
@@ -1,367 +0,0 @@
-import inspect
-import os
-import re
-import traceback
-from copy import deepcopy
-from dataclasses import dataclass
-from enum import Enum
-from functools import partial
-from typing import Union
-
-import PIL
-from pptx.enum.text import MSO_ANCHOR, PP_ALIGN
-from pptx.oxml import parse_xml
-from pptx.shapes.base import BaseShape
-from pptx.util import Pt
-
-from src.presentation import Closure, Picture, ShapeElement, SlidePage
-from src.utils import runs_merge
-
-
-@dataclass
-class HistoryMark:
-    """
-    Mark the execution status of the API call, comment and a line of code.
-    """
-
-    API_CALL_ERROR = "api_call_error"
-    API_CALL_CORRECT = "api_call_correct"
-    COMMENT_CORRECT = "comment_correct"
-    COMMENT_ERROR = "comment_error"
-    CODE_RUN_ERROR = "code_run_error"
-    CODE_RUN_CORRECT = "code_run_correct"
-
-
-class CodeExecutor:
-    """
-    Execute code actions and manage API call history, and providing error feedback.
-    """
-
-    def __init__(self, retry_times: int):
-        """
-        Initialize the CodeExecutor.
-
-        Args:
-            retry_times (int): The number of times to retry failed actions.
-        """
-        self.api_history = []
-        self.command_history = []
-        self.code_history = []
-        self.retry_times = retry_times
-        self.registered_functions = API_TYPES.all_funcs()
-        self.function_regex = re.compile(r"^[a-z]+_[a-z_]+\(.+\)")
-
-    def get_apis_docs(self, funcs: list[callable], show_example: bool = True) -> str:
-        """
-        Get the documentation for a list of API functions.
-
-        Args:
-            funcs (list[callable]): A list of functions to document.
-            show_example (bool): Whether to show examples in the documentation.
-
-        Returns:
-            str: The formatted API documentation.
-        """
-        api_doc = []
-        for func in funcs:
-            sig = inspect.signature(func)
-            params = []
-            for name, param in sig.parameters.items():
-                if name == "slide":
-                    continue
-                param_str = name
-                if param.annotation != inspect.Parameter.empty:
-                    param_str += f": {param.annotation.__name__}"
-                if param.default != inspect.Parameter.empty:
-                    param_str += f" = {repr(param.default)}"
-                params.append(param_str)
-            signature = f"def {func.__name__}({', '.join(params)})"
-            if not show_example:
-                api_doc.append(signature)
-                continue
-            doc = inspect.getdoc(func)
-            if doc is not None:
-                signature += f"\n\t{doc}"
-            api_doc.append(signature)
-        return "\n\n".join(api_doc)
-
-    def execute_actions(
-        self, actions: str, edit_slide: SlidePage, found_code: bool = False
-    ) -> Union[tuple[str, str], None]:
-        """
-        Execute a series of actions on a slide.
-
-        Args:
-            actions (str): The actions to execute.
-            edit_slide (SlidePage): The slide to edit.
-            found_code (bool): Whether code was found in the actions.
-
-        Returns:
-            tuple: The API lines and traceback if an error occurs.
-            None: If no error occurs.
-        """
-        api_calls = actions.strip().split("\n")
-        self.api_history.append(
-            [HistoryMark.API_CALL_ERROR, edit_slide.slide_idx, actions]
-        )
-        for line_idx, line in enumerate(api_calls):
-            try:
-                if line_idx == len(api_calls) - 1 and not found_code:
-                    raise ValueError(
-                        "No code block found in the output, please output the api calls without any prefix."
-                    )
-                if line.startswith("def"):
-                    raise PermissionError("The function definition were not allowed.")
-                if line.startswith("#"):
-                    if len(self.command_history) != 0:
-                        self.command_history[-1][0] = HistoryMark.COMMENT_CORRECT
-                    self.command_history.append([HistoryMark.COMMENT_ERROR, line, None])
-                    continue
-                if not self.function_regex.match(line):
-                    continue
-                found_code = True
-                func = line.split("(")[0]
-                if func not in self.registered_functions:
-                    raise NameError(f"The function {func} is not defined.")
-                # only one of clone and del can be used in a row
-                if func.startswith("clone") or func.startswith("del"):
-                    tag = func.split("_")[0]
-                    if (
-                        self.command_history[-1][-1] == None
-                        or self.command_history[-1][-1] == tag
-                    ):
-                        self.command_history[-1][-1] = tag
-                    else:
-                        raise ValueError(
-                            "Invalid command: Both 'clone_paragraph' and 'del_paragraph'/'del_image' are used within a single command. "
-                            "Each command must only perform one of these operations based on the quantity_change."
-                        )
-                self.code_history.append([HistoryMark.CODE_RUN_ERROR, line, None])
-                partial_func = partial(self.registered_functions[func], edit_slide)
-                eval(line, {}, {func: partial_func})
-                self.code_history[-1][0] = HistoryMark.CODE_RUN_CORRECT
-            except:
-                trace_msg = traceback.format_exc()
-                if len(self.code_history) != 0:
-                    self.code_history[-1][-1] = trace_msg
-                api_lines = (
-                    "\n".join(api_calls[: line_idx - 1])
-                    + f"\n--> Error Line: {line}\n"
-                    + "\n".join(api_calls[line_idx + 1 :])
-                )
-                return api_lines, trace_msg
-        if len(self.command_history) != 0:
-            self.command_history[-1][0] = HistoryMark.COMMENT_CORRECT
-        self.api_history[-1][0] = HistoryMark.API_CALL_CORRECT
-
-
-# supporting functions
-def element_index(slide: SlidePage, element_id: int) -> ShapeElement:
-    """
-    Find the an element in a slide.
-
-    Args:
-        slide (SlidePage): The slide
-        element_id (int): The ID of the element.
-
-    Returns:
-        ShapeElement: The shape corresponding to the element ID.
-
-    Raises:
-        IndexError: If the element is not found.
-    """
-    for shape in slide:
-        if shape.shape_idx == element_id:
-            return shape
-    raise IndexError(f"Cannot find element {element_id}, is it deleted or not exist?")
-
-
-def replace_para(paragraph_id: int, new_text: str, shape: BaseShape):
-    """
-    Replace the text of a paragraph in a shape.
-    """
-    para = shape.text_frame.paragraphs[paragraph_id]
-    runs_merge(para).text = new_text
-
-
-def clone_para(paragraph_id: int, shape: BaseShape):
-    """
-    Clone a paragraph in a shape.
-    """
-    para = shape.text_frame.paragraphs[paragraph_id]
-    shape.text_frame.paragraphs[-1]._element.addnext(parse_xml(para._element.xml))
-
-
-def del_para(paragraph_id: int, shape: BaseShape):
-    """
-    Delete a paragraph from a shape.
-    """
-    para = shape.text_frame.paragraphs[paragraph_id]
-    para._element.getparent().remove(para._element)
-
-
-# api functions
-def del_paragraph(slide: SlidePage, div_id: int, paragraph_id: int):
-    """
-    Delete a paragraph from a slide.
-
-    Args:
-        slide (SlidePage): The slide containing the paragraph.
-        div_id (int): The ID of the division containing the paragraph.
-        paragraph_id (int): The ID of the paragraph to delete.
-
-    Raises:
-        IndexError: If the paragraph is not found.
-    """
-    shape = element_index(slide, div_id)
-    assert (
-        shape.text_frame.is_textframe
-    ), "The element does not have a text frame, please check the element id and type of element."
-    for para in shape.text_frame.paragraphs:
-        if para.idx == paragraph_id:
-            shape.text_frame.paragraphs.remove(para)
-            shape._closures["delete"].append(
-                Closure(partial(del_para, para.real_idx), para.real_idx)
-            )
-            return
-    else:
-        raise IndexError(
-            f"Cannot find the paragraph {paragraph_id} of the element {div_id},"
-            "may refer to a non-existed paragraph or you haven't cloned enough paragraphs beforehand."
-        )
-
-
-def del_image(slide: SlidePage, figure_id: int):
-    """
-    Delete an image from a slide.
-
-    Args:
-        slide (SlidePage): The slide containing the image.
-        figure_id (int): The ID of the image to delete.
-    """
-    shape = element_index(slide, figure_id)
-    assert isinstance(shape, Picture), "The element is not a Picture."
-    slide.shapes.remove(shape)
-
-
-def replace_paragraph(slide: SlidePage, div_id: int, paragraph_id: int, text: str):
-    """
-    Replace the text of a paragraph in a slide.
-
-    Args:
-        slide (SlidePage): The slide containing the paragraph.
-        div_id (int): The ID of the division containing the paragraph.
-        paragraph_id (int): The ID of the paragraph to replace.
-        text (str): The new text to replace with.
-
-    Raises:
-        IndexError: If the paragraph is not found.
-    """
-    shape = element_index(slide, div_id)
-    assert (
-        shape.text_frame.is_textframe
-    ), "The element does not have a text frame, please check the element id and type of element."
-    for para in shape.text_frame.paragraphs:
-        if para.idx == paragraph_id:
-            para.text = text
-            shape._closures["replace"].append(
-                Closure(
-                    partial(replace_para, para.real_idx, text),
-                    para.real_idx,
-                )
-            )
-            return
-    else:
-        raise IndexError(
-            f"Cannot find the paragraph {paragraph_id} of the element {div_id},"
-            "Please: "
-            "1. check if you refer to a non-existed paragraph."
-            "2. check if you already deleted it."
-        )
-
-
-def replace_image(slide: SlidePage, img_id: int, image_path: str):
-    """
-    Replace an image in a slide.
-
-    Args:
-        slide (SlidePage): The slide containing the image.
-        img_id (int): The ID of the image to replace.
-        image_path (str): The path to the new image.
-
-    Raises:
-        ValueError: If the image path does not exist.
-    """
-    if not os.path.exists(image_path):
-        raise ValueError(
-            f"The image {image_path} does not exist, consider use del_image if image_path in the given command is faked"
-        )
-    shape = element_index(slide, img_id)
-    assert isinstance(shape, Picture), "The element is not a Picture."
-    img_size = PIL.Image.open(image_path).size
-    r = min(shape.width / img_size[0], shape.height / img_size[1])
-    new_width = int(img_size[0] * r)
-    new_height = int(img_size[1] * r)
-    shape.width = Pt(new_width)
-    shape.height = Pt(new_height)
-    shape.img_path = image_path
-
-
-def clone_paragraph(slide: SlidePage, div_id: int, paragraph_id: int):
-    """
-    Clone a paragraph in a slide.
-
-    Args:
-        slide (SlidePage): The slide containing the paragraph.
-        div_id (int): The ID of the division containing the paragraph.
-        paragraph_id (int): The ID of the paragraph to clone.
-
-    Raises:
-        IndexError: If the paragraph is not found.
-
-    Mention: the cloned paragraph will have a paragraph_id one greater than the current maximum in the parent element.
-    """
-    shape = element_index(slide, div_id)
-    assert (
-        shape.text_frame.is_textframe
-    ), "The element does not have a text frame, please check the element id and type of element."
-    max_idx = max([para.idx for para in shape.text_frame.paragraphs])
-    for para in shape.text_frame.paragraphs:
-        if para.idx != paragraph_id:
-            continue
-        shape.text_frame.paragraphs.append(deepcopy(para))
-        shape.text_frame.paragraphs[-1].idx = max_idx + 1
-        shape.text_frame.paragraphs[-1].real_idx = len(shape.text_frame.paragraphs) - 1
-        shape._closures["clone"].append(
-            Closure(
-                partial(clone_para, para.real_idx),
-                para.real_idx,
-            )
-        )
-        return
-    raise IndexError(
-        f"Cannot find the paragraph {paragraph_id} of the element {div_id}, may refer to a non-existed paragraph."
-    )
-
-
-class API_TYPES(Enum):
-    Agent = [
-        replace_image,
-        del_image,
-        clone_paragraph,
-        replace_paragraph,
-        del_paragraph,
-    ]
-
-    @classmethod
-    def all_funcs(cls) -> dict[str, callable]:
-        funcs = {}
-        for attr in dir(cls):
-            if attr.startswith("__"):
-                continue
-            funcs |= {func.__name__: func for func in getattr(cls, attr).value}
-        return funcs
-
-
-if __name__ == "__main__":
-    print(CodeExecutor(0).get_apis_docs(API_TYPES.Agent.value))
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/ablation.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/ablation.py
deleted file mode 100644
index e5047eab77933d3776b49f24404e58a2d0684353..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/ablation.py
+++ /dev/null
@@ -1,174 +0,0 @@
-import json
-import random
-from copy import deepcopy
-
-from src.apis import API_TYPES, CodeExecutor
-from src.llms import Role
-from src.pptgen import PPTCrew
-from src.presentation import GroupShape, ShapeElement, SlidePage, TextFrame
-from src.utils import get_slide_content, pexists, pjoin, tenacity
-
-
-class PPTCrew_wo_Structure(PPTCrew):
-    def _hire_staffs(self, record_cost: bool, **kwargs) -> dict[str, Role]:
-        new_planner = "planner_wo_structure"
-        self.roles.append(new_planner)
-        super()._hire_staffs(record_cost, **kwargs)
-        self.staffs["planner"] = self.staffs.pop(new_planner)
-
-    @tenacity
-    def _generate_outline(self, num_slides: int):
-        outline_file = pjoin(self.config.RUN_DIR, "presentation_outline.json")
-        if pexists(outline_file):
-            outline = json.load(open(outline_file, "r"))
-        else:
-            outline = self.staffs["planner"](
-                num_slides=num_slides,
-                layouts="\n".join(
-                    set(self.slide_induction.keys()).difference(self.functional_keys)
-                ),
-                json_content=self.doc_json,
-                image_information=self.image_information,
-            )
-            outline = self._valid_outline(outline)
-            json.dump(
-                outline,
-                open(outline_file, "w"),
-                ensure_ascii=False,
-                indent=4,
-            )
-        return outline
-
-
-class PPTCrew_wo_LayoutInduction(PPTCrew):
-    def _generate_slide(self, slide_data, code_executor: CodeExecutor) -> SlidePage:
-        slide_idx, (slide_title, slide) = slide_data
-        images_info = "No Images"
-        if any(
-            [
-                i in slide["layout"]
-                for i in ["picture", "chart", "table", "diagram", "freeform"]
-            ]
-        ):
-            images_info = self.image_information
-        slide_content = f"Slide-{slide_idx+1} " + get_slide_content(
-            self.doc_json, slide_title, slide
-        )
-        try:
-            return self.synergize(
-                deepcopy(self.slide_induction[random.choice(self.layout_names)]),
-                slide_content,
-                code_executor,
-                images_info,
-            )
-        except Exception as e:
-            print(f"generate slide {slide_idx} failed: {e}")
-            return None
-
-
-class PPTCrew_wo_Decoupling(PPTCrew):
-    roles: list[str] = ["agent"]
-
-    def synergize(
-        self,
-        template: dict,
-        slide_content: str,
-        code_executor: CodeExecutor,
-        image_info: str,
-    ) -> SlidePage:
-        schema = template["content_schema"]
-        edit_actions = self.staffs["agent"](
-            schema=schema,
-            api_docs=code_executor.get_apis_docs(API_TYPES.Agent.value),
-            edit_target=self.presentation.slides[template["template_id"] - 1].to_html(),
-            outline=self.simple_outline,
-            metadata=self.metadata,
-            text=slide_content,
-            images_info=image_info,
-        )
-        for error_idx in range(self.retry_times):
-            edited_slide: SlidePage = deepcopy(
-                self.presentation.slides[template["template_id"] - 1]
-            )
-            feedback = code_executor.execute_actions(edit_actions, edited_slide)
-            if feedback is None:
-                return edited_slide
-            if error_idx == self.retry_times - 1:
-                raise Exception(
-                    f"Failed to generate slide, tried too many times at editing\ntraceback: {feedback[1]}"
-                )
-            edit_actions = self.staffs["agent"].retry(*feedback, error_idx + 1)
-        self.empty_prs.build_slide(edited_slide)
-        return edited_slide
-
-
-class PPTCrew_wo_SchemaInduction(PPTCrew):
-    def _hire_staffs(self, record_cost: bool, **kwargs) -> dict[str, Role]:
-        new_editor = "editor_wo_schema"
-        self.roles.append(new_editor)
-        super()._hire_staffs(record_cost, **kwargs)
-        self.staffs["editor"] = self.staffs.pop(new_editor)
-
-    def synergize(
-        self,
-        template: dict,
-        slide_content: str,
-        code_executor: CodeExecutor,
-        images_info: str,
-    ) -> SlidePage:
-        content_schema = template["content_schema"]
-        new_schema = {}
-        for k, v in enumerate(content_schema.values()):
-            v.pop("description")
-            new_schema[str(k)] = v
-        old_data = self._prepare_schema(new_schema)
-        editor_output = self.staffs["editor"](
-            schema=new_schema,
-            outline=self.simple_outline,
-            metadata=self.metadata,
-            text=slide_content,
-            images_info=images_info,
-        )
-        command_list = self._generate_commands(editor_output, new_schema, old_data)
-
-        edit_actions = self.staffs["coder"](
-            api_docs=code_executor.get_apis_docs(API_TYPES.Agent.value),
-            edit_target=self.presentation.slides[template["template_id"] - 1].to_html(),
-            command_list="\n".join([str(i) for i in command_list]),
-        )
-        for error_idx in range(self.retry_times):
-            edited_slide: SlidePage = deepcopy(
-                self.presentation.slides[template["template_id"] - 1]
-            )
-            feedback = code_executor.execute_actions(edit_actions, edited_slide)
-            if feedback is None:
-                break
-            if error_idx == self.retry_times - 1:
-                raise Exception(
-                    f"Failed to generate slide, tried too many times at editing\ntraceback: {feedback[1]}"
-                )
-            edit_actions = self.staffs["coder"].retry(*feedback, error_idx + 1)
-        self.empty_prs.build_slide(edited_slide)
-        return edited_slide
-
-
-def monkeypatch_render():
-    for cls in [
-        ShapeElement,
-        GroupShape,
-        SlidePage,
-        TextFrame,
-    ]:
-        cls.to_html = lambda s: s.to_pptc()
-
-
-class PPTCrew_wo_HTML(PPTCrew):
-    def __init__(self, *args, **kwargs):
-        super().__init__(*args, **kwargs)
-        monkeypatch_render()
-
-    def _hire_staffs(self, record_cost: bool, **kwargs) -> dict[str, Role]:
-        new_coder = "coder_wo_html"
-        self.roles.append(new_coder)
-        super()._hire_staffs(record_cost, **kwargs)
-        self.staffs["coder"] = self.staffs.pop(new_coder)
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/baseline_docpres.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/baseline_docpres.py
deleted file mode 100644
index 79cb44e4a29b58fec5bfe0702fdafea50bf2b83d..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/baseline_docpres.py
+++ /dev/null
@@ -1,207 +0,0 @@
-import json
-import os
-from concurrent.futures import ThreadPoolExecutor
-from glob import glob
-from typing import Literal
-
-import func_argparse
-import jsonlines
-from jinja2 import Template
-from PIL import Image
-from pptx import Presentation
-from torch import cosine_similarity
-from tqdm import tqdm
-from transformers import CLIPModel, CLIPProcessor
-
-import llms
-from presentation import Presentation
-from utils import edit_distance, ppt_to_images
-
-outline_template = Template(
-    """
-    From the following text which contains a set of headings and some content within each heading:
-    {{ text }}
-    Extract the most important headings present in it. Reduce the length of each heading to five words if they are lengthy.
-    Example Output:
-    ["Heading 1", "Heading 2", "Heading 3"]
-    Output: give your output as a list of strings in json format
-    """
-)
-mapping_template = Template(
-    """
-    Think step by step and then answer the following question:  You are given with the following title: {{outline_headings}}
-    and a list of keys: {{document_heading_from_bird_eye_view}}
-    Each key is associated with some text as presented in the dictionary format below:
-    {{bird_eye_view}}
-    The task is to find 1-2 significantly matched keys. The matching should be done based on the similarity of the text associated with the keys with the given heading.
-    Example Output:
-    thoughts...
-    {"Heading 1": ["key1", "key2"], "Heading 2": ["key1", "key4"]}
-    Output: give your final output as a dictionary in json format, notice that all headings must be present in the output, no heading should be left out and at least one key should be present in the output for each heading
-    """
-)
-generation_template = Template(
-    """
-    You are a presentation generator from a source of text. You have to generate the slide number {{slide_index}}. Previous slide headings and slide contents are given below in the format of a list of dictionaries. {{previous_slide}} Given the following slide heading and the source of text respectively, create the content of the slide number {{slide_index}} such that: 1. The slide should have maximum {{max_bullet}} bullet points. 2. Ensure that the content of the bullet points are coming strictly from the given source of text only. 3. The content of the slide is very relevant to the given slide heading 4. Each bullet point should have a maximum of 10 words 5. Ensure that this slide does not have any content repeated from the previous slides. 6. The flow of the overall presentation is nice. 7. Do not prefix the slide title before the bullet poide nts in the output  SliTitle: {{slide_heading}} Source of text: {{text}}
-    Example Output:
-    ["bullet point 1", "bullet point 2"]
-    Output: give your output as a list of strings in json format
-    """
-)
-
-
-def filter_aspect_ratio(image: list[str]):
-    filtered_images = []
-    for i in image:
-        size = image = Image.open(i).size
-        long, short = max(size), min(size)
-        if long / short < 4:
-            filtered_images.append(i)
-    return filtered_images
-
-
-def get_indexed_sections(bird_eye: dict, indexs: list[str]):
-    indexed_sections = []
-    for section in bird_eye["sections"]:
-        for subsection in section["subsections"]:
-            if any(edit_distance(key, next(iter(subsection))) > 0.9 for key in indexs):
-                indexed_sections.append(subsection)
-    return indexed_sections
-
-
-def generate_content(source_text: str, bird_eye: dict, max_bullet: int):
-    bird_eye_headdings = []
-    for section in bird_eye["sections"]:
-        bird_eye_headdings.extend(
-            [next(iter(subsec)) for subsec in section["subsections"]]
-        )
-    outline: list[str] = llms.language_model(
-        outline_template.render(text=source_text), return_json=True
-    )
-    assert len(outline) != 0, "No outline found"
-    mapping = llms.language_model(
-        mapping_template.render(
-            outline_headings=outline,
-            document_heading_from_bird_eye_view=bird_eye_headdings,
-            bird_eye_view=bird_eye,
-        ),
-        return_json=True,
-    )
-    slides = []
-    for slide_title in outline:
-        bullet_points = llms.language_model(
-            generation_template.render(
-                slide_heading=slide_title,
-                text=get_indexed_sections(bird_eye, mapping.get(slide_title, [])),
-                previous_slide=slides,
-                max_bullet=max_bullet,
-            ),
-            return_json=True,
-        )
-        slides.append(
-            {
-                "title": slide_title,
-                "bullets": bullet_points,
-                "indexed_sections": mapping.get(slide_title, []),
-            }
-        )
-    return slides
-
-
-def generate_slides(
-    output_dir: str,
-    source_text: str,
-    bird_eye: dict,
-    images: list[str],
-    model: CLIPModel,
-    processor: CLIPProcessor,
-):
-    os.makedirs(output_dir, exist_ok=True)
-    images = filter_aspect_ratio(images)
-    slides = generate_content(source_text, bird_eye, 7)
-    image_embeddings = model.get_image_features(
-        **processor(images=[Image.open(i) for i in images], return_tensors="pt").to(
-            "cuda"
-        )
-    ).unsqueeze(0)
-    text_embeddings = model.get_text_features(
-        **processor(
-            text=["\n".join(slide["bullets"]) for slide in slides],
-            return_tensors="pt",
-            padding=True,
-            max_length=77,
-            truncation=True,
-        ).to("cuda")
-    ).unsqueeze(1)
-    similarity = cosine_similarity(image_embeddings, text_embeddings, dim=-1)
-    pptx = Presentation()
-    for slide_idx, slide in enumerate(slides):  # match image here
-        title = slide["title"]
-        bullets = slide["bullets"]
-
-        subsimilarity = similarity[slide_idx]
-        if subsimilarity.max() > 0.8:
-            slide = pptx.slides.add_slide(pptx.slide_layouts[6])
-            bullets_placeholder = slide.shapes.placeholders[2]
-            image = images[subsimilarity.argmax()]
-            slides[slide_idx]["image"] = image
-            slide.shapes.placeholders[1].insert_picture(image)
-        else:
-            slide = pptx.slides.add_slide(pptx.slide_layouts[1])
-            bullets_placeholder = slide.shapes.placeholders[1]
-        slide.shapes.title.text = title
-        text_frame = bullets_placeholder.text_frame
-        for bullet in bullets:
-            para = text_frame.add_paragraph()
-            para.text = bullet
-            para.level = 1
-    with jsonlines.open(output_dir + "/final.jsonl", "w") as writer:
-        writer.write_all(slides)
-    pptx.save(output_dir + "/final.pptx")
-    ppt_to_images(output_dir + "/final.pptx", output_dir + "/slide_images")
-
-
-def generate(model: Literal["Qwen2.5", "gpt"]):
-    if model == "Qwen2.5":
-        llms.language_model = llms.qwen2_5
-    elif model == "gpt":
-        llms.language_model = llms.gpt4o
-
-    print("Generating slides on baseline with ", llms.language_model.model)
-    llm_name = llms.get_simple_modelname(llms.language_model)
-    model = CLIPModel.from_pretrained("openai/clip-vit-large-patch14").to("cuda").eval()
-    processor = CLIPProcessor.from_pretrained("openai/clip-vit-large-patch14")
-    folders = list(glob("data/*/pdf/*"))
-    progress = tqdm(total=len(folders))
-
-    def process_folder(pdf_folder, model, processor):
-        source_text = open(f"{pdf_folder}/source.md").read()
-        bird_eye = json.load(open(f"{pdf_folder}/refined_doc.json"))
-        images = json.load(open(f"{pdf_folder}/image_caption.json")).keys()
-        output_dir = f"{pdf_folder}/docpres/{llm_name}"
-        if os.path.exists(output_dir + "/final.jsonl"):
-            progress.write(f"Skipping {pdf_folder}")
-            progress.update(1)
-            return
-        try:
-            generate_slides(
-                output_dir,
-                source_text,
-                bird_eye,
-                list(images),
-                model,
-                processor,
-            )
-            progress.update(1)
-        except Exception as e:
-            print(f"Error in {pdf_folder}: {e}")
-
-    # for folder in folders:
-    #     process_folder(folder, model, processor)
-
-    with ThreadPoolExecutor() as executor:
-        list(executor.map(lambda f: process_folder(f, model, processor), folders))
-
-
-if __name__ == "__main__":
-    func_argparse.main([generate])
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/baseline_kctv.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/baseline_kctv.py
deleted file mode 100644
index 959588cfc22468341c0a0b6b5fe5835cd032ca7d..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/baseline_kctv.py
+++ /dev/null
@@ -1,147 +0,0 @@
-import glob
-import json
-import os
-import re
-import subprocess
-
-import PyPDF2
-from tqdm import tqdm
-
-import llms
-from utils import pexists, ppt_to_images
-
-slides = """
-Slides should include a title page. Following slides should contain an informative slide title
-and short, concise bullet points. Longer slides should be broken up into multiple slides.
-"""
-
-convert_to_latex = (
-    "Summarize the following input in a {} style."
-    "Style parameters: {}"
-    "Format the output document as a latex file:\n"
-    "Input: {}\n\n"
-    "Output:"
-)
-
-sure_prompt = (
-    f"Given the input text, extract the document title and authors."
-    "For each section in the given input text, extract the most important sentences."
-    "Format the output using the following json template:\n"
-    "{}\n\n"
-    "Input: {}\n"
-    "Output:"
-)
-
-
-internal_representation = """{
-    "Document Title": "TITLE",
-    "Document Authors": ["AUTHOR 1", "AUTHOR 2", "...", "AUTHOR N"],
-    "SECTION TITLE 1": {
-        "Content": [
-            "SENTENCE 1",
-            "SENTENCE 2",
-            "...",
-            "SENTENCE N"
-        ]
-    },
-    "SECTION TITLE 2": {
-        "Content": [
-            "SENTENCE 1",
-            "SENTENCE 2",
-            "...",
-            "SENTENCE N"
-        ]
-    },
-    "...": {},
-    "SECTION TITLE N": {
-        "Content": [
-            "SENTENCE 1",
-            "SENTENCE 2",
-            "...",
-            "SENTENCE N"
-        ]
-    }
-}"""
-
-
-def replace_mentions_of_figures(latex, figure_dir):
-    latex = latex.split("\n")
-    for i in range(len(latex)):
-        paragraph = latex[i]
-        matches = re.findall(r"\\includegraphics.*?{([^}]+)}", paragraph)
-        for match in matches:
-            if pexists(match):
-                continue
-            if match == os.path.basename(match):
-                if pexists(os.path.join(figure_dir, match)):
-                    latex[i] = paragraph.replace(match, f"{figure_dir}/{match}")
-                    continue
-            raise ValueError(f"Figure {match} not found")
-    return "\n".join(latex)
-
-
-def kctv_gen_ppt(doc_dir):
-    # Take input doc
-    pdf = doc_dir.split("/")[-1]
-    input_json = json.load(open(doc_dir + "/refined_doc.json"))
-    model_name = llms.get_simple_modelname(llms.language_model)
-    output_base = os.path.join(doc_dir, "kctv", model_name)
-    os.makedirs(output_base, exist_ok=True)
-
-    if os.path.exists(os.path.join(output_base, "slide_images")):
-        return
-
-    prompt = sure_prompt.format(internal_representation, input_json)
-    gpt_response = llms.language_model(prompt, return_json=True)
-
-    with open(
-        os.path.join(output_base, "final.json"),
-        "w",
-        encoding="utf-8",
-    ) as fout:
-        json.dump(gpt_response, fout, indent=4)
-
-    latex_prompt = convert_to_latex.format("slide", slides, gpt_response)
-    gpt_latex = llms.language_model(
-        latex_prompt,
-    )
-    gpt_latex = gpt_latex.strip().removeprefix("```latex").removesuffix("```")
-    gpt_latex = replace_mentions_of_figures(gpt_latex, doc_dir)
-    with open(os.path.join(output_base, "final.tex"), "w") as f:
-        with open(f.name, "w") as fout:
-            fout.write(gpt_latex.replace("\\ ", " "))
-        subprocess.run(
-            ["pdflatex", f.name],
-            timeout=30,
-            stdin=subprocess.DEVNULL,
-            text=True,
-        )
-        assert len(PyPDF2.PdfReader(open("final.pdf", "rb")).pages) > 1
-        os.rename("final.pdf", os.path.join(output_base, "final.pdf"))
-        ppt_to_images(
-            os.path.join(output_base, "final.pdf"),
-            os.path.join(output_base, "slide_images"),
-        )
-
-
-if __name__ == "__main__":
-    from concurrent.futures import ThreadPoolExecutor
-
-    llms.language_model = llms.gpt4o
-
-    def process_pdf_folder(pdf_folder):
-        try:
-            kctv_gen_ppt(pdf_folder)
-            print("success generated ppt for ", pdf_folder)
-        except Exception as e:
-            print(e)
-
-    pdf_folders = glob.glob("data/*/pdf/*")
-    for i in pdf_folders:
-        process_pdf_folder(i)
-
-    with ThreadPoolExecutor() as executor:
-        list(
-            tqdm(executor.map(process_pdf_folder, pdf_folders), total=len(pdf_folders))
-        )
-    os.system("make clean")
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/crawler.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/crawler.py
deleted file mode 100644
index f9754f84639753249e818ee3d14858b710371eb1..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/crawler.py
+++ /dev/null
@@ -1,374 +0,0 @@
-import asyncio
-import hashlib
-import json
-import os
-import shutil
-import sys
-import tempfile
-import time
-from collections import defaultdict
-from concurrent.futures import ProcessPoolExecutor
-from glob import glob
-from itertools import product
-from time import sleep
-
-import aiohttp
-import jsonlines
-import PyPDF2
-import requests
-from tqdm import tqdm
-
-from presentation import Presentation
-from utils import Config, tenacity
-
-topics = [
-    "culture",
-    "education",
-    "science",
-    "society",
-    "technology",
-]
-
-BANNED_LICENSES = [
-    "unknown",
-    "cc-by-nc-3.0",
-    "cc-by-nc-4.0",
-    "cc-by-nc-nd-1.0",
-    "cc-by-nc-nd-4.0",
-    "cc-by-nd-1.0",
-    "cc-by-nd-2.5",
-    "cc-by-nd-4.0",
-    "other-open",
-    "other-at",
-    "other-pd",
-    "other-closed",
-]
-
-
-@tenacity
-def search_zenodo(
-    sort: str,
-    query: str = None,
-    page: int = 1,
-    filetype: str = "pptx",
-    size: int = 500,
-) -> dict:
-    url = "https://zenodo.org/api/records"
-    params = {
-        "size": size,
-        "page": page,
-        "access_right": "open",
-        "file_type": filetype,
-        "sort": sort,
-    }
-    if query is not None:
-        params["q"] = query
-    response = requests.get(url, params=params)
-    response.raise_for_status()
-    return response.json()
-
-
-@tenacity
-def iter_zenodo(
-    sort: str,
-    query: str = None,
-    maxpage: int = None,
-    filetype: str = "pptx",
-    page: int = 1,
-):
-    while True:
-        if maxpage is not None and page > maxpage:
-            break
-        for record in search_zenodo(sort, query, page, filetype, 500)["hits"]["hits"]:
-            yield record, page
-        page += 1
-        if page > 20:
-            break
-
-
-def ppt_validate(filename: str):
-    with tempfile.TemporaryDirectory() as temp_dir:
-        config = Config(rundir=temp_dir, debug=False)
-        try:
-            presentation = Presentation.from_file(filename, config)
-        except Exception as e:
-            return False
-        num_images = len(os.listdir(config.IMAGE_DIR))
-        if num_images > 3 * len(presentation) or num_images == 0:
-            return False
-    if len(presentation) < 8 or len(presentation) > 64:
-        return False
-    if len(presentation.error_history) > len(presentation) / 2:
-        return False
-    layout_count = defaultdict(int)
-
-    for slide in presentation.slides:
-        layout_count[slide.slide_layout_name] += 1
-    if sum(layout_count.values()) / len(layout_count) < 2:
-        return False
-    return True
-
-
-def pdf_validate(filename: str):
-    try:
-        with open(filename, "rb") as f:
-            num_pages = len(PyPDF2.PdfReader(f).pages)
-    except:
-        return False
-    if num_pages > 16:
-        return False
-    return True
-
-
-async def download_file(
-    session: aiohttp.ClientSession, filepath: str, url: str, pbar: tqdm = None
-) -> None:
-    os.makedirs(os.path.dirname(filepath), exist_ok=True)
-    async with session.get(
-        url, params={"access_token": os.environ["ZENODO_TOKEN"]}
-    ) as response:
-        if response.status == 200:
-            with open(filepath, "wb") as f:
-                f.write(await response.read())
-        elif response.status != 404:
-            raise Exception(f"Failed to download {filepath}: {response.status}")
-    if pbar is not None:
-        pbar.update(1)
-
-
-def collect_links(jsonl_file: str) -> None:
-    page = 1
-    progress_bar = None
-    with jsonlines.open(jsonl_file, mode="w") as writer:
-        while progress_bar is None or progress_bar.n < progress_bar.total:
-            try:
-                results = search_zenodo(page=page, sort="-mostrecent")
-            except Exception as e:
-                tqdm.write(f"Error {e}, current page: {page}")
-                continue
-            if progress_bar is None:
-                progress_bar = tqdm(
-                    initial=(page - 1) * 500,
-                    total=results["hits"]["total"],
-                    desc="Processing pages",
-                )
-            progress_bar.update(len(results["hits"]["hits"]))
-            records = []
-            for record in results["hits"]["hits"]:
-                for file in record["files"]:
-                    if not file["key"].endswith(".pptx"):
-                        continue
-                    license = record["metadata"].get("license", {"id": "unknown"})["id"]
-                    if license == "unknow":
-                        continue
-                    records.append(
-                        {
-                            "filename": file["key"],
-                            "size": file["size"],
-                            "url": file["links"]["self"],
-                            "license": license,
-                            "title": record["title"],
-                            "created": record["created"],
-                            "updated": record["updated"],
-                            "doi": record.get("doi", "unknown"),
-                            "checksum": file["checksum"],
-                        }
-                    )
-            writer.write_all(records)
-            page += 1
-
-
-async def download_links(jsonl_file: str) -> None:
-    async with aiohttp.ClientSession() as session:
-        with jsonlines.open(jsonl_file) as reader:
-            tasks = list(reader)
-        progress_bar = tqdm(total=len(tasks), desc="Downloading files")
-        task_iter = iter(tasks)
-        coroutines = []
-        while True:
-            while len(coroutines) < 80:
-                task = next(task_iter, None)
-                if task is None:
-                    break
-                dirname = f"zenodo-pptx/pptx/{task['license']}/{task['created'][:4]}/"
-                basename = f"{task['checksum'][4:]}-{task['filename']}"
-                filepath = dirname + basename
-                try:
-                    open("/tmp/" + basename, "wb").close()
-                except:
-                    filepath = dirname + basename[:240] + ".pptx"
-                if os.path.exists(filepath):
-                    progress_bar.update(1)
-                    continue
-                coroutines.append(
-                    download_file(session, filepath, task["url"], progress_bar)
-                )
-
-            start_time = time.time()
-            results = await asyncio.gather(*coroutines, return_exceptions=True)
-            for result in results:
-                if isinstance(result, Exception):
-                    tqdm.write(f"Error {result}")
-            if len(coroutines) % 80 != 0:
-                return
-            coroutines.clear()
-            elapsed_time = time.time() - start_time
-            sleep(max(60 - elapsed_time, 0))
-
-
-async def gather_files(topics: list[str], num_results: int) -> None:
-    session = aiohttp.ClientSession()
-    filetypes = ["pptx", "pdf"]
-    progress_bar = tqdm(
-        total=len(topics) * len(filetypes) * num_results, desc="Gathering files"
-    )
-    writer = jsonlines.open(f"data/datastats.jsonl", mode="a")
-    existed = defaultdict(list)
-    for i in jsonlines.open(f"data/datastats.jsonl"):
-        existed[i["topic"] + i["filetype"]].append(i)
-    for topic, filetype in product(topics, filetypes):
-        selected = existed.get(topic + filetype, [])
-        progress_bar.set_description(f"Gathering {topic} {filetype}")
-        progress_bar.update(len(selected))
-        page = 1 if len(selected) == 0 else selected[-1]["page"] + 1
-        for record, page in iter_zenodo(
-            query=topic, filetype=filetype, sort="mostviewed", page=page
-        ):
-            if len(selected) >= num_results:
-                break
-            license = record["metadata"].get("license", {"id": "unknown"})["id"]
-            if license in BANNED_LICENSES:
-                continue
-            for file in record["files"]:
-                if not file["key"].endswith(f".{filetype}"):
-                    continue
-                filepath = f"zenodo-pptx/{filetype}/{license}/{record['created'][:4]}/{file['checksum'][4:]}-{file['key']}"
-                dst = f"data/{topic}/{filetype}/{file['key'].rsplit('.')[0]}/original.{filetype}"
-                if os.path.exists(dst):
-                    continue
-                os.makedirs(os.path.dirname(dst))
-                if os.path.exists(filepath):
-                    shutil.copy(filepath, dst)
-                else:
-                    try:
-                        await download_file(session, dst, file["links"]["self"])
-                    except:
-                        continue
-                if (filetype == "pptx" and not ppt_validate(dst)) or (
-                    filetype == "pdf" and not pdf_validate(dst)
-                ):
-                    shutil.rmtree(os.path.dirname(dst))
-                    continue
-                selected.append(
-                    {
-                        "filename": file["key"],
-                        "size": file["size"],
-                        "url": file["links"]["self"],
-                        "license": license,
-                        "title": record["title"],
-                        "created": record["created"],
-                        "updated": record["updated"],
-                        "doi": record.get("doi", "unknown"),
-                        "checksum": file["checksum"],
-                        "page": page,
-                        "topic": topic,
-                        "filetype": filetype,
-                    }
-                )
-                writer.write(selected[-1])
-                progress_bar.update(1)
-        progress_bar.update(num_results - len(selected))
-
-
-def verify_md5(filepath):
-    expected_md5 = os.path.basename(filepath).split("-")[0]
-    hash_md5 = hashlib.md5()
-    try:
-        with open(filepath, "rb") as f:
-            for chunk in iter(lambda: f.read(4096), b""):
-                hash_md5.update(chunk)
-    except Exception as e:
-        print(f"Error processing file {filepath}: {e}")
-    if hash_md5.hexdigest() != expected_md5:
-        print("find incorrect file: ", filepath)
-        return filepath
-    return None
-
-
-def check_files_md5_parallel(directory: str, num_workers: int = 8):
-    filepaths = []
-    for root, _, files in os.walk(directory):
-        for file in files:
-            if file.endswith(".pptx") or file.endswith(".pdf"):
-                filepaths.append(os.path.join(root, file))
-    with ProcessPoolExecutor(max_workers=num_workers) as executor:
-        result = list(executor.map(verify_md5, filepaths))
-    with jsonlines.open("incorrect_files.jsonl", mode="w") as writer:
-        for i in result:
-            if i is not None:
-                writer.write(i)
-
-
-def dataset_stat():
-    pdf_stat = {}
-    ppt_stat = {}
-    tempdir = tempfile.TemporaryDirectory()
-    config = Config()
-    config.set_rundir(tempdir.name)
-    for topic in topics:
-        markdown_contents = {
-            f: len(open(f, "r").read()) for f in glob(f"data/{topic}/pdf/*/*.md")
-        }
-        pdf_stat |= markdown_contents
-        avg_pdf_text_len = sum(markdown_contents.values()) / len(markdown_contents)
-        num_images = 0
-        for pdf_folder in glob(f"data/{topic}/pdf/*"):
-            images = json.load(open(os.path.join(pdf_folder, "image_caption.json")))
-            num_images += len(images)
-        avg_pdf_images = num_images / len(markdown_contents)
-        ppt_text_len = 0
-        ppt_pages = 0
-        ppt_images = 0
-        num_ppts = 10
-        for ppt_folder in glob(f"data/{topic}/pptx/*"):
-            presentation = Presentation.from_file(
-                os.path.join(ppt_folder, "source.pptx"), config
-            )
-            ppt_stat[ppt_folder] = sum(
-                [len(slide.to_text()) for slide in presentation.slides]
-            )
-
-            ppt_text_len += ppt_stat[ppt_folder]
-            ppt_pages += len(presentation)
-            ppt_images += len(os.listdir(os.path.join(ppt_folder, "images")))
-
-        avg_ppt_pages = ppt_pages / num_ppts
-        avg_ppt_text_len = ppt_text_len / num_ppts
-        avg_ppt_images = ppt_images / num_ppts
-        print(
-            "topic",
-            "avg_pdf_text_len",
-            "avg_pdf_images",
-            "avg_ppt_pages",
-            "avg_ppt_images",
-            "avg_ppt_text_len",
-        )
-        print(
-            f"{topic}: {avg_pdf_text_len:.2f}, {avg_pdf_images:.2f}, {avg_ppt_pages:.2f}, {avg_ppt_images:.2f}, {avg_ppt_text_len:.2f}"
-        )
-
-    json.dump(
-        {"pdf": pdf_stat, "ppt": ppt_stat}, open("data/eval/stat.json", "w"), indent=4
-    )
-
-
-if __name__ == "__main__":
-    jsonl = "zenodo-pptx/filestats.jsonl"
-    if sys.argv[1] == "collect":
-        collect_links(jsonl)
-    elif sys.argv[1] == "download":
-        asyncio.run(download_links(jsonl))
-    elif sys.argv[1] == "gather":
-        asyncio.run(gather_files(topics, 100))
-    elif sys.argv[1] == "check":
-        check_files_md5_parallel("zenodo-pptx", int(sys.argv[2]))
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/evals.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/evals.py
deleted file mode 100644
index 499c892f1cfc60846693a63e927b59afaf717b16..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/evals.py
+++ /dev/null
@@ -1,270 +0,0 @@
-import json
-import os
-import random
-import shutil
-import tempfile
-from collections import defaultdict
-from glob import glob
-from typing import Literal
-
-import func_argparse
-import pytorch_fid.fid_score as fid
-import torch
-from jinja2 import Template
-from pytorch_fid.fid_score import compute_statistics_of_path
-from rich import print
-from tqdm import tqdm
-from transformers import GPT2LMHeadModel, GPT2TokenizerFast
-
-import llms
-from presentation import Picture, Presentation, SlidePage
-from utils import Config, pexists, pjoin
-
-fid.tqdm = lambda x: x
-judges = [
-    (llms.gpt4o, llms.gpt4o, "gpt4o"),
-    (llms.qwen2_5, llms.intern_vl, "qwen+intern"),
-    (llms.qwen2_5, llms.qwen_vl, "Qwen"),
-    (llms.qwen_vl, llms.qwen_vl, "qwen_vl"),
-    (llms.intern_vl, llms.intern_vl, "intern_vl"),
-]
-DEVICES = torch.cuda.device_count()
-
-
-def get_ppl(slide: SlidePage, model: GPT2LMHeadModel, tokenizer: GPT2TokenizerFast):
-    ppl = []
-    text = slide.to_text()
-    if len(text) == 0:
-        return ppl
-    tokenized = tokenizer(text, return_tensors="pt").to(model.device)
-    with torch.no_grad():
-        outputs = model(tokenized.input_ids, labels=tokenized.input_ids)
-        loss = outputs.loss
-        perplexity = torch.exp(loss)
-        ppl.append(perplexity.item())
-    return ppl
-
-
-def eval_general(presentations: list[Presentation], evals: dict[str, list[int]]):
-    for prs in presentations:
-        if prs.source_file in evals["pages"]:
-            continue
-        evals["pages"][prs.source_file] = len(prs)
-        evals["characters"][prs.source_file] = sum(
-            [len(slide.to_text()) for slide in prs.slides]
-        )
-        evals["figures"][prs.source_file] = sum(
-            [len(list(slide.shape_filter(Picture))) for slide in prs.slides]
-        )
-
-
-def eval_feature(
-    presentations: list[Presentation],
-    evals: dict,
-    setting: str,
-):
-    device = f"cuda:{random.randint(0, DEVICES - 1)}"
-    print("start scoring ppl")
-    model = GPT2LMHeadModel.from_pretrained("gpt2").to(device)
-    tokenizer = GPT2TokenizerFast.from_pretrained("gpt2")
-    for prs in tqdm(presentations):
-        try:
-            if prs.source_file in evals["ppl"]:
-                continue
-            if (
-                prs.source_file
-                == "data/culture/pptx/ChemBio-in-the-HUB-public/PPTCrew_wo_SchemaInduction/SSRN-id2933553_Management of Systems Engineering and Technical Assistance of DARPA Research Programs/final.pptx"
-            ):
-                continue
-            ppl = []
-            for slide in prs.slides:
-                ppl.extend(get_ppl(slide, model, tokenizer))
-            if len(ppl) == 0:
-                continue
-            evals["ppl"][prs.source_file] = sum(ppl) / len(ppl)
-        except Exception as e:
-            print(e, "\n", "happended in ", prs.source_file)
-
-    model = fid.InceptionV3([fid.InceptionV3.BLOCK_INDEX_BY_DIM[64]]).to(device)
-    for ppt_folder in tqdm(sorted(glob(f"data/*/pptx/*/"))):
-        if ppt_folder in evals["fid"]:
-            continue
-        source_folder = pjoin(ppt_folder, "source_slides")
-        m1, s1 = compute_statistics_of_path(source_folder, model, 128, 64, device)
-        try:
-            with tempfile.TemporaryDirectory(prefix="ppteval_fid_") as temp_dir:
-                for result_folder in glob(
-                    pjoin(ppt_folder, f"final_images/{setting}/*")
-                ):
-                    folder_base = os.path.basename(result_folder)
-                    for image_file in os.listdir(result_folder):
-                        image_path = os.path.join(result_folder, image_file)
-                        temp_image_path = os.path.join(
-                            temp_dir, folder_base + "_" + image_file
-                        ).replace(" ", "_")
-                        shutil.copyfile(image_path, temp_image_path)
-                if len(os.listdir(temp_dir)) < 10:
-                    continue
-                m2, s2 = compute_statistics_of_path(temp_dir, model, 32, 64, device)
-
-                evals["fid"][ppt_folder] = fid.calculate_frechet_distance(
-                    m1, s1, m2, s2
-                )
-        except Exception as e:
-            print(e, "\n", "happended in ", ppt_folder, "on:", setting)
-
-
-def merge_evals(folders: list[str], evals: dict):
-    for folder in folders:
-        sub_eval = json.load(open(pjoin(folder, "evals.json")))
-        for dimension in ["content", "vision", "logic"]:
-            evals[dimension] |= sub_eval[dimension]
-    return evals
-
-
-def slide_score(slide_folder: str):
-    eval_file = pjoin(slide_folder, "evals.json")
-    evals = defaultdict(dict)
-    if pexists(eval_file):
-        evals |= json.load(open(eval_file))
-    text_scorer = Template(open("prompts/ppteval_content.txt", "r").read())
-    vision_scorer = Template(open("prompts/ppteval_style.txt", "r").read())
-    style_descriptor = open("prompts/ppteval_describe_style.txt", "r").read()
-    content_descriptor = open("prompts/ppteval_describe_content.txt", "r").read()
-    for slide_image in glob(pjoin(slide_folder, "slide_*.jpg")):
-        slide_descr = slide_image.replace(".jpg", ".json")
-        if not os.path.exists(slide_descr):
-            style_descr = llms.vision_model(style_descriptor, slide_image)
-            content_descr = llms.vision_model(content_descriptor, slide_image)
-            json.dump(
-                {"content": content_descr, "style": style_descr},
-                open(slide_descr, "w"),
-                indent=4,
-            )
-        else:
-            descr = json.load(open(slide_descr))
-            style_descr = descr["style"]
-            content_descr = descr["content"]
-        if slide_image not in evals["vision"]:
-            evals["vision"][slide_image] = llms.language_model(
-                vision_scorer.render(descr=style_descr), return_json=True
-            )
-        if slide_image not in evals["content"]:
-            evals["content"][slide_image] = llms.language_model(
-                text_scorer.render(descr=content_descr), return_json=True
-            )
-
-
-def pres_score(prs_source: str):
-    if "/pptx/" in prs_source:  # ours
-        source, setting, pdf, _ = prs_source.rsplit("/", 3)
-        slide_folder = os.path.join(source, "final_images", setting, pdf)
-    else:  # baseline
-        slide_folder = os.path.dirname(prs_source)
-    eval_file = pjoin(slide_folder, "evals.json")
-    evals = defaultdict(dict)
-    if pexists(eval_file):
-        try:
-            evals |= json.load(open(eval_file))
-        except:
-            pass
-    evals.pop("logic", None)  # ? debug
-
-    slide_descr = pjoin(slide_folder, "extracted.json")
-    if not pexists(slide_descr):
-        config = Config("/tmp")
-        presentation = Presentation.from_file(prs_source, config)
-        ppt_extractor = Template(open("prompts/ppteval_extract.txt", "r").read())
-        extracted = llms.language_model(
-            ppt_extractor.render(presentation=presentation.to_text()),
-            return_json=True,
-        )
-        json.dump(extracted, open(slide_descr, "w"), indent=4)
-    else:
-        extracted = json.load(open(slide_descr))
-    if presentation.source_file not in evals["logic"]:
-        logic_scorer = Template(open("ppteval_coherence.txt", "r").read())
-        evals["logic"][presentation.source_file] = llms.language_model(
-            logic_scorer.render(
-                background_information=extracted.pop("metadata"),
-                logical_structure=extracted,
-            ),
-            return_json=True,
-        )
-    json.dump(evals, open(eval_file, "w"), indent=4)
-
-
-# ppt eval
-def eval_experiment(
-    setting: str,
-    general_eval: bool = False,
-    feature_eval: bool = False,
-    ppt_eval: bool = False,
-):
-    assert setting != "*"
-    llms.language_model, llms.vision_model, judge_name = judges[0]
-    print(f"evaluating {setting} under {judge_name}")
-    print(
-        "eval config :",
-        f"general_eval: {general_eval}, feature_eval: {feature_eval}, ppt_eval: {ppt_eval}",
-    )
-    eval_file = f"data/evals/{setting}_{judge_name}.json"
-    eval_stats = defaultdict(dict)
-    if pexists(eval_file):
-        eval_stats |= json.load(open(eval_file))
-    config = Config("/tmp")
-    prs_files = glob(f"data/*/pptx/*/{setting}/*/final.pptx")
-    # filename dimension score
-    print("start evaluation")
-    if general_eval or feature_eval:
-        presentations = [Presentation.from_file(i, config) for i in prs_files]
-    if general_eval:
-        eval_general(presentations, eval_stats)
-
-    if feature_eval:
-        eval_feature(presentations, eval_stats, setting)
-
-    if ppt_eval:
-        slide_image_folders = glob(f"data/*/pptx/*/final_images/{setting}/*")
-        for presentation in prs_files:
-            pres_score(presentation)
-        eval_stats = merge_evals(slide_image_folders, eval_stats)
-    json.dump(eval_stats, open(eval_file, "w"), indent=4)
-
-
-def eval_baseline(
-    setting: str,
-    model: Literal["Qwen2.5", "gpt-4o"],
-    general_eval: bool = False,
-    feature_eval: bool = False,
-    ppt_eval: bool = False,
-):
-    evals = defaultdict(dict)
-    prs_files = glob(f"data/*/pdf/*/{setting}/{model}/final.pptx")
-    slide_folders = [os.path.dirname(i) for i in prs_files]
-
-    if general_eval or feature_eval:
-        config = Config("/tmp")
-        presentations = [Presentation.from_file(i, config) for i in prs_files]
-
-    if general_eval:
-        eval_general(presentations, evals)
-    if feature_eval:
-        eval_feature(presentations, evals, setting, fid_eval=False)
-    if ppt_eval:
-        for slide_folder in slide_folders:
-            slide_score(slide_folder)
-        for presentation in prs_files:
-            pres_score(presentation)
-
-    merge_evals(slide_folders, evals)
-    json.dump(evals, open(f"data/evals/{setting}_{model}.json", "w"), indent=4)
-
-
-if __name__ == "__main__":
-    func_argparse.main(
-        eval_experiment,
-        eval_baseline,
-        pres_score,
-        slide_score,
-    )
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/experiments.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/experiments.py
deleted file mode 100644
index 04c2b213e2d2fe6158f2cc265c2a2f0891e80031..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/experiments.py
+++ /dev/null
@@ -1,183 +0,0 @@
-import json
-import os
-import sys
-import shutil
-from functools import partial
-from glob import glob
-from time import sleep
-from typing import Type
-
-os.environ['OPENAI_API_KEY'] = 'Your key here'
-
-root_dir = os.path.abspath(os.path.join(os.path.dirname(__file__), '../..'))
-sys.path.insert(0, root_dir)
-
-import func_argparse
-import torch
-
-import src.llms as llms
-from src.experiment.ablation import (
-    PPTCrew_wo_Decoupling,
-    PPTCrew_wo_HTML,
-    PPTCrew_wo_LayoutInduction,
-    PPTCrew_wo_SchemaInduction,
-    PPTCrew_wo_Structure,
-)
-from src.experiment.preprocess import process_filetype
-from src.model_utils import get_text_model
-from src.multimodal import ImageLabler
-from src.pptgen import PPTCrew
-from src.presentation import Presentation
-from src.utils import Config, older_than, pbasename, pexists, pjoin, ppt_to_images
-
-# language_model vision_model
-EVAL_MODELS = [
-    (llms.qwen2_5, llms.qwen_vl),
-    (llms.gpt4o, llms.gpt4o),
-    (llms.qwen_vl, llms.qwen_vl),
-]
-
-# ablation
-# 0: w/o layout induction
-# 1: w/o schema induction
-# 2: w/o decoupling
-# 3: w/o html
-# 4: with gpt4o template
-# 5: w/o structure information
-# 6: retry 5 times
-
-AGENT_CLASS = {
-    -1: PPTCrew,
-    0: PPTCrew_wo_LayoutInduction,
-    1: PPTCrew_wo_SchemaInduction,
-    2: PPTCrew_wo_Decoupling,
-    3: PPTCrew_wo_HTML,
-    4: PPTCrew,
-    5: PPTCrew_wo_Structure,
-    6: PPTCrew,
-}
-
-
-def get_setting(setting_id: int, ablation_id: int):
-    assert ablation_id in AGENT_CLASS, f"ablation_id {ablation_id} not in {AGENT_CLASS}"
-    assert (
-        ablation_id == -1 or setting_id == 0
-    ), "ablation_id == -1 only when setting_id != 0"
-    language_model, vision_model = EVAL_MODELS[setting_id]
-    agent_class = AGENT_CLASS.get(ablation_id)
-    llms.language_model = language_model
-    llms.vision_model = vision_model
-    if ablation_id == -1:
-        setting_name = "PPTCrew-" + llms.get_simple_modelname(
-            [language_model, vision_model]
-        )
-    elif ablation_id == 6:
-        setting_name = "PPTCrew_retry_5"
-        agent_class = partial(agent_class, retry_times=5)
-    else:
-        setting_name = agent_class.__name__
-    model_identifier = llms.get_simple_modelname(
-        [llms.language_model, llms.vision_model]
-    )
-    if ablation_id == 4:
-        setting_name = "PPTCrew_with_gpt4o"
-        model_identifier = "gpt-4o+gpt-4o"
-    return agent_class, setting_name, model_identifier
-
-
-def do_generate(
-    genclass: Type[PPTCrew],
-    setting: str,
-    model_identifier: str,
-    debug: bool,
-    ppt_folder: str,
-    thread_id: int,
-    num_slides: int = 12,
-):
-    app_config = Config(rundir=ppt_folder, debug=debug)
-    text_model = get_text_model(f"cuda:{thread_id % torch.cuda.device_count()}")
-    presentation = Presentation.from_file(
-        pjoin(ppt_folder, "source.pptx"),
-        app_config,
-    )
-    ImageLabler(presentation, app_config).caption_images()
-    induct_cache = pjoin(
-        app_config.RUN_DIR, "template_induct", model_identifier, "induct_cache.json"
-    )
-    if not older_than(induct_cache, wait=True):
-        print(f"induct_cache not found: {induct_cache}")
-        return
-    slide_induction = json.load(open(induct_cache))
-    try:
-        pptgen: PPTCrew = genclass(text_model).set_reference(presentation, slide_induction)
-    except:
-        print("set_reference failed")
-        pptgen: PPTCrew = genclass(text_model).set_reference(presentation, slide_induction)
-        
-    topic = ppt_folder.split("/")[1]
-    for pdf_folder in glob(f"data/{topic}/pdf/*"):
-        app_config.set_rundir(pjoin(ppt_folder, setting, pbasename(pdf_folder)))
-        if pexists(pjoin(app_config.RUN_DIR, "history")):
-            continue
-        images = json.load(
-            open(pjoin(pdf_folder, "image_caption.json"), "r"),
-        )
-        doc_json = json.load(
-            open(pjoin(pdf_folder, "refined_doc.json"), "r"),
-        )
-        pptgen.generate_pres(app_config, images, num_slides, doc_json)
-
-
-def generate_pres(
-    setting_id: int = 0,
-    setting_name: str = None,
-    ablation_id: int = -1,
-    thread_num: int = 8,
-    debug: bool = False,
-    topic: str = "*",
-    num_slides: int = 12,
-):
-    agent_class, setting, model_identifier = get_setting(setting_id, ablation_id)
-    setting = setting_name or setting
-    print("generating slides using:", setting)
-    generate = partial(
-        do_generate,
-        agent_class,
-        setting,
-        model_identifier,
-        debug,
-        num_slides=num_slides,
-    )
-    process_filetype("pptx", generate, thread_num, topic)
-
-
-def pptx2images(settings: str = "*"):
-    while True:
-        for folder in glob(f"data/*/pptx/*/{settings}/*/history"):
-            folder = os.path.dirname(folder)
-            pptx = pjoin(folder, "final.pptx")
-            ppt_folder, setting, pdf = folder.rsplit("/", 2)
-            dst = pjoin(ppt_folder, "final_images", setting, pdf)
-
-            if not pexists(pptx):
-                if pexists(dst):
-                    print(f"remove {dst}")
-                    shutil.rmtree(dst)
-                continue
-
-            older_than(pptx)
-            if pexists(dst):
-                continue
-            try:
-                ppt_to_images(pptx, dst)
-            except:
-                print("pptx to images failed")
-        sleep(60)
-        print("keep scanning for new pptx")
-
-
-if __name__ == "__main__":
-    func_argparse.main(
-        generate_pres,
-        pptx2images,
-    )
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/preprocess.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/preprocess.py
deleted file mode 100644
index 6efee86e5d4498b1feb85b9dfe515bab6c2383b4..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/preprocess.py
+++ /dev/null
@@ -1,251 +0,0 @@
-import glob
-import json
-import multiprocessing
-import os
-import re
-import shutil
-import sys
-import traceback
-from concurrent.futures import ProcessPoolExecutor, ThreadPoolExecutor
-from functools import partial
-
-import torch
-from FlagEmbedding import BGEM3FlagModel
-from jinja2 import Template
-from tqdm import tqdm
-
-os.environ['OPENAI_API_KEY'] = 'Your key here'
-
-root_dir = os.path.abspath(os.path.join(os.path.dirname(__file__), '../..'))
-sys.path.insert(0, root_dir)
-
-import src.llms as llms
-from src.induct import SlideInducter
-from src.model_utils import (
-    get_image_embedding,
-    get_image_model,
-    images_cosine_similarity,
-    parse_pdf,
-    prs_dedup,
-)
-from src.multimodal import ImageLabler
-from src.presentation import Picture, Presentation, SlidePage
-from src.utils import Config, older_than, pexists, pjoin, ppt_to_images
-
-markdown_clean_pattern = re.compile(r"!\[.*?\]\((.*?)\)")
-device_count = torch.cuda.device_count()
-
-
-def rm_folder(folder: str):
-    try:
-        shutil.rmtree(folder)
-    except:
-        for i in os.listdir(folder):
-            try:
-                rm_folder(pjoin(folder, i))
-            except:
-                pass
-
-
-def process_filetype(file_type: str, func: callable, thread_num: int, topic="*"):
-    folders = glob.glob(f"data/{topic}/{file_type}/*")
-    progress_bar = tqdm(total=len(folders), desc=f"processing {file_type}")
-
-    def process_folder(folder, *args, **kwargs):
-        try:
-            func(folder, *args, **kwargs)
-        except Exception as e:
-            print(f"process {file_type} folder {folder} failed: {e}")
-            traceback.print_exc()
-        finally:
-            progress_bar.update(1)
-
-    with ThreadPoolExecutor(thread_num) as executor:
-        list(executor.map(process_folder, folders, range(len(folders))))
-
-    progress_bar.close()
-
-
-def parse_pdfs(pdf_folders: list[str], idx: int):
-    # require numpy==1.26.0, which is conflict with other packages
-    from marker.models import create_model_dict
-
-    model = create_model_dict(device=idx % device_count, dtype=torch.float16)
-    for pdf_folder in pdf_folders:
-        if not older_than(pdf_folder + "/original.pdf"):
-            continue
-        if not pexists(pjoin(pdf_folder, "source.md")):
-            text_content = parse_pdf(
-                pdf_folder + "/original.pdf",
-                pdf_folder,
-                model,
-            )
-            if len(text_content) < 512:
-                rm_folder(pdf_folder)
-                continue
-
-
-def prepare_pdf_folder(pdf_folder: str, rank: int):
-    image_model = get_image_model(f"cuda:{rank % device_count}")
-    if not pexists(pjoin(pdf_folder, "source.md")):
-        return
-    if not pexists(pjoin(pdf_folder, "image_caption.json")):
-        images_embeddings = get_image_embedding(pdf_folder, *image_model)
-        images = [pjoin(pdf_folder, image) for image in images_embeddings]
-        if len(images_embeddings) == 0:
-            rm_folder(pdf_folder)
-            return
-        similarity_matrix = images_cosine_similarity(list(images_embeddings.values()))
-        for i in range(len(similarity_matrix)):
-            for j in range(i + 1, len(similarity_matrix)):
-                if similarity_matrix[i][j] > 0.85:
-                    if pexists(images[i]):
-                        os.remove(images[i])
-                    break
-        images = [image for image in images if pexists(image)]
-        image_stats = {}
-        caption_prompt = open("prompts/caption.txt").read()
-        for image in images:
-            image_stats[image] = llms.vision_model(caption_prompt, image)
-            print(image_stats[image])
-        with open(pjoin(pdf_folder, "image_caption.json"), mode="w") as f:
-            json.dump(image_stats, f, indent=4, ensure_ascii=False)
-
-    if not pexists(pjoin(pdf_folder, "refined_doc.json")):
-        text_content = open(pjoin(pdf_folder, "source.md")).read()
-        text_content = markdown_clean_pattern.sub("", text_content)
-        template = Template(open("prompts/document_refine.txt").read())
-        doc_json = llms.language_model(
-            template.render(markdown_document=text_content), return_json=True
-        )
-        json.dump(
-            doc_json,
-            open(pjoin(pdf_folder, "refined_doc.json"), "w"),
-            indent=4,
-            ensure_ascii=False,
-        )
-
-
-def filter_slide(slide: SlidePage):
-    num_pictures = len(list(slide.shape_filter(Picture)))
-    num_shapes = len(slide.shapes)
-    if num_shapes > 10:
-        return True
-    if num_shapes - num_pictures < 2:
-        return True
-    if slide.real_idx != 0 and num_pictures > 2:
-        return True
-
-def I_dont_want_to_filter_slide(slide: SlidePage):
-    return False
-
-def check_consistency(slides: list[SlidePage], ppt_folder: str, image_model):
-    original_embeddings = get_image_embedding(
-        pjoin(ppt_folder, "original_slides"), *image_model
-    )
-    rebuild_embeddings = get_image_embedding(
-        pjoin(ppt_folder, "source_slides"), *image_model
-    )
-    for slide in slides:
-        if (
-            torch.cosine_similarity(
-                original_embeddings[f"slide_{slide.real_idx:04d}.jpg"],
-                rebuild_embeddings[f"slide_{slide.slide_idx:04d}.jpg"],
-                dim=-1,
-            )
-            < 0.9
-        ):
-            raise ValueError(f"slide {slide.real_idx} in {ppt_folder} is inconsistent")
-    return True
-
-
-def prepare_ppt_folder(ppt_folder: str, text_model: BGEM3FlagModel, image_model):
-    if pexists(ppt_folder + "/source.pptx") or not older_than(
-        ppt_folder + "/original.pptx"
-    ):
-        return
-    config = Config(rundir=ppt_folder, debug=False)
-    presentation = Presentation.from_file(ppt_folder + "/original.pptx", config=config)
-    if not os.path.exists(pjoin(ppt_folder, "original_slides")):
-        ppt_to_images(presentation.source_file, pjoin(ppt_folder, "original_slides"))
-    ppt_image_folder = pjoin(ppt_folder, "source_slides")
-    shutil.rmtree(ppt_image_folder, ignore_errors=True)
-    shutil.copytree(pjoin(ppt_folder, "original_slides"), ppt_image_folder)
-
-    removed_slides = prs_dedup(presentation, text_model)
-    for slide in [slide for slide in presentation.slides if I_dont_want_to_filter_slide(slide)]:
-        removed_slides.append(slide)
-        presentation.slides.remove(slide)
-
-    for slide in removed_slides:
-        os.remove(pjoin(ppt_image_folder, f"slide_{slide.real_idx:04d}.jpg"))
-    for err_idx, _ in presentation.error_history:
-        os.remove(pjoin(ppt_image_folder, f"slide_{err_idx:04d}.jpg"))
-    assert len(presentation) == len(
-        [i for i in os.listdir(ppt_image_folder) if i.endswith(".jpg")]
-    )
-    for i, slide in enumerate(presentation.slides, 1):
-        slide.slide_idx = i
-        os.rename(
-            pjoin(ppt_image_folder, f"slide_{slide.real_idx:04d}.jpg"),
-            pjoin(ppt_image_folder, f"slide_{slide.slide_idx:04d}.jpg"),
-        )
-
-    check_consistency(presentation.slides, ppt_folder, image_model)
-    ImageLabler(presentation, config).caption_images()
-    presentation.save(pjoin(ppt_folder, "source.pptx"))
-    presentation.save(pjoin(ppt_folder, "template.pptx"), layout_only=True)
-    ppt_to_images(
-        pjoin(ppt_folder, "template.pptx"),
-        pjoin(ppt_folder, "template_images"),
-    )
-    os.remove(pjoin(ppt_folder, "template.pptx"))
-
-
-def prepare_induction(induct_id: int, wait: bool = False):
-    induct_llms = [
-        (llms.qwen2_5, llms.qwen_vl),
-        (llms.gpt4o, llms.gpt4o),
-        (llms.qwen_vl, llms.qwen_vl),
-    ]
-
-    def do_induct(llm: list[llms.LLM], ppt_folder: str, rank: int):
-        if not older_than(pjoin(ppt_folder, "source.pptx"), wait=wait):
-            return
-        llms.language_model = llm[0]
-        llms.vision_model = llm[1]
-        config = Config(rundir=ppt_folder)
-        ppt_image_folder = pjoin(ppt_folder, "source_slides")
-        template_image_folder = pjoin(ppt_folder, "template_images")
-        image_model = get_image_model(f"cuda:{rank % device_count}")
-        presentation = Presentation.from_file(pjoin(ppt_folder, "source.pptx"), config)
-        ImageLabler(presentation, config).caption_images()
-        slide_inducter = SlideInducter(
-            presentation, ppt_image_folder, template_image_folder, config, image_model
-        )
-        slide_inducter.content_induct()
-
-    for folder in tqdm(sorted(glob.glob("data/*/pptx/*")), desc="prepare induction"):
-        do_induct(induct_llms[induct_id], folder, 0)
-
-
-if __name__ == "__main__":
-    if sys.argv[1] == "prepare_ppt":
-        text_model = BGEM3FlagModel("BAAI/bge-m3", use_fp16=True, device=0)
-        image_model = get_image_model(0)
-        for ppt_folder in tqdm(glob.glob("data/*/pptx/*"), desc="prepare ppt"):
-            prepare_ppt_folder(ppt_folder, text_model, image_model)
-    elif sys.argv[1] == "prepare_induction":
-        prepare_induction(int(sys.argv[2]))
-    elif sys.argv[1] == "parse_pdf":
-        multiprocessing.set_start_method("spawn", force=True)
-        num_process = int(sys.argv[2])
-        with ProcessPoolExecutor(max_workers=num_process) as executor:
-            folders = glob.glob("data/*/pdf/*")
-            subfolders = [[] for _ in range(num_process)]
-            for idx, folder in enumerate(folders):
-                subfolders[idx % num_process].append(folder)
-            list(executor.map(parse_pdfs, subfolders, range(num_process)))
-    elif sys.argv[1] == "prepare_pdf":
-        prepare_pdf_folder = partial(prepare_pdf_folder)
-        process_filetype("pdf", prepare_pdf_folder, int(sys.argv[2]))
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/rebuild.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/rebuild.py
deleted file mode 100644
index 384374619b6c2ea0c1ca350aea94704a228aed1c..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/rebuild.py
+++ /dev/null
@@ -1,72 +0,0 @@
-# rebuild the pptx from saved code steps.jsonl
-import os
-import shutil
-import sys
-from copy import deepcopy
-from glob import glob
-
-import func_argparse
-import jsonlines
-import tqdm
-
-from apis import CodeExecutor, HistoryMark
-from presentation import Presentation
-from utils import Config, pjoin, ppt_to_images
-
-config = Config("/tmp")
-code_executor = CodeExecutor(0)
-
-
-def rebuild_pptx(agent_steps: str, prs: Presentation):
-    slides = []
-    steps = list(jsonlines.open(agent_steps))
-    if len(steps) == 0:
-        os.remove(agent_steps)
-        raise ValueError(f"Jump {agent_steps} as no steps")
-    if steps[-1][0] != HistoryMark.API_CALL_CORRECT:
-        raise ValueError(f"Jump {agent_steps} as last step is failed")
-    for mark, slide_idx, actions in steps:
-        if mark != HistoryMark.API_CALL_CORRECT:
-            continue
-        slides.append(deepcopy(prs.slides[slide_idx - 1]))  # slide_idx starts from 1
-        feedback = code_executor.execute_actions(actions, slides[-1])
-        assert feedback is None, feedback
-    return slides
-
-
-def rebuild_all(
-    setting: str = "*", topic: str = "*", out_filename: str = "rebuild.pptx"
-):
-    for folder in tqdm.tqdm(glob(f"data/{topic}/pptx/*")):
-        prs = Presentation.from_file(pjoin(folder, "source.pptx"), config)
-        pptx_container = deepcopy(prs)
-        for agent_steps in glob(pjoin(folder, setting, "*", "agent_steps.jsonl")):
-            dst = pjoin(os.path.dirname(agent_steps), out_filename)
-            if os.path.exists(dst):
-                continue
-            try:
-                pptx_container.slides = rebuild_pptx(agent_steps, prs)
-                pptx_container.save(dst)
-            except Exception as e:
-                continue
-
-
-if __name__ == "__main__":
-    if len(sys.argv) != 1:
-        func_argparse.main(rebuild_all)
-
-    else:
-        shutil.rmtree("./test", ignore_errors=True)
-        os.makedirs("./test", exist_ok=True)
-
-        source_folder = (
-            "data/education/pptx/Open Science - PhD Human Rights - 2021 - module 3"
-        )
-        setting = "PPTCrew-Qwen2.5+Qwen2.5+Qwen2-VL"
-        pdf = "37-105-1-PB (3)"
-
-        prs = Presentation.from_file(pjoin(source_folder, "source.pptx"), config)
-        container = deepcopy(prs)
-        container.slides = rebuild_pptx(
-            pjoin(source_folder, setting, pdf, "agent_steps.jsonl"), prs
-        )
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/requirements.txt b/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/requirements.txt
deleted file mode 100644
index bc8cfde5f5ab8176b0eb5be9a04f7461960931d5..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/requirements.txt
+++ /dev/null
@@ -1,5 +0,0 @@
-scienceplots
-aiohttp
-googlesearch-python
-pytorch_fid
-seaborn
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/statistic.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/statistic.py
deleted file mode 100644
index 06702364f646f4cd1a27a719a54177d7e237c0e7..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/experiment/statistic.py
+++ /dev/null
@@ -1,167 +0,0 @@
-import json
-import os
-from collections import defaultdict
-from contextlib import contextmanager
-from glob import glob
-
-import matplotlib.pyplot as plt
-import numpy as np
-import pandas as pd
-import seaborn as sns
-from scipy.stats import pearsonr, spearmanr
-
-
-@contextmanager
-def science_plot(font_size=16):
-    import scienceplots
-
-    with plt.style.context(["ieee", "grid", "no-latex", "light"]):
-        plt.rcParams.update({"font.size": font_size})
-        yield
-
-
-def statistic_humaneval(eval_file: str, print_diff: bool = False):
-    llm_eval = json.load(open(eval_file))
-    llm_data = []
-    for dimension, files in llm_eval.items():
-        for filename, values in files.items():
-            try:
-                setting, basename = filename.split("/", 2)[1:]
-                basename = basename.split("/")[0]
-                if not isinstance(values["score"], int):
-                    raise ValueError(f"score is not int: {values['score']}")
-                llm_data.append(
-                    {
-                        "setting": setting,
-                        "sample": basename,
-                        "dimension": dimension,
-                        "score": values["score"],
-                    }
-                )
-            except:
-                continue
-
-    file_path = "human_eval/human_scores_2024-12-13.xlsx"
-    human_eval = pd.read_excel(file_path).to_dict("records")
-    human_data = []
-    for record in human_eval:
-        setting = record.get("setting")
-        score = record[dimension]
-        basename = record["PPT"]
-        try:
-            score = int(score)
-        except:
-            continue
-        human_data.append(
-            {
-                "setting": setting,
-                "sample": basename,
-                "dimension": dimension,
-                "score": score,
-            }
-        )
-
-    # Compare and output differences between human and llm evaluations
-    llm_df = pd.DataFrame(llm_data)
-    human_df = pd.DataFrame(human_data)
-    merged = pd.merge(
-        llm_df,
-        human_df,
-        on=["setting", "sample", "dimension"],
-        suffixes=("_llm", "_human"),
-        how="outer",
-        indicator=True,
-    )
-    # Calculate and print correlation coefficients for common records
-    common_records = merged[merged["_merge"] == "both"].drop(columns=["_merge"])
-    dimensions = common_records["dimension"].unique()
-
-    for dimension in dimensions:
-        scores_human = common_records[common_records["dimension"] == dimension][
-            "score_human"
-        ]
-        scores_llm = common_records[common_records["dimension"] == dimension][
-            "score_llm"
-        ]
-        pearson_correlation = pearsonr(scores_human, scores_llm)
-        spearman_correlation = spearmanr(scores_human, scores_llm)
-        print(
-            f"{dimension}, pearson: {pearson_correlation}, spearman: {spearman_correlation}"
-        )
-        if print_diff:
-            difference_df = common_records[
-                common_records["score_human"] != common_records["score_llm"]
-            ]
-            for _, row in difference_df.iterrows():
-                print(row)
-
-
-def statistic_ppteval():
-    data = []
-    eval_files = glob("./data/evals/PPTCrew*")
-    for eval_file in eval_files:
-        setting = eval_file.split("/")[-1].removesuffix(".json")
-        eval_stats = json.load(open(eval_file))
-        for dimension, files in eval_stats.items():
-            if dimension == "vision":
-                dimension = "design"
-            for filename, score in files.items():
-                domain = filename.split("/")[1]
-                if isinstance(score, dict):
-                    score = score["score"]
-                if isinstance(score, str):
-                    continue
-                if score > 5000 or score < 0:
-                    continue
-                data.append(
-                    {
-                        "setting": setting,
-                        "dimension": dimension,
-                        "sample": filename,
-                        "score": score,
-                        "domain": domain,
-                    }
-                )
-    return pd.DataFrame(data)
-
-
-def setting_perfomance(df: pd.DataFrame):
-    df = df.drop(columns=["domain"])
-    for setting, dimension in df[["setting", "dimension"]].drop_duplicates().values:
-        avg_score = df[(df["setting"] == setting) & (df["dimension"] == dimension)][
-            "score"
-        ].mean()
-        print(f"{setting}, {dimension}, {avg_score}")
-
-
-def plot_correlation(df: pd.DataFrame):
-    df = df.drop(columns=["domain"])
-    correlation_matrix = df[df["setting"] == "PPTCrew-gpt-4o+gpt-4o+gpt-4o"][
-        ["ppl", "fid", "content", "design"]
-    ].corr()
-    # Plot the heatmap with axis limits set from -1 to 1
-    plt.figure(figsize=(10, 8))
-    sns.heatmap(
-        correlation_matrix,
-        annot=True,
-        cmap="coolwarm",
-        fmt=".2f",
-        linewidths=0.5,
-        vmin=-1,
-        vmax=1,
-        annot_kws={"size": 15},
-    )
-    plt.xticks(fontsize=16)
-    plt.yticks(fontsize=16)
-    plt.savefig("correlation.pdf", bbox_inches="tight")
-    plt.show()
-
-
-def domain_perfomance(df: pd.DataFrame):
-    df = df[df["setting"] == "PPTCrew-gpt-4o+gpt-4o+gpt-4o"]
-    for domain, scores in df.groupby("domain")["score"]:
-        print(f"{domain}, {scores.mean()}")
-
-
-if __name__ == "__main__":
-    statistic_humaneval("./test-logic.json")
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/induct.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/induct.py
deleted file mode 100644
index 35fc9af823acc71826ed223431e6f6e4262c6ac5..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/induct.py
+++ /dev/null
@@ -1,195 +0,0 @@
-import json
-import os
-import shutil
-from collections import defaultdict
-
-from jinja2 import Template
-
-import src.llms as llms
-from src.model_utils import get_cluster, get_image_embedding, images_cosine_similarity
-from src.presentation import Presentation
-from src.utils import Config, pexists, pjoin, tenacity
-
-
-class SlideInducter:
-    """
-    Stage I: Presentation Analysis.
-    This stage is to analyze the presentation: cluster slides into different layouts, and extract content schema for each layout.
-    """
-
-    def __init__(
-        self,
-        prs: Presentation,
-        ppt_image_folder: str,
-        template_image_folder: str,
-        config: Config,
-        image_models: list,
-    ):
-        """
-        Initialize the SlideInducter.
-
-        Args:
-            prs (Presentation): The presentation object.
-            ppt_image_folder (str): The folder containing PPT images.
-            template_image_folder (str): The folder containing normalized slide images.
-            config (Config): The configuration object.
-            image_models (list): A list of image models.
-        """
-        self.prs = prs
-        self.config = config
-        self.ppt_image_folder = ppt_image_folder
-        self.template_image_folder = template_image_folder
-        assert (
-            len(os.listdir(template_image_folder))
-            == len(prs)
-            == len(os.listdir(ppt_image_folder))
-        )
-        self.image_models = image_models
-        self.slide_induction = defaultdict(lambda: defaultdict(list))
-        model_identifier = llms.get_simple_modelname(
-            [llms.language_model, llms.vision_model]
-        )
-        self.output_dir = pjoin(config.RUN_DIR, "template_induct", model_identifier)
-        self.split_cache = pjoin(self.output_dir, f"split_cache.json")
-        self.induct_cache = pjoin(self.output_dir, f"induct_cache.json")
-        os.makedirs(self.output_dir, exist_ok=True)
-
-    def layout_induct(self):
-        """
-        Perform layout induction for the presentation.
-        """
-        if pexists(self.induct_cache):
-            return json.load(open(self.induct_cache))
-        content_slides_index, functional_cluster = self.category_split()
-        for layout_name, cluster in functional_cluster.items():
-            for slide_idx in cluster:
-                content_type = self.prs.slides[slide_idx - 1].get_content_type()
-                self.slide_induction[layout_name + ":" + content_type]["slides"].append(
-                    slide_idx
-                )
-        for layout_name, cluster in self.slide_induction.items():
-            cluster["template_id"] = cluster["slides"][-1]
-
-        functional_keys = list(self.slide_induction.keys())
-        function_slides_index = set()
-        for layout_name, cluster in self.slide_induction.items():
-            function_slides_index.update(cluster["slides"])
-        used_slides_index = function_slides_index.union(content_slides_index)
-        for i in range(len(self.prs.slides)):
-            if i + 1 not in used_slides_index:
-                content_slides_index.add(i + 1)
-        self.layout_split(content_slides_index)
-        if self.config.DEBUG:
-            for layout_name, cluster in self.slide_induction.items():
-                cluster_dir = pjoin(self.output_dir, "cluster_slides", layout_name)
-                os.makedirs(cluster_dir, exist_ok=True)
-                for slide_idx in cluster["slides"]:
-                    shutil.copy(
-                        pjoin(self.ppt_image_folder, f"slide_{slide_idx:04d}.jpg"),
-                        pjoin(cluster_dir, f"slide_{slide_idx:04d}.jpg"),
-                    )
-        self.slide_induction["functional_keys"] = functional_keys
-        json.dump(
-            self.slide_induction,
-            open(self.induct_cache, "w"),
-            indent=4,
-            ensure_ascii=False,
-        )
-        return self.slide_induction
-
-    def category_split(self):
-        """
-        Split slides into categories based on their functional purpose.
-        """
-        if pexists(self.split_cache):
-            split = json.load(open(self.split_cache))
-            return set(split["content_slides_index"]), split["functional_cluster"]
-        category_split_template = Template(open("prompts/category_split.txt").read())
-        functional_cluster = llms.language_model(
-            category_split_template.render(slides=self.prs.to_text()),
-            return_json=True,
-        )
-        functional_slides = set(sum(functional_cluster.values(), []))
-        content_slides_index = set(range(1, len(self.prs) + 1)) - functional_slides
-
-        json.dump(
-            {
-                "content_slides_index": list(content_slides_index),
-                "functional_cluster": functional_cluster,
-            },
-            open(self.split_cache, "w"),
-            indent=4,
-            ensure_ascii=False,
-        )
-        return content_slides_index, functional_cluster
-
-    def layout_split(self, content_slides_index: set[int]):
-        """
-        Cluster slides into different layouts.
-        """
-        embeddings = get_image_embedding(self.template_image_folder, *self.image_models)
-        assert len(embeddings) == len(self.prs)
-        template = Template(open("prompts/ask_category.txt").read())
-        content_split = defaultdict(list)
-        for slide_idx in content_slides_index:
-            slide = self.prs.slides[slide_idx - 1]
-            content_type = slide.get_content_type()
-            layout_name = slide.slide_layout_name
-            content_split[(layout_name, content_type)].append(slide_idx)
-
-        for (layout_name, content_type), slides in content_split.items():
-            sub_embeddings = [
-                embeddings[f"slide_{slide_idx:04d}.jpg"] for slide_idx in slides
-            ]
-            similarity = images_cosine_similarity(sub_embeddings)
-            for cluster in get_cluster(similarity):
-                slide_indexs = [slides[i] for i in cluster]
-                template_id = max(
-                    slide_indexs,
-                    key=lambda x: len(self.prs.slides[x - 1].shapes),
-                )
-                cluster_name = (
-                    llms.vision_model(
-                        template.render(
-                            existed_layoutnames=list(self.slide_induction.keys()),
-                        ),
-                        pjoin(self.ppt_image_folder, f"slide_{template_id:04d}.jpg"),
-                    )
-                    + ":"
-                    + content_type
-                )
-                self.slide_induction[cluster_name]["template_id"] = template_id
-                self.slide_induction[cluster_name]["slides"] = slide_indexs
-
-    @tenacity
-    def content_induct(self):
-        """
-        Perform content schema extraction for the presentation.
-        """
-        self.slide_induction = self.layout_induct()
-        content_induct_prompt = Template(open("prompts/content_induct.txt").read())
-        for layout_name, cluster in self.slide_induction.items():
-            if "template_id" in cluster and "content_schema" not in cluster:
-                schema = llms.language_model(
-                    content_induct_prompt.render(
-                        slide=self.prs.slides[cluster["template_id"] - 1].to_html(
-                            element_id=False, paragraph_id=False
-                        )
-                    ),
-                    return_json=True,
-                )
-                for k in list(schema.keys()):
-                    if "data" not in schema[k]:
-                        raise ValueError(f"Cannot find `data` in {k}\n{schema[k]}")
-                    if len(schema[k]["data"]) == 0:
-                        print(f"Empty content schema: {schema[k]}")
-                        schema.pop(k)
-                assert len(schema) > 0, "No content schema generated"
-                self.slide_induction[layout_name]["content_schema"] = schema
-        json.dump(
-            self.slide_induction,
-            open(self.induct_cache, "w"),
-            indent=4,
-            ensure_ascii=False,
-        )
-        return self.slide_induction
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/llms.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/llms.py
deleted file mode 100644
index 73aa17fa6fef22212869cf932928037f17694799..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/llms.py
+++ /dev/null
@@ -1,472 +0,0 @@
-import asyncio
-import base64
-import os
-import re
-from dataclasses import asdict, dataclass
-from math import ceil
-
-import jsonlines
-import requests
-import tiktoken
-import yaml
-from FlagEmbedding import BGEM3FlagModel
-from jinja2 import Environment, Template
-from oaib import Auto
-from openai import OpenAI
-from PIL import Image
-from torch import Tensor, cosine_similarity
-
-from src.model_utils import get_text_embedding
-from src.utils import get_json_from_response, pexists, pjoin, print, tenacity
-
-ENCODING = tiktoken.encoding_for_model("gpt-4o")
-
-
-def run_async(coroutine):
-    """
-    Run an asynchronous coroutine in a non-async environment.
-
-    Args:
-        coroutine: The coroutine to run.
-
-    Returns:
-        The result of the coroutine.
-    """
-    try:
-        loop = asyncio.get_event_loop()
-    except RuntimeError:
-        loop = asyncio.new_event_loop()
-        asyncio.set_event_loop(loop)
-    job = loop.run_until_complete(coroutine)
-    return job
-
-
-def calc_image_tokens(images: list[str]):
-    """
-    Calculate the number of tokens for a list of images.
-    """
-    tokens = 0
-    for image in images:
-        with open(image, "rb") as f:
-            width, height = Image.open(f).size
-        if width > 1024 or height > 1024:
-            if width > height:
-                height = int(height * 1024 / width)
-                width = 1024
-            else:
-                width = int(width * 1024 / height)
-                height = 1024
-        h = ceil(height / 512)
-        w = ceil(width / 512)
-        tokens += 85 + 170 * h * w
-    return tokens
-
-
-class LLM:
-    """
-    A wrapper class to interact with a language model.
-    """
-
-    def __init__(
-        self,
-        model: str = "gpt-4o-2024-08-06",
-        api_base: str = None,
-        use_openai: bool = True,
-        use_batch: bool = False,
-    ) -> None:
-        """
-        Initialize the LLM.
-
-        Args:
-            model (str): The model name.
-            api_base (str): The base URL for the API.
-            use_openai (bool): Whether to use OpenAI.
-            use_batch (bool): Whether to use OpenAI's Batch API, which is single thread only.
-        """
-        if use_openai and "OPENAI_API_KEY" in os.environ:
-            self.client = OpenAI(base_url=api_base)
-        if use_batch and "OPENAI_API_KEY" in os.environ:
-            assert use_openai, "use_batch must be used with use_openai"
-            self.oai_batch = Auto(loglevel=0)
-        if "OPENAI_API_KEY" not in os.environ:
-            print("Warning: no API key found")
-        self.model = model
-        self.api_base = api_base
-        self._use_openai = use_openai
-        self._use_batch = use_batch
-
-    @tenacity
-    def __call__(
-        self,
-        content: str,
-        images: list[str] = None,
-        system_message: str = None,
-        history: list = None,
-        delay_batch: bool = False,
-        return_json: bool = False,
-        return_message: bool = False,
-    ) -> str | dict | list:
-        """
-        Call the language model with a prompt and optional images.
-
-        Args:
-            content (str): The prompt content.
-            images (list[str]): A list of image file paths.
-            system_message (str): The system message.
-            history (list): The conversation history.
-            delay_batch (bool): Whether to delay return of response.
-            return_json (bool): Whether to return the response as JSON.
-            return_message (bool): Whether to return the message.
-
-        Returns:
-            str | dict | list: The response from the model.
-        """
-        if content.startswith("You are"):
-            system_message, content = content.split("\n", 1)
-        if history is None:
-            history = []
-        if isinstance(images, str):
-            images = [images]
-        system, message = self.format_message(content, images, system_message)
-        if self._use_batch:
-            result = run_async(self._run_batch(system + history + message, delay_batch))
-            if delay_batch:
-                return
-            try:
-                response = result.to_dict()["result"][0]["choices"][0]["message"][
-                    "content"
-                ]
-            except Exception as e:
-                print("Failed to get response from batch")
-                raise e
-        elif self._use_openai:
-            completion = self.client.chat.completions.create(
-                model=self.model, messages=system + history + message
-            )
-            response = completion.choices[0].message.content
-        else:
-            response = requests.post(
-                self.api_base,
-                json={
-                    "system": system_message,
-                    "prompt": content,
-                    "image": [
-                        i["image_url"]["url"]
-                        for i in message[-1]["content"]
-                        if i["type"] == "image_url"
-                    ],
-                },
-            )
-            response.raise_for_status()
-            response = response.text
-        message.append({"role": "assistant", "content": response})
-        if return_json:
-            response = get_json_from_response(response)
-        if return_message:
-            response = (response, message)
-        return response
-
-    def __repr__(self) -> str:
-        return f"LLM(model={self.model}, api_base={self.api_base})"
-
-    async def _run_batch(self, messages: list, delay_batch: bool = False):
-        await self.oai_batch.add(
-            "chat.completions.create",
-            model=self.model,
-            messages=messages,
-        )
-        if delay_batch:
-            return
-        return await self.oai_batch.run()
-
-    def format_message(
-        self,
-        content: str,
-        images: list[str] = None,
-        system_message: str = None,
-    ):
-        """
-        Message formatter for OpenAI server call.
-        """
-        if system_message is None:
-            system_message = "You are a helpful assistant"
-        system = [
-            {
-                "role": "system",
-                "content": [{"type": "text", "text": system_message}],
-            }
-        ]
-        message = [{"role": "user", "content": [{"type": "text", "text": content}]}]
-        if images is not None:
-            if not isinstance(images, list):
-                images = [images]
-            for image in images:
-                with open(image, "rb") as f:
-                    message[0]["content"].append(
-                        {
-                            "type": "image_url",
-                            "image_url": {
-                                "url": f"data:image/jpeg;base64,{base64.b64encode(f.read()).decode('utf-8')}"
-                            },
-                        }
-                    )
-        return system, message
-
-    def get_batch_result(self):
-        """
-        Get responses from delayed batch calls.
-        """
-        results = run_async(self.oai_batch.run())
-        return [
-            r["choices"][0]["message"]["content"]
-            for r in results.to_dict()["result"].values()
-        ]
-
-    def clear_history(self):
-        self.history = []
-
-
-@dataclass
-class Turn:
-    """
-    A class to represent a turn in a conversation.
-    """
-
-    id: int
-    prompt: str
-    response: str
-    message: list
-    images: list[str] = None
-    input_tokens: int = 0
-    output_tokens: int = 0
-    embedding: Tensor = None
-
-    def to_dict(self):
-        return {k: v for k, v in asdict(self).items() if k != "embedding"}
-
-    def calc_token(self):
-        """
-        Calculate the number of tokens for the turn.
-        """
-        if self.images is not None:
-            self.input_tokens += calc_image_tokens(self.images)
-        self.input_tokens += len(ENCODING.encode(self.prompt))
-        self.output_tokens = len(ENCODING.encode(self.response))
-
-    def __eq__(self, other):
-        return self is other
-
-
-class Role:
-    """
-    An agent, defined by its instruction template and model.
-    """
-
-    def __init__(
-        self,
-        name: str,
-        env: Environment,
-        record_cost: bool,
-        llm: LLM = None,
-        config: dict = None,
-        text_model: BGEM3FlagModel = None,
-    ):
-        """
-        Initialize the Agent.
-
-        Args:
-            name (str): The name of the role.
-            env (Environment): The Jinja2 environment.
-            record_cost (bool): Whether to record the token cost.
-            llm (LLM): The language model.
-            config (dict): The configuration.
-            text_model (BGEM3FlagModel): The text model.
-        """
-        self.name = name
-        if config is None:
-            with open(f"roles/{name}.yaml", "r") as f:
-                config = yaml.safe_load(f)
-        if llm is None:
-            llm = globals()[config["use_model"] + "_model"]
-        self.llm = llm
-        self.model = llm.model
-        self.record_cost = record_cost
-        self.text_model = text_model
-        self.return_json = config["return_json"]
-        self.system_message = config["system_prompt"]
-        self.prompt_args = set(config["jinja_args"])
-        self.template = env.from_string(config["template"])
-        self.retry_template = Template(
-            """The previous output is invalid, please carefully analyze the traceback and feedback information, correct errors happened before.
-            feedback:
-            {{feedback}}
-            traceback:
-            {{traceback}}
-            Give your corrected output in the same format without including the previous output:
-            """
-        )
-        self.system_tokens = len(ENCODING.encode(self.system_message))
-        self.input_tokens = 0
-        self.output_tokens = 0
-        self.history: list[Turn] = []
-
-    def calc_cost(self, turns: list[Turn]):
-        """
-        Calculate the cost of a list of turns.
-        """
-        for turn in turns:
-            self.input_tokens += turn.input_tokens
-            self.output_tokens += turn.output_tokens
-        self.input_tokens += self.system_tokens
-        self.output_tokens += 3
-
-    def get_history(self, similar: int, recent: int, prompt: str):
-        """
-        Get the conversation history.
-        """
-        history = self.history[-recent:] if recent > 0 else []
-        if similar > 0:
-            embedding = get_text_embedding(prompt, self.text_model)
-            history.sort(key=lambda x: cosine_similarity(embedding, x.embedding))
-            for turn in history:
-                if len(history) > similar + recent:
-                    break
-                if turn not in history:
-                    history.append(turn)
-        history.sort(key=lambda x: x.id)
-        return history
-
-    def save_history(self, output_dir: str):
-        """
-        Save the conversation history to a file.
-        """
-        history_file = pjoin(output_dir, f"{self.name}.jsonl")
-        if pexists(history_file) and len(self.history) == 0:
-            return
-        with jsonlines.open(history_file, "w") as writer:
-            writer.write(
-                {
-                    "input_tokens": self.input_tokens,
-                    "output_tokens": self.output_tokens,
-                }
-            )
-            for turn in self.history:
-                writer.write(turn.to_dict())
-
-    def retry(self, feedback: str, traceback: str, error_idx: int):
-        """
-        Retry a failed turn with feedback and traceback.
-        """
-        assert error_idx > 0, "error_idx must be greater than 0"
-        prompt = self.retry_template.render(feedback=feedback, traceback=traceback)
-        history = []
-        for turn in self.history[-error_idx:]:
-            history.extend(turn.message)
-        response, message = self.llm(
-            prompt,
-            history=history,
-            return_message=True,
-        )
-        turn = Turn(
-            id=len(self.history),
-            prompt=prompt,
-            response=response,
-            message=message,
-        )
-        return self.__post_process__(response, self.history[-error_idx:], turn)
-
-    def __repr__(self) -> str:
-        return f"Role(name={self.name}, model={self.model})"
-
-    def __call__(
-        self,
-        images: list[str] = None,
-        recent: int = 0,
-        similar: int = 0,
-        **jinja_args,
-    ):
-        """
-        Call the agent with prompt arguments.
-
-        Args:
-            images (list[str]): A list of image file paths.
-            recent (int): The number of recent turns to include.
-            similar (int): The number of similar turns to include.
-            **jinja_args: Additional arguments for the Jinja2 template.
-
-        Returns:
-            The response from the role.
-        """
-        if isinstance(images, str):
-            images = [images]
-        assert self.prompt_args == set(jinja_args.keys()), "Invalid arguments"
-        prompt = self.template.render(**jinja_args)
-        history = self.get_history(similar, recent, prompt)
-        history_msg = []
-        for turn in history:
-            history_msg.extend(turn.message)
-
-        response, message = self.llm(
-            prompt,
-            system_message=self.system_message,
-            history=history_msg,
-            images=images,
-            return_message=True,
-        )
-        turn = Turn(
-            id=len(self.history),
-            prompt=prompt,
-            response=response,
-            message=message,
-            images=images,
-        )
-        return self.__post_process__(response, history, turn, similar)
-
-    def __post_process__(
-        self, response: str, history: list[Turn], turn: Turn, similar: int = 0
-    ):
-        """
-        Post-process the response from the agent.
-        """
-        self.history.append(turn)
-        if similar > 0:
-            turn.embedding = get_text_embedding(turn.prompt, self.text_model)
-        if self.record_cost:
-            turn.calc_token()
-            self.calc_cost(history + [turn])
-        if self.return_json:
-            response = get_json_from_response(response)
-        return response
-
-
-def get_simple_modelname(llms: list[LLM]):
-    """
-    Get a abbreviation from a list of LLMs.
-    """
-    if isinstance(llms, LLM):
-        llms = [llms]
-    return "+".join(re.search(r"^(.*?)-\d{2}", llm.model).group(1) for llm in llms)
-
-
-gpt4o = LLM(model="gpt-4o-2024-08-06", use_batch=True)
-gpt4omini = LLM(model="gpt-4o-mini-2024-07-18", use_batch=True)
-qwen2_5 = LLM(
-    model="Qwen2.5-72B-Instruct-GPTQ-Int4", api_base="http://124.16.138.143:7812/v1"
-)
-
-qwen_vl = LLM(model="Qwen2-VL-72B-Instruct", api_base="http://124.16.138.144:7999/v1")
-qwen_coder = LLM(
-    model="Qwen2.5-Coder-32B-Instruct", api_base="http://127.0.0.1:8008/v1"
-)
-intern_vl = LLM(model="InternVL2_5-78B", api_base="http://124.16.138.144:8009/v1")
-
-language_model = gpt4o
-vision_model = gpt4o
-
-if __name__ == "__main__":
-    gpt4o = LLM(model="gpt-4o-2024-08-06")
-    print(
-        gpt4o(
-            "who r u",
-        )
-    )
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/model_utils.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/model_utils.py
deleted file mode 100644
index 07a2ab1fcf296a109f6eefba3e4c582b4f4d306d..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/model_utils.py
+++ /dev/null
@@ -1,297 +0,0 @@
-import json
-import os
-from copy import deepcopy
-
-import numpy as np
-import torch
-import torchvision.transforms as T
-from FlagEmbedding import BGEM3FlagModel
-from marker.config.parser import ConfigParser
-from marker.converters.pdf import PdfConverter
-from marker.output import text_from_rendered
-from PIL import Image
-from torchvision.transforms.functional import InterpolationMode
-from transformers import AutoFeatureExtractor, AutoModel
-
-from utils.src.presentation import Presentation, SlidePage
-from utils.src.utils import is_image_path, pjoin
-
-device_count = torch.cuda.device_count()
-
-
-def prs_dedup(
-    presentation: Presentation,
-    model: BGEM3FlagModel,
-    batchsize: int = 32,
-    threshold: float = 0.8,
-) -> list[SlidePage]:
-    """
-    Deduplicate slides in a presentation based on text similarity.
-
-    Args:
-        presentation (Presentation): The presentation object containing slides.
-        model: The model used for generating text embeddings.
-        batchsize (int): The batch size for processing slides.
-        threshold (float): The similarity threshold for deduplication.
-
-    Returns:
-        list: A list of removed duplicate slides.
-    """
-    text_embeddings = get_text_embedding(
-        [i.to_text() for i in presentation.slides], model, batchsize
-    )
-    pre_embedding = text_embeddings[0]
-    slide_idx = 1
-    duplicates = []
-    while slide_idx < len(presentation):
-        cur_embedding = text_embeddings[slide_idx]
-        if torch.cosine_similarity(pre_embedding, cur_embedding, -1) > threshold:
-            duplicates.append(slide_idx - 1)
-        slide_idx += 1
-        pre_embedding = cur_embedding
-    return [presentation.slides.pop(i) for i in reversed(duplicates)]
-
-
-def get_text_model(device: str = None) -> BGEM3FlagModel:
-    """
-    Initialize and return a text model.
-
-    Args:
-        device (str): The device to run the model on.
-
-    Returns:
-        BGEM3FlagModel: The initialized text model.
-    """
-    return BGEM3FlagModel(
-        "BAAI/bge-m3",
-        use_fp16=True,
-        device=device,
-    )
-
-
-def get_image_model(device: str = None):
-    """
-    Initialize and return an image model and its feature extractor.
-
-    Args:
-        device (str): The device to run the model on.
-
-    Returns:
-        tuple: A tuple containing the feature extractor and the image model.
-    """
-    model_base = "google/vit-base-patch16-224-in21k"
-    return (
-        AutoFeatureExtractor.from_pretrained(
-            model_base,
-            torch_dtype=torch.float16,
-            device_map=device,
-        ),
-        AutoModel.from_pretrained(
-            model_base,
-            torch_dtype=torch.float16,
-            device_map=device,
-        ).eval(),
-    )
-
-
-def parse_pdf(
-    pdf_path: str,
-    output_path: str = None,
-    model_lst: list = None,
-    save_file: bool = True,
-) -> str:
-    """
-    Parse a PDF file and extract text and images.
-
-    Args:
-        pdf_path (str): The path to the PDF file.
-        output_path (str): The directory to save the extracted content.
-        model_lst (list): A list of models for processing the PDF.
-
-    Returns:
-        str: The full text extracted from the PDF.
-    """
-    if save_file:
-        os.makedirs(output_path, exist_ok=True)
-    config_parser = ConfigParser(
-        {
-            "output_format": "markdown",
-        }
-    )
-    converter = PdfConverter(
-        config=config_parser.generate_config_dict(),
-        artifact_dict=model_lst,
-        processor_list=config_parser.get_processors(),
-        renderer=config_parser.get_renderer(),
-    )
-    rendered = converter(pdf_path)
-    full_text, _, images = text_from_rendered(rendered)
-    if save_file:
-        with open(pjoin(output_path, "source.md"), "w+", encoding="utf-8") as f:
-            f.write(full_text)
-        for filename, image in images.items():
-            image_filepath = os.path.join(output_path, filename)
-            image.save(image_filepath, "JPEG")
-        with open(pjoin(output_path, "meta.json"), "w+") as f:
-            f.write(json.dumps(rendered.metadata, indent=4))
-
-    if not save_file:
-        return full_text, rendered
-    return full_text
-
-
-def get_text_embedding(
-    text: list[str], model: BGEM3FlagModel, batchsize: int = 32
-) -> list[torch.Tensor]:
-    """
-    Generate text embeddings for a list of text strings.
-
-    Args:
-        text (list[str]): A list of text strings.
-        model: The model used for generating embeddings.
-        batchsize (int): The batch size for processing text.
-
-    Returns:
-        list: A list of text embeddings.
-    """
-    if isinstance(text, str):
-        return torch.tensor(model.encode(text)["dense_vecs"]).to(model.device)
-    result = []
-    for i in range(0, len(text), batchsize):
-        result.extend(
-            torch.tensor(model.encode(text[i : i + batchsize])["dense_vecs"]).to(
-                model.device
-            )
-        )
-    return result
-
-
-def get_image_embedding(
-    image_dir: str, extractor, model, batchsize: int = 16
-) -> dict[str, torch.Tensor]:
-    """
-    Generate image embeddings for images in a directory.
-
-    Args:
-        image_dir (str): The directory containing images.
-        extractor: The feature extractor for images.
-        model: The model used for generating embeddings.
-        batchsize (int): The batch size for processing images.
-
-    Returns:
-        dict: A dictionary mapping image filenames to their embeddings.
-    """
-    transform = T.Compose(
-        [
-            T.Resize(int((256 / 224) * extractor.size["height"])),
-            T.CenterCrop(extractor.size["height"]),
-            T.ToTensor(),
-            T.Normalize(mean=extractor.image_mean, std=extractor.image_std),
-        ]
-    )
-
-    inputs = []
-    embeddings = []
-    images = [i for i in sorted(os.listdir(image_dir)) if is_image_path(i)]
-    for file in images:
-        image = Image.open(pjoin(image_dir, file)).convert("RGB")
-        inputs.append(transform(image))
-        if len(inputs) % batchsize == 0 or file == images[-1]:
-            batch = {"pixel_values": torch.stack(inputs).to(model.device)}
-            embeddings.extend(model(**batch).last_hidden_state.detach())
-            inputs.clear()
-    return {image: embedding.flatten() for image, embedding in zip(images, embeddings)}
-
-
-def images_cosine_similarity(embeddings: list[torch.Tensor]) -> torch.Tensor:
-    """
-    Calculate the cosine similarity matrix for a list of embeddings.
-    Args:
-        embeddings (list[torch.Tensor]): A list of image embeddings.
-
-    Returns:
-        torch.Tensor: A NxN similarity matrix.
-    """
-    embeddings = [embedding for embedding in embeddings]
-    sim_matrix = torch.zeros((len(embeddings), len(embeddings)))
-    for i in range(len(embeddings)):
-        for j in range(i + 1, len(embeddings)):
-            sim_matrix[i, j] = sim_matrix[j, i] = torch.cosine_similarity(
-                embeddings[i], embeddings[j], -1
-            )
-    return sim_matrix
-
-
-IMAGENET_MEAN = (0.485, 0.456, 0.406)
-IMAGENET_STD = (0.229, 0.224, 0.225)
-
-
-def average_distance(
-    similarity: torch.Tensor, idx: int, cluster_idx: list[int]
-) -> float:
-    """
-    Calculate the average distance between a point (idx) and a cluster (cluster_idx).
-
-    Args:
-        similarity (np.ndarray): The similarity matrix.
-        idx (int): The index of the point.
-        cluster_idx (list): The indices of the cluster.
-
-    Returns:
-        float: The average distance.
-    """
-    if idx in cluster_idx:
-        return 0
-    total_similarity = 0
-    for idx_in_cluster in cluster_idx:
-        total_similarity += similarity[idx, idx_in_cluster]
-    return total_similarity / len(cluster_idx)
-
-
-def get_cluster(similarity: np.ndarray, sim_bound: float = 0.65):
-    """
-    Cluster points based on similarity.
-
-    Args:
-        similarity (np.ndarray): The similarity matrix.
-        sim_bound (float): The similarity threshold for clustering.
-
-    Returns:
-        list: A list of clusters.
-    """
-    num_points = similarity.shape[0]
-    clusters = []
-    sim_copy = deepcopy(similarity)
-    added = [False] * num_points
-    while True:
-        max_avg_dist = sim_bound
-        best_cluster = None
-        best_point = None
-
-        for c in clusters:
-            for point_idx in range(num_points):
-                if added[point_idx]:
-                    continue
-                avg_dist = average_distance(sim_copy, point_idx, c)
-                if avg_dist > max_avg_dist:
-                    max_avg_dist = avg_dist
-                    best_cluster = c
-                    best_point = point_idx
-
-        if best_point is not None:
-            best_cluster.append(best_point)
-            added[best_point] = True
-            similarity[best_point, :] = 0
-            similarity[:, best_point] = 0
-        else:
-            if similarity.max() < sim_bound:
-                break
-            i, j = np.unravel_index(np.argmax(similarity), similarity.shape)
-            clusters.append([int(i), int(j)])
-            added[i] = True
-            added[j] = True
-            similarity[i, :] = 0
-            similarity[:, i] = 0
-            similarity[j, :] = 0
-            similarity[:, j] = 0
-    return clusters
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/multimodal.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/multimodal.py
deleted file mode 100644
index 43b8c47f0e987509e22500ba6b1cc1ad4ce73844..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/multimodal.py
+++ /dev/null
@@ -1,106 +0,0 @@
-import json
-
-import PIL.Image
-from rich import print
-
-import src.llms as llms
-from src.presentation import Picture, Presentation
-from src.utils import Config, pbasename, pexists, pjoin
-
-
-class ImageLabler:
-    """
-    A class to extract images information, including caption, size, and appearance times in a presentation.
-    """
-
-    def __init__(self, presentation: Presentation, config: Config):
-        """
-        Initialize the ImageLabler.
-
-        Args:
-            presentation (Presentation): The presentation object.
-            config (Config): The configuration object.
-        """
-        self.presentation = presentation
-        self.slide_area = presentation.slide_width.pt * presentation.slide_height.pt
-        self.image_stats = {}
-        self.stats_file = pjoin(config.RUN_DIR, "image_stats.json")
-        self.config = config
-        self.collect_images()
-        if pexists(self.stats_file):
-            image_stats: dict[str, dict] = json.load(open(self.stats_file, "r"))
-            for name, stat in image_stats.items():
-                if pbasename(name) in self.image_stats:
-                    self.image_stats[pbasename(name)] = stat
-
-    def apply_stats(self):
-        """
-        Apply image captions to the presentation.
-        """
-        for slide in self.presentation.slides:
-            for shape in slide.shape_filter(Picture):
-                stats = self.image_stats[pbasename(shape.img_path)]
-                shape.caption = stats["caption"]
-
-    def caption_images(self):
-        """
-        Generate captions for images in the presentation.
-        """
-        caption_prompt = open("prompts/caption.txt").read()
-        for image, stats in self.image_stats.items():
-            if "caption" not in stats:
-                stats["caption"] = llms.vision_model(
-                    caption_prompt, pjoin(self.config.IMAGE_DIR, image)
-                )
-                print("captioned", image, ": ", stats["caption"])
-        json.dump(
-            self.image_stats,
-            open(self.stats_file, "w"),
-            indent=4,
-            ensure_ascii=False,
-        )
-        self.apply_stats()
-        return self.image_stats
-
-    def collect_images(self):
-        """
-        Collect images from the presentation and gather other information.
-        """
-        for slide_index, slide in enumerate(self.presentation.slides):
-            for shape in slide.shape_filter(Picture):
-                image_path = pbasename(shape.data[0])
-                self.image_stats[image_path] = {
-                    "appear_times": 0,
-                    "slide_numbers": set(),
-                    "relative_area": shape.area / self.slide_area * 100,
-                    "size": PIL.Image.open(
-                        pjoin(self.config.IMAGE_DIR, image_path)
-                    ).size,
-                }
-                self.image_stats[image_path]["appear_times"] += 1
-                self.image_stats[image_path]["slide_numbers"].add(slide_index + 1)
-        for image_path, stats in self.image_stats.items():
-            stats["slide_numbers"] = sorted(list(stats["slide_numbers"]))
-            ranges = self._find_ranges(stats["slide_numbers"])
-            top_ranges = sorted(ranges, key=lambda x: x[1] - x[0], reverse=True)[:3]
-            top_ranges_str = ", ".join(
-                [f"{r[0]}-{r[1]}" if r[0] != r[1] else f"{r[0]}" for r in top_ranges]
-            )
-            stats["top_ranges_str"] = top_ranges_str
-
-    def _find_ranges(self, numbers):
-        """
-        Find consecutive ranges in a list of numbers.
-        """
-        ranges = []
-        start = numbers[0]
-        end = numbers[0]
-        for num in numbers[1:]:
-            if num == end + 1:
-                end = num
-            else:
-                ranges.append((start, end))
-                start = num
-                end = num
-        ranges.append((start, end))
-        return ranges
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/pptgen.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/pptgen.py
deleted file mode 100644
index e1307f5966975b316f0fd4599ea6f8b242da8c52..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/pptgen.py
+++ /dev/null
@@ -1,463 +0,0 @@
-import json
-import os
-import traceback
-from abc import ABC, abstractmethod
-from copy import deepcopy
-from dataclasses import dataclass, field
-from datetime import datetime
-
-import jsonlines
-import PIL.Image
-import torch
-from FlagEmbedding import BGEM3FlagModel
-from jinja2 import Environment, StrictUndefined
-from rich import print
-
-from src.apis import API_TYPES, CodeExecutor
-from src.llms import Role
-from src.model_utils import get_text_embedding
-from src.presentation import Presentation, SlidePage
-from src.utils import Config, get_slide_content, pexists, pjoin, tenacity
-
-
-@dataclass
-class PPTGen(ABC):
-    """
-    Stage II: Presentation Generation
-    An abstract base class for generating PowerPoint presentations.
-    It accepts a reference presentation as input, then generates a presentation outline and slides.
-    """
-
-    roles: list[str] = field(default_factory=list)
-
-    def __init__(
-        self,
-        text_model: BGEM3FlagModel,
-        retry_times: int = 3,
-        force_pages: bool = False,
-        error_exit: bool = True,
-        record_cost: bool = True,
-        **kwargs,
-    ):
-        """
-        Initialize the PPTGen.
-
-        Args:
-            text_model (BGEM3FlagModel): The text model for generating content.
-            retry_times (int): The number of times to retry failed actions.
-            force_pages (bool): Whether to force a specific number of pages.
-            error_exit (bool): Whether to exit on error.
-            record_cost (bool): Whether to record the cost of generation.
-            **kwargs: Additional arguments.
-        """
-        self.text_model = text_model
-        self.retry_times = retry_times
-        self.force_pages = force_pages
-        self.error_exit = error_exit
-        self._hire_staffs(record_cost, **kwargs)
-
-    def set_reference(
-        self,
-        presentation: Presentation,
-        slide_induction: dict,
-    ):
-        """
-        Set the reference presentation and extracted presentation information.
-
-        Args:
-            presentation (Presentation): The presentation object.
-            slide_induction (dict): The slide induction data.
-
-        Returns:
-            PPTGen: The updated PPTGen object.
-        """
-        self.presentation = presentation
-        self.slide_induction = slide_induction
-        self.functional_keys = slide_induction.pop("functional_keys")
-        self.layout_names = list(slide_induction.keys())
-        self.layout_embeddings = torch.stack(
-            get_text_embedding(self.layout_names, self.text_model)
-        )
-        self.empty_prs = deepcopy(presentation)
-        return self
-
-    def generate_pres(
-        self,
-        config: Config,
-        images: dict[str, str],
-        num_slides: int,
-        doc_json: dict[str, str],
-    ):
-        """
-        Generate a PowerPoint presentation.
-
-        Args:
-            config (Config): The configuration object.
-            images (dict[str, str]): A dictionary of image paths and captions.
-            num_slides (int): The number of slides to generate.
-            doc_json (dict[str, str]): The document JSON data.
-
-        Save:
-            final.pptx: The final PowerPoint presentation to the config.RUN_DIR directory.
-
-        Raise:
-            ValueError: if failed to generate presentation outline.
-        """
-        self.config = config
-        self.doc_json = doc_json
-        meta_data = "\n".join(
-            [f"{k}: {v}" for k, v in self.doc_json.get("metadata", {}).items()]
-        )
-        self.metadata = (
-            f"{meta_data}\nPresentation Time: {datetime.now().strftime('%Y-%m-%d')}\n"
-        )
-        self.image_information = ""
-        for k, v in images.items():
-            assert pexists(k), f"Image {k} not found"
-            size = PIL.Image.open(k).size
-            self.image_information += (
-                f"Image path: {k}, size: {size[0]}*{size[1]} px\n caption: {v}\n"
-            )
-        succ_flag = True
-        code_executor = CodeExecutor(self.retry_times)
-        self.outline = self._generate_outline(num_slides)
-        self.simple_outline = "\n".join(
-            [
-                f"Slide {slide_idx+1}: {slide_title}"
-                for slide_idx, slide_title in enumerate(self.outline)
-            ]
-        )
-        generated_slides = []
-        for slide_data in enumerate(self.outline.items()):
-            if self.force_pages and slide_data[0] == num_slides:
-                break
-            slide = self._generate_slide(slide_data, code_executor)
-            if slide is not None:
-                generated_slides.append(slide)
-                continue
-            if self.error_exit:
-                succ_flag = False
-                break
-        self._save_history(code_executor)
-        if succ_flag:
-            self.empty_prs.slides = generated_slides
-            self.empty_prs.save(pjoin(self.config.RUN_DIR, "final.pptx"))
-
-    def _save_history(self, code_executor: CodeExecutor):
-        """
-        Save the history of code execution, API calls and agent steps.
-        """
-        os.makedirs(pjoin(self.config.RUN_DIR, "history"), exist_ok=True)
-        for role in self.staffs.values():
-            role.save_history(pjoin(self.config.RUN_DIR, "history"))
-            role.history = []
-        if len(code_executor.code_history) == 0:
-            return
-        with jsonlines.open(
-            pjoin(self.config.RUN_DIR, "code_steps.jsonl"), "w"
-        ) as writer:
-            writer.write_all(code_executor.code_history)
-        with jsonlines.open(
-            pjoin(self.config.RUN_DIR, "agent_steps.jsonl"), "w"
-        ) as writer:
-            writer.write_all(code_executor.api_history)
-
-    @tenacity
-    def _generate_outline(self, num_slides: int):
-        """
-        Generate an outline for the presentation.
-
-        Args:
-            num_slides (int): The number of slides to generate.
-
-        Returns:
-            dict: The generated outline.
-        """
-        outline_file = pjoin(self.config.RUN_DIR, "presentation_outline.json")
-        doc_overview = deepcopy(self.doc_json)
-        for section in doc_overview["sections"]:
-            [sub.pop("content") for sub in section["subsections"]]
-        if pexists(outline_file):
-            outline = json.load(open(outline_file, "r"))
-        else:
-            outline = self.staffs["planner"](
-                num_slides=num_slides,
-                layouts="\n".join(
-                    set(self.slide_induction.keys()).difference(self.functional_keys)
-                ),
-                functional_keys="\n".join(self.functional_keys),
-                json_content=doc_overview,
-                image_information=self.image_information,
-            )
-            outline = self._valid_outline(outline)
-            json.dump(
-                outline,
-                open(outline_file, "w"),
-                ensure_ascii=False,
-                indent=4,
-            )
-        return outline
-
-    def _valid_outline(self, outline: dict, retry: int = 0) -> dict:
-        """
-        Validate the generated outline.
-
-        Raises:
-            ValueError: If the outline is invalid.
-        """
-        try:
-            for slide in outline.values():
-                layout_sim = torch.cosine_similarity(
-                    get_text_embedding(slide["layout"], self.text_model),
-                    self.layout_embeddings,
-                )
-                if layout_sim.max() < 0.7:
-                    raise ValueError(
-                        f"Layout `{slide['layout']}` not found, must be one of {self.layout_names}"
-                    )
-                slide["layout"] = self.layout_names[layout_sim.argmax().item()]
-            if any(
-                not {"layout", "subsections", "description"}.issubset(set(slide.keys()))
-                for slide in outline.values()
-            ):
-                raise ValueError(
-                    "Invalid outline structure, must be a dict with layout, subsections, description"
-                )
-        except ValueError as e:
-            print(outline, e)
-            if retry < self.retry_times:
-                new_outline = self.staffs["planner"].retry(
-                    str(e), traceback.format_exc(), retry + 1
-                )
-                return self._valid_outline(new_outline, retry + 1)
-            else:
-                raise ValueError("Failed to generate outline, tried too many times")
-        return outline
-
-    def _hire_staffs(self, record_cost: bool, **kwargs) -> dict[str, Role]:
-        """
-        Initialize agent roles and their models
-        """
-        jinja_env = Environment(undefined=StrictUndefined)
-        self.staffs = {
-            role: Role(
-                role,
-                env=jinja_env,
-                record_cost=record_cost,
-                text_model=self.text_model,
-                **kwargs,
-            )
-            for role in ["planner"] + self.roles
-        }
-
-    @abstractmethod
-    def synergize(
-        self,
-        template: dict,
-        slide_content: str,
-        code_executor: CodeExecutor,
-        image_info: str,
-    ) -> SlidePage:
-        """
-        Synergize Agents to generate a slide.
-
-        Returns:
-            SlidePage: The generated slide.
-        """
-        pass
-
-    def _generate_slide(self, slide_data, code_executor: CodeExecutor) -> SlidePage:
-        """
-        Generate a slide from the slide data.
-        """
-        slide_idx, (slide_title, slide) = slide_data
-        images_info = "No Images"
-        if any(
-            [
-                i in slide["layout"]
-                for i in ["picture", "chart", "table", "diagram", "freeform"]
-            ]
-        ):
-            images_info = self.image_information
-        slide_content = f"Slide-{slide_idx+1} " + get_slide_content(
-            self.doc_json, slide_title, slide
-        )
-        template = deepcopy(self.slide_induction[slide["layout"]])
-        try:
-            return self.synergize(
-                template,
-                slide_content,
-                code_executor,
-                images_info,
-            )
-        except Exception as e:
-            print(f"generate slide {slide_idx} failed: {e}")
-            print(traceback.format_exc())
-            print(self.config.RUN_DIR)
-
-
-# 价格scale factor
-class PPTCrew(PPTGen):
-    """
-    A class to generate PowerPoint presentations with a crew of agents.
-    """
-
-    roles: list[str] = ["editor", "coder"]
-
-    def synergize(
-        self,
-        template: dict,
-        slide_content: str,
-        code_executor: CodeExecutor,
-        images_info: str,
-    ) -> SlidePage:
-        """
-        Synergize Agents to generate a slide.
-
-        Args:
-            template (dict): The template data.
-            slide_content (str): The slide content.
-            code_executor (CodeExecutor): The code executor object.
-            images_info (str): The image information.
-
-        Returns:
-            SlidePage: The generated slide.
-        """
-        content_schema = template["content_schema"]
-        old_data = self._prepare_schema(content_schema)
-        editor_output = self.staffs["editor"](
-            schema=content_schema,
-            outline=self.simple_outline,
-            metadata=self.metadata,
-            text=slide_content,
-            images_info=images_info,
-        )
-        command_list = self._generate_commands(editor_output, content_schema, old_data)
-
-        edit_actions = self.staffs["coder"](
-            api_docs=code_executor.get_apis_docs(API_TYPES.Agent.value),
-            edit_target=self.presentation.slides[template["template_id"] - 1].to_html(),
-            command_list="\n".join([str(i) for i in command_list]),
-        )
-        for error_idx in range(self.retry_times):
-            edited_slide: SlidePage = deepcopy(
-                self.presentation.slides[template["template_id"] - 1]
-            )
-            feedback = code_executor.execute_actions(edit_actions, edited_slide)
-            if feedback is None:
-                break
-            if error_idx == self.retry_times:
-                raise Exception(
-                    f"Failed to generate slide, tried too many times at editing\ntraceback: {feedback[1]}"
-                )
-            edit_actions = self.staffs["coder"].retry(*feedback, error_idx + 1)
-        self.empty_prs.build_slide(edited_slide)
-        return edited_slide
-
-    def _prepare_schema(self, content_schema: dict):
-        """
-        Prepare the content schema for editing.
-
-        Args:
-            content_schema (dict): The content schema.
-
-        Returns:
-            dict: The old data extracted from the schema.
-        """
-        old_data = {}
-        for el_name, el_info in content_schema.items():
-            if el_info["type"] == "text":
-                if not isinstance(el_info["data"], list):
-                    el_info["data"] = [el_info["data"]]
-                if len(el_info["data"]) > 1:
-                    charater_counts = [len(i) for i in el_info["data"]]
-                    content_schema[el_name]["suggestedCharacters"] = (
-                        str(min(charater_counts)) + "-" + str(max(charater_counts))
-                    )
-                else:
-                    content_schema[el_name]["suggestedCharacters"] = "<" + str(
-                        len(el_info["data"][0])
-                    )
-            old_data[el_name] = el_info.pop("data")
-            content_schema[el_name]["default_quantity"] = 1
-            if isinstance(old_data[el_name], list):
-                content_schema[el_name]["default_quantity"] = len(old_data[el_name])
-        assert len(old_data) > 0, "No old data generated"
-        return old_data
-
-    def _generate_commands(
-        self, editor_output: dict, content_schema: dict, old_data: dict, retry: int = 0
-    ):
-        """
-        Generate commands for editing the slide content.
-
-        Args:
-            editor_output (dict): The editor output.
-            content_schema (dict): The content schema.
-            old_data (dict): The old data.
-            retry (int): The number of retries.
-
-        Returns:
-            list: A list of commands.
-
-        Raises:
-            Exception: If command generation fails.
-        """
-        command_list = []
-        try:
-            for el_name, el_data in editor_output.items():
-                assert (
-                    "data" in el_data
-                ), """key `data` not found in output
-                        please give your output as a dict like
-                        {
-                            "element1": {
-                                "data": ["text1", "text2"] for text elements
-                                or ["/path/to/image", "..."] for image elements
-                            },
-                        }"""
-                charater_counts = [len(i) for i in el_data["data"]]
-                max_charater_count = max([len(i) for i in old_data[el_name]])
-                if max(charater_counts) > max_charater_count * 1.5:
-                    raise ValueError(
-                        f"Content for '{el_name}' exceeds character limit ({max(charater_counts)} > {max_charater_count}). "
-                        f"Please reduce the content length to maintain slide readability and visual balance. "
-                        f"Current text: '{el_data['data']}'"
-                    )
-        except Exception as e:
-            if retry < self.retry_times:
-                new_output = self.staffs["editor"].retry(
-                    e,
-                    traceback.format_exc(),
-                    retry + 1,
-                )
-                return self._generate_commands(
-                    new_output, content_schema, old_data, retry + 1
-                )
-
-        for el_name, old_content in old_data.items():
-            if not isinstance(old_content, list):
-                old_content = [old_content]
-
-            new_content = editor_output.get(el_name, {}).get("data", None)
-            if not isinstance(new_content, list):
-                new_content = [new_content]
-
-            new_content = [i for i in new_content if i]
-
-            if content_schema[el_name]["type"] == "image":
-                new_content = [i for i in new_content if pexists(i)]
-
-            quantity_change = len(new_content) - len(old_content)
-            command_list.append(
-                (
-                    el_name,
-                    content_schema[el_name]["type"],
-                    f"quantity_change: {quantity_change}",
-                    old_content,
-                    new_content,
-                )
-            )
-
-        assert len(command_list) > 0, "No commands generated"
-        return command_list
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/presentation.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/presentation.py
deleted file mode 100644
index 497862664e94e48c9a23c8926905d3f7e90e44ad..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/presentation.py
+++ /dev/null
@@ -1,1327 +0,0 @@
-import re
-import traceback
-from dataclasses import dataclass
-from typing import Callable
-
-from pptx import Presentation as PPTXPre
-from pptx.enum.shapes import MSO_SHAPE_TYPE
-from pptx.oxml import parse_xml
-from pptx.shapes.autoshape import Shape as PPTXAutoShape
-from pptx.shapes.base import BaseShape
-from pptx.shapes.connector import Connector as PPTXConnector
-from pptx.shapes.group import GroupShape as PPTXGroupShape
-from pptx.shapes.picture import Picture as PPTXPicture
-from pptx.shapes.placeholder import PlaceholderPicture, SlidePlaceholder, TablePlaceholder
-from pptx.shapes.freeform import FreeformBuilder
-from pptx.slide import Slide as PPTXSlide
-from pptx.text.text import _Paragraph, _Run
-from rich import print
-
-from utils.src.utils import (
-    IMAGE_EXTENSIONS,
-    Config,
-    apply_fill,
-    dict_to_object,
-    extract_fill,
-    get_font_pptcstyle,
-    get_font_style,
-    merge_dict,
-    object_to_dict,
-    parse_groupshape,
-    pexists,
-    pjoin,
-    runs_merge,
-    wmf_to_images,
-)
-
-INDENT = "\t"
-
-
-# textframe: shape bounds font
-# paragraph: space, alignment, level, font bullet
-# run: font, hyperlink, text
-@dataclass
-class StyleArg:
-    """
-    A class to represent style arguments for HTML conversion.
-    """
-
-    paragraph_id: bool = True
-    element_id: bool = True
-    font_style: bool = True
-    area: bool = False
-    size: bool = False
-    geometry: bool = False
-    show_image: bool = True
-
-
-@dataclass
-class Closure:
-    closure: Callable
-    paragraph_id: int = -1
-
-    def apply(self, shape: BaseShape):
-        """
-        Apply the closure to a shape.
-
-        Args:
-            shape (BaseShape): The shape to apply the closure to.
-        """
-        self.closure(shape)
-
-    def __gt__(self, other):
-        if self.paragraph_id != other.paragraph_id:
-            return self.paragraph_id > other.paragraph_id
-
-
-class Paragraph:
-    def __init__(self, paragraph: _Paragraph, idx: int):
-        run = runs_merge(paragraph)
-        self.idx = idx
-        self.real_idx = idx
-        self.bullet = paragraph.bullet
-        if run is None:
-            self.idx = -1
-            return
-        self.font = merge_dict(
-            object_to_dict(paragraph.font), [object_to_dict(run.font)]
-        )
-        self.text = re.sub(r"(_x000B_|\\x0b)", " ", paragraph.text)
-
-    def to_html(self, style_args: StyleArg):
-        if self.idx == -1:
-            raise ValueError(f"paragraph {self.idx} is not valid")
-        tag = "li" if self.bullet else "p"
-        id_str = f" id='{self.idx}'" if style_args.paragraph_id else ""
-        font_style = get_font_style(self.font)
-        style_str = (
-            f" style='{font_style}'" if style_args.font_style and font_style else ""
-        )
-        if self.bullet:
-            style_str += f" bullet-type='{self.bullet}'"
-        return f"<{tag}{id_str}{style_str}>{self.text}</{tag}>"
-
-    def __repr__(self):
-        return f"Paragraph-{self.idx}: {self.text}"
-
-
-class TextFrame:
-    def __init__(self, shape: BaseShape, level: int):
-        if not shape.has_text_frame:
-            self.is_textframe = False
-            return
-        self.paragraphs = [
-            Paragraph(paragraph, idx)
-            for idx, paragraph in enumerate(shape.text_frame.paragraphs)
-        ]
-        para_offset = 0
-        for para in self.paragraphs:
-            if para.idx == -1:
-                para_offset += 1
-            else:
-                para.idx = para.idx - para_offset
-        if len(self.paragraphs) == 0:
-            self.is_textframe = False
-            return
-        self.level = level
-        self.text = shape.text
-        self.is_textframe = True
-        self.font = merge_dict(
-            object_to_dict(shape.text_frame.font),
-            [para.font for para in self.paragraphs if para.idx != -1],
-        )
-
-    def to_html(self, style_args: StyleArg):
-        """
-        Convert the text frame to HTML.
-
-        Args:
-            style_args (StyleArg): The style arguments for HTML conversion.
-
-        Returns:
-            str: The HTML representation of the text frame.
-        """
-        if not self.is_textframe:
-            return ""
-        repr_list = [
-            para.to_html(style_args) for para in self.paragraphs if para.idx != -1
-        ]
-        return "\n".join([INDENT * self.level + repr for repr in repr_list])
-
-    def __repr__(self):
-        if not self.is_textframe:
-            return "TextFrame: null"
-        return f"TextFrame: {self.paragraphs}"
-
-    def __len__(self):
-        if not self.is_textframe:
-            return 0
-        return len(self.text)
-
-    def to_pptc(self, father_idx: int) -> str:
-        """
-        Convert the text frame to PPTC format.
-
-        Args:
-            father_idx (int): The index of the parent shape.
-
-        Returns:
-            str: The PPTC representation of the text frame.
-        """
-        if not self.is_textframe:
-            return ""
-        s = f"[Text id={father_idx}]"
-        for para in self.paragraphs:
-            if para.idx == -1:
-                continue
-            s += f"\n"
-            s += f"[Paragraph id={para.idx}]"
-            s += get_font_pptcstyle(para.font) + f"\n"
-            s += para.text + "\n"
-        return s
-
-
-class ShapeElement:
-    def __init__(
-        self,
-        slide_idx: int,
-        shape_idx: int,
-        style: dict,
-        data: dict,
-        text_frame: TextFrame,
-        slide_area: float,
-        level: int,
-    ):
-        self.slide_idx = slide_idx
-        self.shape_idx = shape_idx
-        self.style = style
-        self.data = data
-        self.text_frame = text_frame
-        self._closure_keys = ["clone", "replace", "delete", "style"]
-        self._closures: dict[str, list[Closure]] = {
-            key: [] for key in self._closure_keys
-        }
-        self.slide_area = slide_area
-        self.level = level
-
-    @classmethod
-    def from_shape(
-        cls,
-        slide_idx: int,
-        shape_idx: int,
-        shape: BaseShape,
-        config: Config,
-        slide_area: float,
-        level: int = 0,
-    ):
-        """
-        Create a ShapeElement from a BaseShape.
-
-        Args:
-            slide_idx (int): The index of the slide.
-            shape_idx (int): The index of the shape.
-            shape (BaseShape): The shape object.
-            config (Config): The configuration object.
-            slide_area (float): The area of the slide.
-            level (int): The indentation level.
-
-        Returns:
-            ShapeElement: The created ShapeElement.
-        """
-        if shape_idx > 100 and isinstance(shape, PPTXGroupShape):
-            raise ValueError(f"nested group shapes are not allowed")
-        line = None
-        if "line" in dir(shape) and shape.line._ln is not None:
-            line = {
-                "fill": extract_fill(shape.line),
-                "width": shape.line.width,
-                "dash_style": shape.line.dash_style,
-            }
-        fill = extract_fill(shape)
-        style = {
-            "shape_bounds": {
-                "width": shape.width,
-                "height": shape.height,
-                "left": shape.left,
-                "top": shape.top,
-            },
-            "shape_type": str(shape.shape_type).split("(")[0].lower(),
-            "rotation": shape.rotation,
-            "fill": fill,
-            "line": line,
-        }
-        text_frame = TextFrame(shape, level + 1)
-        obj = SHAPECAST.get(shape.shape_type, UnsupportedShape).from_shape(
-            slide_idx,
-            shape_idx,
-            shape,
-            style,
-            text_frame,
-            config,
-            slide_area,
-            level,
-        )
-        obj.xml = shape._element.xml
-        # ? for debug, mask to enable pickling
-        # obj.shape = shape
-        return obj
-
-    def build(self, slide: PPTXSlide):
-        """
-        Build the shape element in a slide.
-
-        Args:
-            slide (PPTXSlide): The slide to build the shape in.
-
-        Returns:
-            The built shape.
-        """
-        return slide.shapes._shape_factory(
-            slide.shapes._spTree.insert_element_before(parse_xml(self.xml), "p:extLst")
-        )
-
-    def __repr__(self) -> str:
-        return f"{self.__class__.__name__}: shape {self.shape_idx} of slide {self.slide_idx}"
-
-    def to_html(self, style_args: StyleArg) -> str:
-        """
-        Convert the shape element to HTML.
-
-        Args:
-            style_args (StyleArg): The style arguments for HTML conversion.
-
-        Returns:
-            str: The HTML representation of the shape element.
-        """
-        return ""
-
-    @property
-    def closures(self):
-        """
-        Get the closures associated with the shape element.
-
-        Returns:
-            list: A list of closures.
-        """
-        closures = []
-        closures.extend(sorted(self._closures["clone"]))
-        closures.extend(self._closures["replace"] + self._closures["style"])
-        closures.extend(sorted(self._closures["delete"], reverse=True))
-        return closures
-
-    @property
-    def indent(self):
-        return "\t" * self.level
-
-    @property
-    def left(self):
-        return self.style["shape_bounds"]["left"].pt
-
-    @left.setter
-    def left(self, value):
-        self.style["shape_bounds"]["left"] = value
-
-    @property
-    def top(self):
-        return self.style["shape_bounds"]["top"].pt
-
-    @top.setter
-    def top(self, value):
-        self.style["shape_bounds"]["top"] = value
-
-    @property
-    def width(self):
-        return self.style["shape_bounds"]["width"].pt
-
-    @width.setter
-    def width(self, value):
-        self.style["shape_bounds"]["width"] = value
-
-    @property
-    def height(self):
-        return self.style["shape_bounds"]["height"].pt
-
-    @height.setter
-    def height(self, value):
-        self.style["shape_bounds"]["height"] = value
-
-    @property
-    def area(self):
-        """
-        Get the area of the shape element.
-
-        Returns:
-            float: The area in square points.
-        """
-        return self.width * self.height
-
-    @property
-    def pptc_text_info(self):
-        """
-        Get the PPTC text information of the shape element.
-
-        Returns:
-            str: The PPTC text information.
-        """
-        if isinstance(self, Picture):
-            return self.caption
-        return self.text_frame.to_pptc(self.shape_idx)
-
-    @property
-    def pptc_space_info(self):
-        """
-        Get the PPTC space information of the shape element.
-
-        Returns:
-            str: The PPTC space information.
-        """
-        return f"Visual Positions: left={self.left}pt, top={self.top}pt\n"
-
-    @property
-    def pptc_size_info(self):
-        """
-        Get the PPTC size information of the shape element.
-
-        Returns:
-            str: The PPTC size information.
-        """
-        return f"Size: height={self.height}pt, width={self.width}pt\n"
-
-    @property
-    def pptc_description(self):
-        """
-        Get the PPTC description of the shape element.
-
-        Returns:
-            str: The PPTC description.
-        """
-        return f"[{self.__class__.__name__} id={self.shape_idx}]\n"
-
-    def to_pptc(self):
-        """
-        Convert the shape element to PPTC format.
-
-        Returns:
-            str: The PPTC representation of the shape element.
-        """
-        s = ""
-        s += self.pptc_description
-        s += self.pptc_size_info
-        s += self.pptc_space_info
-        s += self.pptc_text_info
-        return s
-
-    def get_inline_style(self, style_args: StyleArg):
-        """
-        Get the inline style for the shape element.
-
-        Args:
-            style_args (StyleArg): The style arguments for HTML conversion.
-
-        Returns:
-            str: The inline style string.
-        """
-        id_str = f" id='{self.shape_idx}'" if style_args.element_id else ""
-        styles = []
-        if style_args.area:
-            styles.append(f"data-relative-area={self.area*100/self.slide_area:.2f}%;")
-        if style_args.size:
-            styles.append(f"width: {self.width}pt; height: {self.height}pt;")
-        if style_args.geometry:
-            styles.append(f"left: {self.left}pt; top: {self.top}pt;")
-        if style_args.font_style and self.text_frame.is_textframe:
-            font_style = get_font_style(self.text_frame.font)
-            if font_style:
-                styles.append(font_style)
-        if len(styles) != 0:
-            return id_str + " style='" + " ".join(styles) + "'"
-        return id_str
-
-
-class UnsupportedShape(ShapeElement):
-    @classmethod
-    def from_shape(
-        cls,
-        slide_idx: int,
-        shape_idx: int,
-        shape: BaseShape,
-        *args,
-        **kwargs,
-    ):
-        raise ValueError(f"unsupported shape {shape.shape_type}")
-
-
-class TextBox(ShapeElement):
-    @classmethod
-    def from_shape(
-        cls,
-        slide_idx: int,
-        shape_idx: int,
-        shape: TextFrame,
-        style: dict,
-        text_frame: TextFrame,
-        config: Config,
-        slide_area: float,
-        level: int,
-    ):
-        return cls(slide_idx, shape_idx, style, None, text_frame, slide_area, level)
-
-    def to_html(self, style_args: StyleArg) -> str:
-
-        return (
-            f"{self.indent}<div{self.get_inline_style(style_args)}>\n"
-            + self.text_frame.to_html(style_args)
-            + f"\n{self.indent}</div>\n"
-        )
-
-
-class Picture(ShapeElement):
-    @classmethod
-    def from_shape(
-        cls,
-        slide_idx: int,
-        shape_idx: int,
-        shape: PPTXPicture,
-        style: dict,
-        text_frame: TextFrame,
-        config: Config,
-        slide_area: float,
-        level: int,
-    ):
-        img_path = pjoin(
-            config.IMAGE_DIR,
-            f"{shape.image.sha1}.{shape.image.ext}",
-        )
-        if shape.image.ext == "wmf":
-            img_path = img_path.replace(".wmf", ".jpg")
-            if not pexists(img_path):
-                wmf_to_images(shape.image.blob, img_path)
-        elif shape.image.ext not in IMAGE_EXTENSIONS:
-            raise ValueError(f"unsupported image type {shape.image.ext}")
-        if not pexists(img_path):
-            with open(img_path, "wb") as f:
-                f.write(shape.image.blob)
-        style["img_style"] = {
-            "crop_bottom": shape.crop_bottom,
-            "crop_top": shape.crop_top,
-            "crop_left": shape.crop_left,
-            "crop_right": shape.crop_right,
-        }
-        picture = cls(
-            slide_idx,
-            shape_idx,
-            style,
-            [img_path, shape.name, ""],
-            text_frame,
-            slide_area,
-            level=level,
-        )
-        return picture
-
-    def build(self, slide: PPTXSlide):
-        shape = slide.shapes.add_picture(
-            self.img_path,
-            **self.style["shape_bounds"],
-        )
-        shape.name = self.data[1]
-        dict_to_object(self.style["img_style"], shape.image)
-        apply_fill(shape, self.style["fill"])
-        if self.style["line"] is not None:
-            apply_fill(shape.line, self.style["line"]["fill"])
-            dict_to_object(self.style["line"], shape.line, exclude=["fill"])
-
-        dict_to_object(self.style["shape_bounds"], shape)
-        if "rotation" in dir(shape):
-            shape.rotation = self.style["rotation"]
-        return shape
-
-    @property
-    def img_path(self):
-        return self.data[0]
-
-    @img_path.setter
-    def img_path(self, img_path: str):
-        self.data[0] = img_path
-
-    @property
-    def caption(self):
-        return self.data[2]
-
-    @caption.setter
-    def caption(self, caption: str):
-        self.data[2] = caption
-
-    def to_html(self, style_args: StyleArg) -> str:
-        if not style_args.show_image:
-            return ""
-        if not self.caption:
-            raise ValueError(
-                f"caption not found for picture {self.shape_idx} of slide {self.slide_idx}"
-            )
-        return (
-            self.indent
-            + f"<img {self.get_inline_style(style_args)} alt='{self.caption}'/>"
-        )
-
-
-class FreeformShape(ShapeElement):
-    """
-    Represents a FREEFORM shape. Internally, a freeform is a shape with custom
-    geometry. This class demonstrates how you might capture that geometry and
-    rebuild it. 
-    """
-
-    @classmethod
-    def from_shape(
-        cls,
-        slide_idx: int,
-        shape_idx: int,
-        shape,  # This will be the pptx.shapes.autoshape.Shape whose type is FREEFORM
-        style: dict,
-        text_frame: TextFrame,
-        config: Config,
-        slide_area: float,
-        level: int,
-    ):
-        """
-        Captures relevant geometry information (e.g., point sets) from the existing
-        freeform shape.  In python-pptx, retrieving detailed vertex information from
-        the shape can be tricky, so you might rely on custom extension attributes, or
-        simply store bounding-box and minimal details. The 'data' dict is for your use.
-
-        For demonstration, we assume we have some way to extract or store
-        basic vertex info from `shape`. Not all details of a FREEFORM
-        can easily be retrieved from python-pptx, so you may have to
-        maintain that data separately as you create them.
-        """
-        # Hypothetical way to store some geometry or bounding box:
-        data = {
-            "name": shape.name,
-            "width": shape.width,
-            "height": shape.height,
-            "left": shape.left,
-            "top": shape.top,
-            # Possibly store a list of points if you have them:
-            "points": [],   # If you have a way to retrieve them
-            "closed": True, # Whether to close the path
-        }
-
-        return cls(
-            slide_idx,
-            shape_idx,
-            style,
-            data,
-            text_frame,
-            slide_area,
-            level=level,
-        )
-
-    def build(self, slide: PPTXSlide):
-        """
-        Recreates the freeform geometry on the slide using a FreeformBuilder.
-
-        Note: If you loaded this shape from an existing slide, you may already
-        have it in place. Typically, build() is used when exporting back to PPTX
-        from your internal representation. If you don't need to reconstruct (because
-        the shape already exists), you can simply no-op here. This is an example of 
-        how you'd do it if you needed to fully recreate geometry.
-        """
-        # Retrieve your geometry data
-        left = self.data["left"]
-        top = self.data["top"]
-        width = self.data["width"]
-        height = self.data["height"]
-        points = self.data["points"]
-        closed = self.data["closed"]
-
-        # Start the builder at the first point (or top-left, etc.).
-        # This is completely up to you how you define your local coordinate system.
-        if points:
-            first_x, first_y = points[0]
-        else:
-            # Default to top-left if no data
-            first_x, first_y = (0, 0)
-
-        builder = slide.shapes.build_freeform(left + first_x, top + first_y)
-
-        # Now add line segments for the rest of the points (if any).
-        # This is an example usage:
-        if len(points) > 1:
-            # Skip the first point since we used it in build_freeform(...) above
-            builder.add_line_segments(points[1:], close=closed)
-
-        # Convert to a shape
-        shape = builder.convert_to_shape()
-
-        # Possibly apply local styling, rotation, lines, etc.
-        shape.name = self.data["name"]
-        apply_fill(shape, self.style.get("fill"))
-        if self.style.get("line") is not None:
-            apply_fill(shape.line, self.style["line"]["fill"])
-            dict_to_object(self.style["line"], shape.line, exclude=["fill"])
-
-        # If you have a bounding box in .style["shape_bounds"], apply it:
-        if "shape_bounds" in self.style:
-            dict_to_object(self.style["shape_bounds"], shape)
-        if "rotation" in self.style:
-            shape.rotation = self.style["rotation"]
-
-        return shape
-
-    def to_html(self, style_args: StyleArg) -> str:
-        """
-        Returns HTML representation. This could be empty, or you could provide
-        something like an <svg> path if you have a means to convert the freeform's
-        geometry to an SVG path. For now, just pretend it has no textual content.
-        """
-        # If you want to attempt an SVG path, you'd do it here. A minimal example:
-        #   <svg width="..." height="..."><path d="M0,0 L10,10 L20,0 z" /></svg>
-        return (
-            f"{self.indent}<div data-shape-type='freeform'{self.get_inline_style(style_args)}>"
-            f"\n{self.indent}    <!-- freeform geometry not rendered in HTML -->"
-            f"\n{self.indent}</div>\n"
-        )
-
-
-class Placeholder(ShapeElement):
-    @classmethod
-    def from_shape(
-        cls,
-        slide_idx: int,
-        shape_idx: int,
-        shape: SlidePlaceholder,
-        style: dict,
-        text_frame: TextFrame,
-        config: Config,
-        slide_area: float,
-        level: int,
-    ):
-        assert (
-            sum(
-                [
-                    shape.has_text_frame,
-                    shape.has_chart,
-                    shape.has_table,
-                    isinstance(shape, PlaceholderPicture),
-                ]
-            )
-            == 1
-        ), "placeholder should have only one type"
-        if isinstance(shape, PlaceholderPicture):
-            data = Picture.from_shape(
-                slide_idx,
-                shape_idx,
-                shape,
-                style,
-                text_frame,
-                config,
-                slide_area,
-                level,
-            )
-        elif shape.has_text_frame:
-            data = TextBox.from_shape(
-                slide_idx,
-                shape_idx,
-                shape,
-                style,
-                text_frame,
-                config,
-                slide_area,
-                level,
-            )
-        else:
-            raise ValueError(f"unsupported placeholder {shape.placeholder_type}")
-        return data
-
-
-class GroupShape(ShapeElement):
-    @classmethod
-    def from_shape(
-        cls,
-        slide_idx: int,
-        shape_idx: int,
-        shape: PPTXGroupShape,
-        style: dict,
-        text_frame: TextFrame,
-        config: Config,
-        slide_area: float,
-        level: int,
-    ):
-        data = [
-            ShapeElement.from_shape(
-                slide_idx,
-                (shape_idx + 1) * 100 + i,
-                sub_shape,
-                config,
-                slide_area,
-                level=level + 1,
-            )
-            for i, sub_shape in enumerate(shape.shapes)
-        ]
-        for idx, shape_bounds in enumerate(parse_groupshape(shape)):
-            data[idx].style["shape_bounds"] = shape_bounds
-        return cls(
-            slide_idx, shape_idx, style, data, text_frame, slide_area, level=level
-        )
-
-    def build(self, slide: PPTXSlide):
-        for shape in self.data:
-            shape.build(slide)
-        return slide
-
-    def to_pptc(self):
-        return "\n".join([shape.to_pptc() for shape in self.data])
-
-    def __iter__(self):
-        for shape in self.data:
-            if isinstance(shape, GroupShape):
-                yield from shape
-            else:
-                yield shape
-
-    def __eq__(self, __value: object) -> bool:
-        if not isinstance(__value, GroupShape) or len(self.data) != len(__value.data):
-            return False
-        for shape1, shape2 in zip(self.data, __value.data):
-            if isinstance(shape1, type(shape2)):
-                return False
-        return True
-
-    def __repr__(self) -> str:
-        return f"{self.__class__.__name__}: {self.data}"
-
-    def to_html(self, style_args: StyleArg) -> str:
-        return (
-            self.indent
-            + f"<div class='{self.group_label}'{self.get_inline_style(style_args)}>\n"
-            + "\n".join([shape.to_html(style_args) for shape in self.data])
-            + "\n"
-            + self.indent
-            + "</div>\n"
-        )
-
-
-class FreeShape(ShapeElement):
-    @classmethod
-    def from_shape(
-        cls,
-        slide_idx: int,
-        shape_idx: int,
-        shape: PPTXAutoShape,
-        style: dict,
-        text_frame: TextFrame,
-        config: Config,
-        slide_area: float,
-        level: int,
-    ):
-        data = {
-            "shape_type": shape.auto_shape_type.real,
-            "svg_tag": str(shape.auto_shape_type).split()[0].lower(),
-        }
-        return cls(
-            slide_idx, shape_idx, style, data, text_frame, slide_area, level=level
-        )
-
-    def to_html(self, style_args: StyleArg) -> str:
-        textframe = self.text_frame.to_html(style_args)
-        if not textframe:
-            return ""
-        return (
-            f"{self.indent}<div data-shape-type='{self.data['svg_tag']}'{self.get_inline_style(style_args)}>"
-            + f"\n{textframe}"
-            + f"\n{self.indent}</div>"
-        )
-
-
-class Connector(ShapeElement):
-    @classmethod
-    def from_shape(
-        cls,
-        slide_idx: int,
-        shape_idx: int,
-        shape: PPTXConnector,
-        style: dict,
-        text_frame: TextFrame,
-        config: Config,
-        slide_area: float,
-        level: int,
-    ):
-        """
-        Convert a connector to a freeform shape.
-        """
-        return FreeShape(
-            slide_idx,
-            shape_idx,
-            style,
-            {"shape_type": "connector", "svg_tag": "connector"},
-            text_frame,
-            slide_area,
-            level,
-        )
-
-
-class Table(ShapeElement):
-    @classmethod
-    def from_shape(
-        cls,
-        slide_idx: int,
-        shape_idx: int,
-        shape: TablePlaceholder,  # or the actual shape holding the table
-        style: dict,
-        text_frame: TextFrame,
-        config: Config,
-        slide_area: float,
-        level: int,
-    ):
-        """
-        Extract table data (rows, columns, text in each cell, etc.) from the table shape.
-        """
-        pptx_table = shape.table  # Adjust as needed if you'll need shape.table differently
-
-        # Gather all cell text in a list-of-lists
-        content = []
-        for row in pptx_table.rows:
-            row_cells = []
-            for cell in row.cells:
-                row_cells.append(cell.text)
-            content.append(row_cells)
-
-        data = {
-            "rows": len(pptx_table.rows),
-            "cols": len(pptx_table.columns),
-            "content": content,
-        }
-
-        return cls(
-            slide_idx,
-            shape_idx,
-            style,
-            data,
-            text_frame,  # Typically unused for tables, but included for consistency
-            slide_area,
-            level=level,
-        )
-
-    def build(self, slide: PPTXSlide):
-        """
-        Rebuild this table on a pptx slide.
-        """
-        # Use shape bounds from style if available
-        table_shape = slide.shapes.add_table(
-            self.data["rows"],
-            self.data["cols"],
-            **self.style["shape_bounds"]
-        )
-        pptx_table = table_shape.table
-
-        # Populate text in each cell
-        for r in range(self.data["rows"]):
-            for c in range(self.data["cols"]):
-                pptx_table.cell(r, c).text = self.data["content"][r][c]
-        
-        # Apply additional styling if desired (fill, borders, etc.)
-        # For example:
-        # apply_fill(table_shape, self.style["fill"])
-        # dict_to_object(self.style["shape_bounds"], table_shape, exclude=...)
-
-        return table_shape
-
-    def to_html(self, style_args: StyleArg) -> str:
-        """
-        Convert the table data to HTML, ensuring each row and cell is represented.
-        """
-        # If there’s no content at all, return ""
-        if not self.data["content"]:
-            return ""
-
-        rows_html = []
-        for row_cells in self.data["content"]:
-            # If a row is empty, skip it (or handle however you like)
-            if not row_cells:
-                continue
-
-            # Build each cell
-            cells_html = "".join(
-                f"<td>{(cell_text or '').strip()}</td>"
-                for cell_text in row_cells
-            )
-            rows_html.append(f"<tr>{cells_html}</tr>")
-
-        # If after filtering empty rows we have nothing, return empty
-        if not rows_html:
-            return ""
-
-        table_html = f"{self.indent}<table{self.get_inline_style(style_args)}>\n"
-        table_html += "\n".join(rows_html)
-        table_html += f"\n{self.indent}</table>\n"
-        return table_html
-
-
-class SlidePage:
-    """
-    A class to represent a slide page in a presentation.
-    """
-
-    def __init__(
-        self,
-        shapes: list[ShapeElement],
-        slide_idx: int,
-        real_idx: int,
-        background_xml: str,
-        slide_notes: str,
-        slide_layout_name: str,
-        slide_title: str,
-        slide_width: int,
-        slide_height: int,
-    ):
-        self.shapes = shapes
-        self.slide_idx = slide_idx
-        self.real_idx = real_idx
-        self.background_xml = background_xml
-        self.slide_notes = slide_notes
-        self.slide_layout_name = slide_layout_name
-        self.slide_title = slide_title
-        self.slide_width = slide_width
-        self.slide_height = slide_height
-        groups_shapes_labels = []
-        for shape in self.shape_filter(GroupShape):
-            for group_shape in groups_shapes_labels:
-                if group_shape == shape:
-                    shape.group_label = group_shape.group_label
-                    continue
-                groups_shapes_labels.append(shape)
-                shape.group_label = f"group_{len(groups_shapes_labels)}"
-
-    @classmethod
-    def from_slide(
-        cls,
-        slide: PPTXSlide,
-        slide_idx: int,
-        real_idx: int,
-        slide_width: int,
-        slide_height: int,
-        config: Config,
-    ):
-        """
-        Create a SlidePage from a PPTXSlide.
-
-        Args:
-            slide (PPTXSlide): The slide object.
-            slide_idx (int): The index of the slide.
-            real_idx (int): The real index of the slide.
-            slide_width (int): The width of the slide.
-            slide_height (int): The height of the slide.
-            config (Config): The configuration object.
-
-        Returns:
-            SlidePage: The created SlidePage.
-        """
-        shapes = []
-        for i, shape in enumerate(slide.shapes):
-            try:
-                if shape.visible:
-                    shape_element = ShapeElement.from_shape(
-                        slide_idx, i, shape, config, slide_width * slide_height
-                    )
-                    shapes.append(shape_element)
-            except:
-                shape_element = ShapeElement.from_shape(
-                    slide_idx, i, shape, config, slide_width * slide_height
-                )
-            
-            
-
-        background_xml = extract_fill(slide.background)
-        slide_layout_name = slide.slide_layout.name if slide.slide_layout else None
-        slide_title = slide.shapes.title.text if slide.shapes.title else None
-        slide_notes = (
-            slide.notes_slide.notes_text_frame.text
-            if slide.has_notes_slide and slide.notes_slide.notes_text_frame
-            else None
-        )
-        return cls(
-            shapes,
-            slide_idx,
-            real_idx,
-            background_xml,
-            slide_notes,
-            slide_layout_name,
-            slide_title,
-            slide_width,
-            slide_height,
-        )
-
-    def build(self, slide: PPTXSlide):
-        for ph in slide.placeholders:
-            ph.element.getparent().remove(ph.element)
-
-        apply_fill(slide.background, self.background_xml)
-
-        for shape in self.shapes:
-            build_shape = shape.build(slide)
-            for closure in shape.closures:
-                try:
-                    closure.apply(build_shape)
-                except:
-                    raise ValueError("Failed to apply closures to slides")
-        return slide
-
-    def shape_filter(self, shape_type: type, shapes: list[ShapeElement] = None):
-        """
-        Filter shapes in the slide by type.
-
-        Args:
-            shape_type (type): The type of shapes to filter.
-            shapes (list[ShapeElement]): The shapes to filter.
-
-        Yields:
-            ShapeElement: The filtered shapes.
-        """
-        if shapes is None:
-            shapes = self.shapes
-        for shape in shapes:
-            if isinstance(shape, shape_type):
-                yield shape
-            elif isinstance(shape, GroupShape):
-                yield from self.shape_filter(shape_type, shape.data)
-
-    def get_content_type(self):
-        """
-        Get the content type of the slide.
-        """
-        if len(list(self.shape_filter(Picture))) > 0:
-            return "picture"
-        return "text"
-
-    def to_html(self, style_args: StyleArg = None, **kwargs) -> str:
-        """
-        Represent the slide page in HTML.
-
-        Args:
-            style_args (StyleArg): The style arguments for HTML conversion.
-            **kwargs: Additional arguments.
-
-        Returns:
-            str: The HTML representation of the slide page.
-        """
-        if style_args is None:
-            style_args = StyleArg(**kwargs)
-        return "".join(
-            [
-                "<!DOCTYPE html>\n<html>\n",
-                (f"<title>{self.slide_title}</title>\n" if self.slide_title else ""),
-                f'<body style="width:{self.slide_width}pt; height:{self.slide_height}pt;">\n',
-                "\n".join([shape.to_html(style_args) for shape in self.shapes]),
-                "</body>\n</html>\n",
-            ]
-        )
-
-    def to_pptc(self):
-        """
-        Represent the slide page in PPTC format.
-        """
-        return "\n".join([shape.to_pptc() for shape in self.shapes])
-
-    def to_text(self, show_image: bool = False) -> str:
-        """
-        Represent the slide page in text.
-        """
-        text_content = "\n".join(
-            [
-                shape.text_frame.text.strip()
-                for shape in self.shapes
-                if shape.text_frame.is_textframe
-            ]
-        )
-        if show_image:
-            for image in self.shape_filter(Picture):
-                if not image.caption:
-                    raise ValueError(
-                        f"caption not found for picture {image.shape_idx} of slide {image.slide_idx}"
-                    )
-                text_content += "\n" + "Image: " + image.caption
-        return text_content
-
-    @property
-    def text_length(self):
-        """
-        Get the length of the text in the slide page.
-        """
-        return sum([len(shape.text_frame) for shape in self.shapes])
-
-    def __iter__(self):
-        for shape in self.shapes:
-            if isinstance(shape, GroupShape):
-                yield from shape
-            else:
-                yield shape
-
-    def __len__(self):
-        return len(self.shapes)
-
-
-class Presentation:
-    """
-    PPTAgent's representation of a presentation.
-    Aiming at a more readable and editable interface.
-    """
-
-    def __init__(
-        self,
-        slides: list[SlidePage],
-        error_history: list[str],
-        slide_width: float,
-        slide_height: float,
-        file_path: str,
-        num_pages: int,
-    ) -> None:
-        """
-        Initialize the Presentation.
-        """
-        self.slides = slides
-        self.error_history = error_history
-        self.slide_width = slide_width
-        self.slide_height = slide_height
-        self.num_pages = num_pages
-        self.source_file = file_path
-        self.prs = PPTXPre(self.source_file)
-        self.layout_mapping = {layout.name: layout for layout in self.prs.slide_layouts}
-        self.prs.core_properties.last_modified_by = "PPTAgent"
-
-    @classmethod
-    def from_file(cls, file_path: str, config: Config):
-        """
-        Parse a Presentation from a file.
-        """
-        prs = PPTXPre(file_path)
-        slide_width = prs.slide_width
-        slide_height = prs.slide_height
-        slides = []
-        error_history = []
-        slide_idx = 0
-        layouts = [layout.name for layout in prs.slide_layouts]
-        num_pages = len(prs.slides)
-        for slide in prs.slides:
-            if slide._element.get("show") == "0":
-                continue  # will not be printed to pdf
-
-            slide_idx += 1
-            try:
-                if slide.slide_layout.name not in layouts:
-                    raise ValueError(
-                        f"slide layout {slide.slide_layout.name} not found"
-                    )
-                slides.append(
-                    SlidePage.from_slide(
-                        slide,
-                        slide_idx - len(error_history),
-                        slide_idx,
-                        slide_width.pt,
-                        slide_height.pt,
-                        config,
-                    )
-                )
-            except Exception as e:
-                error_history.append((slide_idx, str(e)))
-                if config.DEBUG:
-                    print(
-                        f"Warning in slide {slide_idx} of {file_path}: {traceback.format_exc()}"
-                    )
-
-        return cls(
-            slides, error_history, slide_width, slide_height, file_path, num_pages
-        )
-
-    def save(self, file_path, layout_only=False):
-        """
-        Save the presentation to a file.
-
-        Args:
-            file_path (str): The file path to save the presentation.
-            layout_only (bool): Whether to save only the layout for slide clustering.
-        """
-        self.clear_slides()
-        for slide in self.slides:
-            if layout_only:
-                self.clear_images(slide.shapes)
-            pptx_slide = self.build_slide(slide)
-            if layout_only:
-                self.clear_text(pptx_slide.shapes)
-        self.prs.save(file_path)
-
-    def build_slide(self, slide: SlidePage) -> PPTXSlide:
-        """
-        Build a slide in the presentation.
-        """
-        return slide.build(
-            self.prs.slides.add_slide(self.layout_mapping[slide.slide_layout_name])
-        )
-
-    def clear_slides(self):
-        """
-        Delete all slides from the presentation.
-        """
-        while len(self.prs.slides) != 0:
-            rId = self.prs.slides._sldIdLst[0].rId
-            self.prs.part.drop_rel(rId)
-            del self.prs.slides._sldIdLst[0]
-
-    def clear_images(self, shapes: list[ShapeElement]):
-        for shape in shapes:
-            if isinstance(shape, GroupShape):
-                self.clear_images(shape.data)
-            elif isinstance(shape, Picture):
-                shape.img_path = "resource/pic_placeholder.png"
-
-    def clear_text(self, shapes: list[BaseShape]):
-        for shape in shapes:
-            if isinstance(shape, PPTXGroupShape):
-                self.clear_text(shape.shapes)
-            elif shape.has_text_frame:
-                for para in shape.text_frame.paragraphs:
-                    for run in para.runs:
-                        run.text = "a" * len(run.text)
-
-    def to_text(self, show_image: bool = False) -> str:
-        """
-        Represent the presentation in text.
-        """
-        return "\n----\n".join(
-            [
-                (
-                    f"Slide {slide.slide_idx} of {len(self.prs.slides)}\n"
-                    + (f"Title:{slide.slide_title}\n" if slide.slide_title else "")
-                    + slide.to_text(show_image)
-                )
-                for slide in self.slides
-            ]
-        )
-
-    def __len__(self):
-        return len(self.slides)
-
-
-SHAPECAST: dict[int, ShapeElement] = {
-    MSO_SHAPE_TYPE.AUTO_SHAPE: FreeShape,
-    MSO_SHAPE_TYPE.PLACEHOLDER: Placeholder,
-    MSO_SHAPE_TYPE.PICTURE: Picture,
-    MSO_SHAPE_TYPE.GROUP: GroupShape,
-    MSO_SHAPE_TYPE.TEXT_BOX: TextBox,
-    MSO_SHAPE_TYPE.LINE: Connector,
-    MSO_SHAPE_TYPE.TABLE: Table,
-    MSO_SHAPE_TYPE.FREEFORM: FreeformShape,
-}
-
-if __name__ == "__main__":
-    from copy import deepcopy
-    from glob import glob
-
-    config = Config("/tmp")
-    presentation = deepcopy(
-        Presentation.from_file("runs/pptx/cip_default_template/source.pptx", config)
-    ).save("./test.pptx")
-    for pptx in glob("data/*/pptx/*/source.pptx"):
-        presentation = deepcopy(Presentation.from_file(pptx, config))
-        for slide in presentation.slides:
-            print(slide.to_html(show_image=False))
-            print("\033c", end="")
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/src/utils.py b/Paper2Video/src/evaluation/PresentQuiz/utils/src/utils.py
deleted file mode 100644
index c676c8048c2fe11dd0b37a7bdccd80ee9560a248..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/src/utils.py
+++ /dev/null
@@ -1,335 +0,0 @@
-import os
-import shutil
-import subprocess
-import tempfile
-import traceback
-from time import sleep, time
-from types import SimpleNamespace
-
-import json_repair
-import Levenshtein
-from lxml import etree
-from pdf2image import convert_from_path
-from pptx.dml.color import RGBColor
-from pptx.oxml import parse_xml
-from pptx.shapes.base import BaseShape
-from pptx.shapes.group import GroupShape
-from pptx.text.text import _Paragraph, _Run
-from pptx.util import Length, Pt
-from rich import print
-from tenacity import RetryCallState, retry, stop_after_attempt, wait_fixed
-
-IMAGE_EXTENSIONS = {"bmp", "jpg", "jpeg", "pgm", "png", "ppm", "tif", "tiff", "webp"}
-
-BLACK = RGBColor(0, 0, 0)
-YELLOW = RGBColor(255, 255, 0)
-BLUE = RGBColor(0, 0, 255)
-BORDER_LEN = Pt(2)
-BORDER_OFFSET = Pt(2)
-LABEL_LEN = Pt(24)
-FONT_LEN = Pt(20)
-
-
-def is_image_path(file: str):
-    if file.split(".")[-1].lower() in IMAGE_EXTENSIONS:
-        return True
-    return False
-
-
-def get_font_pptcstyle(font: dict):
-    font = SimpleNamespace(**font)
-    return f"Font Style: bold={font.bold}, italic={font.italic}, underline={font.underline}, size={font.size}pt, color={font.color}, font style={font.name}\n"
-
-
-def get_font_style(font: dict):
-    font = SimpleNamespace(**font)
-    styles = []
-    if font.size:
-        styles.append(f"font-size: {font.size}pt")
-    if font.color:
-        styles.append(f"color: #{font.color}")
-    if font.bold:
-        styles.append("font-weight: bold")
-    if font.italic:
-        styles.append("font-style: italic")
-    return "; ".join(styles)
-
-
-def runs_merge(paragraph: _Paragraph):
-    runs = paragraph.runs
-    if len(runs) == 0:
-        runs = [
-            _Run(r, paragraph)
-            for r in parse_xml(paragraph._element.xml.replace("fld", "r")).r_lst
-        ]
-    if len(runs) == 1:
-        return runs[0]
-    if len(runs) == 0:
-        return None
-    run = max(runs, key=lambda x: len(x.text))
-    run.text = paragraph.text
-
-    for r in runs:
-        if r != run:
-            r._r.getparent().remove(r._r)
-    return run
-
-
-def older_than(filepath, seconds: int = 10, wait: bool = False):
-    if not os.path.exists(filepath):
-        while wait:
-            print("waiting for:", filepath)
-            sleep(1)
-            if os.path.exists(filepath):
-                sleep(seconds)
-                return True
-        return False
-    file_creation_time = os.path.getctime(filepath)
-    current_time = time()
-    return seconds < (current_time - file_creation_time)
-
-
-def edit_distance(text1: str, text2: str):
-    return 1 - Levenshtein.distance(text1, text2) / max(len(text1), len(text2))
-
-
-def get_slide_content(doc_json: dict, slide_title: str, slide: dict):
-    slide_desc = slide.get("description", "")
-    slide_content = f"Slide Purpose: {slide_title}\nSlide Description: {slide_desc}\n"
-    for key in slide.get("subsections", []):
-        slide_content += "Slide Content Source: "
-        for section in doc_json["sections"]:
-            subsections = section.get("subsections", [])
-            if isinstance(subsections, dict) and len(subsections) == 1:
-                subsections = [
-                    {"title": k, "content": v} for k, v in subsections.items()
-                ]
-            for subsection in subsections:
-                try:
-                    if edit_distance(key, subsection["title"]) > 0.9:
-                        slide_content += f"# {key} \n{subsection['content']}\n"
-                except:
-                    pass
-    return slide_content
-
-
-def tenacity_log(retry_state: RetryCallState):
-    print(retry_state)
-    traceback.print_tb(retry_state.outcome.exception().__traceback__)
-
-
-def get_json_from_response(raw_response: str):
-    response = raw_response.strip()
-    l, r = response.rfind("```json"), response.rfind("```")
-    try:
-        if l == -1 or r == -1:
-            response = json_repair.loads(response)
-        else:
-            response = json_repair.loads(response[l + 7 : r].strip())
-        return response
-    except Exception as e:
-        raise RuntimeError("Failed to parse JSON from response", e)
-
-
-tenacity = retry(
-    wait=wait_fixed(3), stop=stop_after_attempt(5), after=tenacity_log, reraise=True
-)
-
-
-@tenacity
-def ppt_to_images(file: str, output_dir: str, warning: bool = False, dpi=72, output_type='png'):
-    assert pexists(file), f"File {file} does not exist"
-    if pexists(output_dir) and warning:
-        print(f"ppt2images: {output_dir} already exists")
-    os.makedirs(output_dir, exist_ok=True)
-    with tempfile.TemporaryDirectory() as temp_dir:
-        command_list = [
-            "soffice",
-            "--headless",
-            "--convert-to",
-            "pdf",
-            file,
-            "--outdir",
-            temp_dir,
-        ]
-        subprocess.run(command_list, check=True, stdout=subprocess.DEVNULL)
-
-        for f in os.listdir(temp_dir):
-            if not f.endswith(".pdf"):
-                continue
-            temp_pdf = pjoin(temp_dir, f)
-            images = convert_from_path(temp_pdf, dpi=72)
-            for i, img in enumerate(images):
-                if output_type == 'png':
-                    img.save(pjoin(output_dir, f"poster.png"), 'PNG')
-                else:
-                    img.save(pjoin(output_dir, f"poster.jpg"), 'JPEG')
-            return
-
-        raise RuntimeError("No PDF file was created in the temporary directory", file)
-
-
-@tenacity
-def wmf_to_images(blob: bytes, filepath: str):
-    if not filepath.endswith(".jpg"):
-        raise ValueError("filepath must end with .jpg")
-    dirname = os.path.dirname(filepath)
-    basename = os.path.basename(filepath).removesuffix(".jpg")
-    with tempfile.TemporaryDirectory() as temp_dir:
-        with open(pjoin(temp_dir, f"{basename}.wmf"), "wb") as f:
-            f.write(blob)
-        command_list = [
-            "soffice",
-            "--headless",
-            "--convert-to",
-            "jpg",
-            pjoin(temp_dir, f"{basename}.wmf"),
-            "--outdir",
-            dirname,
-        ]
-        subprocess.run(command_list, check=True, stdout=subprocess.DEVNULL)
-
-    assert pexists(filepath), f"File {filepath} does not exist"
-
-
-def extract_fill(shape: BaseShape):
-    if "fill" not in dir(shape):
-        return None
-    else:
-        return shape.fill._xPr.xml
-
-
-def apply_fill(shape: BaseShape, fill_xml: str):
-    if fill_xml is None:
-        return
-    new_element = etree.fromstring(fill_xml)
-    shape.fill._xPr.getparent().replace(shape.fill._xPr, new_element)
-
-
-def parse_groupshape(groupshape: GroupShape):
-    assert isinstance(groupshape, GroupShape)
-    group_top_left_x = groupshape.left
-    group_top_left_y = groupshape.top
-    group_width = groupshape.width
-    group_height = groupshape.height
-    shape_top_left_x = min([sp.left for sp in groupshape.shapes])
-    shape_top_left_y = min([sp.top for sp in groupshape.shapes])
-    shape_width = (
-        max([sp.left + sp.width for sp in groupshape.shapes]) - shape_top_left_x
-    )
-    shape_height = (
-        max([sp.top + sp.height for sp in groupshape.shapes]) - shape_top_left_y
-    )
-    group_shape_xy = []
-    for sp in groupshape.shapes:
-        group_shape_left = (
-            sp.left - shape_top_left_x
-        ) * group_width / shape_width + group_top_left_x
-        group_shape_top = (
-            sp.top - shape_top_left_y
-        ) * group_height / shape_height + group_top_left_y
-        group_shape_width = sp.width * group_width / shape_width
-        group_shape_height = sp.height * group_height / shape_height
-        group_shape_xy.append(
-            {
-                "left": Length(group_shape_left),
-                "top": Length(group_shape_top),
-                "width": Length(group_shape_width),
-                "height": Length(group_shape_height),
-            }
-        )
-    return group_shape_xy
-
-
-def is_primitive(obj):
-    if isinstance(obj, (list, tuple, set, frozenset)):
-        return all(is_primitive(item) for item in obj)
-    return isinstance(
-        obj, (int, float, complex, bool, str, bytes, bytearray, type(None))
-    )
-
-
-DEFAULT_EXCLUDE = set(["element", "language_id", "ln", "placeholder_format"])
-
-
-def object_to_dict(obj, result=None, exclude=None):
-    if result is None:
-        result = {}
-    exclude = DEFAULT_EXCLUDE.union(exclude or set())
-    for attr in dir(obj):
-        if attr in exclude:
-            continue
-        try:
-            if not attr.startswith("_") and not callable(getattr(obj, attr)):
-                attr_value = getattr(obj, attr)
-                if "real" in dir(attr_value):
-                    attr_value = attr_value.real
-                if attr == "size" and isinstance(attr_value, int):
-                    attr_value = Length(attr_value).pt
-
-                if is_primitive(attr_value):
-                    result[attr] = attr_value
-        except:
-            pass
-    return result
-
-
-def merge_dict(d1: dict, d2: list[dict]):
-    if len(d2) == 0:
-        return d1
-    for key in list(d1.keys()):
-        values = [d[key] for d in d2]
-        if d1[key] is not None and len(values) != 1:
-            values.append(d1[key])
-        if values[0] is None or not all(value == values[0] for value in values):
-            continue
-        d1[key] = values[0]
-        for d in d2:
-            d[key] = None
-    return d1
-
-
-def dict_to_object(dict: dict, obj: object, exclude=None):
-    if exclude is None:
-        exclude = set()
-    for key, value in dict.items():
-        if key not in exclude:
-            setattr(obj, key, value)
-
-
-class Config:
-
-    def __init__(self, rundir=None, session_id=None, debug=True):
-        self.DEBUG = debug
-        if session_id is not None:
-            self.set_session(session_id)
-        if rundir is not None:
-            self.set_rundir(rundir)
-
-    def set_session(self, session_id):
-        self.session_id = session_id
-        self.set_rundir(f"./runs/{session_id}")
-
-    def set_rundir(self, rundir: str):
-        self.RUN_DIR = rundir
-        self.IMAGE_DIR = pjoin(self.RUN_DIR, "images")
-        for the_dir in [self.RUN_DIR, self.IMAGE_DIR]:
-            os.makedirs(the_dir, exist_ok=True)
-
-    def set_debug(self, debug: bool):
-        self.DEBUG = debug
-
-    def remove_rundir(self):
-        if pexists(self.RUN_DIR):
-            shutil.rmtree(self.RUN_DIR)
-        if pexists(self.IMAGE_DIR):
-            shutil.rmtree(self.IMAGE_DIR)
-
-
-pjoin = os.path.join
-pexists = os.path.exists
-pbasename = os.path.basename
-
-if __name__ == "__main__":
-    config = Config()
-    print(config)
diff --git a/Paper2Video/src/evaluation/PresentQuiz/utils/wei_utils.py b/Paper2Video/src/evaluation/PresentQuiz/utils/wei_utils.py
deleted file mode 100644
index 35dab9f5945afd02e04a610d91e8570fef068f28..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/PresentQuiz/utils/wei_utils.py
+++ /dev/null
@@ -1,1337 +0,0 @@
-import re
-import io
-import contextlib
-import traceback
-from pptx import Presentation
-from pptx.enum.shapes import MSO_SHAPE_TYPE, MSO_SHAPE, MSO_AUTO_SHAPE_TYPE
-from pptx.util import Inches, Pt
-from pptx.dml.color import RGBColor
-from pptx.enum.text import PP_ALIGN, MSO_ANCHOR
-from camel.types import ModelPlatformType, ModelType
-from camel.configs import ChatGPTConfig, QwenConfig, VLLMConfig, OpenRouterConfig, GeminiConfig
-import math
-from urllib.parse import quote_from_bytes, quote
-from PIL import Image
-import os
-import copy
-import io
-from utils.src.utils import ppt_to_images
-from playwright.sync_api import sync_playwright
-from pathlib import Path
-from playwright.async_api import async_playwright
-import asyncio
-from utils.pptx_utils import *
-from utils.critic_utils import *
-
-def get_agent_config(model_type):
-    agent_config = {}
-    if model_type == 'qwen':
-        agent_config = {
-            "model_type": ModelType.DEEPINFRA_QWEN_2_5_72B,
-            "model_config": QwenConfig().as_dict(),
-            "model_platform": ModelPlatformType.DEEPINFRA,
-        }
-    elif model_type == 'gemini':
-        agent_config = {
-            "model_type": ModelType.GEMINI_2_5_PRO,
-            "model_config": GeminiConfig().as_dict(),
-            "model_platform": ModelPlatformType.GEMINI,
-            'max_images': 99
-        }
-    elif model_type == 'phi4':
-        agent_config = {
-            "model_type": ModelType.DEEPINFRA_PHI_4_MULTIMODAL,
-            "model_config": QwenConfig().as_dict(),
-            "model_platform": ModelPlatformType.DEEPINFRA,
-        }
-    elif model_type == 'llama-4-scout-17b-16e-instruct':
-        agent_config = {
-            'model_type': ModelType.ALIYUN_LLAMA4_SCOUT_17B_16E,
-            'model_config': QwenConfig().as_dict(),
-            'model_platform': ModelPlatformType.QWEN,
-            'max_images': 99
-        }
-    elif model_type == 'qwen-2.5-vl-72b':
-        agent_config = {
-            'model_type': ModelType.QWEN_2_5_VL_72B,
-            'model_config': QwenConfig().as_dict(),
-            'model_platform': ModelPlatformType.QWEN,
-            'max_images': 99
-        }
-    elif model_type == 'gemma':
-        agent_config = {
-            "model_type": "google/gemma-3-4b-it",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:5555/v1',
-            'max_images': 99
-        }
-    elif model_type == 'llava':
-        agent_config = {
-            "model_type": "llava-hf/llava-onevision-qwen2-7b-ov-hf",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:8050/v1',
-            'max_images': 99
-        }
-    elif model_type == 'molmo-o':
-        agent_config = {
-            "model_type": "allenai/Molmo-7B-O-0924",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:8050/v1',
-            'max_images': 99
-        }
-    elif model_type == 'qwen-2-vl-7b':
-        agent_config = {
-            "model_type": "Qwen/Qwen2-VL-7B-Instruct",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:8050/v1',
-            'max_images': 99
-        }
-    elif model_type == 'vllm_phi4':
-        agent_config = {
-            "model_type": "microsoft/Phi-4-multimodal-instruct",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:8050/v1',
-            'max_images': 99
-        }
-    elif model_type == 'o3-mini':
-        agent_config = {
-            "model_type": ModelType.O3_MINI,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-        }
-    elif model_type == 'gpt-4.1':
-        agent_config = {
-            "model_type": ModelType.GPT_4_1,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-        }
-    elif model_type == 'gpt-4.1-mini':
-        agent_config = {
-            "model_type": ModelType.GPT_4_1_MINI,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-        }
-    elif model_type == '4o':
-        agent_config = {
-            "model_type": ModelType.GPT_4O,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-            # "model_name": '4o'
-        }
-    elif model_type == '4o-mini':
-        agent_config = {
-            "model_type": ModelType.GPT_4O_MINI,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-        }
-    elif model_type == 'o1':
-        agent_config = {
-            "model_type": ModelType.O1,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-            # "model_name": 'o1'
-        }
-    elif model_type == 'o3':
-        agent_config = {
-            "model_type": ModelType.O3,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-        }
-    elif model_type == 'vllm_qwen_vl':
-        agent_config = {
-            "model_type": "Qwen/Qwen2.5-VL-7B-Instruct",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:7000/v1'
-        }
-    elif model_type == 'vllm_qwen':
-        agent_config = {
-            "model_type": "Qwen/Qwen2.5-7B-Instruct",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:8050/v1',
-        }
-    elif model_type == 'openrouter_qwen_vl_72b':
-        agent_config = {
-            'model_type': ModelType.OPENROUTER_QWEN_2_5_VL_72B,
-            'model_platform': ModelPlatformType.OPENROUTER,
-            'model_config': OpenRouterConfig().as_dict(),
-        }
-    elif model_type == 'openrouter_qwen_vl_7b':
-        agent_config = {
-            'model_type': ModelType.OPENROUTER_QWEN_2_5_VL_7B,
-            'model_platform': ModelPlatformType.OPENROUTER,
-            'model_config': OpenRouterConfig().as_dict(),
-        }
-    elif model_type == 'openrouter_qwen_7b':
-        agent_config = {
-            'model_type': ModelType.OPENROUTER_QWEN_2_5_7B,
-            'model_platform': ModelPlatformType.OPENROUTER,
-            'model_config': OpenRouterConfig().as_dict(),
-        }
-    else:
-        agent_config = {
-            'model_type': model_type,
-            'model_platform': ModelPlatformType.OPENAI_COMPATIBLE_MODEL,
-            'model_config': None
-        }
-    
-    return agent_config
-
-
-def match_response(response):
-    response_text = response.msgs[0].content
-
-    # This regular expression looks for text between ```python ... ```
-    pattern = r'```python(.*?)```'
-    match = re.search(pattern, response_text, flags=re.DOTALL)
-
-    if not match:
-        pattern = r'```(.*?)```'
-        match = re.search(pattern, response_text, flags=re.DOTALL)
-
-    if match:
-        code_snippet = match.group(1).strip()
-    else:
-        # If there's no fenced code block, fallback to entire response or handle error
-        code_snippet = response_text
-    return code_snippet
-
-def run_code_with_utils(code, utils_functions):
-    return run_code(utils_functions + '\n' + code)
-
-def run_code(code):
-    """
-    Execute Python code and capture stdout as well as the full stack trace on error.
-    Forces __name__ = "__main__" so that if __name__ == "__main__": blocks will run.
-    
-    Returns:
-        (output, error)
-        - output: string containing everything that was printed to stdout
-        - error: string containing the full traceback if an exception occurred; None otherwise
-    """
-    stdout_capture = io.StringIO()
-    # Provide a globals dict specifying that __name__ is "__main__"
-    exec_globals = {"__name__": "__main__"}
-
-    with contextlib.redirect_stdout(stdout_capture):
-        try:
-            exec(code, exec_globals)
-            error = None
-        except Exception:
-            # Capture the entire stack trace
-            error = traceback.format_exc()
-
-    output = stdout_capture.getvalue()
-    return output, error
-
-
-def run_code_from_agent(agent, msg, num_retries=1):
-    agent.reset()
-    log = []
-    for attempt in range(num_retries + 1):  # +1 to include the initial attempt
-        response = agent.step(msg)
-        code = match_response(response)
-        output, error = run_code(code)
-        log.append((code, output, error))
-        
-        if error is None:
-            return log
-        
-        if attempt < num_retries:
-            print(f"Retrying... Attempt {attempt + 1} of {num_retries}")
-            msg = error
-    
-    return log
-
-def run_modular(all_code, file_name, with_border=True, with_label=True):
-    concatenated_code = utils_functions
-    concatenated_code += "\n".join(all_code.values())
-    if with_border and with_label:
-        concatenated_code += add_border_label_function
-        concatenated_code += create_id_map_function
-        concatenated_code += save_helper_info_border_label.format(file_name, file_name, file_name)
-    elif with_border:
-        concatenated_code += add_border_function
-        concatenated_code += save_helper_info_border.format(file_name, file_name)
-    else:
-        concatenated_code += f'\nposter.save("{file_name}")'
-    output, error = run_code(concatenated_code)
-    return concatenated_code, output, error
-
-def edit_modular(
-        agent,
-        edit_section_name, 
-        feedback,
-        all_code, 
-        file_name, 
-        outline,
-        content,
-        images,
-        actor_prompt,
-        num_retries=1,
-        prompt_type='initial'
-    ):
-    agent.reset()
-    log = []
-    if prompt_type == 'initial':
-        msg = actor_prompt.format(
-            outline['meta'],
-            {edit_section_name: outline[edit_section_name]}, 
-            content, 
-            images,
-            documentation
-        )
-    elif prompt_type == 'edit':
-        assert (edit_section_name == list(feedback.keys())[0])
-        msg = actor_prompt.format(
-            edit_section_name,
-            all_code[edit_section_name],
-            feedback,
-            {edit_section_name: outline[edit_section_name]}, 
-            content, 
-            images,
-            documentation
-        )
-    elif prompt_type == 'new':
-        assert (list(feedback.keys())[0] == 'all_good')
-        msg = actor_prompt.format(
-            {edit_section_name: outline[edit_section_name]}, 
-            content, 
-            images,
-            documentation
-        )
-
-    for attempt in range(num_retries + 1):
-        response = agent.step(msg)
-        new_code = match_response(response)
-        all_code_changed = all_code.copy()
-        all_code_changed[edit_section_name] = new_code
-        concatenated_code, output, error = run_modular(all_code_changed, file_name, False, False)
-        log.append({
-            "code": new_code,
-            "output": output,
-            "error": error,
-            "concatenated_code": concatenated_code
-        })
-        if error is None:
-            return log
-        
-        if attempt < num_retries:
-            print(f"Retrying... Attempt {attempt + 1} of {num_retries}")
-            msg = error
-            msg += '\nFix your code and try again. The poster is a single-page pptx.'
-            if prompt_type != 'initial':
-                msg += '\nAssume that you have had a Presentation object named "poster" and a slide named "slide".'
-
-    return log
-
-def add_border_to_all_elements(prs, border_color=RGBColor(255, 0, 0), border_width=Pt(2)):
-    """
-    Iterates over all slides and shapes in the Presentation object 'prs'
-    and applies a red border with the specified width to each shape.
-    
-    Args:
-        prs: The Presentation object to modify.
-        border_color: An instance of RGBColor for the border color (default is red).
-        border_width: The width of the border as a Pt value (default is 2 points).
-    """
-    for slide in prs.slides:
-        for shape in slide.shapes:
-            # Some shapes (like charts or group shapes) might not support border styling
-            try:
-                # Set the line fill to be solid and assign the desired color and width.
-                shape.line.fill.solid()
-                shape.line.fill.fore_color.rgb = border_color
-                shape.line.width = border_width
-            except Exception as e:
-                # If a shape doesn't support setting a border, print a message and continue.
-                print(f"Could not add border to shape {shape.shape_type}: {e}")
-
-
-# 1 point = 12700 EMUs (helper function)
-def pt_to_emu(points: float) -> int:
-    return int(points * 12700)
-
-def add_border_and_labels(
-    prs,
-    border_color=RGBColor(255, 0, 0),   # Red border for shapes
-    border_width=Pt(2),                # 2-point border width
-    label_outline_color=RGBColor(0, 0, 255),  # Blue outline for label circle
-    label_text_color=RGBColor(0, 0, 255),     # Blue text color
-    label_diameter_pt=40                       # Diameter of the label circle in points
-):
-    """
-    Iterates over all slides and shapes in the Presentation 'prs', applies a 
-    red border to each shape, and places a transparent (no fill), blue-outlined 
-    circular label with a blue number in the center of each shape. Labels start 
-    from 0 and increment for every shape that gets a border.
-
-    Args:
-        prs: The Presentation object to modify.
-        border_color: RGBColor for the shape border color (default: red).
-        border_width: The width of the shape border (Pt).
-        label_outline_color: The outline color for the label circle (default: blue).
-        label_text_color: The color of the label text (default: blue).
-        label_diameter_pt: The diameter of the label circle, in points (default: 40).
-    """
-    label_diameter_emu = pt_to_emu(label_diameter_pt)  # convert diameter (points) to EMUs
-    label_counter = 0  # Start labeling at 0
-    labeled_elements = {}
-
-    for slide in prs.slides:
-        for shape in slide.shapes:
-            # Skip shapes that are labels themselves
-            if shape.name.startswith("Label_"):
-                continue
-
-            try:
-                # --- 1) Add red border to the shape (if supported) ---
-                shape.line.fill.solid()
-                shape.line.fill.fore_color.rgb = border_color
-                shape.line.width = border_width
-
-                # --- 2) Calculate center for the label circle ---
-                label_left = shape.left + (shape.width // 2) - (label_diameter_emu // 2)
-                label_top  = shape.top  + (shape.height // 2) - (label_diameter_emu // 2)
-
-                # --- 3) Create label circle (an OVAL) in the center of the shape ---
-                label_shape = slide.shapes.add_shape(
-                    MSO_AUTO_SHAPE_TYPE.OVAL,
-                    label_left,
-                    label_top,
-                    label_diameter_emu,
-                    label_diameter_emu
-                )
-                label_shape.name = f"Label_{label_counter}"  # so we can skip it later
-
-                # **Make the circle completely transparent** (no fill at all)
-                label_shape.fill.background()
-
-                # **Give it a blue outline**
-                label_shape.line.fill.solid()
-                label_shape.line.fill.fore_color.rgb = label_outline_color
-                label_shape.line.width = Pt(3)
-
-                # --- 4) Add the label number (centered, blue text) ---
-                tf = label_shape.text_frame
-                tf.text = str(label_counter)
-                paragraph = tf.paragraphs[0]
-                paragraph.alignment = PP_ALIGN.CENTER
-
-                run = paragraph.runs[0]
-                font = run.font
-                font.size = Pt(40)      # Larger font
-                font.bold = True
-                font.name = "Arial"
-                font._element.get_or_change_to_solidFill()
-                font.fill.fore_color.rgb = label_text_color
-                # Record properties from the original shape and label text.
-                labeled_elements[label_counter] = {
-                    'left': f'{shape.left} EMU',
-                    'top': f'{shape.top} EMU',
-                    'width': f'{shape.width} EMU',
-                    'height': f'{shape.height} EMU',
-                    'font_size': f'{shape.text_frame.font.size} PT' if hasattr(shape, 'text_frame') else None,
-                }
-
-                # --- 5) Increment label counter (so every shape has a unique label) ---
-                label_counter += 1
-
-            except Exception as e:
-                # If the shape doesn't support borders or text, skip gracefully
-                print(f"Could not add border/label to shape (type={shape.shape_type}): {e}")
-
-    return labeled_elements
-
-
-def fill_content(agent, prompt, num_retries, existing_code=''):
-    if existing_code == '':
-        existing_code = utils_functions
-    agent.reset()
-    log = []
-    cumulative_input_token, cumulative_output_token = 0, 0
-    for attempt in range(num_retries + 1):
-        response = agent.step(prompt)
-        input_token, output_token = account_token(response)
-        cumulative_input_token += input_token
-        cumulative_output_token += output_token
-        new_code = match_response(response)
-        all_code = existing_code + '\n' + new_code
-
-        output, error = run_code(all_code)
-        log.append({
-            "code": new_code,
-            "output": output,
-            "error": error,
-            "concatenated_code": all_code,
-            'cumulative_tokens': (cumulative_input_token, cumulative_output_token)
-        })
-
-        if error is None:
-            return log
-        
-        if attempt < num_retries:
-            print(f"Retrying... Attempt {attempt + 1} of {num_retries}")
-            prompt = error
-    return log
-
-def apply_theme(agent, prompt, num_retries, existing_code=''):
-    return fill_content(agent, prompt, num_retries, existing_code)
-
-def edit_code(agent, prompt, num_retries, existing_code=''):
-    return fill_content(agent, prompt, num_retries, existing_code)
-
-def stylize(agent, prompt, num_retries, existing_code=''):
-    return fill_content(agent, prompt, num_retries, existing_code)
-
-def gen_layout(agent, prompt, num_retries, name_to_hierarchy, visual_identifier='', existing_code=''):
-    if existing_code == '':
-        existing_code = utils_functions
-    agent.reset()
-    log = []
-    cumulative_input_token, cumulative_output_token = 0, 0
-    for attempt in range(num_retries + 1):
-        response = agent.step(prompt)
-        input_token, output_token = account_token(response)
-        cumulative_input_token += input_token
-        cumulative_output_token += output_token
-        new_code = match_response(response)
-        all_code = existing_code + '\n' + new_code
-
-        # Save visualizations
-        all_code += f'''
-name_to_hierarchy = {name_to_hierarchy}
-identifier = "{visual_identifier}"
-get_visual_cues(name_to_hierarchy, identifier)
-'''
-
-        output, error = run_code(all_code)
-        log.append({
-            "code": new_code,
-            "output": output,
-            "error": error,
-            "concatenated_code": all_code,
-            'num_tokens': (input_token, output_token),
-            'cumulative_tokens': (cumulative_input_token, cumulative_output_token)
-        })
-
-        if error is None:
-            return log
-        
-        if attempt < num_retries:
-            print(f"Retrying... Attempt {attempt + 1} of {num_retries}")
-            prompt = error
-    return log
-
-def gen_layout_parallel(agent, prompt, num_retries, existing_code='', slide_width=0, slide_height=0, tmp_name='tmp'):
-    if existing_code == '':
-        existing_code = utils_functions
-        
-    existing_code += f'''
-poster = create_poster(width_inch={slide_width}, height_inch={slide_height})
-slide = add_blank_slide(poster)
-save_presentation(poster, file_name="poster_{tmp_name}.pptx")
-'''
-    agent.reset()
-    log = []
-    cumulative_input_token, cumulative_output_token = 0, 0
-    for attempt in range(num_retries + 1):
-        response = agent.step(prompt)
-        input_token, output_token = account_token(response)
-        cumulative_input_token += input_token
-        cumulative_output_token += output_token
-        new_code = match_response(response)
-        all_code = existing_code + '\n' + new_code
-
-        output, error = run_code(all_code)
-        log.append({
-            "code": new_code,
-            "output": output,
-            "error": error,
-            "concatenated_code": all_code,
-            'num_tokens': (input_token, output_token),
-            'cumulative_tokens': (cumulative_input_token, cumulative_output_token)
-        })
-        if output is None or output == '':
-            prompt = 'No object name printed.'
-            continue
-
-        if error is None:
-            return log
-        
-        if attempt < num_retries:
-            # print(f"Retrying... Attempt {attempt + 1} of {num_retries}", flush=True)
-            prompt = error
-    return log
-
-def compute_bullet_length(textbox_content):
-    total = 0
-    for bullet in textbox_content:
-        for run in bullet['runs']:
-            total += len(run['text'])
-    return total
-
-def check_bounding_boxes(bboxes, overall_width, overall_height):
-    """
-    Given a dictionary 'bboxes' whose keys are bounding-box names and whose values are
-    dictionaries with keys 'left', 'top', 'width', and 'height' (all floats),
-    along with the overall canvas width and height, this function checks for:
-
-      1) An overlap between any two bounding boxes (it returns a tuple of their names).
-      2) A bounding box that extends beyond the overall width or height (it returns a tuple
-         containing just that bounding box's name).
-
-    It stops upon finding the first error:
-      - If an overlap is found first, it returns (name1, name2).
-      - Otherwise, if an overflow is found, it returns (name,).
-      - If nothing is wrong, it returns ().
-
-    Parameters:
-        bboxes (dict): e.g. {
-            "box1": {"left": 10.0, "top": 10.0, "width": 50.0, "height": 20.0},
-            "box2": {"left": 55.0, "top": 15.0, "width": 10.0, "height": 10.0},
-            ...
-        }
-        overall_width (float): The total width of the available space.
-        overall_height (float): The total height of the available space.
-
-    Returns:
-        tuple: Either (box1, box2) if an overlap is found,
-               (box,) if a bounding box overflows,
-               or () if no problem is found.
-    """
-
-    # Convert bboxes into a list of (name, left, top, width, height) for easier iteration.
-    box_list = []
-    for name, coords in bboxes.items():
-        left = coords["left"]
-        top = coords["top"]
-        width = coords["width"]
-        height = coords["height"]
-        box_list.append((name, left, top, width, height))
-
-    # Helper function to check overlap between two boxes
-    def boxes_overlap(box_a, box_b):
-        # Unpack bounding-box data
-        name_a, left_a, top_a, width_a, height_a = box_a
-        name_b, left_b, top_b, width_b, height_b = box_b
-
-        # Compute right and bottom coordinates
-        right_a = left_a + width_a
-        bottom_a = top_a + height_a
-        right_b = left_b + width_b
-        bottom_b = top_b + height_b
-
-        # Rectangles overlap if not separated along either x or y axis
-        # If one box is completely to the left or right or above or below the other,
-        # there's no overlap.
-        no_overlap = (right_a <= left_b or  # A is completely left of B
-                      right_b <= left_a or  # B is completely left of A
-                      bottom_a <= top_b or  # A is completely above B
-                      bottom_b <= top_a)    # B is completely above A
-        return not no_overlap
-
-    # 1) Check for overlap first
-    n = len(box_list)
-    for i in range(n):
-        for j in range(i + 1, n):
-            if boxes_overlap(box_list[i], box_list[j]):
-                return (box_list[i][0], box_list[j][0])  # Return names
-
-    # 2) Check for overflow
-    for name, left, top, width, height in box_list:
-        right = left + width
-        bottom = top + height
-
-        # If boundary is outside [0, overall_width] or [0, overall_height], it's an overflow
-        if (left < 0 or top < 0 or right > overall_width or bottom > overall_height):
-            return (name,)
-
-    # 3) If nothing is wrong, return empty tuple
-    return ()
-
-
-def is_poster_filled(
-    bounding_boxes: dict,
-    overall_width: float,
-    overall_height: float,
-    max_lr_margin: float,
-    max_tb_margin: float
-) -> bool:
-    """
-    Given a dictionary of bounding boxes (keys are box names and
-    values are dicts with float keys: "left", "top", "width", "height"),
-    along with the overall dimensions of the poster and maximum allowed
-    margins, this function determines whether the boxes collectively
-    fill the poster within those margin constraints.
-
-    :param bounding_boxes: Dictionary of bounding boxes of the form:
-                          {
-                              "box1": {"left": float, "top": float, "width": float, "height": float},
-                              "box2": {...},
-                              ...
-                          }
-    :param overall_width: Total width of the poster
-    :param overall_height: Total height of the poster
-    :param max_lr_margin: Maximum allowed left and right margins
-    :param max_tb_margin: Maximum allowed top and bottom margins
-    :return: True if the bounding boxes fill the poster (with no big leftover spaces),
-             False otherwise.
-    """
-
-    # If there are no bounding boxes, we consider the poster unfilled.
-    if not bounding_boxes:
-        return False
-
-    # Extract the minimum left, maximum right, minimum top, and maximum bottom from all bounding boxes.
-    min_left = min(b["left"] for b in bounding_boxes.values())
-    max_right = max(b["left"] + b["width"] for b in bounding_boxes.values())
-    min_top = min(b["top"] for b in bounding_boxes.values())
-    max_bottom = max(b["top"] + b["height"] for b in bounding_boxes.values())
-
-    # Calculate leftover margins.
-    leftover_left = min_left
-    leftover_right = overall_width - max_right
-    leftover_top = min_top
-    leftover_bottom = overall_height - max_bottom
-
-    # Check if leftover margins exceed the allowed maxima.
-    if (leftover_left > max_lr_margin or leftover_right > max_lr_margin or
-        leftover_top > max_tb_margin or leftover_bottom > max_tb_margin):
-        return False
-
-    return True
-
-def check_and_fix_subsections(section, subsections):
-    """
-    Given a 'section' bounding box and a dictionary of 'subsections',
-    checks:
-
-    1) That each subsection is within the main section and that
-       no two subsections overlap.
-       - If there is a problem, returns a tuple of the names of
-         the offending subsections.
-
-    2) That the subsections fully occupy the area of 'section'.
-       - If not, greedily expand each subsection (in the order
-         left->right->top->bottom), and return a dictionary of
-         the updated bounding boxes for the subsections.
-
-    3) Otherwise, returns an empty tuple if nothing is wrong.
-
-    :param section: dict with keys "left", "top", "width", "height".
-    :param subsections: dict mapping name -> dict with "left", "top", "width", "height".
-    :return: Either
-        - tuple of subsection names that are out of bounds or overlapping,
-        - dict of expanded bounding boxes if they do not fully occupy 'section',
-        - or an empty tuple if everything is correct.
-    """
-
-    # --- Utility functions ---
-    def right(rect):
-        return rect["left"] + rect["width"]
-
-    def bottom(rect):
-        return rect["top"] + rect["height"]
-
-    def is_overlapping(r1, r2):
-        """
-        Returns True if rectangles r1 and r2 overlap (strictly),
-        False otherwise.
-        """
-        return not (
-            right(r1) <= r2["left"]
-            or r1["left"] >= right(r2)
-            or bottom(r1) <= r2["top"]
-            or r1["top"] >= bottom(r2)
-        )
-
-    # 1) Check each subsection is within the main section
-    names_violating = set()
-    sec_left, sec_top = section["left"], section["top"]
-    sec_right = section["left"] + section["width"]
-    sec_bottom = section["top"] + section["height"]
-
-    for name, sub in subsections.items():
-        # Check boundary
-        sub_left, sub_top = sub["left"], sub["top"]
-        sub_right, sub_bottom = right(sub), bottom(sub)
-        if (
-            sub_left < sec_left
-            or sub_top < sec_top
-            or sub_right > sec_right
-            or sub_bottom > sec_bottom
-        ):
-            # Out of bounds
-            names_violating.add(name)
-
-    # 2) Check pairwise overlaps
-    sub_keys = list(subsections.keys())
-    for i in range(len(sub_keys)):
-        for j in range(i + 1, len(sub_keys)):
-            n1, n2 = sub_keys[i], sub_keys[j]
-            if is_overlapping(subsections[n1], subsections[n2]):
-                # Mark both as violating
-                names_violating.add(n1)
-                names_violating.add(n2)
-
-    # If anything violated boundaries or overlapped, return them as a tuple
-    if names_violating:
-        return tuple(sorted(names_violating))
-
-    # 3) Check if subsections fully occupy the section by area.
-    #    (Since we've checked there's no overlap, area-based check is safe for "full coverage".)
-    area_section = section["width"] * section["height"]
-    area_subs = sum(
-        sub["width"] * sub["height"] for sub in subsections.values()
-    )
-
-    if area_subs < area_section:
-        # -- We need to expand subsections greedily. --
-
-        # Make a copy of the bounding boxes so as not to modify originals.
-        expanded_subs = {
-            name: {
-                "left": sub["left"],
-                "top": sub["top"],
-                "width": sub["width"],
-                "height": sub["height"],
-            }
-            for name, sub in subsections.items()
-        }
-
-        # Helper to see whether we are touching a boundary or another subsection
-        def touching_left(sname, sbox):
-            if abs(sbox["left"] - sec_left) < 1e-9:
-                # touches main section left boundary
-                return True
-            # touches the right edge of another subsection
-            for oname, obox in expanded_subs.items():
-                if oname == sname:
-                    continue
-                if abs(right(obox) - sbox["left"]) < 1e-9:
-                    return True
-            return False
-
-        def touching_right(sname, sbox):
-            r = right(sbox)
-            if abs(r - sec_right) < 1e-9:
-                return True
-            for oname, obox in expanded_subs.items():
-                if oname == sname:
-                    continue
-                if abs(obox["left"] - r) < 1e-9:
-                    return True
-            return False
-
-        def touching_top(sname, sbox):
-            if abs(sbox["top"] - sec_top) < 1e-9:
-                return True
-            for oname, obox in expanded_subs.items():
-                if oname == sname:
-                    continue
-                if abs(bottom(obox) - sbox["top"]) < 1e-9:
-                    return True
-            return False
-
-        def touching_bottom(sname, sbox):
-            b = bottom(sbox)
-            if abs(b - sec_bottom) < 1e-9:
-                return True
-            for oname, obox in expanded_subs.items():
-                if oname == sname:
-                    continue
-                if abs(obox["top"] - b) < 1e-9:
-                    return True
-            return False
-
-        # Attempt a single pass of expansions, left->right->top->bottom
-        for name in expanded_subs:
-            sub = expanded_subs[name]
-
-            # Expand left if not touching left boundary or another box
-            if not touching_left(name, sub):
-                # The "left boundary" is the maximum "right" of any subsection strictly to the left,
-                # or the section's left boundary, whichever is larger.
-                left_bound = sec_left
-                for oname, obox in expanded_subs.items():
-                    if oname == name:
-                        continue
-                    r_ = obox["left"] + obox["width"]
-                    # only consider those that are strictly left of this sub
-                    if r_ <= sub["left"] and r_ > left_bound:
-                        left_bound = r_
-                # Now expand
-                delta = sub["left"] - left_bound
-                if delta > 1e-9:  # If there's any real gap
-                    sub["width"] += delta
-                    sub["left"] = left_bound
-
-            # Expand right if not touching right boundary or another box
-            if not touching_right(name, sub):
-                right_bound = sec_right
-                sub_right = sub["left"] + sub["width"]
-                for oname, obox in expanded_subs.items():
-                    if oname == name:
-                        continue
-                    left_ = obox["left"]
-                    # only consider those that are strictly to the right
-                    if left_ >= sub_right and left_ < right_bound:
-                        right_bound = left_
-                delta = right_bound - (sub["left"] + sub["width"])
-                if delta > 1e-9:
-                    sub["width"] += delta
-
-            # Expand top if not touching top boundary or another box
-            if not touching_top(name, sub):
-                top_bound = sec_top
-                for oname, obox in expanded_subs.items():
-                    if oname == name:
-                        continue
-                    b_ = obox["top"] + obox["height"]
-                    if b_ <= sub["top"] and b_ > top_bound:
-                        top_bound = b_
-                delta = sub["top"] - top_bound
-                if delta > 1e-9:
-                    sub["height"] += delta
-                    sub["top"] = top_bound
-
-            # Expand bottom if not touching bottom boundary or another box
-            if not touching_bottom(name, sub):
-                bottom_bound = sec_bottom
-                sub_bottom = sub["top"] + sub["height"]
-                for oname, obox in expanded_subs.items():
-                    if oname == name:
-                        continue
-                    other_top = obox["top"]
-                    if other_top >= sub_bottom and other_top < bottom_bound:
-                        bottom_bound = other_top
-                delta = bottom_bound - (sub["top"] + sub["height"])
-                if delta > 1e-9:
-                    sub["height"] += delta
-
-        # After expansion, return the expanded dictionary
-        # per the spec: "If the second case happens, return a dictionary ...
-        # containing the modified bounding box dictionaries."
-        return expanded_subs
-
-    # If we get here, then area_subs == area_section and there's no overlap => all good
-    return ()
-
-async def rendered_dims(html: Path) -> tuple[int, int]:
-    async with async_playwright() as p:
-        browser = await p.chromium.launch()
-        page    = await browser.new_page()        # no fixed viewport yet
-        resolved = html.resolve()
-        # quote_from_bytes expects bytes, so we encode the path as UTF‐8:
-        url = "file://" + quote_from_bytes(str(resolved).encode("utf-8"), safe="/:")
-        await page.goto(url, wait_until="networkidle")
-
-        # 1) bounding-box of <body>
-        body_box = await page.eval_on_selector(
-            "body",
-            "el => el.getBoundingClientRect()")
-        w = int(body_box["width"])
-        h = int(body_box["height"])
-
-        await browser.close()
-        return w, h
-
-    
-def html_to_png(html_abs_path, poster_width_default, poster_height_default, output_path):
-    html_file = html_abs_path
-
-    try:
-        w, h = asyncio.run(rendered_dims(html_file))
-    except:
-        w = poster_width_default
-        h = poster_height_default
-
-    with sync_playwright() as p:
-        path_posix = Path(html_file).resolve().as_posix()
-
-        file_url = "file://" + quote(path_posix, safe="/:")
-        browser = p.chromium.launch()
-        page    = browser.new_page(viewport={"width": w, "height": h})
-        page.goto(file_url, wait_until='networkidle')
-        page.screenshot(path=output_path, full_page=True)
-        browser.close()
-
-def account_token(response):
-    input_token = response.info['usage']['prompt_tokens']
-    output_token = response.info['usage']['completion_tokens']
-
-    return input_token, output_token
-
-def style_bullet_content(bullet_content_item, color, fill_color):
-    for i in range(len(bullet_content_item)):
-        bullet_content_item[i]['runs'][0]['color'] = color
-        bullet_content_item[i]['runs'][0]['fill_color'] = fill_color
-
-def scale_to_target_area(width, height, target_width=900, target_height=1200):
-    """
-    Scale the given width and height by the same factor to achieve a new area equal 
-    to target_width * target_height while preserving the aspect ratio.
-
-    Parameters:
-      width (float or int): The original width.
-      height (float or int): The original height.
-      target_width (int, optional): The target width for area calculation. Default is 900.
-      target_height (int, optional): The target height for area calculation. Default is 1200.
-
-    Returns:
-      tuple: (new_width, new_height) after scaling such that the area is target_width * target_height.
-    """
-    # Calculate target area from provided dimensions.
-    target_area = target_width * target_height
-    
-    # Calculate original area
-    current_area = width * height
-    
-    # Compute scale factor required: s^2 * (width * height) = target_area => s = sqrt(target_area / (width * height))
-    scale_factor = math.sqrt(target_area / current_area)
-    
-    # Calculate new dimensions
-    new_width = width * scale_factor
-    new_height = height * scale_factor
-    
-    # Optional: Round the dimensions to integers.
-    return int(round(new_width)), int(round(new_height))
-
-def char_capacity(
-    bbox,
-    font_size_px=40 * (96 / 72),  # Default font size in px (40pt converted to px)
-    *,
-    # Average glyph width as fraction of font-size (≈0.6 for monospace,
-    # ≈0.52–0.55 for most proportional sans-serif faces)
-    avg_width_ratio: float = 0.54,
-    line_height_ratio: float = 1,
-    # Optional inner padding in px that the renderer might reserve
-    padding_px: int = 0,
-) -> int:
-    """
-    Estimate the number of characters that will fit into a rectangular text box.
-
-    Parameters
-    ----------
-    bbox : (x, y, height, width)  # all in pixels
-    font_size_px : int           # font size in px
-    avg_width_ratio : float      # average char width ÷ fontSize
-    line_height_ratio : float    # line height ÷ fontSize
-    padding_px : int             # optional inner padding on each side
-
-    Returns
-    -------
-    int : estimated character capacity
-    """
-    CHAR_CONST = 10
-    _, _, height_px, width_px = bbox
-
-    usable_w = max(0, width_px - 2 * padding_px)
-    usable_h = max(0, height_px - 2 * padding_px)
-
-    if usable_w == 0 or usable_h == 0:
-        return 0  # box is too small
-
-    avg_char_w = font_size_px * avg_width_ratio
-    line_height = font_size_px * line_height_ratio
-
-    chars_per_line = max(1, math.floor(usable_w / avg_char_w))
-    lines = max(1, math.floor(usable_h / line_height))
-
-    return chars_per_line * lines * CHAR_CONST
-
-def estimate_characters(width_in_inches, height_in_inches, font_size_points, line_spacing_points=None):
-    """
-    Estimate the number of characters that can fit into a bounding box.
-
-    :param width_in_inches:  The width of the bounding box, in inches.
-    :param height_in_inches: The height of the bounding box, in inches.
-    :param font_size_points: The font size, in points.
-    :param line_spacing_points: (Optional) The line spacing, in points.
-                                Defaults to 1.5 × font_size_points if not provided.
-    :return: Estimated number of characters that fit in the bounding box.
-    """
-    if line_spacing_points is None:
-        # Default line spacing is 1.5 times the font size
-        line_spacing_points = 1.5 * font_size_points
-
-    # 1 inch = 72 points 
-    width_in_points = width_in_inches * 72
-    height_in_points = height_in_inches * 72
-
-    # Rough approximation of the average width of a character: half of the font size
-    avg_char_width = 0.5 * font_size_points
-
-    # Number of characters that can fit per line
-    chars_per_line = int(width_in_points // avg_char_width)
-
-    # Number of lines that can fit in the bounding box
-    lines_count = int(height_in_points // line_spacing_points)
-
-    # Total number of characters
-    total_characters = chars_per_line * lines_count
-
-    return total_characters
-
-def equivalent_length_with_forced_breaks(text, width_in_inches, font_size_points):
-    """
-    Returns the "width-equivalent length" of the text when forced newlines
-    are respected. Each physical line (including partial) is counted as if it
-    had 'max_chars_per_line' characters.
-    
-    This number can exceed len(text), because forced newlines waste leftover
-    space on the line.
-    """
-    # 1 inch = 72 points
-    width_in_points = width_in_inches * 72
-    avg_char_width = 0.5 * font_size_points
-
-    # How many characters fit in one fully occupied line?
-    max_chars_per_line = int(width_in_points // avg_char_width)
-
-    # Split on explicit newlines
-    logical_lines = text.split('\n')
-
-    total_equiv_length = 0
-
-    for line in logical_lines:
-        # If the line is empty, we still "use" one line (which is max_chars_per_line slots).
-        if not line:
-            total_equiv_length += max_chars_per_line
-            continue
-
-        line_length = len(line)
-        # How many sub-lines (wraps) does it need?
-        sub_lines = math.ceil(line_length / max_chars_per_line)
-
-        # Each sub-line is effectively counted as if it were fully used
-        total_equiv_length += sub_lines * max_chars_per_line
-
-    return total_equiv_length
-
-def actual_rendered_length(
-    text,
-    width_in_inches,
-    height_in_inches,
-    font_size_points,
-    line_spacing_points=None
-):
-    """
-    Estimate how many characters from `text` will actually fit in the bounding
-    box, taking into account explicit newlines.
-    """
-    if line_spacing_points is None:
-        line_spacing_points = 1.5 * font_size_points
-
-    # 1 inch = 72 points
-    width_in_points = width_in_inches * 72
-    height_in_points = height_in_inches * 72
-
-    # Estimate average character width
-    avg_char_width = 0.5 * font_size_points
-
-    # Maximum chars per line (approx)
-    max_chars_per_line = int(width_in_points // avg_char_width)
-
-    # Maximum number of lines that can fit
-    max_lines = int(height_in_points // line_spacing_points)
-
-    # Split on newline chars to get individual "logical" lines
-    logical_lines = text.split('\n')
-
-    used_lines = 0
-    displayed_chars = 0
-
-    for line in logical_lines:
-        # If the line is empty, it still takes one printed line
-        if not line:
-            used_lines += 1
-            # Stop if we exceed available lines
-            if used_lines >= max_lines:
-                break
-            continue
-
-        # Number of sub-lines the text will occupy if it wraps
-        sub_lines = math.ceil(len(line) / max_chars_per_line)
-
-        # If we don't exceed the bounding box's vertical capacity
-        if used_lines + sub_lines <= max_lines:
-            # All chars fit within the bounding box
-            displayed_chars += len(line)
-            used_lines += sub_lines
-        else:
-            # Only part of this line will fit
-            lines_left = max_lines - used_lines
-            if lines_left <= 0:
-                # No space left at all
-                break
-
-            # We can render only `lines_left` sub-lines of this line
-            # That means we can render up to:
-            chars_that_fit = lines_left * max_chars_per_line
-
-            # Clip to the actual number of characters
-            chars_that_fit = min(chars_that_fit, len(line))
-
-            displayed_chars += chars_that_fit
-            used_lines += lines_left  # We've used up all remaining lines
-            break  # No more space in the bounding box
-
-    return displayed_chars
-
-
-def remove_hierarchy_and_id(data):
-    """
-    Recursively remove the 'hierarchy' and 'id' fields from a nested
-    dictionary representing sections and subsections.
-    """
-    if isinstance(data, dict):
-        # Create a new dict to store filtered data
-        new_data = {}
-        for key, value in data.items():
-            # Skip the keys "hierarchy" and "id"
-            if key in ("hierarchy", "id", 'location'):
-                continue
-            # Recursively process the value
-            new_data[key] = remove_hierarchy_and_id(value)
-        return new_data
-    elif isinstance(data, list):
-        # If it's a list, process each item recursively
-        return [remove_hierarchy_and_id(item) for item in data]
-    else:
-        # Base case: if it's neither dict nor list, just return the value as is
-        return data
-    
-def outline_estimate_num_chars(outline):
-    for k, v in outline.items():
-        if k == 'meta':
-            continue
-        if 'title' in k.lower() or 'author' in k.lower() or 'reference' in k.lower():
-            continue
-        if not 'subsections' in v:
-            num_chars = estimate_characters(
-                v['location']['width'], 
-                v['location']['height'], 
-                60, line_spacing_points=None
-            )
-            v['num_chars'] = num_chars
-        else:
-            for k_sub, v_sub in v['subsections'].items():
-                if 'title' in k_sub.lower():
-                    continue
-                if 'path' in v_sub:
-                    continue
-                num_chars = estimate_characters(
-                    v_sub['location']['width'], 
-                    v_sub['location']['height'], 
-                    60, line_spacing_points=None
-                )
-                v_sub['num_chars'] = num_chars
-
-def generate_length_suggestions(result_json, original_section_outline, raw_section_outline):
-    NOT_CHANGE = 'Do not change text.'
-    original_section_outline = json.loads(original_section_outline)
-    suggestion_flag = False
-    new_section_outline = copy.deepcopy(result_json)
-    def check_length(text, target, width, height):
-        text_length = equivalent_length_with_forced_breaks(
-            text,
-            width,
-            font_size_points=60,
-        )
-        if text_length - target > 100:
-            return f'Text too long, shrink by {text_length - target} characters.'
-        elif target - text_length > 100:
-            return f'Text too short, expand by {target - text_length} characters.'
-        else:
-            return NOT_CHANGE
-
-    if 'num_chars' in original_section_outline:
-        new_section_outline['suggestions'] = check_length(
-            result_json['description'], 
-            original_section_outline['num_chars'],
-            raw_section_outline['location']['width'],
-            raw_section_outline['location']['height']
-        )
-        if new_section_outline['suggestions'] != NOT_CHANGE:
-            suggestion_flag = True
-    if 'subsections' in original_section_outline:
-        for k, v in original_section_outline['subsections'].items():
-            if 'num_chars' in v:
-                new_section_outline['subsections'][k]['suggestion'] = check_length(
-                    result_json['subsections'][k]['description'], 
-                    v['num_chars'],
-                    raw_section_outline['subsections'][k]['location']['width'],
-                    raw_section_outline['subsections'][k]['location']['height']
-                )
-                if new_section_outline['subsections'][k]['suggestion'] != NOT_CHANGE:
-                    suggestion_flag = True
-
-    return new_section_outline, suggestion_flag
-
-def get_img_ratio(img_path):
-    img = Image.open(img_path)
-    return {
-        'width': img.width,
-        'height': img.height
-    }
-
-def get_img_ratio_in_section(content_json):
-    res = {}
-    if 'path' in content_json:
-        res[content_json['path']] = get_img_ratio(content_json['path'])
-
-    if 'subsections' in content_json:
-        for subsection_name, val in content_json['subsections'].items():
-            if 'path' in val:
-                res[val['path']] = get_img_ratio(val['path'])
-
-    return res
-
-
-def get_snapshot_from_section(leaf_section, section_name, name_to_hierarchy, leaf_name, section_code, empty_poster_path='poster.pptx'):
-    hierarchy = name_to_hierarchy[leaf_name]
-    hierarchy_overflow_name = f'tmp/overflow_check_<{section_name}>_<{leaf_section}>_hierarchy_{hierarchy}'
-    run_code_with_utils(section_code, utils_functions)
-    poster = Presentation(empty_poster_path)
-    # add border regardless of the hierarchy
-    curr_location = add_border_hierarchy(
-        poster, 
-        name_to_hierarchy, 
-        hierarchy, 
-        border_width=10,
-        # regardless=True
-    )
-    if not leaf_section in curr_location:
-        leaf_section = section_name
-    save_presentation(poster, file_name=f"{hierarchy_overflow_name}.pptx")
-    ppt_to_images(
-        f"{hierarchy_overflow_name}.pptx", 
-        hierarchy_overflow_name, 
-        dpi=200
-    )
-    poster_image_path = os.path.join(f"{hierarchy_overflow_name}", "slide_0001.jpg")
-    poster_image = Image.open(poster_image_path)
-
-    poster_width = emu_to_inches(poster.slide_width)
-    poster_height = emu_to_inches(poster.slide_height)
-    locations = convert_pptx_bboxes_json_to_image_json(
-        curr_location, 
-        poster_width, 
-        poster_height
-    )
-    zoomed_in_img = zoom_in_image_by_bbox(
-        poster_image, 
-        locations[leaf_name], 
-        padding=0.01
-    )
-    # save the zoomed_in_img
-    zoomed_in_img.save(f"{hierarchy_overflow_name}_zoomed_in.jpg")
-    return curr_location, zoomed_in_img, f"{hierarchy_overflow_name}_zoomed_in.jpg"
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/prompt/answer_question_from_video.yaml b/Paper2Video/src/evaluation/prompt/answer_question_from_video.yaml
deleted file mode 100644
index 7b4a8976dc9afe71d048734c5fe1a1385ec41407..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/prompt/answer_question_from_video.yaml
+++ /dev/null
@@ -1,48 +0,0 @@
-system_prompt: |
-  You are an answering agent. You will be provided with:
-    1. A presentation video of a paper.
-    2. A JSON object called "questions" which contains multiple questions. Each question has four possible answers: A, B, C, or D.
-
-  Your goal is to analyze the video thoroughly and answer each question based on the information it provides.
-  **You should **NOT** use any external knowledge or context beyond the presentation video. You must rely solely on the content of the video to answer the questions.**
-  **You should NOT reference the timesteps if it exceeds video length
-  **You should NOT reference the timesteps if it exceeds video length
-  **You should NOT reference the timesteps if it exceeds video length
-
-  For each question:
-    • If you find enough evidence in the video to decide on a specific option (A, B, C, or D), then choose that option. Also include a brief reference to the part of the video that supports your answer (e.g., “Top-left text”, “Event date section”, etc.).
-    • If there is not enough information for you to choose, choose NA
-    
-  Your final output must be returned as a JSON object. For each question, the structure should be:
-    "Question N": {
-      "answer": "A" | "B" | "C" | "D" | "NA",
-      "reference": "<short description>"
-    }
-
-template: |
-  Follow these steps to create your response:
-
-  1. Study the presentation video along with the "questions" provided.
-  2. For each question:
-     • Decide if the video clearly supports one of the four options (A, B, C, or D). If so, pick that answer.
-  3. Provide a brief reference indicating where in the video you found the answer.
-  4. Format your output strictly as a JSON object with this pattern:
-     {
-       "Question 1": {
-         "answer": "X",
-         "reference": "some reference"
-       },
-       "Question 2": {
-         "answer": "X",
-         "reference": "some reference"
-       },
-       ...
-     }
-  5. Do not include any explanations or extra keys beyond the specified structure.
-  6. You must provide an answer entry for all questions in the "questions" object.
-
-  questions:
-  {{questions}}
-
-jinja_args:
-  - questions
\ No newline at end of file
diff --git a/Paper2Video/src/evaluation/prompt/content_sim_score.txt b/Paper2Video/src/evaluation/prompt/content_sim_score.txt
deleted file mode 100644
index 35dc8fbe2c9bde35bf904579a2381ea227d5043c..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/prompt/content_sim_score.txt
+++ /dev/null
@@ -1,45 +0,0 @@
-You are an evaluator. You will be given two presentation videos of the same talk:
-(1) a human-presented version and (2) an AI-generated version.
-Evaluate ONLY the slides and subtitles. Ignore the presenter’s face, voice quality, background music, camera motion, and any non-slide visuals.
-
-INPUTS YOU MAY RECEIVE
-- Human video (and optionally its slide images and subtitles/transcript)
-- AI video (and optionally its slide images and subtitles/transcript)
-
-EVALUATION SCOPE (focus strictly on slides + subtitles)
-1) Slide Content Matching
-   • Do the AI-generated slides convey the same key points and include comparable layout/visual elements (titles, bullets, diagrams, tables, axes annotations) as the human version?
-
-2) Slide Sequence Alignment
-   • Is the slide order in the AI version consistent with the human version?
-   • Are any sections missing, added, or rearranged?
-
-3) Subtitle Wording Similarity
-   • Does the AI subtitle text reflect similar phrasing, terminology, and information to the human version’s speech/subtitles?
-   • Focus on semantic equivalence and terminology; spelling and minor stylistic differences don’t matter.
-
-4) Slide–Subtitle Synchronization
-   • Within the AI video, does the narration/subtitle content match the on-screen slide content at the same time?
-   • Also, does this correspondence broadly align with what the human presenter says during each slide?
-
-EVIDENCE-ONLY RULES
-- Base your judgment solely on the provided materials (videos, slides, subtitles). Do NOT use outside knowledge about the paper or topic.
-- If some inputs are missing (e.g., no subtitles), judge from what is available and briefly note the missing piece in the Reasons.
-
-RELAXED SCORING RUBRIC (0–5)
-(Adopt a lenient mapping: if your judgment is borderline between two adjacent levels, choose the higher score. If you compute subscores, take their average and ROUND UP to the nearest integer, capped to [0,5].)
-• 5 — Nearly identical: Slides and subtitles closely match the human version in content, layout, sequence, and timing; wording is near-paraphrase level.
-• 4 — Highly similar: Only minor differences in layout or phrasing; overall content, order, and on-screen alignment clearly match.
-• 3 — Moderate differences yet same core content: Several layout/wording/sequence deviations but the main sections and key points are preserved. 
-       (Relaxation: cases that previously felt like “2” because of noticeable structure or language differences can be scored as “3” if the key points largely align.)
-• 2 — Partial overlap: Some key points match, but there are substantial omissions/rearrangements or subtitle paraphrasing that drifts; multiple slide mismatches or sync issues.
-• 1 — Minimal overlap: Only a few matching fragments; most slides/subtitles diverge markedly.
-• 0 — No meaningful match: Slides and subtitles in the AI version do not correspond to the human version.
-
-OUTPUT FORMAT (STRICT)
-Return exactly one line:
-Content Similarity: X/5; Reasons
-
-WHERE:
-- X is an integer 0–5 selected using the relaxed rubric above.
-- Reasons is a concise justification (1–3 short sentences) referencing the four aspects (content, sequence, wording, synchronization) as relevant.
diff --git a/Paper2Video/src/evaluation/prompt/ip_qa.txt b/Paper2Video/src/evaluation/prompt/ip_qa.txt
deleted file mode 100644
index d160b4c30227616ea23be7605469134e86f0b668..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/prompt/ip_qa.txt
+++ /dev/null
@@ -1,7 +0,0 @@
-You are given 4 video clips, each from a different academic paper presentation. Each video is associated with a specific research paper.
-You are also given 4 questions. Each question is an "understanding question" derived from the corresponding paper, designed to test comprehension of that presentation.
-Additionally, a query is provided — an image. The query is the speaker head portrait.
-
-Your task is to choose the most relevant question (from the 4 provided) that matches the speaker of the query.
-Respond with the number (1–4) of the selected question and a brief explanation (1–2 sentences) justifying your choice in following format strictly:
-My choice: x. Explanation
diff --git a/Paper2Video/src/evaluation/prompt/ip_qa_find_human.txt b/Paper2Video/src/evaluation/prompt/ip_qa_find_human.txt
deleted file mode 100644
index d160b4c30227616ea23be7605469134e86f0b668..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/prompt/ip_qa_find_human.txt
+++ /dev/null
@@ -1,7 +0,0 @@
-You are given 4 video clips, each from a different academic paper presentation. Each video is associated with a specific research paper.
-You are also given 4 questions. Each question is an "understanding question" derived from the corresponding paper, designed to test comprehension of that presentation.
-Additionally, a query is provided — an image. The query is the speaker head portrait.
-
-Your task is to choose the most relevant question (from the 4 provided) that matches the speaker of the query.
-Respond with the number (1–4) of the selected question and a brief explanation (1–2 sentences) justifying your choice in following format strictly:
-My choice: x. Explanation
diff --git a/Paper2Video/src/evaluation/prompt/which_is_better.txt b/Paper2Video/src/evaluation/prompt/which_is_better.txt
deleted file mode 100644
index 5292a0a261d69e605f82f0e1f641968377d59182..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/prompt/which_is_better.txt
+++ /dev/null
@@ -1,25 +0,0 @@
-You are an expert in evaluating academic presentation videos. You are given two videos (Video A and Video B) that present the same research topic.
-
-Your task is to evaluate each video independently, and then determine which video is better, or if they are basically the same(perferred if you are not confident).
-
-Evaluate both videos based on the following criteria:
-
-- Content Clarity: Are the key ideas and findings clearly explained?
-- Speaker Delivery: Is the speaker confident, fluent, and engaging?
-- Visual Aids: Are the slides or visuals clear, helpful, and well-integrated?
-- Structure and Pacing: Is the presentation logically organized and appropriately paced?
-- Audience Engagement: Does the speaker maintain interest and attention throughout?
-
-**Step 1**: Write a short (1-2 sentence) evaluation of **Video A** based on the criteria.
-
-**Step 2**: Write a short (1-2 sentence) evaluation of **Video B** based on the criteria.
-
-**Step 3**: Based on your evaluations, answer the following:
-
-Output Format(Strict):
-Final Judgment:  
-[A] or [B]
-
-Reason: [One concise sentence justifying your judgment based on the evaluations above.]
-
-Only respond with Step 1, Step 2, and Step 3 in the format shown.
diff --git a/Paper2Video/src/evaluation/requirements.txt b/Paper2Video/src/evaluation/requirements.txt
deleted file mode 100644
index 9b2651de6e3065109a50c33cc12fa6a87369bd72..0000000000000000000000000000000000000000
--- a/Paper2Video/src/evaluation/requirements.txt
+++ /dev/null
@@ -1,640 +0,0 @@
-accelerate==1.8.1
-agentops==0.3.26
-aiofiles==24.1.0
-aiohappyeyeballs==2.4.4
-aiohttp==3.11.11
-aiohttp-client-cache==0.11.1
-aiohttp-cors==0.7.0
-aiosignal==1.3.2
-aiosqlite==0.20.0
-airportsdata==20241001
-alabaster==1.0.0
-alembic==1.16.4
-annotated-types==0.7.0
-anthropic==0.42.0
-antlr4-python3-runtime==4.9.3
-anyio==4.8.0
-apify_client==1.8.1
-apify_shared==1.2.1
-appdirs==1.4.4
-argon2-cffi==23.1.0
-argon2-cffi-bindings==21.2.0
-arrow==1.3.0
-arxiv==2.1.3
-arxiv2text==0.1.14
-asgiref==3.8.1
-asknews==0.7.58
-asteroid-filterbanks==0.4.0
-astor==0.8.1
-asttokens==3.0.0
-async-lru==2.0.4
-async-timeout==4.0.3
-asyncer==0.0.8
-asyncio==3.4.3
-attrs==24.3.0
-av==14.1.0
-azure-core==1.32.0
-azure-storage-blob==12.24.1
-babel==2.16.0
-backoff==2.2.1
-backports.tarfile==1.2.0
-beautifulsoup4==4.12.3
-bibtexparser==1.4.3
-black==25.1.0
-blake3==1.0.2
-bleach==6.2.0
-botocore==1.36.9
-Brotli==1.1.0
-build==1.2.2.post1
-CacheControl==0.14.2
-cachetools==5.5.0
-cairocffi==1.7.1
-CairoSVG==2.7.1
-camel-ai==0.2.70
-cattrs==24.1.2
-cbor==1.0.0
-certifi==2024.12.14
-cffi==1.17.1
-chardet==5.2.0
-charset-normalizer==3.4.1
-cleo==2.1.0
-click==8.1.8
-cloudpickle==3.1.1
-cohere==5.13.11
-colorama==0.4.6
-coloredlogs==15.0.1
-colorful==0.5.6
-colorlog==6.9.0
-comm==0.2.2
-compressed-tensors==0.8.1
-contourpy==1.3.1
-crashtest==0.4.1
-cryptography==42.0.6
-cssselect==1.2.0
-cssselect2==0.8.0
-ctranslate2==4.4.0
-curl_cffi==0.6.2
-cycler==0.12.1
-dappier==0.3.3
-dataclasses-json==0.6.7
-datacommons==1.4.3
-datacommons-pandas==0.0.3
-datasets==3.2.0
-debugpy==1.8.12
-decorator==4.4.2
-deepdiff==8.1.1
-deepsearch-glm==1.0.0
-defusedxml==0.8.0rc2
-Deprecated==1.2.18
-depyf==0.18.0
-diffusers==0.25.1
-dill==0.3.8
-discord.py==2.4.0
-diskcache==5.6.3
-distlib==0.3.9
-distro==1.9.0
-docker==7.1.0
-docling-core==2.43.0
-docling-ibm-models==3.8.1
-docling-parse==4.1.0
-docopt==0.6.2
-docstring-parser==0.15
-docutils==0.21.2
-docx2txt==0.8
-duckdb==1.1.3
-duckduckgo_search==6.4.2
-dulwich==0.21.7
-e2b==1.0.6
-e2b-code-interpreter==1.0.4
-easyocr==1.7.2
-effdet==0.4.1
-einops==0.8.0
-emoji==2.14.1
-et_xmlfile==2.0.0
-eval_type_backport==0.2.0
-evaluate==0.4.3
-exceptiongroup==1.2.2
-executing==2.1.0
-fake-useragent==2.0.3
-Faker==19.13.0
-fast-depends==2.4.12
-fastapi==0.115.6
-fastavro==1.10.0
-faster-whisper==1.2.0
-fastjsonschema==2.21.1
-feedfinder2==0.0.4
-feedparser==6.0.11
-ffmpeg-python==0.2.0
-ffmpy==0.5.0
-filelock==3.16.1
-filetype==1.2.0
-firecrawl-py==1.6.8
-fish-audio-sdk==2024.12.5
-FlagEmbedding==1.3.5
-flatbuffers==25.1.24
-fonttools==4.55.8
-fqdn==1.5.1
-free_proxy==1.1.3
-frozendict==2.4.6
-frozenlist==1.5.0
-fsspec==2024.9.0
-ftfy==6.3.1
-func-argparse==1.1.1
-future==1.0.0
-geojson==2.5.0
-gguf==0.10.0
-google==3.0.0
-google-ai-generativelanguage==0.6.15
-google-api-core==2.24.0
-google-api-python-client==2.160.0
-google-auth==2.37.0
-google-auth-httplib2==0.2.0
-google-cloud-core==2.4.1
-google-cloud-storage==2.19.0
-google-cloud-vision==3.9.0
-google-crc32c==1.6.0
-google-genai==1.32.0
-google-generativeai==0.8.5
-google-resumable-media==2.7.2
-googleapis-common-protos==1.66.0
-googlemaps==4.10.0
-gradio_client==1.7.0
-greenlet==3.1.1
-grpcio==1.67.1
-grpcio-status==1.62.3
-grpcio-tools==1.62.3
-h11==0.14.0
-h2==4.1.0
-hpack==4.1.0
-html5lib==1.1
-httpcore==1.0.7
-httplib2==0.22.0
-httptools==0.6.4
-httpx==0.28.1
-httpx-sse==0.4.0
-httpx-ws==0.7.1
-huggingface-hub==0.27.1
-humanfriendly==10.0
-hyperframe==6.1.0
-HyperPyYAML==1.2.2
-idna==3.10
-ijson==3.4.0
-imageio==2.37.0
-imageio-ffmpeg==0.6.0
-imagesize==1.4.1
-importlib_metadata==8.4.0
-iniconfig==2.0.0
-inscriptis==2.6.0
-installer==0.7.0
-interegular==0.3.3
-iopath==0.1.10
-ipykernel==6.29.5
-ipython==8.31.0
-ipywidgets==8.1.5
-ir_datasets==0.5.11
-isodate==0.7.2
-isoduration==20.11.0
-itsdangerous==2.2.0
-jaraco.classes==3.4.0
-jaraco.context==6.0.1
-jedi==0.19.2
-jeepney==0.8.0
-jieba3k==0.35.1
-Jinja2==3.1.5
-jiter==0.8.2
-jmespath==1.0.1
-joblib==1.4.2
-json5==0.10.0
-json_repair==0.35.0
-jsonlines==3.1.0
-jsonpatch==1.33
-jsonpath-python==1.0.6
-jsonpointer==3.0.0
-jsonref==1.1.0
-jsonschema==4.23.0
-jsonschema-path==0.3.4
-jsonschema-specifications==2024.10.1
-julius==0.2.7
-jupyter==1.1.1
-jupyter-console==6.6.3
-jupyter-events==0.11.0
-jupyter-kernel-gateway==3.0.1
-jupyter-lsp==2.2.5
-jupyter_client==8.6.3
-jupyter_core==5.7.2
-jupyter_server==2.15.0
-jupyter_server_terminals==0.5.3
-jupyterlab==4.3.4
-jupyterlab_pygments==0.3.0
-jupyterlab_server==2.27.3
-jupyterlab_widgets==3.0.13
-kaleido==0.2.1
-keyring==24.3.1
-kiwisolver==1.4.8
-langchain==0.3.17
-langchain-community==0.3.16
-langchain-core==0.3.33
-langchain-openai==0.3.3
-langchain-text-splitters==0.3.5
-langdetect==1.0.9
-langsmith==0.3.3
-lark==1.2.2
-latex2mathml==3.77.0
-layoutparser==0.3.4
-lazy-object-proxy==1.10.0
-lazy_loader==0.4
-Levenshtein==0.26.1
-lightning==2.5.2
-lightning-utilities==0.14.3
-linkify-it-py==2.0.3
-linkup-sdk==0.2.2
-litellm==1.59.10
-lm-format-enforcer==0.10.9
-lxml==5.3.0
-lz4==4.4.4
-Mako==1.3.10
-Markdown==3.7
-markdown-it-py==3.0.0
-markdownify==0.13.1
-marker-pdf==1.1.0
-marko==2.1.2
-MarkupSafe==2.1.5
-marshmallow==3.26.0
-matplotlib==3.10.0
-matplotlib-inline==0.1.7
-mcp==1.11.0
-mdit-py-plugins==0.4.2
-mdurl==0.1.2
-memray==1.15.0
-milvus-lite==2.4.11
-mistral_common==1.5.1
-mistralai==1.5.0
-mistune==3.1.0
-monotonic==1.6
-more-itertools==10.6.0
-moviepy==1.0.3
-mpire==2.10.2
-mpmath==1.3.0
-msgpack==1.1.0
-msgspec==0.19.0
-multidict==6.1.0
-multiprocess==0.70.16
-multitasking==0.0.11
-mypy-extensions==1.0.0
-narwhals==1.38.2
-nbclient==0.10.2
-nbconvert==7.16.5
-nbformat==5.10.4
-nebula3-python==3.8.2
-neo4j==5.27.0
-nest-asyncio==1.6.0
-networkx==3.4.2
-newspaper3k==0.2.8
-ninja==1.11.1.3
-nltk==3.9.1
-notebook==7.3.2
-notebook_shim==0.2.4
-notion-client==2.3.0
-numpy==2.2.6
-nvidia-cublas-cu12==12.1.3.1
-nvidia-cuda-cupti-cu12==12.1.105
-nvidia-cuda-nvrtc-cu12==12.1.105
-nvidia-cuda-runtime-cu12==12.1.105
-nvidia-cudnn-cu12==9.1.0.70
-nvidia-cufft-cu12==11.0.2.54
-nvidia-cufile-cu12==1.11.1.6
-nvidia-curand-cu12==10.3.2.106
-nvidia-cusolver-cu12==11.4.5.107
-nvidia-cusparse-cu12==12.1.0.106
-nvidia-cusparselt-cu12==0.6.3
-nvidia-ml-py==12.575.51
-nvidia-nccl-cu12==2.21.5
-nvidia-nvjitlink-cu12==12.6.85
-nvidia-nvtx-cu12==12.1.105
-oaib==1.2.0
-oauthlib==3.2.2
-olefile==0.47
-omegaconf==2.3.0
-onnx==1.17.0
-onnxruntime==1.20.1
-openai==1.96.1
-openapi-schema-validator==0.6.3
-openapi-spec-validator==0.7.1
-openbb==4.3.5
-openbb-benzinga==1.3.5
-openbb-bls==1.0.3
-openbb-cftc==1.0.3
-openbb-commodity==1.2.6
-openbb-core==1.3.8
-openbb-crypto==1.3.5
-openbb-currency==1.3.5
-openbb-derivatives==1.3.5
-openbb-econdb==1.2.5
-openbb-economy==1.3.5
-openbb-equity==1.3.5
-openbb-etf==1.3.5
-openbb-federal-reserve==1.3.5
-openbb-fixedincome==1.3.5
-openbb-fmp==1.3.5
-openbb-fred==1.3.5
-openbb-imf==1.0.2
-openbb-index==1.3.5
-openbb-intrinio==1.3.5
-openbb-news==1.3.5
-openbb-oecd==1.3.5
-openbb-platform-api==1.0.4
-openbb-polygon==1.3.5
-openbb-regulators==1.3.5
-openbb-sec==1.3.5
-openbb-tiingo==1.3.5
-openbb-tradingeconomics==1.3.5
-openbb-us-eia==1.0.0
-openbb-yfinance==1.3.6
-opencensus==0.11.4
-opencensus-context==0.1.3
-opencv-python==4.11.0.86
-opencv-python-headless==4.11.0.86
-openpyxl==3.1.5
-opentelemetry-api==1.27.0
-opentelemetry-exporter-otlp-proto-common==1.27.0
-opentelemetry-exporter-otlp-proto-http==1.27.0
-opentelemetry-proto==1.27.0
-opentelemetry-sdk==1.27.0
-opentelemetry-semantic-conventions==0.48b0
-optuna==4.4.0
-orderly-set==5.2.3
-orjson==3.10.15
-ormsgpack==1.7.0
-outcome==1.3.0.post0
-outlines==0.1.11
-outlines_core==0.1.26
-overrides==7.7.0
-packaging==24.2
-pandas==2.3.1
-pandasai==0.1.0
-pandoc==2.4
-pandocfilters==1.5.1
-parso==0.8.4
-partial-json-parser==0.2.1.1.post5
-pathable==0.4.4
-pathlib==1.0.1
-pathspec==0.12.1
-pdf2image==1.17.0
-pdfminer.six==20231228
-pdfplumber==0.11.5
-pdftext==0.4.1
-peewee==3.17.8
-peft==0.14.0
-pexpect==4.9.0
-pi_heif==0.21.0
-pikepdf==9.5.1
-pillow==10.4.0
-pkginfo==1.12.0
-platformdirs==4.3.6
-playwright==1.51.0
-plotly==6.0.1
-pluggy==1.5.0
-plumbum==1.9.0
-ply==3.11
-poetry==1.8.5
-poetry-core==1.9.1
-poetry-plugin-export==1.8.0
-portalocker==2.10.1
-posthog==3.11.0
-prance==23.6.21.0
-praw==7.8.1
-prawcore==2.4.0
-primePy==1.3
-primp==0.11.0
-proglog==0.1.12
-prometheus-fastapi-instrumentator==7.0.2
-prometheus_client==0.21.1
-prompt_toolkit==3.0.50
-propcache==0.2.1
-proto-plus==1.25.0
-protobuf==4.25.6
-psutil==5.9.8
-ptyprocess==0.7.0
-pure_eval==0.2.3
-py-cpuinfo==9.0.0
-py-spy==0.4.0
-pyannote.audio==3.3.2
-pyannote.core==5.0.0
-pyannote.database==5.1.3
-pyannote.metrics==3.2.1
-pyannote.pipeline==3.0.1
-pyarrow==19.0.0
-pyasn1==0.6.1
-pyasn1_modules==0.4.1
-pyautogen==0.7.3
-pybind11==2.13.6
-pyclipper==1.3.0.post6
-pycocotools==2.0.8
-pycountry==24.6.1
-pycparser==2.22
-pydantic==2.11.7
-pydantic-settings==2.7.1
-pydantic_core==2.33.2
-pydub==0.25.1
-pydyf==0.11.0
-pyee==12.1.1
-PyGithub==2.5.0
-Pygments==2.19.1
-PyJWT==2.10.1
-pymilvus==2.5.4
-PyMuPDF==1.25.2
-PyNaCl==1.5.0
-pyowm==3.3.0
-pypandoc==1.15
-pyparsing==3.2.1
-pypdf==5.2.0
-PyPDF2==3.0.1
-pypdfium2==4.30.0
-pyphen==0.17.2
-pyproject_hooks==1.2.0
-pysbd==0.3.4
-PySocks==1.7.1
-pyTelegramBotAPI==4.26.0
-pytesseract==0.3.13
-pytest==8.3.4
-python-bidi==0.6.3
-python-dateutil==2.9.0.post0
-python-docx==1.1.2
-python-dotenv==1.0.1
-python-iso639==2025.1.28
-python-json-logger==3.2.1
-python-Levenshtein==0.26.1
-python-magic==0.4.27
-python-multipart==0.0.18
-python-oxmsg==0.0.1
-python-pptx @ git+https://github.com/Force1ess/python-pptx@dc356685d4d210a10abe1ffab3c21315cdfae63d
-pytorch-fid==0.3.0
-pytorch-lightning==2.5.2
-pytorch-metric-learning==2.8.1
-pytz==2024.2
-PyYAML==6.0.2
-pyzmq==26.2.0
-qdrant-client==1.13.2
-ragas==0.1.6
-rank-bm25==0.2.2
-RapidFuzz==3.11.0
-ray==2.40.0
-redis==5.2.1
-referencing==0.36.1
-regex==2024.11.6
-reka-api==3.2.0
-requests==2.32.3
-requests-cache==1.2.1
-requests-file==2.1.0
-requests-oauthlib==1.3.1
-requests-toolbelt==1.0.0
-rfc3339-validator==0.1.4
-rfc3986-validator==0.1.1
-rich==13.9.4
-rouge==1.0.1
-rpds-py==0.22.3
-rsa==4.9
-Rtree==1.3.0
-ruamel.yaml==0.18.10
-ruamel.yaml.clib==0.2.12
-ruff==0.7.4
-safehttpx==0.1.6
-safetensors==0.5.2
-scholarly==1.7.11
-scikit-image==0.25.1
-scikit-learn==1.6.1
-scipy==1.15.1
-SecretStorage==3.3.3
-selenium==4.28.1
-semantic-version==2.10.0
-semchunk==2.2.2
-semver==3.0.4
-Send2Trash==1.8.3
-sentence-transformers==3.3.1
-sentencepiece==0.2.0
-setproctitle==1.3.4
-sglang==0.4.2
-sgmllib3k==1.0.0
-shapely==2.0.7
-shellingham==1.5.4
-six==1.17.0
-slack_bolt==1.22.0
-slack_sdk==3.34.0
-smart-open==7.1.0
-sniffio==1.3.1
-snowballstemmer==2.2.0
-socksio==1.0.0
-sortedcontainers==2.4.0
-soundfile==0.13.1
-soupsieve==2.6
-speechbrain==1.0.3
-Sphinx==8.1.3
-sphinx-rtd-theme==3.0.2
-sphinxcontrib-applehelp==2.0.0
-sphinxcontrib-devhelp==2.0.0
-sphinxcontrib-htmlhelp==2.1.0
-sphinxcontrib-jquery==4.1
-sphinxcontrib-jsmath==1.0.1
-sphinxcontrib-qthelp==2.0.0
-sphinxcontrib-serializinghtml==2.0.0
-SQLAlchemy==2.0.37
-sqlglot==25.34.1
-sqlglotrs==0.3.0
-sse-starlette==2.4.1
-stack-data==0.6.3
-starlette==0.41.3
-stem==1.8.2
-stripe==11.5.0
-surya-ocr==0.8.3
-sympy==1.13.1
-tabled-pdf==0.2.0
-tabulate==0.9.0
-tavily-python==0.5.0
-tenacity==9.0.0
-tensorboardX==2.6.4
-termcolor==2.4.0
-terminado==0.18.1
-texify==0.2.1
-textblob==0.17.1
-textual==1.0.0
-threadpoolctl==3.5.0
-tifffile==2025.1.10
-tiktoken==0.7.0
-timm==1.0.13
-tinycss2==1.4.0
-tinyhtml5==2.0.0
-tinysegmenter==0.3
-tldextract==5.1.3
-tokenizers==0.21.0
-tomli==2.2.1
-tomlkit==0.13.2
-torch==2.5.1+cu121
-torch-audiomentations==0.12.0
-torch_pitch_shift==1.2.5
-torchaudio==2.5.1+cu121
-torchmetrics==1.7.4
-torchvision==0.20.1+cu121
-tornado==6.4.2
-tqdm==4.67.1
-traitlets==5.14.3
-transformers==4.48.0
-trec-car-tools==2.6
-tree-sitter==0.23.2
-tree-sitter-python==0.23.6
-trio==0.28.0
-trio-websocket==0.11.1
-triton==3.1.0
-trove-classifiers==2025.1.15.22
-typer==0.12.5
-types-python-dateutil==2.9.0.20241206
-types-requests==2.32.0.20241016
-typing-inspect==0.9.0
-typing-inspection==0.4.1
-typing_extensions==4.12.2
-tzdata==2024.2
-uc-micro-py==1.0.3
-ujson==5.10.0
-unlzw3==0.2.3
-unstructured==0.16.11
-unstructured-client==0.28.1
-unstructured-inference==0.8.1
-unstructured.pytesseract==0.3.13
-update-checker==0.18.0
-uri-template==1.3.0
-uritemplate==4.1.1
-url-normalize==1.4.3
-urllib3==2.3.0
-uuid7==0.1.0
-uvicorn==0.32.1
-uvloop==0.21.0
-virtualenv==20.29.1
-vllm==0.6.6.post1
-warc3-wet==0.2.5
-warc3-wet-clueweb09==0.2.5
-watchfiles==1.0.4
-wcwidth==0.2.13
-WeasyPrint==52.5
-webcolors==24.11.1
-webencodings==0.5.1
-websocket-client==1.8.0
-websockets==14.1
-whisperx==3.4.2
-widgetsnbextension==4.0.13
-wikipedia==1.4.0
-wolframalpha==5.1.3
-wordcloud==1.9.4
-wrapt==1.17.2
-wsproto==1.2.0
-xformers==0.0.28.post3
-xgrammar==0.1.10
-xlrd==2.0.1
-XlsxWriter==3.2.0
-xmltodict==0.13.0
-xxhash==3.5.0
-yarl==1.18.3
-yfinance==0.2.52
-yt-dlp==2024.12.23
-zipp==3.21.0
-zlib-state==0.1.9
-zopfli==0.2.3.post1
-zstandard==0.23.0
diff --git a/Paper2Video/src/latex_proj/fancyhdr.sty b/Paper2Video/src/latex_proj/fancyhdr.sty
deleted file mode 100644
index 77ed4e3012d822c7cca5c17efcae308b32b8cc2b..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/fancyhdr.sty
+++ /dev/null
@@ -1,485 +0,0 @@
-% fancyhdr.sty version 3.2
-% Fancy headers and footers for LaTeX.
-% Piet van Oostrum, 
-% Dept of Computer and Information Sciences, University of Utrecht,
-% Padualaan 14, P.O. Box 80.089, 3508 TB Utrecht, The Netherlands
-% Telephone: +31 30 2532180. Email: piet@cs.uu.nl
-% ========================================================================
-% LICENCE:
-% This file may be distributed under the terms of the LaTeX Project Public
-% License, as described in lppl.txt in the base LaTeX distribution.
-% Either version 1 or, at your option, any later version.
-% ========================================================================
-% MODIFICATION HISTORY:
-% Sep 16, 1994
-% version 1.4: Correction for use with \reversemargin
-% Sep 29, 1994:
-% version 1.5: Added the \iftopfloat, \ifbotfloat and \iffloatpage commands
-% Oct 4, 1994:
-% version 1.6: Reset single spacing in headers/footers for use with
-% setspace.sty or doublespace.sty
-% Oct 4, 1994:
-% version 1.7: changed \let\@mkboth\markboth to
-% \def\@mkboth{\protect\markboth} to make it more robust
-% Dec 5, 1994:
-% version 1.8: corrections for amsbook/amsart: define \@chapapp and (more
-% importantly) use the \chapter/sectionmark definitions from ps@headings if
-% they exist (which should be true for all standard classes).
-% May 31, 1995:
-% version 1.9: The proposed \renewcommand{\headrulewidth}{\iffloatpage...
-% construction in the doc did not work properly with the fancyplain style. 
-% June 1, 1995:
-% version 1.91: The definition of \@mkboth wasn't restored on subsequent
-% \pagestyle{fancy}'s.
-% June 1, 1995:
-% version 1.92: The sequence \pagestyle{fancyplain} \pagestyle{plain}
-% \pagestyle{fancy} would erroneously select the plain version.
-% June 1, 1995:
-% version 1.93: \fancypagestyle command added.
-% Dec 11, 1995:
-% version 1.94: suggested by Conrad Hughes <chughes@maths.tcd.ie>
-% CJCH, Dec 11, 1995: added \footruleskip to allow control over footrule
-% position (old hardcoded value of .3\normalbaselineskip is far too high
-% when used with very small footer fonts).
-% Jan 31, 1996:
-% version 1.95: call \@normalsize in the reset code if that is defined,
-% otherwise \normalsize.
-% this is to solve a problem with ucthesis.cls, as this doesn't
-% define \@currsize. Unfortunately for latex209 calling \normalsize doesn't
-% work as this is optimized to do very little, so there \@normalsize should
-% be called. Hopefully this code works for all versions of LaTeX known to
-% mankind.  
-% April 25, 1996:
-% version 1.96: initialize \headwidth to a magic (negative) value to catch
-% most common cases that people change it before calling \pagestyle{fancy}.
-% Note it can't be initialized when reading in this file, because
-% \textwidth could be changed afterwards. This is quite probable.
-% We also switch to \MakeUppercase rather than \uppercase and introduce a
-% \nouppercase command for use in headers. and footers.
-% May 3, 1996:
-% version 1.97: Two changes:
-% 1. Undo the change in version 1.8 (using the pagestyle{headings} defaults
-% for the chapter and section marks. The current version of amsbook and
-% amsart classes don't seem to need them anymore. Moreover the standard
-% latex classes don't use \markboth if twoside isn't selected, and this is
-% confusing as \leftmark doesn't work as expected.
-% 2. include a call to \ps@empty in ps@@fancy. This is to solve a problem
-% in the amsbook and amsart classes, that make global changes to \topskip,
-% which are reset in \ps@empty. Hopefully this doesn't break other things.
-% May 7, 1996:
-% version 1.98:
-% Added % after the line  \def\nouppercase
-% May 7, 1996:
-% version 1.99: This is the alpha version of fancyhdr 2.0
-% Introduced the new commands \fancyhead, \fancyfoot, and \fancyhf.
-% Changed \headrulewidth, \footrulewidth, \footruleskip to
-% macros rather than length parameters, In this way they can be
-% conditionalized and they don't consume length registers. There is no need
-% to have them as length registers unless you want to do calculations with
-% them, which is unlikely. Note that this may make some uses of them
-% incompatible (i.e. if you have a file that uses \setlength or \xxxx=)
-% May 10, 1996:
-% version 1.99a:
-% Added a few more % signs
-% May 10, 1996:
-% version 1.99b:
-% Changed the syntax of \f@nfor to be resistent to catcode changes of :=
-% Removed the [1] from the defs of \lhead etc. because the parameter is
-% consumed by the \@[xy]lhead etc. macros.
-% June 24, 1997:
-% version 1.99c:
-% corrected \nouppercase to also include the protected form of \MakeUppercase
-% \global added to manipulation of \headwidth.
-% \iffootnote command added.
-% Some comments added about \@fancyhead and \@fancyfoot.
-% Aug 24, 1998
-% version 1.99d
-% Changed the default \ps@empty to \ps@@empty in order to allow
-% \fancypagestyle{empty} redefinition.
-% Oct 11, 2000
-% version 2.0
-% Added LPPL license clause.
-%
-% A check for \headheight is added. An errormessage is given (once) if the
-% header is too large. Empty headers don't generate the error even if
-% \headheight is very small or even 0pt. 
-% Warning added for the use of 'E' option when twoside option is not used.
-% In this case the 'E' fields will never be used.
-%
-% Mar 10, 2002
-% version 2.1beta
-% New command: \fancyhfoffset[place]{length}
-% defines offsets to be applied to the header/footer to let it stick into
-% the margins (if length > 0).
-% place is like in fancyhead, except that only E,O,L,R can be used.
-% This replaces the old calculation based on \headwidth and the marginpar
-% area.
-% \headwidth will be dynamically calculated in the headers/footers when
-% this is used.
-%
-% Mar 26, 2002
-% version 2.1beta2
-% \fancyhfoffset now also takes h,f as possible letters in the argument to
-% allow the header and footer widths to be different.
-% New commands \fancyheadoffset and \fancyfootoffset added comparable to
-% \fancyhead and \fancyfoot.
-% Errormessages and warnings have been made more informative.
-%
-% Dec 9, 2002
-% version 2.1
-% The defaults for \footrulewidth, \plainheadrulewidth and
-% \plainfootrulewidth are changed from \z@skip to 0pt. In this way when
-% someone inadvertantly uses \setlength to change any of these, the value
-% of \z@skip will not be changed, rather an errormessage will be given.
-
-% March 3, 2004
-% Release of version 3.0
-
-% Oct 7, 2004
-% version 3.1
-% Added '\endlinechar=13' to \fancy@reset to prevent problems with
-% includegraphics in header when verbatiminput is active.
-
-% March 22, 2005
-% version 3.2
-% reset \everypar (the real one) in \fancy@reset because spanish.ldf does
-% strange things with \everypar between << and >>.
-
-\def\ifancy@mpty#1{\def\temp@a{#1}\ifx\temp@a\@empty}
-
-\def\fancy@def#1#2{\ifancy@mpty{#2}\fancy@gbl\def#1{\leavevmode}\else
-                                   \fancy@gbl\def#1{#2\strut}\fi}
-
-\let\fancy@gbl\global
-
-\def\@fancyerrmsg#1{%
-        \ifx\PackageError\undefined
-        \errmessage{#1}\else
-        \PackageError{Fancyhdr}{#1}{}\fi}
-\def\@fancywarning#1{%
-        \ifx\PackageWarning\undefined
-        \errmessage{#1}\else
-        \PackageWarning{Fancyhdr}{#1}{}\fi}
-
-% Usage: \@forc \var{charstring}{command to be executed for each char}
-% This is similar to LaTeX's \@tfor, but expands the charstring.
-
-\def\@forc#1#2#3{\expandafter\f@rc\expandafter#1\expandafter{#2}{#3}}
-\def\f@rc#1#2#3{\def\temp@ty{#2}\ifx\@empty\temp@ty\else
-                                    \f@@rc#1#2\f@@rc{#3}\fi}
-\def\f@@rc#1#2#3\f@@rc#4{\def#1{#2}#4\f@rc#1{#3}{#4}}
-
-% Usage: \f@nfor\name:=list\do{body}
-% Like LaTeX's \@for but an empty list is treated as a list with an empty
-% element
-
-\newcommand{\f@nfor}[3]{\edef\@fortmp{#2}%
-    \expandafter\@forloop#2,\@nil,\@nil\@@#1{#3}}
-
-% Usage: \def@ult \cs{defaults}{argument}
-% sets \cs to the characters from defaults appearing in argument
-% or defaults if it would be empty. All characters are lowercased.
-
-\newcommand\def@ult[3]{%
-    \edef\temp@a{\lowercase{\edef\noexpand\temp@a{#3}}}\temp@a
-    \def#1{}%
-    \@forc\tmpf@ra{#2}%
-        {\expandafter\if@in\tmpf@ra\temp@a{\edef#1{#1\tmpf@ra}}{}}%
-    \ifx\@empty#1\def#1{#2}\fi}
-% 
-% \if@in <char><set><truecase><falsecase>
-%
-\newcommand{\if@in}[4]{%
-    \edef\temp@a{#2}\def\temp@b##1#1##2\temp@b{\def\temp@b{##1}}%
-    \expandafter\temp@b#2#1\temp@b\ifx\temp@a\temp@b #4\else #3\fi}
-
-\newcommand{\fancyhead}{\@ifnextchar[{\f@ncyhf\fancyhead h}%
-                                     {\f@ncyhf\fancyhead h[]}}
-\newcommand{\fancyfoot}{\@ifnextchar[{\f@ncyhf\fancyfoot f}%
-                                     {\f@ncyhf\fancyfoot f[]}}
-\newcommand{\fancyhf}{\@ifnextchar[{\f@ncyhf\fancyhf{}}%
-                                   {\f@ncyhf\fancyhf{}[]}}
-
-% New commands for offsets added
-
-\newcommand{\fancyheadoffset}{\@ifnextchar[{\f@ncyhfoffs\fancyheadoffset h}%
-                                           {\f@ncyhfoffs\fancyheadoffset h[]}}
-\newcommand{\fancyfootoffset}{\@ifnextchar[{\f@ncyhfoffs\fancyfootoffset f}%
-                                           {\f@ncyhfoffs\fancyfootoffset f[]}}
-\newcommand{\fancyhfoffset}{\@ifnextchar[{\f@ncyhfoffs\fancyhfoffset{}}%
-                                         {\f@ncyhfoffs\fancyhfoffset{}[]}}
-
-% The header and footer fields are stored in command sequences with
-% names of the form: \f@ncy<x><y><z> with <x> for [eo], <y> from [lcr]
-% and <z> from [hf].
-
-\def\f@ncyhf#1#2[#3]#4{%
-    \def\temp@c{}%
-    \@forc\tmpf@ra{#3}%
-        {\expandafter\if@in\tmpf@ra{eolcrhf,EOLCRHF}%
-            {}{\edef\temp@c{\temp@c\tmpf@ra}}}%
-    \ifx\@empty\temp@c\else
-        \@fancyerrmsg{Illegal char `\temp@c' in \string#1 argument:
-          [#3]}%
-    \fi
-    \f@nfor\temp@c{#3}%
-        {\def@ult\f@@@eo{eo}\temp@c
-         \if@twoside\else
-           \if\f@@@eo e\@fancywarning
-             {\string#1's `E' option without twoside option is useless}\fi\fi
-         \def@ult\f@@@lcr{lcr}\temp@c
-         \def@ult\f@@@hf{hf}{#2\temp@c}%
-         \@forc\f@@eo\f@@@eo
-             {\@forc\f@@lcr\f@@@lcr
-                 {\@forc\f@@hf\f@@@hf
-                     {\expandafter\fancy@def\csname
-                      f@ncy\f@@eo\f@@lcr\f@@hf\endcsname
-                      {#4}}}}}}
-
-\def\f@ncyhfoffs#1#2[#3]#4{%
-    \def\temp@c{}%
-    \@forc\tmpf@ra{#3}%
-        {\expandafter\if@in\tmpf@ra{eolrhf,EOLRHF}%
-            {}{\edef\temp@c{\temp@c\tmpf@ra}}}%
-    \ifx\@empty\temp@c\else
-        \@fancyerrmsg{Illegal char `\temp@c' in \string#1 argument:
-          [#3]}%
-    \fi
-    \f@nfor\temp@c{#3}%
-        {\def@ult\f@@@eo{eo}\temp@c
-         \if@twoside\else
-           \if\f@@@eo e\@fancywarning
-             {\string#1's `E' option without twoside option is useless}\fi\fi
-         \def@ult\f@@@lcr{lr}\temp@c
-         \def@ult\f@@@hf{hf}{#2\temp@c}%
-         \@forc\f@@eo\f@@@eo
-             {\@forc\f@@lcr\f@@@lcr
-                 {\@forc\f@@hf\f@@@hf
-                     {\expandafter\setlength\csname
-                      f@ncyO@\f@@eo\f@@lcr\f@@hf\endcsname
-                      {#4}}}}}%
-     \fancy@setoffs}
-
-% Fancyheadings version 1 commands. These are more or less deprecated,
-% but they continue to work.
-
-\newcommand{\lhead}{\@ifnextchar[{\@xlhead}{\@ylhead}}
-\def\@xlhead[#1]#2{\fancy@def\f@ncyelh{#1}\fancy@def\f@ncyolh{#2}}
-\def\@ylhead#1{\fancy@def\f@ncyelh{#1}\fancy@def\f@ncyolh{#1}}
-
-\newcommand{\chead}{\@ifnextchar[{\@xchead}{\@ychead}}
-\def\@xchead[#1]#2{\fancy@def\f@ncyech{#1}\fancy@def\f@ncyoch{#2}}
-\def\@ychead#1{\fancy@def\f@ncyech{#1}\fancy@def\f@ncyoch{#1}}
-
-\newcommand{\rhead}{\@ifnextchar[{\@xrhead}{\@yrhead}}
-\def\@xrhead[#1]#2{\fancy@def\f@ncyerh{#1}\fancy@def\f@ncyorh{#2}}
-\def\@yrhead#1{\fancy@def\f@ncyerh{#1}\fancy@def\f@ncyorh{#1}}
-
-\newcommand{\lfoot}{\@ifnextchar[{\@xlfoot}{\@ylfoot}}
-\def\@xlfoot[#1]#2{\fancy@def\f@ncyelf{#1}\fancy@def\f@ncyolf{#2}}
-\def\@ylfoot#1{\fancy@def\f@ncyelf{#1}\fancy@def\f@ncyolf{#1}}
-
-\newcommand{\cfoot}{\@ifnextchar[{\@xcfoot}{\@ycfoot}}
-\def\@xcfoot[#1]#2{\fancy@def\f@ncyecf{#1}\fancy@def\f@ncyocf{#2}}
-\def\@ycfoot#1{\fancy@def\f@ncyecf{#1}\fancy@def\f@ncyocf{#1}}
-
-\newcommand{\rfoot}{\@ifnextchar[{\@xrfoot}{\@yrfoot}}
-\def\@xrfoot[#1]#2{\fancy@def\f@ncyerf{#1}\fancy@def\f@ncyorf{#2}}
-\def\@yrfoot#1{\fancy@def\f@ncyerf{#1}\fancy@def\f@ncyorf{#1}}
-
-\newlength{\fancy@headwidth}
-\let\headwidth\fancy@headwidth
-\newlength{\f@ncyO@elh}
-\newlength{\f@ncyO@erh}
-\newlength{\f@ncyO@olh}
-\newlength{\f@ncyO@orh}
-\newlength{\f@ncyO@elf}
-\newlength{\f@ncyO@erf}
-\newlength{\f@ncyO@olf}
-\newlength{\f@ncyO@orf}
-\newcommand{\headrulewidth}{0.4pt}
-\newcommand{\footrulewidth}{0pt}
-\newcommand{\footruleskip}{.3\normalbaselineskip}
-
-% Fancyplain stuff shouldn't be used anymore (rather
-% \fancypagestyle{plain} should be used), but it must be present for
-% compatibility reasons.
-
-\newcommand{\plainheadrulewidth}{0pt}
-\newcommand{\plainfootrulewidth}{0pt}
-\newif\if@fancyplain \@fancyplainfalse
-\def\fancyplain#1#2{\if@fancyplain#1\else#2\fi}
-
-\headwidth=-123456789sp %magic constant
-
-% Command to reset various things in the headers:
-% a.o.  single spacing (taken from setspace.sty)
-% and the catcode of ^^M (so that epsf files in the header work if a
-% verbatim crosses a page boundary)
-% It also defines a \nouppercase command that disables \uppercase and
-% \Makeuppercase. It can only be used in the headers and footers.
-\let\fnch@everypar\everypar% save real \everypar because of spanish.ldf
-\def\fancy@reset{\fnch@everypar{}\restorecr\endlinechar=13
- \def\baselinestretch{1}%
- \def\nouppercase##1{{\let\uppercase\relax\let\MakeUppercase\relax
-     \expandafter\let\csname MakeUppercase \endcsname\relax##1}}%
- \ifx\undefined\@newbaseline% NFSS not present; 2.09 or 2e
-   \ifx\@normalsize\undefined \normalsize % for ucthesis.cls
-   \else \@normalsize \fi
- \else% NFSS (2.09) present
-  \@newbaseline%
- \fi}
-
-% Initialization of the head and foot text.
-
-% The default values still contain \fancyplain for compatibility.
-\fancyhf{} % clear all
-% lefthead empty on ``plain'' pages, \rightmark on even, \leftmark on odd pages
-% evenhead empty on ``plain'' pages, \leftmark on even, \rightmark on odd pages
-\if@twoside
-  \fancyhead[el,or]{\fancyplain{}{\sl\rightmark}}
-  \fancyhead[er,ol]{\fancyplain{}{\sl\leftmark}}
-\else
-  \fancyhead[l]{\fancyplain{}{\sl\rightmark}}
-  \fancyhead[r]{\fancyplain{}{\sl\leftmark}}
-\fi
-\fancyfoot[c]{\rm\thepage} % page number
-
-% Use box 0 as a temp box and dimen 0 as temp dimen. 
-% This can be done, because this code will always
-% be used inside another box, and therefore the changes are local.
-
-\def\@fancyvbox#1#2{\setbox0\vbox{#2}\ifdim\ht0>#1\@fancywarning
-  {\string#1 is too small (\the#1): ^^J Make it at least \the\ht0.^^J
-    We now make it that large for the rest of the document.^^J
-    This may cause the page layout to be inconsistent, however\@gobble}%
-  \dimen0=#1\global\setlength{#1}{\ht0}\ht0=\dimen0\fi
-  \box0}
-
-% Put together a header or footer given the left, center and
-% right text, fillers at left and right and a rule.
-% The \lap commands put the text into an hbox of zero size,
-% so overlapping text does not generate an errormessage.
-% These macros have 5 parameters:
-% 1. LEFTSIDE BEARING % This determines at which side the header will stick
-%    out. When \fancyhfoffset is used this calculates \headwidth, otherwise
-%    it is \hss or \relax (after expansion).
-% 2. \f@ncyolh, \f@ncyelh, \f@ncyolf or \f@ncyelf. This is the left component.
-% 3. \f@ncyoch, \f@ncyech, \f@ncyocf or \f@ncyecf. This is the middle comp.
-% 4. \f@ncyorh, \f@ncyerh, \f@ncyorf or \f@ncyerf. This is the right component.
-% 5. RIGHTSIDE BEARING. This is always \relax or \hss (after expansion).
-
-\def\@fancyhead#1#2#3#4#5{#1\hbox to\headwidth{\fancy@reset
-  \@fancyvbox\headheight{\hbox
-    {\rlap{\parbox[b]{\headwidth}{\raggedright#2}}\hfill
-      \parbox[b]{\headwidth}{\centering#3}\hfill
-      \llap{\parbox[b]{\headwidth}{\raggedleft#4}}}\headrule}}#5}
-
-\def\@fancyfoot#1#2#3#4#5{#1\hbox to\headwidth{\fancy@reset
-    \@fancyvbox\footskip{\footrule
-      \hbox{\rlap{\parbox[t]{\headwidth}{\raggedright#2}}\hfill
-        \parbox[t]{\headwidth}{\centering#3}\hfill
-        \llap{\parbox[t]{\headwidth}{\raggedleft#4}}}}}#5}
-
-\def\headrule{{\if@fancyplain\let\headrulewidth\plainheadrulewidth\fi
-    \hrule\@height\headrulewidth\@width\headwidth \vskip-\headrulewidth}}
-
-\def\footrule{{\if@fancyplain\let\footrulewidth\plainfootrulewidth\fi
-    \vskip-\footruleskip\vskip-\footrulewidth
-    \hrule\@width\headwidth\@height\footrulewidth\vskip\footruleskip}}
-
-\def\ps@fancy{%
-\@ifundefined{@chapapp}{\let\@chapapp\chaptername}{}%for amsbook
-%
-% Define \MakeUppercase for old LaTeXen.
-% Note: we used \def rather than \let, so that \let\uppercase\relax (from
-% the version 1 documentation) will still work.
-%
-\@ifundefined{MakeUppercase}{\def\MakeUppercase{\uppercase}}{}%
-\@ifundefined{chapter}{\def\sectionmark##1{\markboth
-{\MakeUppercase{\ifnum \c@secnumdepth>\z@
- \thesection\hskip 1em\relax \fi ##1}}{}}%
-\def\subsectionmark##1{\markright {\ifnum \c@secnumdepth >\@ne
- \thesubsection\hskip 1em\relax \fi ##1}}}%
-{\def\chaptermark##1{\markboth {\MakeUppercase{\ifnum \c@secnumdepth>\m@ne
- \@chapapp\ \thechapter. \ \fi ##1}}{}}%
-\def\sectionmark##1{\markright{\MakeUppercase{\ifnum \c@secnumdepth >\z@
- \thesection. \ \fi ##1}}}}%
-%\csname ps@headings\endcsname % use \ps@headings defaults if they exist
-\ps@@fancy
-\gdef\ps@fancy{\@fancyplainfalse\ps@@fancy}%
-% Initialize \headwidth if the user didn't
-%
-\ifdim\headwidth<0sp
-%
-% This catches the case that \headwidth hasn't been initialized and the
-% case that the user added something to \headwidth in the expectation that
-% it was initialized to \textwidth. We compensate this now. This loses if
-% the user intended to multiply it by a factor. But that case is more
-% likely done by saying something like \headwidth=1.2\textwidth. 
-% The doc says you have to change \headwidth after the first call to
-% \pagestyle{fancy}. This code is just to catch the most common cases were
-% that requirement is violated.
-%
-    \global\advance\headwidth123456789sp\global\advance\headwidth\textwidth
-\fi}
-\def\ps@fancyplain{\ps@fancy \let\ps@plain\ps@plain@fancy}
-\def\ps@plain@fancy{\@fancyplaintrue\ps@@fancy}
-\let\ps@@empty\ps@empty
-\def\ps@@fancy{%
-\ps@@empty % This is for amsbook/amsart, which do strange things with \topskip
-\def\@mkboth{\protect\markboth}%
-\def\@oddhead{\@fancyhead\fancy@Oolh\f@ncyolh\f@ncyoch\f@ncyorh\fancy@Oorh}%
-\def\@oddfoot{\@fancyfoot\fancy@Oolf\f@ncyolf\f@ncyocf\f@ncyorf\fancy@Oorf}%
-\def\@evenhead{\@fancyhead\fancy@Oelh\f@ncyelh\f@ncyech\f@ncyerh\fancy@Oerh}%
-\def\@evenfoot{\@fancyfoot\fancy@Oelf\f@ncyelf\f@ncyecf\f@ncyerf\fancy@Oerf}%
-}
-% Default definitions for compatibility mode:
-% These cause the header/footer to take the defined \headwidth as width
-% And to shift in the direction of the marginpar area
-
-\def\fancy@Oolh{\if@reversemargin\hss\else\relax\fi}
-\def\fancy@Oorh{\if@reversemargin\relax\else\hss\fi}
-\let\fancy@Oelh\fancy@Oorh
-\let\fancy@Oerh\fancy@Oolh
-
-\let\fancy@Oolf\fancy@Oolh
-\let\fancy@Oorf\fancy@Oorh
-\let\fancy@Oelf\fancy@Oelh
-\let\fancy@Oerf\fancy@Oerh
-
-% New definitions for the use of \fancyhfoffset
-% These calculate the \headwidth from \textwidth and the specified offsets.
-
-\def\fancy@offsolh{\headwidth=\textwidth\advance\headwidth\f@ncyO@olh
-                   \advance\headwidth\f@ncyO@orh\hskip-\f@ncyO@olh}
-\def\fancy@offselh{\headwidth=\textwidth\advance\headwidth\f@ncyO@elh
-                   \advance\headwidth\f@ncyO@erh\hskip-\f@ncyO@elh}
-
-\def\fancy@offsolf{\headwidth=\textwidth\advance\headwidth\f@ncyO@olf
-                   \advance\headwidth\f@ncyO@orf\hskip-\f@ncyO@olf}
-\def\fancy@offself{\headwidth=\textwidth\advance\headwidth\f@ncyO@elf
-                   \advance\headwidth\f@ncyO@erf\hskip-\f@ncyO@elf}
-
-\def\fancy@setoffs{%
-% Just in case \let\headwidth\textwidth was used
-  \fancy@gbl\let\headwidth\fancy@headwidth
-  \fancy@gbl\let\fancy@Oolh\fancy@offsolh
-  \fancy@gbl\let\fancy@Oelh\fancy@offselh
-  \fancy@gbl\let\fancy@Oorh\hss
-  \fancy@gbl\let\fancy@Oerh\hss
-  \fancy@gbl\let\fancy@Oolf\fancy@offsolf
-  \fancy@gbl\let\fancy@Oelf\fancy@offself
-  \fancy@gbl\let\fancy@Oorf\hss
-  \fancy@gbl\let\fancy@Oerf\hss}
-
-\newif\iffootnote
-\let\latex@makecol\@makecol
-\def\@makecol{\ifvoid\footins\footnotetrue\else\footnotefalse\fi
-\let\topfloat\@toplist\let\botfloat\@botlist\latex@makecol}
-\def\iftopfloat#1#2{\ifx\topfloat\empty #2\else #1\fi}
-\def\ifbotfloat#1#2{\ifx\botfloat\empty #2\else #1\fi}
-\def\iffloatpage#1#2{\if@fcolmade #1\else #2\fi}
-
-\newcommand{\fancypagestyle}[2]{%
-  \@namedef{ps@#1}{\let\fancy@gbl\relax#2\relax\ps@fancy}}
diff --git a/Paper2Video/src/latex_proj/iclr2023_conference.bbl b/Paper2Video/src/latex_proj/iclr2023_conference.bbl
deleted file mode 100644
index 33f19df4338dbf162627e440777352e8ca676e24..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/iclr2023_conference.bbl
+++ /dev/null
@@ -1,457 +0,0 @@
-\begin{thebibliography}{90}
-\providecommand{\natexlab}[1]{#1}
-\providecommand{\url}[1]{\texttt{#1}}
-\expandafter\ifx\csname urlstyle\endcsname\relax
-  \providecommand{\doi}[1]{doi: #1}\else
-  \providecommand{\doi}{doi: \begingroup \urlstyle{rm}\Url}\fi
-
-\bibitem[Achiam et~al.(2017)Achiam, Held, Tamar, and Abbeel]{achiam2017constrained}
-Joshua Achiam, David Held, Aviv Tamar, and Pieter Abbeel.
-\newblock Constrained policy optimization.
-\newblock In \emph{International Conference on Machine Learning (ICML)}, pp.\  22--31. PMLR, 2017.
-
-\bibitem[Altman(1999)]{altman1999constrained}
-Eitan Altman.
-\newblock \emph{Constrained Markov decision processes: stochastic modeling}.
-\newblock Routledge, 1999.
-
-\bibitem[Arora et~al.(2019)Arora, Du, Hu, Li, and Wang]{arora2019fine}
-Sanjeev Arora, Simon Du, Wei Hu, Zhiyuan Li, and Ruosong Wang.
-\newblock Fine-grained analysis of optimization and generalization for overparameterized two-layer neural networks.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  322--332. PMLR, 2019.
-
-\bibitem[Auer et~al.(2002)Auer, Cesa-Bianchi, and Fischer]{auer2002finite}
-Peter Auer, Nicolo Cesa-Bianchi, and Paul Fischer.
-\newblock Finite-time analysis of the multiarmed bandit problem.
-\newblock \emph{Machine learning}, 47\penalty0 (2):\penalty0 235--256, 2002.
-
-\bibitem[Bai et~al.(2021)Bai, Bedi, Agarwal, Koppel, and Aggarwal]{bai2021achieving}
-Qinbo Bai, Amrit~Singh Bedi, Mridul Agarwal, Alec Koppel, and Vaneet Aggarwal.
-\newblock Achieving zero constraint violation for constrained reinforcement learning via primal-dual approach.
-\newblock \emph{arXiv preprint arXiv:2109.06332}, 2021.
-
-\bibitem[Balcan et~al.(2015)Balcan, Blum, and Vempala]{balcan2015efficient}
-Maria-Florina Balcan, Avrim Blum, and Santosh Vempala.
-\newblock Efficient representations for lifelong learning and autoencoding.
-\newblock In \emph{Conference on Learning Theory}, pp.\  191--210. PMLR, 2015.
-
-\bibitem[Balcan et~al.(2019)Balcan, Khodak, and Talwalkar]{balcan2019provable}
-Maria-Florina Balcan, Mikhail Khodak, and Ameet Talwalkar.
-\newblock Provable guarantees for gradient-based meta-learning.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  424--433. PMLR, 2019.
-
-\bibitem[Balcan et~al.(2021)Balcan, Khodak, Sharma, and Talwalkar]{balcan2021learning}
-Maria-Florina~F Balcan, Mikhail Khodak, Dravyansh Sharma, and Ameet Talwalkar.
-\newblock Learning-to-learn non-convex piecewise-lipschitz functions.
-\newblock \emph{Advances in Neural Information Processing Systems}, 34, 2021.
-
-\bibitem[Bedi et~al.(2018)Bedi, Sarma, and Rajawat]{bedi2018tracking}
-Amrit~Singh Bedi, Paban Sarma, and Ketan Rajawat.
-\newblock Tracking moving agents via inexact online gradient descent algorithm.
-\newblock \emph{IEEE Journal of Selected Topics in Signal Processing}, 12\penalty0 (1):\penalty0 202--217, 2018.
-
-\bibitem[Besbes et~al.(2015)Besbes, Gur, and Zeevi]{besbes2015non}
-Omar Besbes, Yonatan Gur, and Assaf Zeevi.
-\newblock Non-stationary stochastic optimization.
-\newblock \emph{Operations research}, 63\penalty0 (5):\penalty0 1227--1244, 2015.
-
-\bibitem[Bhandari et~al.(2018)Bhandari, Russo, and Singal]{bhandari2018finite}
-Jalaj Bhandari, Daniel Russo, and Raghav Singal.
-\newblock A finite time analysis of temporal difference learning with linear function approximation.
-\newblock In \emph{Conference on learning theory}, pp.\  1691--1692. PMLR, 2018.
-
-\bibitem[Bhatnagar \& Lakshmanan(2012)Bhatnagar and Lakshmanan]{bhatnagar2012online}
-Shalabh Bhatnagar and K~Lakshmanan.
-\newblock An online actor--critic algorithm with function approximation for constrained markov decision processes.
-\newblock \emph{Journal of Optimization Theory and Applications}, 153\penalty0 (3):\penalty0 688--708, 2012.
-
-\bibitem[Bolte et~al.(2007)Bolte, Daniilidis, Lewis, and Shiota]{bolte2007clarke}
-J{\'e}r{\^o}me Bolte, Aris Daniilidis, Adrian Lewis, and Masahiro Shiota.
-\newblock Clarke subgradients of stratifiable functions.
-\newblock \emph{SIAM Journal on Optimization}, 18\penalty0 (2):\penalty0 556--572, 2007.
-
-\bibitem[Bolte et~al.(2010)Bolte, Daniilidis, Ley, and Mazet]{bolte2010characterizations}
-J{\'e}r{\^o}me Bolte, Aris Daniilidis, Olivier Ley, and Laurent Mazet.
-\newblock Characterizations of {\l}ojasiewicz inequalities: subgradient flows, talweg, convexity.
-\newblock \emph{Transactions of the American Mathematical Society}, 362\penalty0 (6):\penalty0 3319--3363, 2010.
-
-\bibitem[Borkar(2005)]{borkar2005actor}
-Vivek~S Borkar.
-\newblock An actor-critic algorithm for constrained markov decision processes.
-\newblock \emph{Systems \& control letters}, 54\penalty0 (3):\penalty0 207--213, 2005.
-
-\bibitem[Brockman et~al.(2016)Brockman, Cheung, Pettersson, Schneider, Schulman, Tang, and Zaremba]{brockman2016openai}
-Greg Brockman, Vicki Cheung, Ludwig Pettersson, Jonas Schneider, John Schulman, Jie Tang, and Wojciech Zaremba.
-\newblock {OpenAI} gym.
-\newblock \emph{arXiv preprint arXiv:1606.01540}, 2016.
-
-\bibitem[Cesa-Bianchi et~al.(2011)Cesa-Bianchi, Shalev-Shwartz, and Shamir]{cesa2011online}
-Nicolo Cesa-Bianchi, Shai Shalev-Shwartz, and Ohad Shamir.
-\newblock Online learning of noisy data.
-\newblock \emph{IEEE Transactions on Information Theory}, 57\penalty0 (12):\penalty0 7907--7931, 2011.
-
-\bibitem[Chen et~al.(2021{\natexlab{a}})Chen, Hu, Jin, Li, and Wang]{chen2021understanding}
-Xiaoyu Chen, Jiachen Hu, Chi Jin, Lihong Li, and Liwei Wang.
-\newblock Understanding domain randomization for sim-to-real transfer.
-\newblock \emph{arXiv preprint arXiv:2110.03239}, 2021{\natexlab{a}}.
-
-\bibitem[Chen et~al.(2021{\natexlab{b}})Chen, Dong, and Wang]{chen2021primal}
-Yi~Chen, Jing Dong, and Zhaoran Wang.
-\newblock A primal-dual approach to constrained markov decision processes.
-\newblock \emph{arXiv preprint arXiv:2101.10895}, 2021{\natexlab{b}}.
-
-\bibitem[Chow et~al.(2017)Chow, Ghavamzadeh, Janson, and Pavone]{chow2017risk}
-Yinlam Chow, Mohammad Ghavamzadeh, Lucas Janson, and Marco Pavone.
-\newblock Risk-constrained reinforcement learning with percentile risk criteria.
-\newblock \emph{The Journal of Machine Learning Research}, 18\penalty0 (1):\penalty0 6070--6120, 2017.
-
-\bibitem[Chow et~al.(2018)Chow, Nachum, Du{\'e}{\~n}ez-Guzm{\'a}n, and Ghavamzadeh]{chow2018lyapunov}
-Yinlam Chow, Ofir Nachum, Edgar~A Du{\'e}{\~n}ez-Guzm{\'a}n, and Mohammad Ghavamzadeh.
-\newblock A {Lyapunov}-based approach to safe reinforcement learning.
-\newblock In \emph{Advances in Neural Information Processing Systems}, 2018.
-
-\bibitem[Davis et~al.(2020)Davis, Drusvyatskiy, Kakade, and Lee]{davis2020stochastic}
-Damek Davis, Dmitriy Drusvyatskiy, Sham Kakade, and Jason~D Lee.
-\newblock Stochastic subgradient method converges on tame functions.
-\newblock \emph{Foundations of computational mathematics}, 20\penalty0 (1):\penalty0 119--154, 2020.
-
-\bibitem[De~Nijs et~al.(2021)De~Nijs, Walraven, De~Weerdt, and Spaan]{de2021constrained}
-Frits De~Nijs, Erwin Walraven, Mathijs De~Weerdt, and Matthijs Spaan.
-\newblock Constrained multiagent markov decision processes: A taxonomy of problems and algorithms.
-\newblock \emph{Journal of Artificial Intelligence Research}, 70:\penalty0 955--1001, 2021.
-
-\bibitem[Denevi et~al.(2019)Denevi, Ciliberto, Grazzi, and Pontil]{denevi2019learning}
-Giulia Denevi, Carlo Ciliberto, Riccardo Grazzi, and Massimiliano Pontil.
-\newblock Learning-to-learn stochastic gradient descent with biased regularization.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  1566--1575. PMLR, 2019.
-
-\bibitem[Ding et~al.(2021{\natexlab{a}})Ding, Wei, Yang, Wang, and Jovanovic]{ding2021provably}
-Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, and Mihailo Jovanovic.
-\newblock Provably efficient safe exploration via primal-dual policy optimization.
-\newblock In \emph{International Conference on Artificial Intelligence and Statistics}, pp.\  3304--3312. PMLR, 2021{\natexlab{a}}.
-
-\bibitem[Ding \& Lavaei(2022)Ding and Lavaei]{ding2022provably}
-Yuhao Ding and Javad Lavaei.
-\newblock Provably efficient primal-dual reinforcement learning for {CMDP}s with non-stationary objectives and constraints.
-\newblock \emph{arXiv preprint arXiv:2201.11965}, 2022.
-
-\bibitem[Ding et~al.(2021{\natexlab{b}})Ding, Zhang, and Lavaei]{ding2021beyond}
-Yuhao Ding, Junzi Zhang, and Javad Lavaei.
-\newblock Beyond exact gradients: Convergence of stochastic soft-max policy gradient methods with entropy regularization.
-\newblock \emph{arXiv preprint arXiv:2110.10117}, 2021{\natexlab{b}}.
-
-\bibitem[Dixit et~al.(2019)Dixit, Bedi, Tripathi, and Rajawat]{dixit2019online}
-Rishabh Dixit, Amrit~Singh Bedi, Ruchi Tripathi, and Ketan Rajawat.
-\newblock Online learning with inexact proximal online gradient descent algorithms.
-\newblock \emph{IEEE Transactions on Signal Processing}, 67\penalty0 (5):\penalty0 1338--1352, 2019.
-
-\bibitem[Drusvyatskiy \& Lewis(2018)Drusvyatskiy and Lewis]{drusvyatskiy2018error}
-Dmitriy Drusvyatskiy and Adrian~S Lewis.
-\newblock Error bounds, quadratic growth, and linear convergence of proximal methods.
-\newblock \emph{Mathematics of Operations Research}, 43\penalty0 (3):\penalty0 919--948, 2018.
-
-\bibitem[Du et~al.(2020)Du, Hu, Kakade, Lee, and Lei]{du2020few}
-Simon~Shaolei Du, Wei Hu, Sham~M Kakade, Jason~D Lee, and Qi~Lei.
-\newblock Few-shot learning via learning the representation, provably.
-\newblock In \emph{International Conference on Learning Representations}, 2020.
-
-\bibitem[Duan et~al.(2016)Duan, Schulman, Chen, Bartlett, Sutskever, and Abbeel]{duan2016rl}
-Yan Duan, John Schulman, Xi~Chen, Peter~L Bartlett, Ilya Sutskever, and Pieter Abbeel.
-\newblock $\text{RL}^2$: Fast reinforcement learning via slow reinforcement learning.
-\newblock \emph{arXiv preprint arXiv:1611.02779}, 2016.
-
-\bibitem[Duan et~al.(2020)Duan, Jia, and Wang]{duan2020minimax}
-Yaqi Duan, Zeyu Jia, and Mengdi Wang.
-\newblock Minimax-optimal off-policy evaluation with linear function approximation.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  2701--2709. PMLR, 2020.
-
-\bibitem[Efroni et~al.(2020)Efroni, Mannor, and Pirotta]{efroni2020exploration}
-Yonathan Efroni, Shie Mannor, and Matteo Pirotta.
-\newblock Exploration-exploitation in constrained {MDP}s.
-\newblock \emph{arXiv preprint arXiv:2003.02189}, 2020.
-
-\bibitem[Fallah et~al.(2021)Fallah, Georgiev, Mokhtari, and Ozdaglar]{fallah2021convergence}
-Alireza Fallah, Kristian Georgiev, Aryan Mokhtari, and Asuman Ozdaglar.
-\newblock On the convergence theory of debiased model-agnostic meta-reinforcement learning.
-\newblock \emph{Advances in Neural Information Processing Systems}, 34, 2021.
-
-\bibitem[Fan et~al.(2021)Fan, Ma, and Zhong]{fan2021selective}
-Jianqing Fan, Cong Ma, and Yiqiao Zhong.
-\newblock A selective overview of deep learning.
-\newblock \emph{Statistical science: a review journal of the Institute of Mathematical Statistics}, 36\penalty0 (2):\penalty0 264, 2021.
-
-\bibitem[Finn et~al.(2017)Finn, Abbeel, and Levine]{finn2017model}
-Chelsea Finn, Pieter Abbeel, and Sergey Levine.
-\newblock Model-agnostic meta-learning for fast adaptation of deep networks.
-\newblock In \emph{International conference on machine learning}, pp.\  1126--1135. PMLR, 2017.
-
-\bibitem[Finn et~al.(2019)Finn, Rajeswaran, Kakade, and Levine]{finn2019online}
-Chelsea Finn, Aravind Rajeswaran, Sham Kakade, and Sergey Levine.
-\newblock Online meta-learning.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  1920--1930. PMLR, 2019.
-
-\bibitem[Garc{\i}a \& Fern{\'a}ndez(2015)Garc{\i}a and Fern{\'a}ndez]{garcia2015comprehensive}
-Javier Garc{\i}a and Fernando Fern{\'a}ndez.
-\newblock A comprehensive survey on safe reinforcement learning.
-\newblock \emph{Journal of Machine Learning Research}, 16\penalty0 (1):\penalty0 1437--1480, 2015.
-
-\bibitem[Geist et~al.(2019)Geist, Scherrer, and Pietquin]{geist2019theory}
-Matthieu Geist, Bruno Scherrer, and Olivier Pietquin.
-\newblock A theory of regularized markov decision processes.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  2160--2169. PMLR, 2019.
-
-\bibitem[Gelada \& Bellemare(2019)Gelada and Bellemare]{gelada2019off}
-Carles Gelada and Marc~G Bellemare.
-\newblock Off-policy deep reinforcement learning by bootstrapping the covariate shift.
-\newblock In \emph{Proceedings of the AAAI Conference on Artificial Intelligence}, volume~33, pp.\  3647--3655, 2019.
-
-\bibitem[Hazan et~al.(2016)]{hazan2016introduction}
-Elad Hazan et~al.
-\newblock Introduction to online convex optimization.
-\newblock \emph{Foundations and Trends{\textregistered} in Optimization}, 2\penalty0 (3-4):\penalty0 157--325, 2016.
-
-\bibitem[Hospedales et~al.(2020)Hospedales, Antoniou, Micaelli, and Storkey]{hospedales2020meta}
-Timothy Hospedales, Antreas Antoniou, Paul Micaelli, and Amos Storkey.
-\newblock Meta-learning in neural networks: A survey.
-\newblock \emph{arXiv preprint arXiv:2004.05439}, 2020.
-
-\bibitem[Ioffe(2009)]{ioffe2009invitation}
-Alexander~D Ioffe.
-\newblock An invitation to tame optimization.
-\newblock \emph{SIAM Journal on Optimization}, 19\penalty0 (4):\penalty0 1894--1917, 2009.
-
-\bibitem[Jadbabaie et~al.(2015)Jadbabaie, Rakhlin, Shahrampour, and Sridharan]{jadbabaie2015online}
-Ali Jadbabaie, Alexander Rakhlin, Shahin Shahrampour, and Karthik Sridharan.
-\newblock Online optimization: Competing with dynamic comparators.
-\newblock In \emph{Artificial Intelligence and Statistics}, pp.\  398--406. PMLR, 2015.
-
-\bibitem[Jaderberg et~al.(2019)Jaderberg, Czarnecki, Dunning, Marris, Lever, Castaneda, Beattie, Rabinowitz, Morcos, Ruderman, et~al.]{jaderberg2019human}
-Max Jaderberg, Wojciech~M Czarnecki, Iain Dunning, Luke Marris, Guy Lever, Antonio~Garcia Castaneda, Charles Beattie, Neil~C Rabinowitz, Ari~S Morcos, Avraham Ruderman, et~al.
-\newblock Human-level performance in 3d multiplayer games with population-based reinforcement learning.
-\newblock \emph{Science}, 364\penalty0 (6443):\penalty0 859--865, 2019.
-
-\bibitem[Jean-Baptiste(2010)]{jean2010convex}
-HU~Jean-Baptiste.
-\newblock Convex analysis and minimization algorithms: advanced theory and bundle methods, 2010.
-
-\bibitem[Ji et~al.(2022)Ji, Yang, and Liang]{ji2022theoretical}
-Kaiyi Ji, Junjie Yang, and Yingbin Liang.
-\newblock Theoretical convergence of multi-step model-agnostic meta-learning.
-\newblock \emph{Journal of Machine Learning Research}, 23\penalty0 (29):\penalty0 1--41, 2022.
-
-\bibitem[Johnstone \& Moulin(2020)Johnstone and Moulin]{johnstone2020faster}
-Patrick~R Johnstone and Pierre Moulin.
-\newblock Faster subgradient methods for functions with h{\"o}lderian growth.
-\newblock \emph{Mathematical Programming}, 180\penalty0 (1):\penalty0 417--450, 2020.
-
-\bibitem[Khodak et~al.(2019)Khodak, Balcan, and Talwalkar]{khodak2019adaptive}
-Mikhail Khodak, Maria-Florina~F Balcan, and Ameet~S Talwalkar.
-\newblock Adaptive gradient-based meta-learning methods.
-\newblock \emph{Advances in Neural Information Processing Systems}, 32, 2019.
-
-\bibitem[Kwon et~al.(2021)Kwon, Efroni, Caramanis, and Mannor]{kwon2021rl}
-Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, and Shie Mannor.
-\newblock {RL} for latent {MDP}s: Regret guarantees and a lower bound.
-\newblock \emph{Advances in Neural Information Processing Systems}, 34, 2021.
-
-\bibitem[Le et~al.(2019)Le, Voloshin, and Yue]{le2019batch}
-Hoang Le, Cameron Voloshin, and Yisong Yue.
-\newblock Batch policy learning under constraints.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  3703--3712. PMLR, 2019.
-
-\bibitem[Lee et~al.(2021)Lee, Jeon, Lee, Pineau, and Kim]{lee2021optidice}
-Jongmin Lee, Wonseok Jeon, Byungjun Lee, Joelle Pineau, and Kee-Eung Kim.
-\newblock Optidice: Offline policy optimization via stationary distribution correction estimation.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  6120--6130. PMLR, 2021.
-
-\bibitem[Levin \& Peres(2017)Levin and Peres]{levin2017markov}
-David~A Levin and Yuval Peres.
-\newblock \emph{Markov chains and mixing times}, volume 107.
-\newblock American Mathematical Soc., 2017.
-
-\bibitem[Li \& Liang(2018)Li and Liang]{li2018learning}
-Yuanzhi Li and Yingyu Liang.
-\newblock Learning overparameterized neural networks via stochastic gradient descent on structured data.
-\newblock \emph{Advances in Neural Information Processing Systems}, 31, 2018.
-
-\bibitem[Li et~al.(2017)Li, Zhou, Chen, and Li]{li2017meta}
-Zhenguo Li, Fengwei Zhou, Fei Chen, and Hang Li.
-\newblock Meta-{SGD}: Learning to learn quickly for few-shot learning.
-\newblock \emph{arXiv preprint arXiv:1707.09835}, 2017.
-
-\bibitem[Liu et~al.(2019)Liu, Socher, and Xiong]{liu2019taming}
-Hao Liu, Richard Socher, and Caiming Xiong.
-\newblock Taming maml: Efficient unbiased meta-reinforcement learning.
-\newblock In \emph{International conference on machine learning}, pp.\  4061--4071. PMLR, 2019.
-
-\bibitem[Liu et~al.(2018)Liu, Li, Tang, and Zhou]{liu2018breaking}
-Qiang Liu, Lihong Li, Ziyang Tang, and Dengyong Zhou.
-\newblock Breaking the curse of horizon: Infinite-horizon off-policy estimation.
-\newblock \emph{Advances in Neural Information Processing Systems}, 31, 2018.
-
-\bibitem[Liu et~al.(2021{\natexlab{a}})Liu, Zhou, Kalathil, Kumar, and Tian]{liu2021learning}
-Tao Liu, Ruida Zhou, Dileep Kalathil, Panganamala Kumar, and Chao Tian.
-\newblock Learning policies with zero or bounded constraint violation for constrained {MDP}s.
-\newblock \emph{Advances in Neural Information Processing Systems}, 34, 2021{\natexlab{a}}.
-
-\bibitem[Liu et~al.(2021{\natexlab{b}})Liu, Zhou, Kalathil, Kumar, and Tian]{liu2021fast}
-Tao Liu, Ruida Zhou, Dileep Kalathil, PR~Kumar, and Chao Tian.
-\newblock Fast global convergence of policy optimization for constrained {MDP}s.
-\newblock \emph{arXiv preprint arXiv:2111.00552}, 2021{\natexlab{b}}.
-
-\bibitem[Maurer et~al.(2016)Maurer, Pontil, and Romera-Paredes]{maurer2016benefit}
-Andreas Maurer, Massimiliano Pontil, and Bernardino Romera-Paredes.
-\newblock The benefit of multitask representation learning.
-\newblock \emph{Journal of Machine Learning Research}, 17\penalty0 (81):\penalty0 1--32, 2016.
-
-\bibitem[Mei et~al.(2020)Mei, Xiao, Szepesvari, and Schuurmans]{mei2020global}
-Jincheng Mei, Chenjun Xiao, Csaba Szepesvari, and Dale Schuurmans.
-\newblock On the global convergence rates of softmax policy gradient methods.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  6820--6829. PMLR, 2020.
-
-\bibitem[Mitchell et~al.(2021)Mitchell, Rafailov, Peng, Levine, and Finn]{mitchell2021offline}
-Eric Mitchell, Rafael Rafailov, Xue~Bin Peng, Sergey Levine, and Chelsea Finn.
-\newblock Offline meta-reinforcement learning with advantage weighting.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  7780--7791. PMLR, 2021.
-
-\bibitem[Mokhtari et~al.(2016)Mokhtari, Shahrampour, Jadbabaie, and Ribeiro]{mokhtari2016online}
-Aryan Mokhtari, Shahin Shahrampour, Ali Jadbabaie, and Alejandro Ribeiro.
-\newblock Online optimization in dynamic environments: Improved regret rates for strongly convex problems.
-\newblock In \emph{2016 IEEE 55th Conference on Decision and Control}, pp.\  7195--7201. IEEE, 2016.
-
-\bibitem[Nachum et~al.(2019)Nachum, Chow, Dai, and Li]{nachum2019dualdice}
-Ofir Nachum, Yinlam Chow, Bo~Dai, and Lihong Li.
-\newblock Dualdice: Behavior-agnostic estimation of discounted stationary distribution corrections.
-\newblock \emph{Advances in Neural Information Processing Systems}, 32, 2019.
-
-\bibitem[Neyshabur et~al.(2019)Neyshabur, Li, Bhojanapalli, LeCun, and Srebro]{neyshabur2019towards}
-Behnam Neyshabur, Zhiyuan Li, Srinadh Bhojanapalli, Yann LeCun, and Nathan Srebro.
-\newblock Towards understanding the role of over-parametrization in generalization of neural networks.
-\newblock In \emph{International Conference on Learning Representations (ICLR)}, 2019.
-
-\bibitem[Paternain et~al.(2022)Paternain, Calvo-Fullana, Chamon, and Ribeiro]{paternain2022safe}
-Santiago Paternain, Miguel Calvo-Fullana, Luiz~FO Chamon, and Alejandro Ribeiro.
-\newblock Safe policies for reinforcement learning via primal-dual methods.
-\newblock \emph{IEEE Transactions on Automatic Control}, 2022.
-
-\bibitem[Resler \& Mansour(2019)Resler and Mansour]{resler2019adversarial}
-Alon Resler and Yishay Mansour.
-\newblock Adversarial online learning with noise.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  5429--5437. PMLR, 2019.
-
-\bibitem[Rothfuss et~al.(2018)Rothfuss, Lee, Clavera, Asfour, and Abbeel]{rothfuss2018promp}
-Jonas Rothfuss, Dennis Lee, Ignasi Clavera, Tamim Asfour, and Pieter Abbeel.
-\newblock Promp: Proximal meta-policy search.
-\newblock \emph{arXiv preprint arXiv:1810.06784}, 2018.
-
-\bibitem[Song et~al.(2019)Song, Gao, Yang, Choromanski, Pacchiano, and Tang]{song2019maml}
-Xingyou Song, Wenbo Gao, Yuxiang Yang, Krzysztof Choromanski, Aldo Pacchiano, and Yunhao Tang.
-\newblock Es-maml: Simple hessian-free meta learning.
-\newblock \emph{arXiv preprint arXiv:1910.01215}, 2019.
-
-\bibitem[Suilen et~al.(2022)Suilen, Sim{\~a}o, Jansen, and Parker]{suilen2022robust}
-Marnix Suilen, Thiago~D Sim{\~a}o, Nils Jansen, and David Parker.
-\newblock Robust anytime learning of markov decision processes.
-\newblock \emph{arXiv preprint arXiv:2205.15827}, 2022.
-
-\bibitem[Tennenholtz et~al.(2020)Tennenholtz, Shalit, and Mannor]{tennenholtz2020off}
-Guy Tennenholtz, Uri Shalit, and Shie Mannor.
-\newblock Off-policy evaluation in partially observable environments.
-\newblock In \emph{Proceedings of the AAAI Conference on Artificial Intelligence}, volume~34, pp.\  10276--10283, 2020.
-
-\bibitem[Thomas et~al.(2021)Thomas, Pineau, Laroche, et~al.]{thomas2021multi}
-Philip~S Thomas, Joelle Pineau, Romain Laroche, et~al.
-\newblock Multi-objective spibb: Seldonian offline policy improvement with safety constraints in finite {MDP}s.
-\newblock \emph{Advances in Neural Information Processing Systems}, 34, 2021.
-
-\bibitem[Todorov et~al.(2012)Todorov, Erez, and Tassa]{todorov2012mujoco}
-Emanuel Todorov, Tom Erez, and Yuval Tassa.
-\newblock Mujoco: A physics engine for model-based control.
-\newblock In \emph{2012 IEEE/RSJ international conference on intelligent robots and systems}, pp.\  5026--5033. IEEE, 2012.
-
-\bibitem[Tripuraneni et~al.(2020)Tripuraneni, Jordan, and Jin]{tripuraneni2020theory}
-Nilesh Tripuraneni, Michael Jordan, and Chi Jin.
-\newblock On the theory of transfer learning: The importance of task diversity.
-\newblock \emph{Advances in Neural Information Processing Systems}, 33:\penalty0 7852--7862, 2020.
-
-\bibitem[Uchibe \& Doya(2007)Uchibe and Doya]{uchibe2007constrained}
-Eiji Uchibe and Kenji Doya.
-\newblock Constrained reinforcement learning from intrinsic and extrinsic rewards.
-\newblock In \emph{2007 IEEE 6th International Conference on Development and Learning}, pp.\  163--168. IEEE, 2007.
-
-\bibitem[Van~den Dries \& Miller(1996)Van~den Dries and Miller]{van1996geometric}
-Lou Van~den Dries and Chris Miller.
-\newblock Geometric categories and o-minimal structures.
-\newblock \emph{Duke Mathematical Journal}, 84\penalty0 (2):\penalty0 497--540, 1996.
-
-\bibitem[Wu et~al.(2021)Wu, Zhang, Yang, and Wang]{wu2021offline}
-Runzhe Wu, Yufeng Zhang, Zhuoran Yang, and Zhaoran Wang.
-\newblock Offline constrained multi-objective reinforcement learning via pessimistic dual value iteration.
-\newblock \emph{Advances in Neural Information Processing Systems}, 34, 2021.
-
-\bibitem[Xu et~al.(2020)Xu, Wang, and Liang]{xu2020improving}
-Tengyu Xu, Zhe Wang, and Yingbin Liang.
-\newblock Improving sample complexity bounds for (natural) actor-critic algorithms.
-\newblock \emph{Advances in Neural Information Processing Systems}, 33:\penalty0 4358--4369, 2020.
-
-\bibitem[Xu et~al.(2021)Xu, Liang, and Lan]{xu2021crpo}
-Tengyu Xu, Yingbin Liang, and Guanghui Lan.
-\newblock Crpo: A new approach for safe reinforcement learning with convergence guarantee.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  11480--11491. PMLR, 2021.
-
-\bibitem[Yang et~al.(2016)Yang, Zhang, Jin, and Yi]{yang2016tracking}
-Tianbao Yang, Lijun Zhang, Rong Jin, and Jinfeng Yi.
-\newblock Tracking slowly moving clairvoyant: Optimal dynamic regret of online learning with true and noisy gradient.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  449--457. PMLR, 2016.
-
-\bibitem[Ying et~al.(2022)Ying, Ding, and Lavaei]{ying2021dual}
-Donghao Ying, Yuhao Ding, and Javad Lavaei.
-\newblock A dual approach to constrained markov decision processes with entropy regularization.
-\newblock \emph{25th International Conference on Artificial Intelligence and Statistics (AISTATS)}, 2022.
-
-\bibitem[Young et~al.(2018)Young, Wang, and Taylor]{young2018metatrace}
-Kenny Young, Baoxiang Wang, and Matthew~E Taylor.
-\newblock Metatrace: Online step-size tuning by meta-gradient descent for reinforcement learning control.
-\newblock \emph{arXiv preprint arXiv:1805.04514}, 2018.
-
-\bibitem[Yu et~al.(2019)Yu, Yang, Kolar, and Wang]{yu2019convergent}
-Ming Yu, Zhuoran Yang, Mladen Kolar, and Zhaoran Wang.
-\newblock Convergent policy optimization for safe reinforcement learning.
-\newblock \emph{Advances in Neural Information Processing Systems}, 32, 2019.
-
-\bibitem[Zhang et~al.(2021)Zhang, Bengio, Hardt, Recht, and Vinyals]{zhang2021understanding}
-Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, and Oriol Vinyals.
-\newblock Understanding deep learning (still) requires rethinking generalization.
-\newblock \emph{Communications of the ACM}, 64\penalty0 (3):\penalty0 107--115, 2021.
-
-\bibitem[Zhang et~al.(2017)Zhang, Yang, Yi, Jin, and Zhou]{zhang2017improved}
-Lijun Zhang, Tianbao Yang, Jinfeng Yi, Rong Jin, and Zhi-Hua Zhou.
-\newblock Improved dynamic regret for non-degenerate functions.
-\newblock \emph{Advances in Neural Information Processing Systems}, 30, 2017.
-
-\bibitem[Zhao et~al.(2021)Zhao, Chen, and Thuraisingham]{zhao2021fairness}
-Chen Zhao, Feng Chen, and Bhavani Thuraisingham.
-\newblock Fairness-aware online meta-learning.
-\newblock In \emph{Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery \& Data Mining}, pp.\  2294--2304, 2021.
-
-\bibitem[Zhao et~al.(2020)Zhao, Zhang, Zhang, and Zhou]{zhao2020dynamic}
-Peng Zhao, Yu-Jie Zhang, Lijun Zhang, and Zhi-Hua Zhou.
-\newblock Dynamic regret of convex and smooth functions.
-\newblock \emph{Advances in Neural Information Processing Systems}, 33:\penalty0 12510--12520, 2020.
-
-\bibitem[Zinkevich(2003)]{zinkevich2003online}
-Martin Zinkevich.
-\newblock Online convex programming and generalized infinitesimal gradient ascent.
-\newblock In \emph{Proceedings of the 20th international conference on machine learning (icml-03)}, pp.\  928--936, 2003.
-
-\bibitem[Zintgraf et~al.(2021)Zintgraf, Feng, Lu, Igl, Hartikainen, Hofmann, and Whiteson]{zintgraf2021exploration}
-Luisa~M Zintgraf, Leo Feng, Cong Lu, Maximilian Igl, Kristian Hartikainen, Katja Hofmann, and Shimon Whiteson.
-\newblock Exploration in approximate hyper-state space for meta reinforcement learning.
-\newblock In \emph{International Conference on Machine Learning}, pp.\  12991--13001. PMLR, 2021.
-
-\bibitem[Zou et~al.(2018)Zou, Cao, Zhou, and Gu]{zou2018stochastic}
-Difan Zou, Yuan Cao, Dongruo Zhou, and Quanquan Gu.
-\newblock Stochastic gradient descent optimizes over-parameterized deep relu networks.
-\newblock \emph{arXiv preprint arXiv:1811.08888}, 2018.
-
-\end{thebibliography}
diff --git a/Paper2Video/src/latex_proj/iclr2023_conference.bib b/Paper2Video/src/latex_proj/iclr2023_conference.bib
deleted file mode 100644
index 8215393118f050430f77a71db1861563360086da..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/iclr2023_conference.bib
+++ /dev/null
@@ -1,11986 +0,0 @@
-@incollection{Bengio+chapter2007,
-author = {Bengio, Yoshua and LeCun, Yann},
-booktitle = {Large Scale Kernel Machines},
-publisher = {MIT Press},
-title = {Scaling Learning Algorithms Towards {AI}},
-year = {2007}
-}
-@article{banerjee2005clustering,
-  title={Clustering with Bregman divergences.},
-  author={Banerjee, Arindam and Merugu, Srujana and Dhillon, Inderjit S and Ghosh, Joydeep and Lafferty, John},
-  journal={Journal of machine learning research},
-  volume={6},
-  number={10},
-  year={2005}
-}
-@article{Hinton06,
-author = {Hinton, Geoffrey E. and Osindero, Simon and Teh, Yee Whye},
-journal = {Neural Computation},
-pages = {1527--1554},
-title = {A Fast Learning Algorithm for Deep Belief Nets},
-volume = {18},
-year = {2006}
-}
-
-@book{goodfellow2016deep,
-title={Deep learning},
-author={Goodfellow, Ian and Bengio, Yoshua and Courville, Aaron and Bengio, Yoshua},
-volume={1},
-year={2016},
-publisher={MIT Press}
-}
-
-@article{auer2002finite,
-  title={Finite-time analysis of the multiarmed bandit problem},
-  author={Auer, Peter and Cesa-Bianchi, Nicolo and Fischer, Paul},
-  journal={Machine learning},
-  volume={47},
-  number={2},
-  pages={235--256},
-  year={2002},
-  publisher={Springer}
-}
-
-@article{besbes2015non,
-  title={Non-stationary stochastic optimization},
-  author={Besbes, Omar and Gur, Yonatan and Zeevi, Assaf},
-  journal={Operations research},
-  volume={63},
-  number={5},
-  pages={1227--1244},
-  year={2015},
-  publisher={INFORMS}
-}
-@inproceedings{mokhtari2016online,
-  title={Online optimization in dynamic environments: Improved regret rates for strongly convex problems},
-  author={Mokhtari, Aryan and Shahrampour, Shahin and Jadbabaie, Ali and Ribeiro, Alejandro},
-  booktitle={2016 IEEE 55th Conference on Decision and Control},
-  pages={7195--7201},
-  year={2016},
-  organization={IEEE}
-}
-@inproceedings{zinkevich2003online,
-  title={Online convex programming and generalized infinitesimal gradient ascent},
-  author={Zinkevich, Martin},
-  booktitle={Proceedings of the 20th international conference on machine learning (icml-03)},
-  pages={928--936},
-  year={2003}
-}
-
-@article{zhang2021understanding,
-  title={Understanding deep learning (still) requires rethinking generalization},
-  author={Zhang, Chiyuan and Bengio, Samy and Hardt, Moritz and Recht, Benjamin and Vinyals, Oriol},
-  journal={Communications of the ACM},
-  volume={64},
-  number={3},
-  pages={107--115},
-  year={2021},
-  publisher={ACM New York, NY, USA}
-}
-
-@inproceedings{neyshabur2019towards,
-  title={Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks},
-  author={Neyshabur, Behnam and Li, Zhiyuan and Bhojanapalli, Srinadh and LeCun, Yann and Srebro, Nathan},
-  booktitle={International Conference on Learning Representations (ICLR)},
-  year={2019}
-}
-
-@article{li2018learning,
-  title={Learning overparameterized neural networks via stochastic gradient descent on structured data},
-  author={Li, Yuanzhi and Liang, Yingyu},
-  journal={Advances in Neural Information Processing Systems},
-  volume={31},
-  year={2018}
-}
-
-@article{zou2018stochastic,
-  title={Stochastic gradient descent optimizes over-parameterized deep relu networks},
-  author={Zou, Difan and Cao, Yuan and Zhou, Dongruo and Gu, Quanquan},
-  journal={arXiv preprint arXiv:1811.08888},
-  year={2018}
-}
-
-@inproceedings{arora2019fine,
-  title={Fine-grained analysis of optimization and generalization for overparameterized two-layer neural networks},
-  author={Arora, Sanjeev and Du, Simon and Hu, Wei and Li, Zhiyuan and Wang, Ruosong},
-  booktitle={International Conference on Machine Learning},
-  pages={322--332},
-  year={2019},
-  organization={PMLR}
-}
-
-@article{fan2021selective,
-  title={A selective overview of deep learning},
-  author={Fan, Jianqing and Ma, Cong and Zhong, Yiqiao},
-  journal={Statistical science: a review journal of the Institute of Mathematical Statistics},
-  volume={36},
-  number={2},
-  pages={264},
-  year={2021},
-  publisher={NIH Public Access}
-}
-@misc{jean2010convex,
-  title={Convex analysis and minimization algorithms: advanced theory and bundle methods},
-  author={Jean-Baptiste, HU},
-  year={2010},
-  publisher={SPRINGER}
-}
-
-@article{hazan2016introduction,
-  title={Introduction to online convex optimization},
-  author={Hazan, Elad and others},
-  journal={Foundations and Trends{\textregistered} in Optimization},
-  volume={2},
-  number={3-4},
-  pages={157--325},
-  year={2016},
-  publisher={Now Publishers, Inc.}
-}
-@article{baby2022optimal,
-  title={Optimal Dynamic Regret in Proper Online Learning with Strongly Convex Losses and Beyond},
-  author={Baby, Dheeraj and Wang, Yu-Xiang},
-  journal={arXiv preprint arXiv:2201.08905},
-  year={2022}
-}
-@inproceedings{jadbabaie2015online,
-  title={Online optimization: Competing with dynamic comparators},
-  author={Jadbabaie, Ali and Rakhlin, Alexander and Shahrampour, Shahin and Sridharan, Karthik},
-  booktitle={Artificial Intelligence and Statistics},
-  pages={398--406},
-  year={2015},
-  organization={PMLR}
-}
-@article{kwon2021rl,
-  title={{RL} for latent {MDP}s: Regret guarantees and a lower bound},
-  author={Kwon, Jeongyeol and Efroni, Yonathan and Caramanis, Constantine and Mannor, Shie},
-  journal={Advances in Neural Information Processing Systems},
-  volume={34},
-  year={2021}
-}
-@article{shalev2012online,
-  title={Online learning and online convex optimization},
-  author={Shalev-Shwartz, Shai and others},
-  journal={Foundations and Trends{\textregistered} in Machine Learning},
-  volume={4},
-  number={2},
-  pages={107--194},
-  year={2012},
-  publisher={Now Publishers, Inc.}
-}
-
-%% added on May 17, 6am
-@inproceedings{dai2018sbeed,
-  title={Sbeed: Convergent reinforcement learning with nonlinear function approximation},
-  author={Dai, Bo and Shaw, Albert and Li, Lihong and Xiao, Lin and He, Niao and Liu, Zhen and Chen, Jianshu and Song, Le},
-  booktitle={International Conference on Machine Learning},
-  pages={1125--1134},
-  year={2018},
-  organization={PMLR}
-}
-
-@article{vinyals2019grandmaster,
-  title={Grandmaster level in StarCraft II using multi-agent reinforcement learning},
-  author={Vinyals, Oriol and Babuschkin, Igor and Czarnecki, Wojciech M and Mathieu, Micha{\"e}l and Dudzik, Andrew and Chung, Junyoung and Choi, David H and Powell, Richard and Ewalds, Timo and Georgiev, Petko and others},
-  journal={Nature},
-  volume={575},
-  number={7782},
-  pages={350--354},
-  year={2019},
-  publisher={Nature Publishing Group}
-}
-@article{khodak2019adaptive,
-  title={Adaptive gradient-based meta-learning methods},
-  author={Khodak, Mikhail and Balcan, Maria-Florina F and Talwalkar, Ameet S},
-  journal={Advances in Neural Information Processing Systems},
-  volume={32},
-  year={2019}
-}
-@article{zhang2020variational,
-  title={Variational policy gradient method for reinforcement learning with general utilities},
-  author={Zhang, Junyu and Koppel, Alec and Bedi, Amrit Singh and Szepesvari, Csaba and Wang, Mengdi},
-  journal={Advances in Neural Information Processing Systems},
-  volume={33},
-  pages={4572--4583},
-  year={2020}
-}
-
-@article{zahavy2021reward,
-  title={Reward is enough for convex {MDP}s},
-  author={Zahavy, Tom and O'Donoghue, Brendan and Desjardins, Guillaume and Singh, Satinder},
-  journal={Advances in Neural Information Processing Systems},
-  volume={34},
-  year={2021}
-}
-
-@article{bauerle2014more,
-  title={More risk-sensitive Markov decision processes},
-  author={B{\"a}uerle, Nicole and Rieder, Ulrich},
-  journal={Mathematics of Operations Research},
-  volume={39},
-  number={1},
-  pages={105--120},
-  year={2014},
-  publisher={INFORMS}
-}
-
-@article{fei2021exponential,
-  title={Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning},
-  author={Fei, Yingjie and Yang, Zhuoran and Chen, Yudong and Wang, Zhaoran},
-  journal={Advances in Neural Information Processing Systems},
-  volume={34},
-  year={2021}
-}
-@article{anderson2019system,
-  title={System level synthesis},
-  author={Anderson, James and Doyle, John C and Low, Steven H and Matni, Nikolai},
-  journal={Annual Reviews in Control},
-  volume={47},
-  pages={364--393},
-  year={2019},
-  publisher={Elsevier}
-}
-@article{fei2020risk,
-  title={Risk-sensitive reinforcement learning: Near-optimal risk-sample tradeoff in regret},
-  author={Fei, Yingjie and Yang, Zhuoran and Chen, Yudong and Wang, Zhaoran and Xie, Qiaomin},
-  journal={Advances in Neural Information Processing Systems},
-  volume={33},
-  pages={22384--22395},
-  year={2020}
-}
-
-@article{ding2021global,
-  title={On the Global Convergence of Momentum-based Policy Gradient},
-  author={Ding, Yuhao and Zhang, Junzi and Lavaei, Javad},
-  journal={25th International Conference on Artificial Intelligence and Statistics (AISTATS)},
-  year={2022}
-}
-
-@article{ho2016generative,
-  title={Generative adversarial imitation learning},
-  author={Ho, Jonathan and Ermon, Stefano},
-  journal={Advances in neural information processing systems},
-  volume={29},
-  year={2016}
-}
-
-@inproceedings{rosenberg2019online,
-  title={Online convex optimization in adversarial markov decision processes},
-  author={Rosenberg, Aviv and Mansour, Yishay},
-  booktitle={International Conference on Machine Learning},
-  pages={5478--5486},
-  year={2019},
-  organization={PMLR}
-}
-@article{zhuang2020comprehensive,
-  title={A comprehensive survey on transfer learning},
-  author={Zhuang, Fuzhen and Qi, Zhiyuan and Duan, Keyu and Xi, Dongbo and Zhu, Yongchun and Zhu, Hengshu and Xiong, Hui and He, Qing},
-  journal={Proceedings of the IEEE},
-  volume={109},
-  number={1},
-  pages={43--76},
-  year={2020},
-  publisher={IEEE}
-}
-@article{tripuraneni2020theory,
-  title={On the theory of transfer learning: The importance of task diversity},
-  author={Tripuraneni, Nilesh and Jordan, Michael and Jin, Chi},
-  journal={Advances in Neural Information Processing Systems},
-  volume={33},
-  pages={7852--7862},
-  year={2020}
-}
-@inproceedings{hazan2019provably,
-  title={Provably efficient maximum entropy exploration},
-  author={Hazan, Elad and Kakade, Sham and Singh, Karan and Van Soest, Abby},
-  booktitle={International Conference on Machine Learning},
-  pages={2681--2691},
-  year={2019},
-  organization={PMLR}
-}
-
-@inproceedings{ng2000algorithms,
-  title={Algorithms for inverse reinforcement learning.},
-  author={Ng, Andrew Y and Russell, Stuart J and others},
-  booktitle={Icml},
-  volume={1},
-  pages={2},
-  year={2000}
-}
-
-
-
-@book{markowitz1968portfolio,
-  title={Portfolio selection},
-  author={Markowitz, Harry M},
-  year={1968},
-  publisher={Yale university press}
-}
-
-
-@article{agarwal2021theory,
-  title={On the theory of policy gradient methods: Optimality, approximation, and distribution shift},
-  author={Agarwal, Alekh and Kakade, Sham M and Lee, Jason D and Mahajan, Gaurav},
-  journal={Journal of Machine Learning Research},
-  volume={22},
-  number={98},
-  pages={1--76},
-  year={2021},
-  publisher={Microtome Publishing}
-}
-@article{schmidt2011convergence,
-  title={Convergence rates of inexact proximal-gradient methods for convex optimization},
-  author={Schmidt, Mark and Roux, Nicolas and Bach, Francis},
-  journal={Advances in neural information processing systems},
-  volume={24},
-  year={2011}
-}
-@article{wang2020deep,
-  title={Deep reinforcement learning: a survey},
-  author={Wang, Hao-nan and Liu, Ning and Zhang, Yi-yun and Feng, Da-wei and Huang, Feng and Li, Dong-sheng and Zhang, Yi-ming},
-  journal={Frontiers of Information Technology \& Electronic Engineering},
-  volume={21},
-  number={12},
-  pages={1726--1744},
-  year={2020},
-  publisher={Springer}
-}
-@article{nesterov2013gradient,
-  title={Gradient methods for minimizing composite functions},
-  author={Nesterov, Yu},
-  journal={Mathematical programming},
-  volume={140},
-  number={1},
-  pages={125--161},
-  year={2013},
-  publisher={Springer}
-}
-@inproceedings{liu2021policy,
-  title={Policy learning with constraints in model-free reinforcement learning: A survey},
-  author={Liu, Yongshuai and Halev, Avishai and Liu, Xin},
-  booktitle={Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence},
-  year={2021}
-}
-@article{moerland2020model,
-  title={Model-based reinforcement learning: A survey},
-  author={Moerland, Thomas M and Broekens, Joost and Jonker, Catholijn M},
-  journal={arXiv preprint arXiv:2006.16712},
-  year={2020}
-}
-
-
-@article{vieillard2020leverage,
-  title={Leverage the average: an analysis of KL regularization in reinforcement learning},
-  author={Vieillard, Nino and Kozuno, Tadashi and Scherrer, Bruno and Pietquin, Olivier and Munos, R{\'e}mi and Geist, Matthieu},
-  journal={Advances in Neural Information Processing Systems},
-  volume={33},
-  pages={12163--12174},
-  year={2020}
-}
-
-@inproceedings{haarnoja2018soft,
-  title={Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor},
-  author={Haarnoja, Tuomas and Zhou, Aurick and Abbeel, Pieter and Levine, Sergey},
-  booktitle={International conference on machine learning},
-  pages={1861--1870},
-  year={2018},
-  organization={PMLR}
-}
-@inproceedings{mnih2016asynchronous,
-  title={Asynchronous methods for deep reinforcement learning},
-  author={Mnih, Volodymyr and Badia, Adria Puigdomenech and Mirza, Mehdi and Graves, Alex and Lillicrap, Timothy and Harley, Tim and Silver, David and Kavukcuoglu, Koray},
-  booktitle={International conference on machine learning},
-  pages={1928--1937},
-  year={2016},
-  organization={PMLR}
-}
-@inproceedings{haarnoja2017reinforcement,
-  title={Reinforcement learning with deep energy-based policies},
-  author={Haarnoja, Tuomas and Tang, Haoran and Abbeel, Pieter and Levine, Sergey},
-  booktitle={International Conference on Machine Learning},
-  pages={1352--1361},
-  year={2017},
-  organization={PMLR}
-}
-@article{o2016combining,
-  title={Combining policy gradient and {Q}-learning},
-  author={O'Donoghue, Brendan and Munos, Remi and Kavukcuoglu, Koray and Mnih, Volodymyr},
-  journal={arXiv preprint arXiv:1611.01626},
-  year={2016}
-}
-@article{schulman2017equivalence,
-  title={Equivalence between policy gradients and soft {Q}-learning},
-  author={Schulman, John and Chen, Xi and Abbeel, Pieter},
-  journal={arXiv preprint arXiv:1704.06440},
-  year={2017}
-}
-@book{ziebart2010modeling,
-  title={Modeling purposeful adaptive behavior with the principle of maximum causal entropy},
-  author={Ziebart, Brian D},
-  year={2010},
-  publisher={Carnegie Mellon University}
-}
-
-@article{cen2021fast,
-  title={Fast global convergence of natural policy gradient methods with entropy regularization},
-  author={Cen, Shicong and Cheng, Chen and Chen, Yuxin and Wei, Yuting and Chi, Yuejie},
-  journal={Operations Research},
-  year={2021},
-  publisher={INFORMS}
-}
-
-@article{lan2021policy,
-  title={Policy mirror descent for reinforcement learning: Linear convergence, new sampling complexity, and generalized problem classes},
-  author={Lan, Guanghui},
-  journal={arXiv preprint arXiv:2102.00135},
-  year={2021}
-}
-
-@article{efroni2020exploration,
-  title={Exploration-exploitation in constrained {MDP}s},
-  author={Efroni, Yonathan and Mannor, Shie and Pirotta, Matteo},
-  journal={arXiv preprint arXiv:2003.02189},
-  year={2020}
-}
-@inproceedings{ding2021provably,
-  title={Provably efficient safe exploration via primal-dual policy optimization},
-  author={Ding, Dongsheng and Wei, Xiaohan and Yang, Zhuoran and Wang, Zhaoran and Jovanovic, Mihailo},
-  booktitle={International Conference on Artificial Intelligence and Statistics},
-  pages={3304--3312},
-  year={2021},
-  organization={PMLR}
-}
-@inproceedings{xu2021crpo,
-  title={Crpo: A new approach for safe reinforcement learning with convergence guarantee},
-  author={Xu, Tengyu and Liang, Yingbin and Lan, Guanghui},
-  booktitle={International Conference on Machine Learning},
-  pages={11480--11491},
-  year={2021},
-  organization={PMLR}
-}
-@article{yu2019convergent,
-  title={Convergent policy optimization for safe reinforcement learning},
-  author={Yu, Ming and Yang, Zhuoran and Kolar, Mladen and Wang, Zhaoran},
-  journal={Advances in Neural Information Processing Systems},
-  volume={32},
-  year={2019}
-}
-@inproceedings{mei2020global,
-  title={On the global convergence rates of softmax policy gradient methods},
-  author={Mei, Jincheng and Xiao, Chenjun and Szepesvari, Csaba and Schuurmans, Dale},
-  booktitle={International Conference on Machine Learning},
-  pages={6820--6829},
-  year={2020},
-  organization={PMLR}
-}
-@article{zhan2021policy,
-  title={Policy mirror descent for regularized reinforcement learning: A generalized framework with linear convergence},
-  author={Zhan, Wenhao and Cen, Shicong and Huang, Baihe and Chen, Yuxin and Lee, Jason D and Chi, Yuejie},
-  journal={arXiv preprint arXiv:2105.11066},
-  year={2021}
-}
-
-@article{ding2021beyond,
-  title={Beyond Exact Gradients: Convergence of Stochastic Soft-Max Policy Gradient Methods with Entropy Regularization},
-  author={Ding, Yuhao and Zhang, Junzi and Lavaei, Javad},
-  journal={arXiv preprint arXiv:2110.10117},
-  year={2021}
-}
-
-
-@article{williams1991function,
-  title={Function optimization using connectionist reinforcement learning algorithms},
-  author={Williams, Ronald J and Peng, Jing},
-  journal={Connection Science},
-  volume={3},
-  number={3},
-  pages={241--268},
-  year={1991},
-  publisher={Taylor \& Francis}
-}
-
-@article{chen2013composite,
-	title        = {Composite power system vulnerability evaluation to cascading failures using importance sampling and antithetic variates},
-	author       = {Chen, Quan and Mili, Lamine},
-	year         = 2013,
-	journal      = {IEEE transactions on power systems},
-	publisher    = {IEEE},
-	volume       = 28,
-	number       = 3,
-	pages        = {2321--2330}
-}
-@article{lin2012geco,
-	title        = {GECO: Global event-driven co-simulation framework for interconnected power system and communication network},
-	author       = {Lin, Hua and Veda, Santhosh S and Shukla, Sandeep S and Mili, Lamine and Thorp, James},
-	year         = 2012,
-	journal      = {IEEE Transactions on Smart Grid},
-	publisher    = {IEEE},
-	volume       = 3,
-	number       = 3,
-	pages        = {1444--1456}
-}
-% Encoding: UTF-8
-@article{zhang2020robust,
-	title        = {Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations},
-	author       = {Zhang, Huan and Chen, Hongge and Xiao, Chaowei and Li, Bo and Liu, Mingyan and Boning, Duane and Hsieh, Cho-Jui},
-	year         = 2020,
-	journal      = {NeurIPS}
-}
-@inproceedings{wang2020reinforcement,
-	title        = {Reinforcement learning with perturbed rewards},
-	author       = {Wang, Jingkang and Liu, Yang and Li, Bo},
-	year         = 2020,
-	booktitle    = {AAAI},
-	volume       = 34,
-	number       = {04},
-	pages        = {6202--6209}
-}
-@inproceedings{ye2020reinforcement,
-	title        = {Reinforcement-learning based portfolio management with augmented asset movement prediction states},
-	author       = {Ye, Yunan and Pei, Hengzhi and Wang, Boxin and Chen, Pin-Yu and Zhu, Yada and Xiao, Ju and Li, Bo},
-	year         = 2020,
-	booktitle    = {AAAI},
-	volume       = 34,
-	number       = {01},
-	pages        = {1112--1119}
-}
-@inproceedings{habeeb2020,
-	title        = {Towards a Deep Network Architecture for Structured Smoothness},
-	author       = {Habeeb, Haroun and Koyejo, Oluwasanmi},
-	year         = 2020,
-	booktitle    = {International Conference on Learning Representations (ICLR)}
-}
-@inproceedings{zhang2019,
-	title        = {Learning Sparse Distributions using Iterative Hard Thresholding},
-	author       = {Zhang, Jacky Y and Khanna, Rajiv and Kyrillidis, Anastasios and Koyejo, Oluwasanmi},
-	year         = 2019,
-	booktitle    = {Advances in Neural Information Processing Systems (NeurIPS)}
-}
-@inproceedings{hiranandani2019multiclass,
-	title        = {Multiclass Performance Metric Elicitation},
-	author       = {Gaurush Hiranandani and Shant Boodaghians and Ruta Mehta and  Oluwasanmi Koyejo},
-	year         = 2019,
-	booktitle    = {Advances in Neural Information Processing Systems (NeurIPS)}
-}
-@inproceedings{zhuang2019,
-	title        = {{FMRI} Data Augmentation Via Synthesis},
-	author       = {Zhuang, Peiye and Schwing, Alexander and Koyejo,Oluwasanmi},
-	year         = 2019,
-	booktitle    = {International Symposium on Biomedical Imaging (ISBI)}
-}
-@article{xie2019local,
-	title        = {Local AdaAlter: Communication-Efficient Stochastic Gradient Descent with Adaptive Learning Rates},
-	author       = {Xie, Cong and Koyejo, Oluwasanmi and Gupta, Indranil and Lin, Haibin},
-	year         = 2019,
-	journal      = {arXiv preprint arXiv:1911.09030}
-}
-@inproceedings{zhuang2019b,
-	title        = {Synthetic Power Analyses: Empirical Evaluation and Application to Cognitive Neuroimaging},
-	author       = {Zhuang, Peiye and Chapman, Bliss and Li, Ran, and Koyejo, Oluwasanmi},
-	year         = 2019,
-	booktitle    = {Asilomar Conference on Signals, Systems, and Computers (Asilomar)}
-}
-@inproceedings{deshpande2019,
-	title        = {Max-Sliced Wasserstein Distance and its use for GANs},
-	author       = {Ishan Deshpande and Yuan-Ting Hu and Ruoyu Sun and Ayis Pyrros and Sanmi Koyejo and Zhizhen Zhao and David Forsyth and Alexander Schwing},
-	year         = 2019,
-	booktitle    = {Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR)}
-}
-@article{gilron2016s,
-	title        = {What's in a pattern? {E}xamining the Type of Signal Multivariate Analysis Uncovers At the Group Level},
-	author       = {Gilron, Roee and Rosenblatt, Jonathan and Koyejo, Oluwasanmi and Poldrack, Russell A and Mukamel, Roy},
-	year         = 2016,
-	journal      = {NeuroImage},
-	publisher    = {Elsevier}
-}
-@article{acharyya2014u,
-	title        = {Learning to Rank With {B}regman Divergences and Monotone Retargeting},
-	author       = {Acharyya, Sreangsu and Koyejo, Oluwasanmi and Ghosh, Joydeep},
-	year         = 2014,
-	booktitle    = {Proceedings of the 28th conference on Uncertainty in Artificial Intelligence (UAI)},
-	note         = {Under review},
-	owner        = {sanmi},
-	timestamp    = {2012.08.20}
-}
-@inproceedings{asteris2016,
-	title        = {A Simple and Provable Algorithm for Sparse {CCA}},
-	author       = {Megasthenis Asteris and Anastasios Kyrillidis and Koyejo, Oluwasanmi and Russell A Poldrack},
-	year         = 2016,
-	booktitle    = {International Conference on Machine Learning},
-	owner        = {sanmi},
-	timestamp    = {2012.08.20}
-}
-@inproceedings{bhowmik2016,
-	title        = {Sparse Parameter Recovery from Aggregated Data},
-	author       = {Bhowmik, Avradeep and Ghosh, Joydeep and Koyejo, Oluwasanmi},
-	year         = 2016,
-	booktitle    = {Proceedings of The 33rd International Conference on Machine Learning},
-	pages        = {1090--1099},
-	owner        = {ook59},
-	timestamp    = {2016.09.30}
-}
-@inproceedings{bhowmik2015,
-	title        = {Generalized Linear Models for Aggregated Data},
-	author       = {Bhowmik, Avradeep and Ghosh, Joydeep and Koyejo, Oluwasanmi},
-	year         = 2015,
-	booktitle    = {Proceedings of the 18th International conference on Artificial Intelligence and Statistics (AISTATS)},
-	publisher    = {JMLR.org},
-	series       = {JMLR Proceedings},
-	volume       = 38,
-	pages        = {93--101},
-	owner        = {ook59},
-	timestamp    = {2014.11.25}
-}
-@inproceedings{gunasekar2016,
-	title        = {Preference Completion from Partial Rankings},
-	author       = {Gunasekar, Suriya and Koyejo, Oluwasanmi O and Ghosh, Joydeep},
-	year         = 2016,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {1370--1378}
-}
-@inproceedings{joshi2015,
-	title        = {R\'enyi Divergence Minimization based Co-regularized Multiview Clustering},
-	author       = {Joshi, Shalmali and Ghosh, Joydeep and Reid, Mark and Koyejo, Oluwasanmi},
-	year         = 2015,
-	note         = {Under review},
-	owner        = {ook59},
-	timestamp    = {2014.11.25}
-}
-@inproceedings{joshi2014,
-	title        = {Constrained Inference for Multi-View Clustering},
-	author       = {Shalmali Joshi and Oluwasanmi Koyejo and Joydeep Ghosh},
-	year         = 2014,
-	booktitle    = {{ICML} Workshop on Divergence Methods for Probabilistic Inference},
-	owner        = {ook59},
-	timestamp    = {2014.05.14}
-}
-@inproceedings{khanna2015,
-	title        = {Sparse Submodular Probabilistic {PCA}},
-	author       = {Khanna, Rajiv and Ghosh, Joydeep and Poldrack, Russell A. and Koyejo, Oluwasanmi},
-	year         = 2015,
-	booktitle    = {Proceedings of the 18th International conference on Artificial Intelligence and Statistics (AISTATS)},
-	publisher    = {JMLR.org},
-	series       = {JMLR Proceedings},
-	volume       = 38,
-	pages        = {453--461},
-	owner        = {ook59},
-	timestamp    = {2014.11.25}
-}
-@inproceedings{kim2016,
-	title        = {Examples are not enough, learn to criticize! criticism for interpretability},
-	author       = {Kim, Been and Khanna, Rajiv and Koyejo, Oluwasanmi O},
-	year         = 2016,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {2280--2288}
-}
-@phdthesis{koyejo2013thesis,
-	title        = {Constrained relative entropy minimization with applications to multitask learning},
-	author       = {Oluwasanmi Koyejo},
-	year         = 2013,
-	month        = {May},
-	school       = {The University of {T}exas at {A}ustin}
-}
-@article{koyejo2011a,
-	title        = {Manifold learning and its applications: Reports of the {AAAI} 2010 Fall Symposia},
-	author       = {Oluwasanmi Koyejo},
-	year         = 2011,
-	journal      = {AI Magazine},
-	volume       = 32,
-	number       = 1,
-	pages        = {93--100}
-}
-@inproceedings{koyejo2013b,
-	title        = {Retargeted Matrix Factorization for Collaborative Filtering},
-	author       = {Koyejo, Oluwasanmi and Acharyya, Sreangsu and Ghosh, Joydeep},
-	year         = 2013,
-	booktitle    = {Proceedings of the 7th ACM Conference on Recommender Systems},
-	publisher    = {ACM},
-	address      = {New York, NY, USA},
-	series       = {RecSys '13},
-	pages        = {49--56},
-	doi          = {10.1145/2507157.2507185},
-	isbn         = {978-1-4503-2409-0},
-	acmid        = 2507185
-}
-
-@inproceedings{koyejo2013c,
-	title        = {Ratings Re-specification for Rank Ordered Recommendations},
-	author       = {Oluwasanmi Koyejo and Sreangsu Acharyya and Joydeep Ghosh},
-	year         = 2013,
-	booktitle    = {{UAI} workshop on New Challenges in E-Commerce Recommendations},
-	owner        = {sanmi},
-	timestamp    = {2013.04.16}
-}
-@conference{koyejo2007,
-	title        = {Capacity Gains of Multi-User Diversity in a Cellular Downlink Interference-limited Environment},
-	author       = {Oluwasanmi Koyejo and Jeff Andrews},
-	year         = 2007,
-	month        = {February},
-	booktitle    = {{GAIN} 2007 student conference},
-	owner        = {sanmi},
-	timestamp    = {2012.01.22}
-}
-@article{koyejo2014u,
-	title        = {{B}ayesian Inference with Constraints},
-	author       = {Oluwasanmi Koyejo and Joydeep Ghosh},
-	year         = 2015,
-	note         = {In Preparation},
-	owner        = {sanmi},
-	timestamp    = {2012.08.20}
-}
-@inproceedings{koyejo2013a,
-	title        = {Constrained {B}ayesian Inference for Low Rank Multitask Learning},
-	author       = {Oluwasanmi Koyejo and Joydeep Ghosh},
-	year         = 2013,
-	booktitle    = {Proceedings of the 29th conference on Uncertainty in Artificial Intelligence (UAI)},
-	pages        = {97--106},
-	owner        = {sanmi},
-	timestamp    = {2013.04.16}
-}
-@inproceedings{koyejo2013e,
-	title        = {A Representation Approach for Relative Entropy Minimization with Expectation Constraints},
-	author       = {Oluwasanmi Koyejo and Joydeep Ghosh},
-	year         = 2013,
-	booktitle    = {{ICML} workshop on Divergences and Divergence Learning (WDDL)},
-	owner        = {sanmi},
-	timestamp    = {2013.04.16}
-}
-@inproceedings{koyejo2011,
-	title        = {A Kernel-based Approach to Exploiting Interaction-networks in Heterogeneous Information Sources for Improved Recommender Systems},
-	author       = {Koyejo, Oluwasanmi and Ghosh, Joydeep},
-	year         = 2011,
-	booktitle    = {Proceedings of the 2nd International Workshop on Information Heterogeneity and Fusion in Recommender Systems},
-	publisher    = {ACM},
-	address      = {New York, NY, USA},
-	series       = {HetRec '11},
-	pages        = {9--16},
-	doi          = {10.1145/2039320.2039322},
-	acmid        = 2039322
-}
-@inproceedings{koyejo2009,
-	title        = {{MiPPS}; A Generative Model for Multi-Manifold Clustering},
-	author       = {Oluwasanmi Koyejo and Joydeep Ghosh},
-	year         = 2009,
-	booktitle    = {AAAI Fall Symposium on Manifold Learning and Its Applications},
-	publisher    = {AAAI Press}
-}
-@inproceedings{koyejo2014c,
-	title        = {On Prior Distributions and Approximate Inference for Structured Variables},
-	author       = {Koyejo, Oluwasanmi and Khanna, Rajiv and Ghosh, Joydeep and Poldrack, Russell},
-	year         = 2014,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {676--684}
-}
-@article{koyejo2014b,
-	title        = {A constrained matrix-variate {G}aussian process for transposable data},
-	author       = {Koyejo, Oluwasanmi and Lee, Cheng and Ghosh, Joydeep},
-	year         = 2014,
-	journal      = {Machine Learning},
-	publisher    = {Springer US},
-	volume       = 97,
-	number       = {1-2},
-	pages        = {103--127},
-	issn         = {0885-6125}
-}
-@inproceedings{koyejo2013d,
-	title        = {Constrained {G}aussian Process Regression for Gene-Disease Association},
-	author       = {Koyejo, Oluwasanmi and Lee, Cheng and Ghosh, Joydeep},
-	year         = 2013,
-	booktitle    = {Data Mining Workshops (ICDMW), 2013 IEEE 13th International Conference on},
-	pages        = {72--79},
-	organization = {IEEE}
-}
-@article{koyejo2013preprint,
-	title        = {The trace norm constrained matrix-variate {G}aussian process for multitask bipartite ranking},
-	author       = {Koyejo, Oluwasanmi and Lee, Cheng and Ghosh, Joydeep},
-	year         = 2013,
-	journal      = {arXiv:1302.2576}
-}
-@inproceedings{koyejo2014abstract,
-	title        = {Exploratory Analysis of Imaging and Behavioral Phenotypes with Sparse {CCA}},
-	author       = {Oluwasanmi Koyejo and David Reese McKay and Emma E.M. Knowles and John Blangero and David Glahn and Russell A. Poldrack},
-	year         = 2014,
-	booktitle    = {Organization for Human Brain Mapping (Abstract)},
-	owner        = {ook59},
-	timestamp    = {2014.03.23}
-}
-@inproceedings{manchanda2019abstract,
-	title        = {Informing {B}ayesian Functional Connectivity Modeling with Structural Connectivity Priors},
-	author       = {Sameer Manchanda and Carolyn Murray and Sanmi Koyejo},
-	year         = 2019,
-	booktitle    = {Organization for Human Brain Mapping (Abstract)}
-}
-
-@inproceedings{denevi2019learning,
-  title={Learning-to-learn stochastic gradient descent with biased regularization},
-  author={Denevi, Giulia and Ciliberto, Carlo and Grazzi, Riccardo and Pontil, Massimiliano},
-  booktitle={International Conference on Machine Learning},
-  pages={1566--1575},
-  year={2019},
-  organization={PMLR}
-}
-
-@inproceedings{nagabandi2018neural,
-  title={Neural network dynamics for model-based deep reinforcement learning with model-free fine-tuning},
-  author={Nagabandi, Anusha and Kahn, Gregory and Fearing, Ronald S and Levine, Sergey},
-  booktitle={2018 IEEE International Conference on Robotics and Automation (ICRA)},
-  pages={7559--7566},
-  year={2018},
-  organization={IEEE}
-}
-@article{rahimi2008weighted,
-  title={Weighted sums of random kitchen sinks: Replacing minimization with randomization in learning},
-  author={Rahimi, Ali and Recht, Benjamin},
-  journal={Advances in neural information processing systems},
-  volume={21},
-  year={2008}
-}
-@article{bolte2010characterizations,
-  title={Characterizations of {\L}ojasiewicz inequalities: subgradient flows, talweg, convexity},
-  author={Bolte, J{\'e}r{\^o}me and Daniilidis, Aris and Ley, Olivier and Mazet, Laurent},
-  journal={Transactions of the American Mathematical Society},
-  volume={362},
-  number={6},
-  pages={3319--3363},
-  year={2010}
-}
-
-
-@article{van1986generalization,
-  title={A generalization of the Tarski-Seidenberg theorem, and some nondefinability results},
-  author={Van den Dries, Lou},
-  journal={Bulletin (New Series) of the American Mathematical Society},
-  volume={15},
-  number={2},
-  pages={189--193},
-  year={1986},
-  publisher={American Mathematical Society}
-}
-
-@inproceedings{koyejo2015u,
-	title        = {Consistent multilabel classification},
-	author       = {Koyejo, Oluwasanmi O and Natarajan, Nagarajan and Ravikumar, Pradeep K and Dhillon, Inderjit S},
-	year         = 2015,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {3321--3329}
-}
-@inproceedings{koyejo2014d,
-	title        = {Consistent Binary Classification with Generalized Performance Metrics},
-	author       = {Koyejo, Oluwasanmi and Natarajan, Nagarajan and Ravikumar, Pradeep K and Dhillon, Inderjit S},
-	year         = 2014,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {2744--2752}
-}
-@inproceedings{koyejo2013f,
-	title        = {Learning Predictive Cognitive Structure from f{MRI} Using Supervised Topic Models},
-	author       = {Koyejo, Oluwasanmi and Patel, Priyank and Ghosh, Joydeep and Poldrack, Russell A.},
-	year         = 2013,
-	booktitle    = {Pattern Recognition in Neuroimaging (PRNI), 2013 International Workshop on},
-	pages        = {9--12},
-	organization = {IEEE}
-}
-@inproceedings{koyejo2013g,
-	title        = {Decoding Cognitive Processes from functional {MRI}},
-	author       = {Oluwasanmi Koyejo and Russell A. Poldrack},
-	year         = 2013,
-	booktitle    = {{NIPS} Workshop on Machine Learning and Interpretation in Neuroimaging},
-	owner        = {sanmi},
-	timestamp    = {2011.12.26}
-}
-@inproceedings{lee2013,
-	title        = {Identifying candidate disease genes using a trace norm constrained bipartite raking model},
-	author       = {Lee, Cheng H and Koyejo, Oluwasanmi and Ghosh, Joydeep},
-	year         = 2013,
-	booktitle    = {Engineering in Medicine and Biology Society ({EMBC}), 2013 35th Annual International Conference of the IEEE},
-	pages        = {3459--3462},
-	organization = {IEEE}
-}
-@inproceedings{natarajan2016,
-	title        = {Optimal Classification with Multivariate Losses},
-	author       = {Natarajan, Nagarajan and Koyejo, Oluwasanmi and Ravikumar, Pradeep and Dhillon, Inderjit},
-	year         = 2016,
-	booktitle    = {Proceedings of The 33rd International Conference on Machine Learning},
-	pages        = {1530--1538},
-	owner        = {ook59},
-	timestamp    = {2016.09.30}
-}
-@article{natarajan2015,
-	title        = {Optimal Decision-Theoretic Classification Using Non-Decomposable Performance Metrics},
-	author       = {Natarajan, Nagarajan and Koyejo, Oluwasanmi and Ravikumar, Pradeep K and Dhillon, Inderjit S},
-	year         = 2015,
-	note         = {Under review},
-	owner        = {sanmi},
-	timestamp    = {2012.08.20}
-}
-@inproceedings{park2013,
-	title        = {{B}ayesian structure learning for functional neuroimaging},
-	author       = {Park, Mijung and Koyejo, Oluwasanmi and Ghosh, Joydeep and Poldrack, Russell and Pillow, Jonathan},
-	year         = 2013,
-	booktitle    = {Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics},
-	publisher    = {JMLR.org},
-	series       = {JMLR Proceedings},
-	volume       = 31,
-	pages        = {489--497}
-}
-@article{poldrack2013,
-	title        = {Toward open sharing of task-based f{MRI} data: the Openf{MRI} project},
-	author       = {Poldrack, Russell A. and Barch, Deanna M. and Mitchell, Jason P. and Wager, Tor D. and Wagner, Anthony D. and Devlin, Joseph T. and Cumba, Chad and Koyejo, Oluwasanmi and Milham, Michael P.},
-	year         = 2013,
-	journal      = {Frontiers in neuroinformatics},
-	publisher    = {Frontiers Media SA},
-	volume       = 7
-}
-@inproceedings{poldrack2014abstract,
-	title        = {Extensive neurocognitive phenotyping of a single human: The {MyConnectome} Project.},
-	author       = {Russell A. Poldrack and Timothy Laumann and Laurie Frick and Oluwasanmi Koyejo and Brenda Gregory and Ashleigh Hover and Mei-Yen Chen and Alex Huk and Sung Jun Joo and Evan Gordon and Avi Snyder and Babatunde Adeyemo and Daniel Handwerker and Jackson Liang and Ryan Boyd Zack Booth Simpson and Scott Hunicke-Smith and Thomas Caven and Edward Marcotte and Steven E. Petersen and Jeanette A. Mumford},
-	year         = 2014,
-	booktitle    = {Organization for Human Brain Mapping (Abstract)},
-	owner        = {ook59},
-	timestamp    = {2014.05.14}
-}
-@article{poldrack2015,
-	title        = {Phenome-wide dynamics of mind, brain, and body: The MyConnectome Project},
-	author       = {Russell A. Poldrack and Timothy Laumann and Oluwasanmi Koyejo and Brenda Gregory, Ashleigh Hover and Mei-Yen Chen and Jeffrey Luci and Sung Jun Joo and Ryan Boyd, Scott Hunicke-Smith and Zack Booth Simpson and Thomas Caven and Vanessa Sochat, James M. Shine and Evan Gordon and Abraham Snyder and Babatunde Adeyemo and Steven E. Petersen and David Glahn and D. Reese Mckay and Joanne E. Curran and Harald H. H. Goring and Melanie A. Carless and John Blangero and Laurie Frick and Edward M. Marcotte and Jeanette A. Mumford},
-	year         = 2015,
-	note         = {Submitted},
-	owner        = {ook59},
-	timestamp    = {2015.01.09}
-}
-@article{rubin2015abstract,
-	title        = {Large-scale functional mapping of brain activity using a joint spatial and semantic topic model},
-	author       = {Timothy Rubin and Michael N. Jones and Oluwasanmi Koyejo and Tal Yarkoni},
-	year         = 2015,
-	journal      = {Organization for Human Brain Mapping (Abstract)},
-	owner        = {ook59},
-	timestamp    = {2014.05.14}
-}
-@inproceedings{rubin2016,
-	title        = {Generalized Correspondence-{LDA} Models ({GC-LDA}) for Identifying Functional Regions in the Brain},
-	author       = {Timothy Rubin and Oluwasanmi Koyejo and Michael N. Jones and Tal Yarkoni},
-	year         = 2016,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	owner        = {ook59},
-	timestamp    = {2014.05.14}
-}
-@article{rubin2016b,
-	title        = {Decoding brain activity using a large-scale probabilistic functional-anatomical atlas of human cognition},
-	author       = {Rubin, Timothy N and Koyejo, Oluwasanmi and Gorgolewski, Krzysztof J and Jones, Michael N and Poldrack, Russell A and Yarkoni, Tal},
-	year         = 2016,
-	journal      = {bioRxiv},
-	publisher    = {Cold Spring Harbor Labs Journals},
-	pages        = {059618},
-	owner        = {ook59},
-	timestamp    = {2016.10.02}
-}
-@article{shine2016b,
-	title        = {The Dynamics of Functional Brain Networks: Integrated Network States during Cognitive Task Performance},
-	author       = {James M. Shine and Patrick G. Bissett and Peter T. Bell and Oluwasanmi Koyejo and Joshua H. Balsters and Krzysztof J. Gorgolewski and Craig A. Moodie and Russell A. Poldrack},
-	year         = 2016,
-	journal      = {Neuron},
-	owner        = {ook59},
-	timestamp    = {2016.09.30}
-}
-@article{shine2015,
-	title        = {Estimation of dynamic functional connectivity using Multiplicative Analytical Coupling},
-	author       = {James M Shine and Oluwasanmi Koyejo and Peter T Bell and Krzysztov J Gorgolewski and Moran Gilat and Russell A Poldrack},
-	year         = 2015,
-	journal      = {Organization for Human Brain Mapping (Abstract \& Talk)},
-	note         = {Submitted},
-	owner        = {ook59},
-	timestamp    = {2014.05.14}
-}
-@article{shine2016,
-	title        = {Temporal metastates are associated with differential patterns of time-resolved connectivity, network topology, and attention},
-	author       = {Shine, James M and Koyejo, Oluwasanmi and Poldrack, Russell A},
-	year         = 2016,
-	journal      = {Proceedings of the National Academy of Sciences},
-	publisher    = {National Acad Sciences}
-}
-@article{Shine2019,
-	title        = {{Human cognition involves the dynamic integration of neural activity and neuromodulatory systems}},
-	author       = {Shine, James M and Breakspear, Michael and Bell, Peter T and {Ehgoetz Martens}, Kayla and Shine, Richard and Koyejo, Oluwasanmi and Sporns, Olaf and Poldrack, Russell A},
-	year         = 2019,
-	journal      = {Nature Neuroscience},
-	doi          = {10.1038/s41593-018-0312-0},
-	issn         = {1546-1726},
-	url          = {https://doi.org/10.1038/s41593-018-0312-0}
-}
-@article{koyejo2010,
-	title        = {Manifold learning and its applications: Reports of the {AAAI} 2009 Fall Symposia},
-	author       = {Richard Souvenir and Oluwasanmi Koyejo},
-	year         = 2010,
-	journal      = {AI Magazine},
-	volume       = 31,
-	number       = 1,
-	pages        = {88--94}
-}
-@article{tansey2017,
-	title        = {False discovery rate smoothing},
-	author       = {Tansey, Wesley and Koyejo, Oluwasanmi and Poldrack, Russell A. and Scott, James G.},
-	year         = 2017,
-	journal      = {Journal of the American Statistical Association (JASA): Theory and Methods},
-	note         = {Under review},
-	owner        = {ook59},
-	timestamp    = {2014.11.25}
-}
-@inproceedings{wu2014,
-	title        = {Sparse {B}ayesian structure learning with dependent relevance determination priors},
-	author       = {Wu, Anqi and Park, Mijung and Koyejo, Oluwasanmi and Pillow, Jonathan W.},
-	year         = 2014,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {1628--1636}
-}
-@inproceedings{bhowmik2017,
-	title        = {Frequency Domain Predictive Modeling with Aggregated Data},
-	author       = {Bhowmik, Avradeep and Ghosh, Joydeep and Koyejo, Oluwasanmi},
-	year         = 2017,
-	booktitle    = {Proceedings of the 20th International conference on Artificial Intelligence and Statistics (AISTATS)},
-	owner        = {ook59},
-	timestamp    = {2014.11.25}
-}
-@inproceedings{khanna2017b,
-	title        = {Information Projection and Approximate Inference for Structured Sparse Variables},
-	author       = {Khanna, Rajiv and Ghosh, Joydeep and Poldrack, Russell A. and Koyejo, Oluwasanmi},
-	year         = 2017,
-	booktitle    = {Proceedings of the 20th International conference on Artificial Intelligence and Statistics (AISTATS)},
-	owner        = {ook59},
-	timestamp    = {2014.11.25}
-}
-@inproceedings{khanna2017,
-	title        = {A Deflation Method for Structured Probabilistic {PCA}},
-	author       = {Khanna, Rajiv and Ghosh, Joydeep and Poldrack, Russell A. and Koyejo, Oluwasanmi},
-	year         = 2017,
-	booktitle    = {Proceedings of the SIAM International Conference on Data Mining (SDM)},
-	owner        = {ook59}
-}
-@article{sochat2015,
-	title        = {Effects of thresholding on correlation-based image similarity metrics},
-	author       = {Sochat, Vanessa V and Gorgolewski, Krzysztof Jacek and Koyejo, Oluwasanmi and Durnez, Joke and Poldrack, Russell A},
-	year         = 2015,
-	journal      = {Frontiers in Neuroscience},
-	publisher    = {Frontiers},
-	volume       = 9,
-	pages        = 418
-}
-@article{esteban2017mriqc,
-	title        = {{MRIQC}: Advancing the automatic prediction of image quality in {MRI} from unseen sites},
-	author       = {Esteban, Oscar and Birman, Daniel and Schaer, Marie and Koyejo, Oluwasanmi O and Poldrack, Russell A and Gorgolewski, Krzysztof J},
-	year         = 2017,
-	journal      = {PLoS One},
-	publisher    = {Public Library of Science},
-	volume       = 12,
-	number       = 9,
-	pages        = {e0184661}
-}
-@inproceedings{dembczynski2017,
-	title        = {Consistency Analysis for Binary Classification Revisited},
-	author       = {Dembczy{\'n}ski, Krzysztof and Kot{\l}owski, Wojciech and Koyejo, Oluwasanmi and Natarajan, Nagarajan},
-	year         = 2017,
-	booktitle    = {International Conference on Machine Learning},
-	pages        = {961--969}
-}
-@inproceedings{andersen2018,
-	title        = {Bayesian Structure Learning for Dynamic Brain Connectivity},
-	author       = {Andersen, Michael R. and Hansen, Lars K. and Winther, Ole and Poldrack, Russell A. and Koyejo, Oluwasanmi},
-	year         = 2018,
-	booktitle    = {Proceedings of the 21st International conference on Artificial Intelligence and Statistics (AISTATS)}
-}
-@inproceedings{yan2018,
-	title        = {Binary Classification with Karmic, Threshold-Quasi-Concave Metrics},
-	author       = {Yan, Bowei and Koyejo, Oluwasanmi and Zhong, Kai and Ravikumar, Pradeep},
-	year         = 2018,
-	booktitle    = {International Conference on Machine Learning}
-}
-@inproceedings{zhu2018,
-	title        = {Clustered Fused Graphical Lasso},
-	author       = {Zhu, Yizhi and Koyejo, Oluwasanmi},
-	year         = 2018,
-	booktitle    = {Conference on Uncertainty in Artificial Intelligence (UAI)}
-}
-@conference{deshpande2018abstract,
-	title        = {Generative Adversarial Neural Networks in the Creation of Synthetic Chest Radiographs: Can We Fool the Experts?},
-	author       = {Ishan Deshpande and Alexander G. Schwing and Sanmi Koyejo and Nasir A. Siddiqui and Ayis T. Pyrros, and David A. Forsyth},
-	year         = 2018,
-	booktitle    = {Radiological Society of North America Annual Meeting (RSNA)}
-}
-@conference{subakan2018,
-	title        = {Learning the Base Distribution in Implicit Generative Models},
-	author       = {Cem S{\"u}bakan and Sanmi Koyejo and Paris Smaragdis},
-	year         = 2018,
-	booktitle    = {Montreal AI Symposium}
-}
-@conference{subakan2018b,
-	title        = {Generative Modeling of Structural Neuroimaging Data},
-	author       = {Cem S{\"u}bakan and Maitham Naeemi and Julie Harries and Sanmi Koyejo and Eva Dyer},
-	year         = 2018,
-	booktitle    = {Montreal AI Symposium}
-}
-@article{wang2018,
-	title        = {Consistent Multioutput Classification},
-	author       = {Xiaoyan Wang and Ran Li and Oluwasanmi Koyejo},
-	year         = 2018
-}
-@article{wu2019,
-	title        = {Dependent relevance determination for smooth and structured sparse regression},
-	author       = {Wu, Anqi and Koyejo, Oluwasanmi and Pillow, Jonathan W.},
-	year         = 2019,
-	journal      = {Journal of Machine Learning Research},
-	volume       = 20,
-	pages        = {89:1--89:43}
-}
-@inproceedings{Somani2019,
-	title        = {Clustered Monotone Transforms for Rating Factorization},
-	author       = {Raghav Somani and Gaurush Hiranandani and Oluwasanmi Koyejo and Sreangsu Acharyya},
-	year         = 2019,
-	booktitle    = {The 12th ACM International Conference on Web Search and Data Mining (WSDM)}
-}
-@inproceedings{khanna2019,
-	title        = {Interpreting Black Box Predictions using Fisher Kernels},
-	author       = {Khanna, Rajiv and Kim, Been, and Ghosh, Joydeep and Koyejo, Oluwasanmi O},
-	year         = 2019,
-	booktitle    = {Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS)}
-}
-@inproceedings{Hiranandani2019,
-	title        = {Performance Metric Elicitation from Pairwise Classifier Comparisons},
-	author       = {Gaurush Hiranandani and Shant Boodaghians and Ruta Mehta and Oluwasanmi Koyejo},
-	year         = 2019,
-	booktitle    = {Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS)}
-}
-@inproceedings{geng2019,
-	title        = {Partially Linear Additive Gaussian Graphical Models},
-	author       = {Sinong Geng and Minhao Yan and Mladen Kolar and Sanmi Koyejo},
-	year         = 2019,
-	booktitle    = {Proceedings of the 36th International Conference on Machine Learning, {ICML}},
-	pages        = {2180--2190}
-}
-@inproceedings{xie2019zeno,
-	title        = {Zeno: Distributed stochastic gradient descent with suspicion-based fault-tolerance},
-	author       = {Xie, Cong and Koyejo, Sanmi and Gupta, Indranil},
-	year         = 2019,
-	booktitle    = {International Conference on Machine Learning},
-	pages        = {6893--6901},
-	organization = {PMLR}
-}
-@article{yang2019,
-	title        = {On the Consistency of Top-k Surrogate Losses},
-	author       = {Forest Yang and Sanmi Koyejo},
-	year         = 2019,
-	journal      = {arXiv},
-	volume       = {1901.11141},
-	url          = {http://arxiv.org/abs/1901.11141}
-}
-@article{xie2019b,
-	title        = {Asynchronous Federated Optimization},
-	author       = {Cong Xie and Sanmi Koyejo and Indranil Gupta},
-	year         = 2019,
-	journal      = {arXiv},
-	volume       = {1903.03934}
-}
-@inproceedings{xie2019c,
-	title        = {Fall of Empires: Breaking Byzantine-tolerant {SGD} by Inner Product Manipulation},
-	author       = {Cong Xie and Sanmi Koyejo and Indranil Gupta},
-	year         = 2019,
-	booktitle    = {Conference on Uncertainty in Artificial Intelligence (UAI)}
-}
-@inproceedings{xie2019d,
-	title        = {Practical Distributed Learning: Secure Machine Learning with Communication-Efficient Local Updates},
-	author       = {Cong Xie and Sanmi Koyejo and Indranil Gupta},
-	year         = 2019,
-	booktitle    = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD)}
-}
-@inproceedings{geng2019b,
-	title        = {Joint Nonparametric Precision Matrix Estimation with Confounding},
-	author       = {Sinong Geng and Mladen Kolar and Oluwasanmi Koyejo},
-	year         = 2019,
-	booktitle    = {Conference on Uncertainty in Artificial Intelligence (UAI)}
-}
-@inproceedings{adhikari2018,
-	title        = {Building an Expert Classifier with LIME: Who can we Trust?},
-	author       = {Rittika Adhikari and Oluwasanmi Koyejo},
-	year         = 2019,
-	booktitle    = {Women in Machine Learning (WiML) at NeurIPS}
-}
-@inproceedings{agarwal2019,
-	title        = {Adversarial Attacks and Defenses for Generative Models},
-	author       = {Rishika Agarwal and Oluwasanmi Koyejo},
-	year         = 2020,
-	booktitle    = {Women in Machine Learning (WiML) at NeurIPS}
-}
-@inproceedings{bhowmik2019,
-	title        = {Aggregation for Sensitive Data},
-	author       = {Bhowmik, Avradeep and Ghosh, Joydeep and Koyejo, Oluwasanmi},
-	year         = 2019,
-	booktitle    = {Proceedings of The 13th International Conference on Sampling Theory and Applications (SampTA)},
-	note         = {Invited}
-}
-@article{joshi2019,
-	title        = {Towards Realistic Individual Recourse and Actionable Explanations in Black-Box Decision Making Systems},
-	author       = {Joshi, Shalmali and Koyejo, Oluwasanmi and Vijitbenjaronk, Warut and Kim, Been and Ghosh, Joydeep},
-	year         = 2019,
-	journal      = {Safe Machine Learning workshop at {ICLR}}
-}
-@inproceedings{ratliff2014social,
-	title        = {Social game for building energy efficiency: Incentive design},
-	author       = {Ratliff, Lillian J and Jin, Ming and Konstantakopoulos, Ioannis C and Spanos, Costas and Sastry, S Shankar},
-	year         = 2014,
-	booktitle    = {Allerton},
-	pages        = {1011--1018},
-	organization = {IEEE}
-}
-@article{jin2018microgrid,
-	title        = {Microgrid to enable optimal distributed energy retail and end-user demand response},
-	author       = {Jin, Ming and Feng, Wei and Marnay, Chris and Spanos, Costas},
-	year         = 2018,
-	journal      = {Applied Energy},
-	publisher    = {Elsevier},
-	volume       = 210,
-	pages        = {1321--1335}
-}
-@article{jia2018design,
-	title        = {Design automation for smart building systems},
-	author       = {Jia, Ruoxi and Jin, Baihong and Jin, Ming and Zhou, Yuxun and Konstantakopoulos, Ioannis C and Zou, Han and Kim, Joyce and Li, Dan and Gu, Weixi and Arghandeh, Reza and others},
-	year         = 2018,
-	journal      = {Proceedings of the IEEE},
-	publisher    = {IEEE},
-	volume       = 106,
-	number       = 9,
-	pages        = {1680--1699}
-}
-@comment{jabref-meta: databaseType:bibtex;}
-
-@article{jin2019scalable,
-	title        = {Scalable and Robust State Estimation from Abundant but Untrusted Data},
-	author       = {Jin, Ming and Molybog, Igor and Mohammadi-Ghazi, Reza and Lavaei, Javad},
-	year         = 2019,
-	journal      = {IEEE Transactions on Smart Grid},
-	publisher    = {IEEE},
-	volume       = 11,
-	number       = 3,
-	pages        = {1880--1894}
-}
-@inproceedings{jin2018control,
-	title        = {Control-theoretic analysis of smoothness for stability-certified reinforcement learning},
-	author       = {Jin, Ming and Lavaei, Javad},
-	year         = 2018,
-	booktitle    = {IEEE Conference on Decision and Control},
-	pages        = {6840--6847},
-	organization = {IEEE}
-}
-@article{shukla2017hierarchical,
-	title        = {Hierarchical decentralized control for enhanced rotor angle and voltage stability of large-scale power systems},
-	author       = {Shukla, Srivats and Mili, Lamine},
-	year         = 2017,
-	journal      = {IEEE Transactions on Power Systems},
-	publisher    = {IEEE},
-	volume       = 32,
-	number       = 6,
-	pages        = {4783--4793}
-}
-@article{xu2019risk,
-	title        = {Risk assessment of rare events in probabilistic power flow via hybrid multi-surrogate method},
-	author       = {Xu, Yijun and Korkali, Mert and Mili, Lamine and Chen, Xiao and Min, Liang},
-	year         = 2019,
-	journal      = {IEEE Transactions on Smart Grid},
-	publisher    = {IEEE},
-	volume       = 11,
-	number       = 2,
-	pages        = {1593--1603}
-}
-@article{mili2018integrating,
-	title        = {Integrating community resilience in power system planning},
-	author       = {Mili, Lamine and Triantis, Konstantinos and Greer, Alex},
-	year         = 2018,
-	journal      = {Power Engineering: Advances and Challenges Part B: Electrical Power Publisher. CRC Press}
-}
-@article{ahmed2017nonanticipative,
-	title        = {Nonanticipative duality, relaxations, and formulations for chance-constrained stochastic programs},
-	author       = {Ahmed, Shabbir and Luedtke, James and Song, Yongjia and Xie, Weijun},
-	year         = 2017,
-	journal      = {Mathematical Programming},
-	publisher    = {Springer Berlin Heidelberg},
-	volume       = 162,
-	number       = {1-2},
-	pages        = {51--81}
-}
-@inproceedings{madan2019combinatorial,
-	title        = {Combinatorial algorithms for optimal design},
-	author       = {Madan, Vivek and Singh, Mohit and Tantipongpipat, Uthaipon and Xie, Weijun},
-	year         = 2019,
-	booktitle    = {Conference on Learning Theory},
-	pages        = {2210--2258}
-}
-@article{liu2019adversarial,
-	title        = {Adversarial attack on Speech-to-Text Recognition Models},
-	author       = {Liu, Xiaolei and Wan, Kun and Ding, Yufei},
-	year         = 2019,
-	journal      = {arXiv preprint arXiv:1901.10300}
-}
-@article{jackson2013using,
-	title        = {Using NCCN clinical practice guidelines in oncology to measure the quality of colorectal cancer care in the veterans health administration},
-	author       = {Jackson, George L and Zullig, Leah L and Zafar, S Yousuf and Powell, Adam A and Ordin, Diana L and Gellad, Ziad F and Abbott, David and Schlosser, James M and Hersh, Janis and Provenzale, Dawn},
-	year         = 2013,
-	journal      = {Journal of the National Comprehensive Cancer Network},
-	publisher    = {Harborside Press, LLC},
-	volume       = 11,
-	number       = 4,
-	pages        = {431--441}
-}
-@article{weber2020rab,
-	title        = {RAB: Provable Robustness Against Backdoor Attacks},
-	author       = {Weber, Maurice and Xu, Xiaojun and Karlas, Bojan and Zhang, Ce and Li, Bo},
-	year         = 2020,
-	journal      = {arXiv preprint arXiv:2003.08904}
-}
-@article{liu2018dpatch,
-	title        = {Dpatch: An adversarial patch attack on object detectors},
-	author       = {Liu, Xin and Yang, Huanrui and Liu, Ziwei and Song, Linghao and Li, Hai and Chen, Yiran},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1806.02299}
-}
-@article{jia2019efficient,
-	title        = {Efficient task-specific data valuation for nearest neighbor algorithms},
-	author       = {Jia, Ruoxi and Dao, David and Wang, Boxin and Hubis, Frances Ann and Gurel, Nezihe Merve and Li, Bo and Zhang, Ce and Spanos, Costas and Song, Dawn},
-	year         = 2019,
-	journal      = {Proceedings of the VLDB Endowment},
-	publisher    = {VLDB Endowment},
-	volume       = 12,
-	number       = 11,
-	pages        = {1610--1623}
-}
-@article{konevcny2016federated,
-	title        = {Federated learning: Strategies for improving communication efficiency},
-	author       = {Kone{\v{c}}n{\`y}, Jakub and McMahan, H Brendan and Yu, Felix X and Richt{\'a}rik, Peter and Suresh, Ananda Theertha and Bacon, Dave},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1610.05492}
-}
-@inproceedings{wang2018joint,
-	title        = {A joint optimization approach for personalized recommendation diversification},
-	author       = {Wang, Xiaojie and Qi, Jianzhong and Ramamohanarao, Kotagiri and Sun, Yu and Li, Bo and Zhang, Rui},
-	year         = 2018,
-	booktitle    = {PAKDD}
-}
-@inproceedings{licvpr20,
-	title        = {QEBA: Query-Efficient Boundary-Based Black Box Attack},
-	author       = {Li, Huichen and Xu, Xiaojun and Li, Bo},
-	year         = 2020,
-	booktitle    = {CVPR}
-}
-@inproceedings{bhattad2020unrestricted,
-	title        = {Unrestricted adversarial examples via semantic manipulation},
-	author       = {Bhattad, Anand and Chong, Min Jin and Liang, Kaizhao and Li, Bo and Forsyth, David},
-	year         = 2020,
-	booktitle    = {International Conference on Learning Representations}
-}
-@article{yu2018bdd100k,
-	title        = {Bdd100k: A diverse driving video database with scalable annotation tooling},
-	author       = {Yu, Fisher and Xian, Wenqi and Chen, Yingying and Liu, Fangchen and Liao, Mike and Madhavan, Vashisht and Darrell, Trevor},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1805.04687}
-}
-% Song bibliography
-@article{venkataraman2008limits,
-	title        = {Limits of learning-based signature generation with adversaries},
-	author       = {Venkataraman, Shobha and Blum, Avrim and Song, Dawn},
-	year         = 2008,
-	booktitle    = {16th Annual Network and Distributed System Security Symposium},
-	publisher    = {Internet Society}
-}
-@inproceedings{narayanan2012feasibility,
-	title        = {On the feasibility of internet-scale author identification},
-	author       = {Narayanan, Arvind and Paskov, Hristo and Gong, Neil Zhenqiang and Bethencourt, John and Stefanov, Emil and Shin, Eui Chul Richard and Song, Dawn},
-	year         = 2012,
-	booktitle    = {Security and Privacy (SP), 2012 IEEE Symposium on},
-	pages        = {300--314},
-	organization = {IEEE}
-}
-@article{sqlnet,
-	title        = {{SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning}},
-	author       = {{Xu}, X. and {Liu}, C. and {Song}, D.},
-	year         = 2017,
-	journal      = {ArXiv e-prints},
-	eprint       = {1711.04436}
-}
-@inproceedings{venkataraman2009tracking,
-	title        = {Tracking dynamic sources of malicious activity at internet scale},
-	author       = {Venkataraman, Shobha and Blum, Avrim and Song, Dawn and Sen, Subhabrata and Spatscheck, Oliver},
-	year         = 2009,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {1946--1954}
-}
-@inproceedings{thomas2011design,
-	title        = {Design and evaluation of a real-time url spam filtering service},
-	author       = {Thomas, Kurt and Grier, Chris and Ma, Justin and Paxson, Vern and Song, Dawn},
-	year         = 2011,
-	booktitle    = {2011 IEEE Symposium on Security and Privacy},
-	pages        = {447--462},
-	organization = {IEEE}
-}
-@inproceedings{frank2012mining,
-	title        = {Mining permission request patterns from android and facebook applications},
-	author       = {Frank, Mario and Dong, Ben and Felt, Adrienne Porter and Song, Dawn},
-	year         = 2012,
-	booktitle    = {2012 IEEE 12th International Conference on Data Mining},
-	pages        = {870--875},
-	organization = {IEEE}
-}
-@inproceedings{shin2015recognizing,
-	title        = {Recognizing functions in binaries with neural networks},
-	author       = {Shin, Eui Chul Richard and Song, Dawn and Moazzezi, Reza},
-	year         = 2015,
-	booktitle    = {24th USENIX Security Symposium (USENIX Security 15)},
-	pages        = {611--626}
-}
-@inproceedings{xu2018fooling,
-	title        = {Fooling vision and language models despite localization and attention mechanism},
-	author       = {Xu, Xiaojun and Chen, Xinyun and Liu, Chang and Rohrbach, Anna and Darrell, Trevor and Song, Dawn},
-	year         = 2018,
-	booktitle    = {CVPR}
-}
-@inproceedings{kos2018adversarial,
-	title        = {Adversarial examples for generative models},
-	author       = {Kos, Jernej and Fischer, Ian and Song, Dawn},
-	year         = 2018,
-	booktitle    = {2018 IEEE Security and Privacy Workshops (SPW)},
-	pages        = {36--42},
-	organization = {IEEE}
-}
-@article{kos2017delving,
-	title        = {Delving into adversarial attacks on deep policies},
-	author       = {Kos, Jernej and Song, Dawn},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1705.06452}
-}
-@article{liu2016delving,
-	title        = {Delving into transferable adversarial examples and black-box attacks},
-	author       = {Liu, Yanpei and Chen, Xinyun and Liu, Chang and Song, Dawn},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1611.02770},
-	booktitle    = {ICLR}
-}
-@inproceedings{he2017adversarial,
-	title        = {Adversarial example defense: Ensembles of weak defenses are not strong},
-	author       = {He, Warren and Wei, James and Chen, Xinyun and Carlini, Nicholas and Song, Dawn},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1706.04701},
-	booktitle    = {11th $\{$USENIX$\}$ Workshop on Offensive Technologies ($\{$WOOT$\}$ 17)}
-}
-@inproceedings{cai2018curriculum,
-	title        = {Curriculum adversarial training},
-	author       = {Cai, Qi-Zhi and Liu, Chang and Song, Dawn},
-	year         = 2018,
-	booktitle    = {Proceedings of the 27th International Joint Conference on Artificial Intelligence},
-	pages        = {3740--3747},
-	organization = {AAAI Press}
-}
-@article{carlini2018secret,
-	title        = {The secret sharer: Measuring unintended neural network memorization \& extracting secrets},
-	author       = {Carlini, Nicholas and Liu, Chang and Kos, Jernej and Erlingsson, {\'U}lfar and Song, Dawn},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1802.08232}
-}
-@inproceedings{chen2018execution,
-	title        = {Execution-Guided Neural Program Synthesis},
-	author       = {Chen, Xinyun and Liu, Chang and Song, Dawn},
-	year         = 2019,
-	booktitle    = {ICLR}
-}
-@article{shin2018synthetic,
-	title        = {Synthetic Datasets for Neural Program Synthesis},
-	author       = {Shin, Richard and Kant, Neel and Gupta, Kavi and Bender, Chris and Trabucco, Brandon and Singh, Rishabh and Song, Dawn},
-	year         = 2019,
-	booktitle    = {ICLR}
-}
-@inproceedings{chen2018tree,
-	title        = {Tree-to-tree neural networks for program translation},
-	author       = {Chen, Xinyun and Liu, Chang and Song, Dawn},
-	year         = 2018,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {2547--2557}
-}
-@inproceedings{shin2018improving,
-	title        = {Improving Neural Program Synthesis with Inferred Execution Traces},
-	author       = {Shin, Richard and Polosukhin, Illia and Song, Dawn},
-	year         = 2018,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {8917--8926}
-}
-@inproceedings{chen2017towards,
-	title        = {Towards synthesizing complex programs from input-output examples},
-	author       = {Chen, Xinyun and Liu, Chang and Song, Dawn},
-	year         = 2018,
-	booktitle    = {ICLR}
-}
-@article{fox2018parametrized,
-	title        = {Parametrized hierarchical procedures for neural programming},
-	author       = {Fox, Roy and Shin, Richard and Krishnan, Sanjay and Goldberg, Ken and Song, Dawn and Stoica, Ion},
-	year         = 2018,
-	journal      = {ICLR},
-	booktitle    = {International Conference on Learning Representations},
-	url          = {https://openreview.net/forum?id=rJl63fZRb}
-}
-@inproceedings{cai2017making,
-	title        = {Making neural programming architectures generalize via recursion},
-	author       = {Cai, Jonathon and Shin, Richard and Song, Dawn},
-	year         = 2017,
-	journal      = {ICLR},
-	booktitle    = {ICLR}
-}
-@inproceedings{chen2016latent,
-	title        = {Latent attention for if-then program synthesis},
-	author       = {Chen, Xinyun and Liu, Chang and Shin, Richard and Song, Dawn and Chen, Mingcheng},
-	year         = 2016,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {4574--4582}
-}
-@article{xiao2018characterizing,
-	title        = {Characterizing Attacks on Deep Reinforcement Learning},
-	author       = {Xiao, Chaowei and Pan, Xinlei and He, Warren and Li, Bo and Peng, Jian and Sun, Mingjie and Yi, Jinfeng and Liu, Mingyan and Song, Dawn},
-	year         = 2018
-}
-@article{shannon1949communication,
-	title        = {Communication theory of secrecy systems},
-	author       = {Shannon, Claude E},
-	year         = 1949,
-	journal      = {Bell Labs Technical Journal},
-	publisher    = {Wiley Online Library},
-	volume       = 28,
-	number       = 4,
-	pages        = {656--715}
-}
-@article{evtimov2017robust,
-	title        = {Robust physical-world attacks on deep learning models},
-	author       = {Evtimov, Ivan and Eykholt, Kevin and Fernandes, Earlence and Kohno, Tadayoshi and Li, Bo and Prakash, Atul and Rahmati, Amir and Song, Dawn},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1707.08945}
-}
-@article{ghavamzadeh2015bayesian,
-  title={Bayesian reinforcement learning: A survey},
-  author={Ghavamzadeh, Mohammad and Mannor, Shie and Pineau, Joelle and Tamar, Aviv and others},
-  journal={Foundations and Trends{\textregistered} in Machine Learning},
-  volume={8},
-  number={5-6},
-  pages={359--483},
-  year={2015},
-  publisher={Now Publishers, Inc.}
-}
-
-@article{singh2010intrinsically,
-  title={Intrinsically motivated reinforcement learning: An evolutionary perspective},
-  author={Singh, Satinder and Lewis, Richard L and Barto, Andrew G and Sorg, Jonathan},
-  journal={IEEE Transactions on Autonomous Mental Development},
-  volume={2},
-  number={2},
-  pages={70--82},
-  year={2010},
-  publisher={IEEE}
-}
-@incollection{barto2013intrinsic,
-  title={Intrinsic motivation and reinforcement learning},
-  author={Barto, Andrew G},
-  booktitle={Intrinsically motivated learning in natural and artificial systems},
-  pages={17--47},
-  year={2013},
-  publisher={Springer}
-}
-
-
-@techreport{Thrun-1992-15850,
-author = {Sebastian Thrun},
-title = {Efficient Exploration In Reinforcement Learning.},
-year = {1992},
-month = {January},
-institution = {Carnegie Mellon University},
-address = {Pittsburgh, PA},
-number = {CMU-CS-92-102},
-}
-@article{rajpurohit2016dissipativity,
-  title={Dissipativity theory for nonlinear stochastic dynamical systems},
-  author={Rajpurohit, Tanmay and Haddad, Wassim M},
-  journal={IEEE Transactions on Automatic Control},
-  volume={62},
-  number={4},
-  pages={1684--1699},
-  year={2016},
-  publisher={IEEE}
-}
-@article{wu2011theory,
-  title={Theory of stochastic dissipative systems},
-  author={Wu, Zhaojing and Cui, Mingyue and Xie, Xuejun and Shi, Peng},
-  journal={IEEE Transactions on Automatic Control},
-  volume={56},
-  number={7},
-  pages={1650--1655},
-  year={2011},
-  publisher={IEEE}
-}
-@article{tsitsiklis1985complexity,
-  title={On the complexity of decentralized decision making and detection problems},
-  author={Tsitsiklis, John and Athans, Michael},
-  journal={IEEE Transactions on Automatic Control},
-  volume={30},
-  number={5},
-  pages={440--446},
-  year={1985},
-  publisher={IEEE}
-}
-@article{witsenhausen1968counterexample,
-  title={A counterexample in stochastic optimum control},
-  author={Witsenhausen, Hans S},
-  journal={SIAM Journal on Control},
-  volume={6},
-  number={1},
-  pages={131--147},
-  year={1968},
-  publisher={SIAM}
-}
-@inproceedings{zhao2020sim,
-  title={Sim-to-real transfer in deep reinforcement learning for robotics: a survey},
-  author={Zhao, Wenshuai and Queralta, Jorge Pe{\~n}a and Westerlund, Tomi},
-  booktitle={2020 IEEE Symposium Series on Computational Intelligence (SSCI)},
-  pages={737--744},
-  year={2020},
-  organization={IEEE}
-}
-@article{fazelnia2016convex,
-  title={Convex relaxation for optimal distributed control problems},
-  author={Fazelnia, Ghazal and Madani, Ramtin and Kalbat, Abdulrahman and Lavaei, Javad},
-  journal={IEEE Transactions on Automatic Control},
-  volume={62},
-  number={1},
-  pages={206--221},
-  year={2016},
-  publisher={IEEE}
-}
-@article{seshia2016towards,
-  title={Towards verified artificial intelligence},
-  author={Seshia, Sanjit A and Sadigh, Dorsa and Sastry, S Shankar},
-  journal={arXiv preprint arXiv:1606.08514},
-  year={2016}
-}
-@article{fulton2020formal,
-  title={Formal verification of end-to-end learning in cyber-physical systems: Progress and challenges},
-  author={Fulton, Nathan and Hunt, Nathan and Hoang, Nghia and Das, Subhro},
-  journal={arXiv preprint arXiv:2006.09181},
-  year={2020}
-}
-
-@book{bertsekas2022lessons,
-  title={Lessons from alphazero for optimal, model predictive, and adaptive control},
-  author={Bertsekas, Dimitri},
-  year={2022},
-  publisher={Athena Scientific}
-}
-@article{chen2022adaptive,
-  title={Adaptive Control for Systems with Time-Varying Parameters—A Survey},
-  author={Chen, Kaiwen and Astolfi, Alessandro},
-  journal={Trends in Nonlinear and Adaptive Control},
-  pages={217--247},
-  year={2022},
-  publisher={Springer}
-}
-@article{petersen2014robust,
-  title={Robust control of uncertain systems: Classical results and recent developments},
-  author={Petersen, Ian R and Tempo, Roberto},
-  journal={Automatica},
-  volume={50},
-  number={5},
-  pages={1315--1335},
-  year={2014},
-  publisher={Elsevier}
-}
-@article{bhattacharyya2017robust,
-  title={Robust control under parametric uncertainty: An overview and recent results},
-  author={Bhattacharyya, SP},
-  journal={Annual Reviews in Control},
-  volume={44},
-  pages={45--77},
-  year={2017},
-  publisher={Elsevier}
-}
-@article{lewis2007nonsmooth,
-  title={Nonsmooth optimization and robust control},
-  author={Lewis, Adrian S},
-  journal={Annual Reviews in Control},
-  volume={31},
-  number={2},
-  pages={167--177},
-  year={2007},
-  publisher={Elsevier}
-}
-@article{wang2017adaptive,
-  title={Adaptive critic nonlinear robust control: A survey},
-  author={Wang, Ding and He, Haibo and Liu, Derong},
-  journal={IEEE transactions on cybernetics},
-  volume={47},
-  number={10},
-  pages={3429--3451},
-  year={2017},
-  publisher={IEEE}
-}
-@article{stoica2017berkeley,
-  title={A berkeley view of systems challenges for ai},
-  author={Stoica, Ion and Song, Dawn and Popa, Raluca Ada and Patterson, David and Mahoney, Michael W and Katz, Randy and Joseph, Anthony D and Jordan, Michael and Hellerstein, Joseph M and Gonzalez, Joseph E and others},
-  journal={arXiv preprint arXiv:1712.05855},
-  year={2017}
-}
-@article{chen2021deep,
-  title={Deep reinforcement learning for Internet of Things: A comprehensive survey},
-  author={Chen, Wuhui and Qiu, Xiaoyu and Cai, Ting and Dai, Hong-Ning and Zheng, Zibin and Zhang, Yan},
-  journal={IEEE Communications Surveys \& Tutorials},
-  year={2021},
-  publisher={IEEE}
-}
-@article{li2017deep,
-  title={Deep reinforcement learning: An overview},
-  author={Li, Yuxi},
-  journal={arXiv preprint arXiv:1701.07274},
-  year={2017}
-}
-@article{nian2020review,
-  title={A review on reinforcement learning: Introduction and applications in industrial process control},
-  author={Nian, Rui and Liu, Jinfeng and Huang, Biao},
-  journal={Computers \& Chemical Engineering},
-  volume={139},
-  pages={106886},
-  year={2020},
-  publisher={Elsevier}
-}
-@article{haydari2020deep,
-  title={Deep reinforcement learning for intelligent transportation systems: A survey},
-  author={Haydari, Ammar and Yilmaz, Yasin},
-  journal={IEEE Transactions on Intelligent Transportation Systems},
-  year={2020},
-  publisher={IEEE}
-}
-@article{zhang2019deep,
-  title={Deep reinforcement learning for power system applications: An overview},
-  author={Zhang, Zidong and Zhang, Dongxia and Qiu, Robert C},
-  journal={CSEE Journal of Power and Energy Systems},
-  volume={6},
-  number={1},
-  pages={213--225},
-  year={2019},
-  publisher={CSEE}
-}
-@article{glavic2019deep,
-  title={(Deep) Reinforcement learning for electric power system control and related problems: A short review and perspectives},
-  author={Glavic, Mevludin},
-  journal={Annual Reviews in Control},
-  volume={48},
-  pages={22--35},
-  year={2019},
-  publisher={Elsevier}
-}
-@article{bucsoniu2018reinforcement,
-  title={Reinforcement learning for control: Performance, stability, and deep approximators},
-  author={Bu{\c{s}}oniu, Lucian and de Bruin, Tim and Toli{\'c}, Domagoj and Kober, Jens and Palunko, Ivana},
-  journal={Annual Reviews in Control},
-  volume={46},
-  pages={8--28},
-  year={2018},
-  publisher={Elsevier}
-}
-@article{dulac2021challenges,
-  title={Challenges of real-world reinforcement learning: definitions, benchmarks and analysis},
-  author={Dulac-Arnold, Gabriel and Levine, Nir and Mankowitz, Daniel J and Li, Jerry and Paduraru, Cosmin and Gowal, Sven and Hester, Todd},
-  journal={Machine Learning},
-  volume={110},
-  number={9},
-  pages={2419--2468},
-  year={2021},
-  publisher={Springer}
-}
-@article{cao2020reinforcement,
-  title={Reinforcement learning and its applications in modern power and energy systems: A review},
-  author={Cao, Di and Hu, Weihao and Zhao, Junbo and Zhang, Guozhou and Zhang, Bin and Liu, Zhou and Chen, Zhe and Blaabjerg, Frede},
-  journal={Journal of Modern Power Systems and Clean Energy},
-  volume={8},
-  number={6},
-  pages={1029--1042},
-  year={2020},
-  publisher={SGEPRI}
-}
-@inproceedings{cao2019adversarial,
-	title        = {Adversarial sensor attack on lidar-based perception in autonomous driving},
-	author       = {Cao, Yulong and Xiao, Chaowei and Cyr, Benjamin and Zhou, Yimeng and Park, Won and Rampazzi, Sara and Chen, Qi Alfred and Fu, Kevin and Mao, Z Morley},
-	year         = 2019,
-	journal      = {arXiv preprint arXiv:1907.06826},
-	booktitle    = {SIGSAC}
-}
-%article{scikit-learn,
- title={Scikit-learn: Machine Learning in {P}ython},
- author={Pedregosa,
-
-
-%article{scikit-learn,
- title={Scikit-learn: Machine Learning in {P}ython},
- author={Pedregosa, F. and Varoquaux, G. and Gramfort, A. and Michel, V.
-         and Thirion, B. and Grisel, O. and Blondel, M. and Prettenhofer, P.
-         and Weiss, R. and Dubourg, V. and Vanderplas, J. and Passos, A. and
-         Cournapeau, D. and Brucher, M. and Perrot, M. and Duchesnay, E.},
- journal={Journal of Machine Learning Research},
- volume={12},
- pages={2825--2830},
- year={2011}
-}
-@misc{paszke2017pytorch,
-	title        = {PyTorch},
-	author       = {Paszke, Adam and Chintala, Soumith},
-	year         = 2017
-}
-@inproceedings{abadi2016tensorflow,
-	title        = {Tensorflow: Large-scale machine learning on heterogeneous distributed systems},
-	author       = {M. Abadi and A. Agarwal and P. Barham and E. Brevdo and Z. Chen and C. Citro and G. S. Corrado and A. Davis and J. Dean and M. Devin and S. Ghemawat and I. J. Goodfellow and A. Harp and G. Irving and M. Isard and Y. Jia and R. Józefowicz and L. Kaiser and M. Kudlur and J. Levenberg and D. Mané and R. Monga and S. Moore and D. G. Murray and C. Olah and M. Schuster and J. Shlens and B. Steiner and I. Sutskever and K. Talwar and P. A. Tucker and V. Vanhoucke and V. Vasudevan and F. B. Viégas and O. Vinyals and P. Warden and M. Wattenberg and M. Wicke and Y. Yu and X. Zheng},
-	year         = 2016,
-	booktitle    = {12\textsuperscript{th} USENIX Symposium on Operating Systems Design and Implementation (OSDI)},
-	volume       = 16,
-	pages        = {265--283}
-}
-@article{bojarski2016end,
-	title        = {End to end learning for self-driving cars},
-	author       = {Mariusz Bojarski and Davide Del Testa and Daniel Dworakowski and Bernhard Firner and Beat Flepp and Prasoon Goyal and Lawrence D. Jackel and Mathew Monfort and Urs Muller and Jiakai Zhang and Xin Zhang and Jake Zhao and Karol Zieb},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1604.07316}
-}
-@article{apple2017siri,
-	title        = {Deep Learning for Siri's Voice: On-device Deep Mixture Density Networks for Hybrid Unit Selection Synthesis},
-	author       = {{Siri~Team}},
-	year         = 2017,
-	journal      = {Apple Machine Learning Journal}
-}
-@article{papernot2016towards,
-	title        = {Towards the science of security and privacy in machine learning},
-	author       = {Papernot, Nicolas and McDaniel, Patrick and Sinha, Arunesh and Wellman, Michael},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1611.03814}
-}
-%article{he2017adversarial,
-  title={Adversarial Example Defenses: Ensembles of Weak Defenses are not Strong},
-  author={He, Warren and Wei, James and Chen, Xinyun and Carlini, Nicholas and Song, Dawn},
-  journal={arXiv preprint arXiv:1706.04701},
-  year={2017}
-}
-@inproceedings{chen2017ead,
-	title        = {EAD: Elastic-Net Attacks to Deep Neural Networks via Adversarial Examples},
-	author       = {Chen, Pin-Yu and Sharma, Yash and Zhang, Huan and Yi, Jinfeng and Hsieh, Cho-Jui},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1709.04114},
-	booktitle    = {AAAI}
-}
-@article{gardiner2016security,
-	title        = {On the Security of Machine Learning in Malware C8C Detection: A Survey},
-	author       = {Gardiner, Joseph and Nagaraja, Shishir},
-	year         = 2016,
-	journal      = {ACM Computing Surveys (CSUR)},
-	publisher    = {ACM},
-	volume       = 49,
-	number       = 3,
-	pages        = 59
-}
-@article{hayes2017logan,
-	title        = {LOGAN: Evaluating Privacy Leakage of Generative Models Using Generative Adversarial Networks},
-	author       = {Hayes, Jamie and Melis, Luca and Danezis, George and De Cristofaro, Emiliano},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1705.07663}
-}
-@article{li2017dropout,
-	title        = {Dropout Inference in Bayesian Neural Networks with Alpha-divergences},
-	author       = {Li, Yingzhen and Gal, Yarin},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.02914}
-}
-%article{liu2016delving,
-	title={Delving into transferable adversarial examples and black-box attacks},
-	author={Liu, Yanpei and Chen, Xinyun and Liu, Chang and Song, Dawn},
-	journal={arXiv preprint arXiv:1611.02770},
-	year={2016}
-}
-
-%article{huang2017adversarial,
-	title={Adversarial attacks on neural network policies},
-	author={Huang, Sandy and Papernot, Nicolas and Goodfellow, Ian and Duan, Yan and Abbeel, Pieter},
-	journal={arXiv preprint arXiv:1702.02284},
-	year={2017}
-}
-@article{behzadan2017vulnerability,
-	title        = {Vulnerability of Deep Reinforcement Learning to Policy Induction Attacks},
-	author       = {Behzadan, Vahid and Munir, Arslan},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1701.04143}
-}
-@article{steinhardt2017certified,
-	title        = {Certified Defenses for Data Poisoning Attacks},
-	author       = {Steinhardt, Jacob and Koh, Pang Wei and Liang, Percy},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1706.03691},
-	booktitle    = {Advances in Neural Information Processing Systems (NIPS)},
-	pages        = {3517--3529}
-}
-@book{vapnik1998statistical,
-	title        = {Statistical Learning Theory},
-	author       = {Vapnik, Vladimir N.},
-	year         = 1998,
-	publisher    = {Wiley}
-}
-@inproceedings{biggio2014poisoning,
-	title        = {Poisoning behavioral malware clustering},
-	author       = {Biggio, Battista and Rieck, Konrad and Ariu, Davide and Wressnegger, Christian and Corona, Igino and Giacinto, Giorgio and Roli, Fabio},
-	year         = 2014,
-	booktitle    = {Workshop on Artificial Intelligence and Security},
-	pages        = {27--36}
-}
-@inproceedings{xiao2012adversarial,
-	title        = {Adversarial label flips attack on support vector machines},
-	author       = {Xiao, Han and Xiao, Huang and Eckert, Claudia},
-	year         = 2012,
-	booktitle    = {20th European Conference on Artificial Intelligence},
-	pages        = {870--875}
-}
-%inproceedings{carlini2017towards,
-	title={Towards evaluating the robustness of neural networks},
-	author={Carlini, Nicholas and Wagner, David},
-	booktitle={IEEE Symposium on Security and Privacy},
-	pages={39--57},
-	year=2017
-}
-@article{kerckhoffs1883cryptographie,
-	title        = {La cryptographie militaire},
-	author       = {Kerckhoffs, Auguste},
-	year         = 1883,
-	journal      = {Journal des sciences militaires},
-	pages        = {5--83}
-}
-%article{tramer2017space,
-	title={The Space of Transferable Adversarial Examples},
-	author={Tram{\`e}r, Florian and Papernot, Nicolas and Goodfellow, Ian and Boneh, Dan and McDaniel, Patrick},
-	journal={arXiv preprint arXiv:1704.03453},
-	year={2017}
-}
-@article{bassily2014differentially,
-	title        = {Differentially private empirical risk minimization: Efficient algorithms and tight error bounds},
-	author       = {Bassily, Raef and Smith, Adam and Thakurta, Abhradeep},
-	year         = 2014,
-	journal      = {arXiv preprint arXiv:1405.7085}
-}
-@article{chaudhuri2011differentially,
-	title        = {Differentially private empirical risk minimization},
-	author       = {Chaudhuri, Kamalika and Monteleoni, Claire and Sarwate, Anand D},
-	year         = 2011,
-	journal      = {Journal of Machine Learning Research},
-	volume       = 12,
-	number       = {Mar},
-	pages        = {1069--1109}
-}
-@article{hamm2016learning,
-	title        = {Learning Privately from Multiparty Data},
-	author       = {Hamm, Jihun and Cao, Paul and Belkin, Mikhail},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1602.03552}
-}
-@article{rivest1978data,
-	title        = {On data banks and privacy homomorphisms},
-	author       = {Rivest, Ronald L and Adleman, Len and Dertouzos, Michael L},
-	year         = 1978,
-	journal      = {Foundations of secure computation},
-	volume       = 4,
-	number       = 11,
-	pages        = {169--180}
-}
-@inproceedings{gilad2016cryptonets,
-	title        = {CryptoNets: Applying Neural Networks to Encrypted Data with High Throughput and Accuracy},
-	author       = {Gilad-Bachrach, Ran and Dowlin, Nathan and Laine, Kim and Lauter, Kristin and Naehrig, Michael and Wernsing, John},
-	year         = 2016,
-	booktitle    = {Proceedings of The 33rd International Conference on Machine Learning},
-	pages        = {201--210}
-}
-@inproceedings{ohrimenko2016oblivious,
-	title        = {Oblivious multi-party machine learning on trusted processors},
-	author       = {Ohrimenko, Olga and Schuster, Felix and Fournet, C{\'e}dric and Mehta, Aastha and Nowozin, Sebastian and Vaswani, Kapil and Costa, Manuel},
-	year         = 2016,
-	booktitle    = {25th USENIX Security Symposium}
-}
-@article{rosasco2004loss,
-	title        = {Are loss functions all the same?},
-	author       = {Rosasco, Lorenzo and De Vito, Ernesto and Caponnetto, Andrea and Piana, Michele and Verri, Alessandro},
-	year         = 2004,
-	journal      = {Neural Computation},
-	publisher    = {MIT Press},
-	volume       = 16,
-	number       = 5,
-	pages        = {1063--1076}
-}
-@book{anthony2009neural,
-	title        = {Neural network learning: Theoretical foundations},
-	author       = {Anthony, Martin and Bartlett, Peter L},
-	year         = 2009,
-	publisher    = {cambridge university press}
-}
-@inproceedings{erlingsson2014rappor,
-	title        = {Rappor: Randomized aggregatable privacy-preserving ordinal response},
-	author       = {Erlingsson, {\'U}lfar and Pihur, Vasyl and Korolova, Aleksandra},
-	year         = 2014,
-	booktitle    = {21st ACM SIGSAC Conference on Computer and Communications Security},
-	pages        = {1054--1067}
-}
-@article{kasiviswanathan2011can,
-	title        = {What can we learn privately?},
-	author       = {Kasiviswanathan, Shiva Prasad and Lee, Homin K and Nissim, Kobbi and Raskhodnikova, Sofya and Smith, Adam},
-	year         = 2011,
-	journal      = {SIAM Journal on Computing},
-	publisher    = {SIAM},
-	volume       = 40,
-	number       = 3,
-	pages        = {793--826}
-}
-@inproceedings{narayanan2008robust,
-	title        = {Robust de-anonymization of large sparse datasets},
-	author       = {Narayanan, Arvind and Shmatikov, Vitaly},
-	year         = 2008,
-	booktitle    = {2008 IEEE Symposium on Security and Privacy (sp 2008)},
-	pages        = {111--125},
-	organization = {IEEE}
-}
-@inproceedings{kifer2011no,
-	title        = {No free lunch in data privacy},
-	author       = {Kifer, Daniel and Machanavajjhala, Ashwin},
-	year         = 2011,
-	booktitle    = {Proceedings of the 2011 ACM SIGMOD International Conference on Management of data},
-	pages        = {193--204},
-	organization = {ACM}
-}
-%inproceedings{globerson2006nightmare,
-	title={Nightmare at test time: Robust learning by feature deletion},
-	author={Globerson, Amir and Roweis, Sam},
-	booktitle={23rd International Conference on Machine Learning},
-	pages={353--360},
-	year=2006
-}
-@article{powers2011evaluation,
-	title        = {Evaluation: From precision, recall and {F}-measure to {ROC}, informedness, markedness and correlation},
-	author       = {Powers, David Martin},
-	year         = 2011,
-	journal      = {Journal of Machine Learning Technologies},
-	volume       = 2,
-	pages        = {37--63}
-}
-@article{vapnik2009new,
-	title        = {A new learning paradigm: Learning using privileged information},
-	author       = {Vapnik, Vladimir and Vashist, Akshay},
-	year         = 2009,
-	journal      = {Neural Networks},
-	volume       = 22,
-	number       = 5,
-	pages        = {544--557}
-}
-@inproceedings{mei2015using,
-	title        = {Using Machine Teaching to Identify Optimal Training-Set Attacks on Machine Learners},
-	author       = {Mei, Shike and Zhu, Xiaojin},
-	year         = 2015,
-	booktitle    = {AAAI},
-	pages        = {2871--2877}
-}
-@article{pinto2016supervision,
-	title        = {Supervision via Competition: Robot Adversaries for Learning Tasks},
-	author       = {Pinto, Lerrel and Davidson, James and Gupta, Abhinav},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1610.01685}
-}
-@inproceedings{lowd2005good,
-	title        = {Good Word Attacks on Statistical Spam Filters.},
-	author       = {Lowd, Daniel and Meek, Christopher},
-	year         = 2005,
-	booktitle    = {CEAS}
-}
-@article{bolton2002statistical,
-	title        = {Statistical fraud detection: A review},
-	author       = {Bolton, Richard J and Hand, David J},
-	year         = 2002,
-	journal      = {Statistical Science},
-	volume       = 17,
-	pages        = {235--249}
-}
-@article{mozaffari2015systematic,
-	title        = {Systematic Poisoning Attacks on and Defenses for Machine Learning in Healthcare},
-	author       = {Mozaffari-Kermani, Mehran and Sur-Kolay, Susmita and Raghunathan, Anand and Jha, Niraj K},
-	year         = 2015,
-	journal      = {IEEE Journal of Biomedical and Health Informatics},
-	volume       = 19,
-	number       = 6,
-	pages        = {1893--1905}
-}
-@inproceedings{wittel2004attacking,
-	title        = {On Attacking Statistical Spam Filters.},
-	author       = {Wittel, Gregory L and Wu, Shyhtsun Felix},
-	year         = 2004,
-	booktitle    = {CEAS}
-}
-@inproceedings{hulten2001mining,
-	title        = {Mining time-changing data streams},
-	author       = {Hulten, Geoff and Spencer, Laurie and Domingos, Pedro},
-	year         = 2001,
-	booktitle    = {7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining},
-	pages        = {97--106}
-}
-@inproceedings{nelson2006bounding,
-	title        = {Bounding an attack's complexity for a simple learning model},
-	author       = {Nelson, Blaine and Joseph, Anthony D},
-	year         = 2006,
-	booktitle    = {First Workshop on Tackling Computer Systems Problems with Machine Learning Techniques},
-	location     = {Saint-Malo, France}
-}
-
-@article{ruszczynski2010risk,
-  title={Risk-averse dynamic programming for Markov decision processes},
-  author={Ruszczy{\'n}ski, Andrzej},
-  journal={Mathematical programming},
-  volume={125},
-  number={2},
-  pages={235--261},
-  year={2010},
-  publisher={Springer}
-}
-
-@article{tamar2015policy,
-  title={Policy gradient for coherent risk measures},
-  author={Tamar, Aviv and Chow, Yinlam and Ghavamzadeh, Mohammad and Mannor, Shie},
-  journal={Advances in neural information processing systems},
-  volume={28},
-  year={2015}
-}
-
-@book{lattimore2020bandit,
-  title={Bandit algorithms},
-  author={Lattimore, Tor and Szepesv{\'a}ri, Csaba},
-  year={2020},
-  publisher={Cambridge University Press}
-}
-
-@article{manwani2013noise,
-	title        = {Noise tolerance under risk minimization},
-	author       = {Manwani, Naresh and Sastry, P. S.},
-	year         = 2013,
-	journal      = {IEEE Transactions on Cybernetics},
-	volume       = 43,
-	number       = 3,
-	pages        = {1146--1151}
-}
-@inproceedings{dwork2006calibrating,
-	title        = {Calibrating noise to sensitivity in private data analysis},
-	author       = {Dwork, Cynthia and McSherry, Frank and Nissim, Kobbi and Smith, Adam},
-	year         = 2006,
-	booktitle    = {Theory of Cryptography Conference},
-	pages        = {265--284},
-	organization = {Springer}
-}
-@article{papernot2016effectiveness,
-	title        = {On the Effectiveness of Defensive Distillation},
-	author       = {Papernot, Nicolas and McDaniel, Patrick},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1607.05113}
-}
-@article{carlini2016defensive,
-	title        = {Defensive Distillation is Not Robust to Adversarial Examples},
-	author       = {Carlini, Nicholas and Wagner, David},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1607.04311}
-}
-@article{ororbia2016unifying,
-	title        = {Unifying Adversarial Training Algorithms with Flexible Deep Data Gradient Regularization},
-	author       = {Ororbia, II and Alexander, G and Giles, C Lee and Kifer, Daniel},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1601.07213}
-}
-@inproceedings{lyu2015unified,
-	title        = {A unified gradient regularization family for adversarial examples},
-	author       = {Lyu, Chunchuan and Huang, Kaizhu and Liang, Hai-Ning},
-	year         = 2015,
-	booktitle    = {Data Mining (ICDM), 2015 IEEE International Conference on},
-	pages        = {301--309},
-	organization = {IEEE}
-}
-@article{dwork2014algorithmic,
-	title        = {The algorithmic foundations of differential privacy},
-	author       = {Dwork, Cynthia and Roth, Aaron and others},
-	year         = 2014,
-	journal      = {Foundations and Trends in Theoretical Computer Science},
-	publisher    = {Now Publishers, Inc.},
-	volume       = 9,
-	number       = {3-4},
-	pages        = {211--407}
-}
-@article{papernot2016semi,
-	title        = {Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data},
-	author       = {Papernot, Nicolas and Abadi, Mart{\'\i}n and Erlingsson, {\'U}lfar and Goodfellow, Ian and Talwar, Kunal},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1610.05755}
-}
-@inproceedings{abadi2016deep,
-	title        = {Deep learning with differential privacy},
-	author       = {Abadi, Martin and Chu, Andy and Goodfellow, Ian and McMahan, H. Brendan and Mironov, Ilya and Talwar, Kunal and Zhang, Li},
-	year         = 2016,
-	booktitle    = {23rd ACM SIGSAC Conference on Computer and Communications Security},
-	pages        = {308--318}
-}
-@inproceedings{shokri2015privacy,
-	title        = {Privacy-Preserving Deep Learning},
-	author       = {Shokri, Reza and Shmatikov, Vitaly},
-	year         = 2015,
-	booktitle    = {22nd ACM SIGSAC Conference on Computer and Communications Security},
-	pages        = {1310--1321}
-}
-@book{arcak2016networks,
-  title={Networks of dissipative systems: compositional certification of stability, performance, and safety},
-  author={Arcak, Murat and Meissen, Chris and Packard, Andrew},
-  year={2016},
-  publisher={Springer}
-}
-@article{moylan1978stability,
-  title={Stability criteria for large-scale systems},
-  author={Moylan, P and Hill, David},
-  journal={IEEE Transactions on Automatic Control},
-  volume={23},
-  number={2},
-  pages={143--149},
-  year={1978},
-  publisher={IEEE}
-}
-@article{qiu2020upper,
-  title={Upper confidence primal-dual reinforcement learning for {CMDP} with adversarial loss},
-  author={Qiu, Shuang and Wei, Xiaohan and Yang, Zhuoran and Ye, Jieping and Wang, Zhaoran},
-  journal={Advances in Neural Information Processing Systems},
-  volume={33},
-  pages={15277--15287},
-  year={2020}
-}
-@inproceedings{cheung2020reinforcement,
-  title={Reinforcement learning for non-stationary markov decision processes: The blessing of (more) optimism},
-  author={Cheung, Wang Chi and Simchi-Levi, David and Zhu, Ruihao},
-  booktitle={International Conference on Machine Learning},
-  pages={1843--1854},
-  year={2020},
-  organization={PMLR}
-}
-@inproceedings{wei2021non,
-  title={Non-stationary reinforcement learning without prior knowledge: An optimal black-box approach},
-  author={Wei, Chen-Yu and Luo, Haipeng},
-  booktitle={Conference on Learning Theory},
-  pages={4300--4354},
-  year={2021},
-  organization={PMLR}
-}
-@article{zhong2021optimistic,
-  title={Optimistic Policy Optimization is Provably Efficient in Non-stationary {MDP}s},
-  author={Zhong, Han and Yang, Zhuoran and Szepesv{\'a}ri, Zhaoran Wang Csaba},
-  journal={arXiv preprint arXiv:2110.08984},
-  year={2021}
-}
-@article{mulvaneydynamic,
-  title={Dynamic Regret Bounds for Online Nonconvex Optimization},
-  author={Mulvaney-Kemp, Julie and Park, SangWoo and Jin, Ming and Lavaei, Javad},
-  journal={In revision},
-  year={2022}
-}
-@inproceedings{park2021diminishing,
-  title={Diminishing regret for online nonconvex optimization},
-  author={Park, SangWoo and Mulvaney-Kemp, Julie and Jin, Ming and Lavaei, Javad},
-  booktitle={2021 American Control Conference (ACC)},
-  pages={978--985},
-  year={2021},
-  organization={IEEE}
-}
-@inproceedings{ding2021analysis,
-  title={Analysis of Spurious Local Solutions of Optimal Control Problems: One-Shot Optimization Versus Dynamic Programming},
-  author={Ding, Yuhao and Bi, Yingjie and Lavaei, Javad},
-  booktitle={2021 American Control Conference (ACC)},
-  pages={3836--3843},
-  year={2021},
-  organization={IEEE}
-}
-@inproceedings{ding2021escaping,
-  title={Escaping spurious local minimum trajectories in online time-varying nonconvex optimization},
-  author={Ding, Yuhao and Lavaei, Javad and Arcak, Murat},
-  booktitle={2021 American Control Conference (ACC)},
-  pages={454--461},
-  year={2021},
-  organization={IEEE}
-}
-@article{molybog2020global,
-  title={Global convergence of MAML for LQR},
-  author={Molybog, Igor and Lavaei, Javad},
-  journal={arXiv preprint arXiv:2006.00453},
-  year={2020}
-}
-@inproceedings{feng2021learning,
-  title={Learning of dynamical systems under adversarial attacks},
-  author={Feng, Han and Lavaei, Javad},
-  booktitle={2021 60th IEEE Conference on Decision and Control (CDC)},
-  pages={3010--3017},
-  year={2021},
-  organization={IEEE}
-}
-@article{madani2020penalized,
-  title={Penalized semidefinite programming for quadratically-constrained quadratic optimization},
-  author={Madani, Ramtin and Kheirandishfard, Mohsen and Lavaei, Javad and Atamt{\"u}rk, Alper},
-  journal={Journal of Global Optimization},
-  volume={78},
-  number={3},
-  pages={423--451},
-  year={2020},
-  publisher={Springer}
-}
-@article{zhang2021sparse,
-  title={Sparse semidefinite programs with guaranteed near-linear time complexity via dualized clique tree conversion},
-  author={Zhang, Richard Y and Lavaei, Javad},
-  journal={Mathematical programming},
-  volume={188},
-  number={1},
-  pages={351--393},
-  year={2021},
-  publisher={Springer}
-}
-@article{feng2020connectivity,
-  title={Connectivity properties of the set of stabilizing static decentralized controllers},
-  author={Feng, Han and Lavaei, Javad},
-  journal={SIAM Journal on Control and Optimization},
-  volume={58},
-  number={5},
-  pages={2790--2820},
-  year={2020},
-  publisher={SIAM}
-}
-@article{fattahi2021absence,
-  title={On the Absence of Spurious Local Trajectories in Time-varying Nonconvex Optimization},
-  author={Fattahi, Salar and Josz, Cedric and Ding, Yuhao and Ghazi, Reza Mohammadi and Lavaei, Javad and Sojoudi, Somayeh},
-  journal={IEEE Transactions on Automatic Control},
-  year={2021},
-  publisher={IEEE}
-}
-@article{feng2021damping,
-  title={Damping with varying regularization in optimal decentralized control},
-  author={Feng, Han and Lavaei, Javad},
-  journal={IEEE transactions on control of network systems},
-  year={2021},
-  publisher={IEEE}
-}
-@article{ding2021time,
-  title={Time-variation in online nonconvex optimization enables escaping from spurious local minima},
-  author={Ding, Yuhao and Lavaei, Javad and Arcak, Murat},
-  journal={IEEE Transactions on Automatic Control},
-  year={2021},
-  publisher={IEEE}
-}
-@article{bi2022connectivity,
-  title={On the Connectivity Properties of Feasible Regions of Optimal Decentralized Control Problems},
-  author={Bi, Yingjie and Lavaei, Javad},
-  journal={IEEE Transactions on Control of Network Systems},
-  year={2022},
-  publisher={IEEE}
-}
-@article{yekkehkhany2020hitting,
-  title={A Hitting Time Analysis for Stochastic Time-Varying Functions with Applications to Adversarial Attacks on Computation of Markov Decision Processes},
-  author={Yekkehkhany, Ali and Feng, Han and Lavaei, Javad},
-  year={2020}
-}
-@article{fei2020dynamic,
-  title={Dynamic regret of policy optimization in non-stationary environments},
-  author={Fei, Yingjie and Yang, Zhuoran and Wang, Zhaoran and Xie, Qiaomin},
-  journal={Advances in Neural Information Processing Systems},
-  volume={33},
-  pages={6743--6754},
-  year={2020}
-}
-@article{auer2008near,
-  title={Near-optimal regret bounds for reinforcement learning},
-  author={Auer, Peter and Jaksch, Thomas and Ortner, Ronald},
-  journal={Advances in neural information processing systems},
-  volume={21},
-  year={2008}
-}@article{touati2020efficient,
-  title={Efficient learning in non-stationary linear markov decision processes},
-  author={Touati, Ahmed and Vincent, Pascal},
-  journal={arXiv preprint arXiv:2010.12870},
-  year={2020}
-}
-
-@inproceedings{chandak2020optimizing,
-  title={Optimizing for the future in non-stationary {MDP}s},
-  author={Chandak, Yash and Theocharous, Georgios and Shankar, Shiv and White, Martha and Mahadevan, Sridhar and Thomas, Philip},
-  booktitle={International Conference on Machine Learning},
-  pages={1414--1425},
-  year={2020},
-  organization={PMLR}
-}
-
-@article{ding2022provably,
-  title={Provably Efficient Primal-Dual Reinforcement Learning for {CMDP}s with Non-stationary Objectives and Constraints},
-  author={Ding, Yuhao and Lavaei, Javad},
-  journal={arXiv preprint arXiv:2201.11965},
-  year={2022}
-}
-
-@article{zhou2020nonstationary,
-  title={Nonstationary reinforcement learning with linear function approximation},
-  author={Zhou, Huozhi and Chen, Jinglin and Varshney, Lav R and Jagmohan, Ashish},
-  journal={arXiv preprint arXiv:2010.04244},
-  year={2020}
-}
-@article{mao2020model,
-  title={Model-Free Non-Stationary RL: Near-Optimal Regret and Applications in Multi-Agent {RL} and Inventory Control},
-  author={Mao, Weichao and Zhang, Kaiqing and Zhu, Ruihao and Simchi-Levi, David and Ba{\c{s}}ar, Tamer},
-  journal={arXiv preprint arXiv:2010.03161},
-  year={2020}
-}
-@inproceedings{domingues2021kernel,
-  title={A kernel-based approach to non-stationary reinforcement learning in metric spaces},
-  author={Domingues, Omar Darwiche and M{\'e}nard, Pierre and Pirotta, Matteo and Kaufmann, Emilie and Valko, Michal},
-  booktitle={International Conference on Artificial Intelligence and Statistics},
-  pages={3538--3546},
-  year={2021},
-  organization={PMLR}
-}
-@inproceedings{ortner2020variational,
-  title={Variational regret bounds for reinforcement learning},
-  author={Ortner, Ronald and Gajane, Pratik and Auer, Peter},
-  booktitle={Uncertainty in Artificial Intelligence},
-  pages={81--90},
-  year={2020},
-  organization={PMLR}
-}
-@inproceedings{auer2019adaptively,
-  title={Adaptively tracking the best bandit arm with an unknown number of distribution changes},
-  author={Auer, Peter and Gajane, Pratik and Ortner, Ronald},
-  booktitle={Conference on Learning Theory},
-  pages={138--158},
-  year={2019},
-  organization={PMLR}
-}
-@article{rubinstein2009learning,
-	title        = {Learning in a large function space: Privacy-preserving mechanisms for SVM learning},
-	author       = {Rubinstein, Benjamin IP and Bartlett, Peter L and Huang, Ling and Taft, Nina},
-	year         = 2009,
-	journal      = {arXiv preprint arXiv:0911.5708}
-}
-@article{Hu03,
-	title        = {{Nash Q}-learning for general-sum stochastic games},
-	author       = {Hu, Junling and Wellman, Michael P.},
-	year         = 2003,
-	journal      = {Journal of Machine Learning Research},
-	volume       = 4,
-	pages        = {1039--1069}
-}
-@article{zhang2012functional,
-	title        = {Functional mechanism: regression analysis under differential privacy},
-	author       = {Zhang, Jun and Zhang, Zhenjie and Xiao, Xiaokui and Yang, Yin and Winslett, Marianne},
-	year         = 2012,
-	journal      = {Proceedings of the VLDB Endowment},
-	publisher    = {VLDB Endowment},
-	volume       = 5,
-	number       = 11,
-	pages        = {1364--1375}
-}
-@inproceedings{chaudhuri2009privacy,
-	title        = {Privacy-preserving logistic regression},
-	author       = {Chaudhuri, Kamalika and Monteleoni, Claire},
-	year         = 2009,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {289--296}
-}
-@article{ateniese2015hacking,
-	title        = {Hacking smart machines with smarter ones: How to extract meaningful data from machine learning classifiers},
-	author       = {Ateniese, Giuseppe and Mancini, Luigi V and Spognardi, Angelo and Villani, Antonio and Vitali, Domenico and Felici, Giovanni},
-	year         = 2015,
-	journal      = {International Journal of Security and Networks},
-	publisher    = {Inderscience Publishers (IEL)},
-	volume       = 10,
-	number       = 3,
-	pages        = {137--150}
-}
-@inproceedings{fredrikson2014privacy,
-	title        = {Privacy in pharmacogenetics: An end-to-end case study of personalized warfarin dosing},
-	author       = {Fredrikson, Matthew and Lantz, Eric and Jha, Somesh and Lin, Simon and Page, David and Ristenpart, Thomas},
-	year         = 2014,
-	booktitle    = {23rd USENIX Security Symposium},
-	pages        = {17--32}
-}
-@article{papernot2016transferability,
-	title        = {Transferability in Machine Learning: From Phenomena to Black-Box Attacks using Adversarial Samples},
-	author       = {Papernot, Nicolas and McDaniel, Patrick and Goodfellow, Ian},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1605.07277}
-}
-%inproceedings{tramer2016stealing,
-  title={Stealing Machine Learning Models via Prediction {APIs}},
-  author={Tram{\`e}r, Florian and Zhang, Fan and Juels, Ari and Reiter, Michael K and Ristenpart, Thomas},
-  booktitle={25th USENIX Security Symposium},
-  pages="601-618",
-  year={2016}
-}
-@inproceedings{laskov2014practical,
-	title        = {Practical evasion of a learning-based classifier: A case study},
-	author       = {{\v{S}}rndi{\'c}, Nedim and Laskov, Pavel},
-	year         = 2014,
-	booktitle    = {IEEE Symposium on Security and Privacy},
-	pages        = {197--211}
-}
-%inproceedings{xu2016automatically,
-  title={Automatically evading classifiers: A Case Study on {PDF} Malware Classifiers},
-  author={Xu, Weilin and Qi, Yanjun and Evans, David},
-  booktitle={Network and Distributed Systems Symposium},
-  year={2016}
-}
-@article{homer2008resolving,
-	title        = {Resolving individuals contributing trace amounts of {DNA} to highly complex mixtures using high-density {SNP} genotyping microarrays},
-	author       = {Homer, Nils and Szelinger, Szabolcs and Redman, Margot and Duggan, David and Tembe, Waibhav and Muehling, Jill and Pearson, John V and Stephan, Dietrich A and Nelson, Stanley F and Craig, David W},
-	year         = 2008,
-	journal      = {PLoS Genetics},
-	volume       = 4,
-	number       = 8,
-	pages        = {e1000167}
-}
-@article{xu2020non,
-  title={Non-asymptotic convergence analysis of two time-scale (natural) actor-critic algorithms},
-  author={Xu, Tengyu and Wang, Zhe and Liang, Yingbin},
-  journal={arXiv preprint arXiv:2005.03557},
-  year={2020}
-}
-
-@article{khong2021integral,
-  title={On integral quadratic constraints},
-  author={Khong, Sei Zhen},
-  journal={IEEE Transactions on Automatic Control},
-  year={2021},
-  publisher={IEEE}
-}
-@article{cialdea2022survey,
-  title={A survey of functional and L p-dissipativity theory},
-  author={Cialdea, A and Maz’ya, V},
-  journal={Bulletin of Mathematical Sciences},
-  pages={2230003},
-  year={2022},
-  publisher={World Scientific}
-}
-@article{grune2021dissipativity,
-  title={Dissipativity and optimal control},
-  author={Gr{\"u}ne, Lars},
-  journal={arXiv preprint arXiv:2101.12606},
-  year={2021}
-}
-@article{scherer2021dissipativity,
-  title={Dissipativity and Integral Quadratic Constraints, Tailored computational robustness tests for complex interconnections},
-  author={Scherer, Carsten},
-  journal={arXiv preprint arXiv:2105.07401},
-  year={2021}
-}
-@inproceedings{borkar2018concentration,
-  title={Concentration bounds for two time scale stochastic approximation},
-  author={Borkar, Vivek S and Pattathil, Sarath},
-  booktitle={2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)},
-  pages={504--511},
-  year={2018},
-  organization={IEEE}
-}
-@article{khanduri2021near,
-  title={A near-optimal algorithm for stochastic bilevel optimization via double-momentum},
-  author={Khanduri, Prashant and Zeng, Siliang and Hong, Mingyi and Wai, Hoi-To and Wang, Zhaoran and Yang, Zhuoran},
-  journal={Advances in Neural Information Processing Systems},
-  volume={34},
-  year={2021}
-}
-
-  
-@article{koch2021determining,
-  title={Determining optimal input--output properties: A data-driven approach},
-  author={Koch, Anne and Berberich, Julian and K{\"o}hler, Johannes and Allg{\"o}wer, Frank},
-  journal={Automatica},
-  volume={134},
-  pages={109906},
-  year={2021},
-  publisher={Elsevier}
-}
-@article{chow2019lyapunov,
-  title={Lyapunov-based safe policy optimization for continuous control},
-  author={Chow, Yinlam and Nachum, Ofir and Faust, Aleksandra and Duenez-Guzman, Edgar and Ghavamzadeh, Mohammad},
-  journal={arXiv preprint arXiv:1901.10031},
-  year={2019}
-}
-@article{shokri2016membership,
-	title        = {Membership Inference Attacks against Machine Learning Models},
-	author       = {Shokri, Reza and Stronati, Marco and Shmatikov, Vitaly},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1610.05820}
-}
-@article{warde2016adversarial,
-	title        = {Adversarial perturbations of deep neural networks},
-	author       = {Warde-Farley, D and Goodfellow, I},
-	year         = 2016,
-	journal      = {Advanced Structured Prediction, T. Hazan, G. Papandreou, and D. Tarlow, Eds},
-	pages        = {1--32}
-}
-@article{szegedy2015rethinking,
-	title        = {Rethinking the inception architecture for computer vision},
-	author       = {Szegedy, Christian and Vanhoucke, Vincent and Ioffe, Sergey and Shlens, Jonathon and Wojna, Zbigniew},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1512.00567}
-}
-@inproceedings{carlini2016hidden,
-	title        = {Hidden voice commands},
-	author       = {Carlini, Nicholas and Mishra, Pratyush and Vaidya, Tavish and Zhang, Yuankai and Sherr, Micah and Shields, Clay and Wagner, David and Zhou, Wenchao},
-	year         = 2016,
-	booktitle    = {25th USENIX Security Symposium (USENIX Security 16), Austin, TX}
-}
-@inproceedings{kannan2016smart,
-	title        = {Smart Reply: Automated Response Suggestion for Email},
-	author       = {Kannan, Anjuli and Kurach, Karol and Ravi, Sujith and Kaufmann, Tobias and Tomkins, Andrew and Miklos, Balint and Corrado, Greg and Luk{\'a}cs, L{\'a}szl{\'o} and Ganea, Marina and Young, Peter and others},
-	year         = 2016,
-	booktitle    = {ACM SIGKDD Conference on Knowledge Discovery and Data Mining},
-	volume       = 36,
-	pages        = {495--503}
-}
-@article{dunlop2000predictive,
-	title        = {Predictive text entry methods for mobile phones},
-	author       = {Dunlop, Mark D. and Crossan, Andrew},
-	year         = 2000,
-	journal      = {Personal Technologies},
-	volume       = 4,
-	number       = {2-3},
-	pages        = {134--143}
-}
-@article{rindfleisch1997privacy,
-	title        = {Privacy, information technology, and health care},
-	author       = {Rindfleisch, Thomas C.},
-	year         = 1997,
-	journal      = {Communications of the ACM},
-	volume       = 40,
-	number       = 8,
-	pages        = {92--100}
-}
-@inproceedings{fredrikson2015model,
-	title        = {Model inversion attacks that exploit confidence information and basic countermeasures},
-	author       = {Fredrikson, Matt and Jha, Somesh and Ristenpart, Thomas},
-	year         = 2015,
-	booktitle    = {22nd ACM SIGSAC Conference on Computer and Communications Security},
-	pages        = {1322--1333}
-}
-@inproceedings{sharif2016accessorize,
-	title        = {Accessorize to a Crime: Real and Stealthy Attacks on State-of-the-Art Face Recognition},
-	author       = {Sharif, Mahmood and Bhagavatula, Sruti and Bauer, Lujo and Reiter, Michael K},
-	year         = 2016,
-	booktitle    = {23rd ACM SIGSAC Conference on Computer and Communications Security},
-	pages        = {1528--1540},
-	organization = {ACM}
-}
-@inproceedings{parkhi2015deep,
-	title        = {Deep face recognition},
-	author       = {Parkhi, Omkar M and Vedaldi, Andrea and Zisserman, Andrew},
-	year         = 2015,
-	booktitle    = {British Machine Vision Conference},
-	volume       = 1,
-	number       = 3,
-	pages        = 6
-}
-@book{Pfleeger12,
-	title        = {Analyzing Computer Security: A Threat/Vulnerability/Countermeasure Approach},
-	author       = {Pfleeger, Charles P. and Pfleeger, Shari Lawrence},
-	year         = 2012,
-	publisher    = {Prentice Hall}
-}
-@article{campbell2002deep,
-	title        = {{Deep Blue}},
-	author       = {Campbell, Murray and Hoane, Jr., A. Joseph and Hsu, Feng-hsiung},
-	year         = 2002,
-	journal      = {Artificial Intelligence},
-	volume       = 134,
-	number       = 1,
-	pages        = {57--83}
-}
-@book{sejnowski1988nettalk,
-	title        = {{NETtalk}: A parallel network that learns to read aloud},
-	author       = {Sejnowski, Terrence J. and Rosenberg, Charles R.},
-	year         = 1988,
-	publisher    = {MIT Press}
-}
-@article{rosenblatt1958perceptron,
-	title        = {The perceptron: A probabilistic model for information storage and organization in the brain},
-	author       = {Rosenblatt, Frank},
-	year         = 1958,
-	journal      = {Psychological Review},
-	volume       = 65,
-	number       = 6,
-	pages        = 386
-}
-@article{nachenberg1997computer,
-	title        = {Computer virus-coevolution},
-	author       = {Nachenberg, Carey},
-	year         = 1997,
-	journal      = {Communications of the ACM},
-	volume       = 50,
-	number       = 1,
-	pages        = {46--51}
-}
-@unpublished{goodfellow2016book,
-	title        = {Deep Learning},
-	author       = {Ian Goodfellow and Yoshua Bengio and Aaron Courville},
-	year         = 2016,
-	url          = {http://goodfeli.github.io/dlbook/},
-	note         = {{Book in preparation for MIT Press} (www.deeplearningbook.org)}
-}
-@article{lanckriet2004learning,
-	title        = {Learning the kernel matrix with semidefinite programming},
-	author       = {Lanckriet, Gert RG and Cristianini, Nello and Bartlett, Peter and Ghaoui, Laurent El and Jordan, Michael I},
-	year         = 2004,
-	journal      = {Journal of Machine learning research},
-	volume       = 5,
-	number       = {Jan},
-	pages        = {27--72}
-}
-@article{mahendran2016visualizing,
-	title        = {Visualizing deep convolutional neural networks using natural pre-images},
-	author       = {Mahendran, Aravindh and Vedaldi, Andrea},
-	year         = 2016,
-	journal      = {International Journal of Computer Vision},
-	publisher    = {Springer},
-	pages        = {1--23}
-}
-@article{erhan2009visualizing,
-	title        = {Visualizing higher-layer features of a deep network},
-	author       = {Erhan, Dumitru and Bengio, Yoshua and Courville, Aaron and Vincent, Pascal},
-	year         = 2009,
-	journal      = {University of Montreal},
-	volume       = 1341
-}
-@inproceedings{datta2016algorithmic,
-	title        = {Algorithmic Transparency via Quantitative Input Influence},
-	author       = {Datta, Anupam and Sen, Shayak and Zick, Yair},
-	year         = 2016,
-	booktitle    = {Proceedings of 37th IEEE Symposium on Security and Privacy}
-}
-@article{tibshirani1996regression,
-	title        = {Regression shrinkage and selection via the lasso},
-	author       = {Tibshirani, Robert},
-	year         = 1996,
-	journal      = {Journal of the Royal Statistical Society. Series B (Methodological)},
-	publisher    = {JSTOR},
-	pages        = {267--288}
-}
-@article{letham2015interpretable,
-	title        = {Interpretable classifiers using rules and Bayesian analysis: Building a better stroke prediction model},
-	author       = {Letham, Benjamin and Rudin, Cynthia and McCormick, Tyler H and Madigan, David and others},
-	year         = 2015,
-	journal      = {The Annals of Applied Statistics},
-	publisher    = {Institute of Mathematical Statistics},
-	volume       = 9,
-	number       = 3,
-	pages        = {1350--1371}
-}
-@inproceedings{pedreshi2008discrimination,
-	title        = {Discrimination-aware data mining},
-	author       = {Pedreshi, Dino and Ruggieri, Salvatore and Turini, Franco},
-	year         = 2008,
-	booktitle    = {14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining},
-	pages        = {560--568}
-}
-@article{bilal2015learning,
-	title        = {Learning Fair Classifiers},
-	author       = {Bilal Zafar, Muhammad and Valera, Isabel and Gomez Rodriguez, Manuel and Gummadi, Krishna P.},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1507.05259}
-}
-@article{zemel2013learning,
-	title        = {Learning Fair Representations.},
-	author       = {Zemel, Richard S. and Wu, Yu and Swersky, Kevin and Pitassi, Toniann and Dwork, Cynthia},
-	year         = 2013,
-	journal      = {ICML (3)},
-	volume       = 28,
-	pages        = {325--333}
-}
-@inproceedings{dwork2012fairness,
-	title        = {Fairness through awareness},
-	author       = {Dwork, Cynthia and Hardt, Moritz and Pitassi, Toniann and Reingold, Omer and Zemel, Richard},
-	year         = 2012,
-	booktitle    = {Proceedings of the 3rd Innovations in Theoretical Computer Science Conference},
-	pages        = {214--226},
-	organization = {ACM}
-}
-@article{friedler2016possibility,
-	title        = {On the (im) possibility of fairness},
-	author       = {Friedler, Sorelle A. and Scheidegger, Carlos and Venkatasubramanian, Suresh},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1609.07236}
-}
-@article{kleinberg2016inherent,
-	title        = {Inherent Trade-Offs in the Fair Determination of Risk Scores},
-	author       = {Kleinberg, Jon and Mullainathan, Sendhil and Raghavan, Manish},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1609.05807}
-}
-@inproceedings{gama2004learning,
-	title        = {Learning with drift detection},
-	author       = {Gama, Joao and Medas, Pedro and Castillo, Gladys and Rodrigues, Pedro},
-	year         = 2004,
-	booktitle    = {Brazilian Symposium on Artificial Intelligence},
-	pages        = {286--295},
-	organization = {Springer}
-}
-@article{amodei2016concrete,
-	title        = {Concrete problems in {AI} safety},
-	author       = {Amodei, Dario and Olah, Chris and Steinhardt, Jacob and Christiano, Paul and Schulman, John and Man{\'e}, Dan},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1606.06565}
-}
-@article{international2009estimation,
-	title        = {Estimation of the warfarin dose with clinical and pharmacogenetic data},
-	author       = {International Warfarin Pharmacogenetics Consortium and others},
-	year         = 2009,
-	journal      = {N Engl J Med},
-	publisher    = {Mass Medical Soc},
-	volume       = 2009,
-	number       = 360,
-	pages        = {753--764}
-}
-@article{barocas2016big,
-	title        = {Big data's disparate impact},
-	author       = {Barocas, Solon and Selbst, Andrew D},
-	year         = 2016,
-	journal      = {California Law Review},
-	volume       = 104
-}
-@article{calders2010three,
-	title        = {Three naive Bayes approaches for discrimination-free classification},
-	author       = {Calders, Toon and Verwer, Sicco},
-	year         = 2010,
-	journal      = {Data Mining and Knowledge Discovery},
-	publisher    = {Springer},
-	volume       = 21,
-	number       = 2,
-	pages        = {277--292}
-}
-@article{hajian2013methodology,
-	title        = {A methodology for direct and indirect discrimination prevention in data mining},
-	author       = {Hajian, Sara and Domingo-Ferrer, Josep},
-	year         = 2013,
-	journal      = {IEEE transactions on knowledge and data engineering},
-	publisher    = {IEEE},
-	volume       = 25,
-	number       = 7,
-	pages        = {1445--1459}
-}
-%article{kurakin2016adversarial,
-  title={Adversarial examples in the physical world},
-  author={Kurakin, Alexey and Goodfellow, Ian and Bengio, Samy},
-  journal={arXiv preprint arXiv:1607.02533},
-  year={2016}
-}
-@techreport{ptacek1998insertion,
-	title        = {Insertion, evasion, and denial of service: Eluding network intrusion detection},
-	author       = {Ptacek, Thomas H. and Newsham, Timothy N.},
-	year         = 1998,
-	institution  = {DTIC Document}
-}
-@inproceedings{handley2001network,
-	title        = {Network Intrusion Detection: Evasion, Traffic Normalization, and End-to-End Protocol Semantics.},
-	author       = {Handley, Mark and Paxson, Vern and Kreibich, Christian},
-	year         = 2001,
-	booktitle    = {USENIX Security Symposium},
-	pages        = {115--131}
-}
-@inproceedings{sommer2010outside,
-	title        = {Outside the closed world: On using machine learning for network intrusion detection},
-	author       = {Sommer, Robin and Paxson, Vern},
-	year         = 2010,
-	booktitle    = {IEEE Symposium on Security and Privacy},
-	pages        = {305--316}
-}
-@inproceedings{enck2009lightweight,
-	title        = {On lightweight mobile phone application certification},
-	author       = {Enck, William and Ongtang, Machigar and McDaniel, Patrick},
-	year         = 2009,
-	booktitle    = {Proceedings of the 16th ACM conference on Computer and communications security},
-	pages        = {235--245},
-	organization = {ACM}
-}
-@inproceedings{grosse2016adversarial,
-	title        = {Adversarial Perturbations Against Deep Neural Networks for Malware Classification},
-	author       = {Grosse, Kathrin and Papernot, Nicolas and Manoharan, Praveen and Backes, Michael and McDaniel, Patrick},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1606.04435},
-	booktitle    = {22nd European Symposium on Research in Computer Security}
-}
-@inproceedings{cannady2000next,
-	title        = {Next generation intrusion detection: Autonomous reinforcement learning of network attacks},
-	author       = {Cannady, James},
-	year         = 2000,
-	booktitle    = {23rd National Information Systems Security Conference},
-	pages        = {1--12}
-}
-@inproceedings{serpen2005empirical,
-  title={Empirical approximation for Lyapunov functions with artificial neural nets},
-  author={Serpen, Gursel},
-  booktitle={Proceedings. 2005 IEEE International Joint Conference on Neural Networks},
-  volume={2},
-  pages={735--740},
-  year={2005},
-  organization={IEEE}
-}
-
-@article{tsukamoto2020neural,
-  title={Neural stochastic contraction metrics for learning-based control and estimation},
-  author={Tsukamoto, Hiroyasu and Chung, Soon-Jo and Slotine, Jean-Jacques E},
-  journal={IEEE Control Systems Letters},
-  volume={5},
-  number={5},
-  pages={1825--1830},
-  year={2020},
-  publisher={IEEE}
-}
-@article{tsukamoto2020neural,
-  title={Neural contraction metrics for robust estimation and control: A convex optimization approach},
-  author={Tsukamoto, Hiroyasu and Chung, Soon-Jo},
-  journal={IEEE Control Systems Letters},
-  volume={5},
-  number={1},
-  pages={211--216},
-  year={2020},
-  publisher={IEEE}
-}
-@inproceedings{lindemann2021learning,
-  title={Learning Hybrid Control Barrier Functions from Data},
-  author={Lindemann, Lars and Hu, Haimin and Robey, Alexander and Zhang, Hanwen and Dimarogonas, Dimos and Tu, Stephen and Matni, Nikolai},
-  booktitle={Conference on Robot Learning},
-  pages={1351--1370},
-  year={2021},
-  organization={PMLR}
-}
-@inproceedings{robey2020learning,
-  title={Learning control barrier functions from expert demonstrations},
-  author={Robey, Alexander and Hu, Haimin and Lindemann, Lars and Zhang, Hanwen and Dimarogonas, Dimos V and Tu, Stephen and Matni, Nikolai},
-  booktitle={2020 59th IEEE Conference on Decision and Control},
-  pages={3717--3724},
-  year={2020},
-  organization={IEEE}
-}
-@inproceedings{dawson2022safe,
-  title={Safe nonlinear control using robust neural lyapunov-barrier functions},
-  author={Dawson, Charles and Qin, Zengyi and Gao, Sicun and Fan, Chuchu},
-  booktitle={Conference on Robot Learning},
-  pages={1724--1735},
-  year={2022},
-  organization={PMLR}
-}
-@inproceedings{qin2020learning,
-  title={Learning Safe Multi-agent Control with Decentralized Neural Barrier Certificates},
-  author={Qin, Zengyi and Zhang, Kaiqing and Chen, Yuxiao and Chen, Jingkai and Fan, Chuchu},
-  booktitle={International Conference on Learning Representations},
-  year={2021}
-}
-@article{chang2019neural,
-  title={Neural lyapunov control},
-  author={Chang, Ya-Chien and Roohi, Nima and Gao, Sicun},
-  journal={Advances in neural information processing systems},
-  volume={32},
-  year={2019}
-}
-@inproceedings{petridis2006construction,
-  title={Construction of neural network based lyapunov functions},
-  author={Petridis, Vassilios and Petridis, Stavros},
-  booktitle={The 2006 IEEE International Joint Conference on Neural Network Proceedings},
-  pages={5059--5065},
-  year={2006},
-  organization={IEEE}
-}
-@inproceedings{prokhorov1994lyapunov,
-  title={A Lyapunov machine for stability analysis of nonlinear systems},
-  author={Prokhorov, Danil V},
-  booktitle={Proceedings of 1994 IEEE International Conference on Neural Networks},
-  volume={2},
-  pages={1028--1031},
-  year={1994},
-  organization={IEEE}
-}
-@inproceedings{abate2021fossil,
-  title={FOSSIL: a software tool for the formal synthesis of lyapunov functions and barrier certificates using neural networks},
-  author={Abate, Alessandro and Ahmed, Daniele and Edwards, Alec and Giacobbe, Mirco and Peruffo, Andrea},
-  booktitle={Proceedings of the 24th International Conference on Hybrid Systems: Computation and Control},
-  pages={1--11},
-  year={2021}
-}
-@article{abate2020formal,
-  title={Formal synthesis of Lyapunov neural networks},
-  author={Abate, Alessandro and Ahmed, Daniele and Giacobbe, Mirco and Peruffo, Andrea},
-  journal={IEEE Control Systems Letters},
-  volume={5},
-  number={3},
-  pages={773--778},
-  year={2020},
-  publisher={IEEE}
-}
-@inproceedings{richards2018lyapunov,
-  title={The lyapunov neural network: Adaptive stability certification for safe learning of dynamical systems},
-  author={Richards, Spencer M and Berkenkamp, Felix and Krause, Andreas},
-  booktitle={Conference on Robot Learning},
-  pages={466--476},
-  year={2018},
-  organization={PMLR}
-}
-
-@book{sutton1998reinforcement,
-	title        = {Reinforcement Learning: An Introduction},
-	author       = {Sutton, Richard S. and Barto, Andrew G.},
-	year         = 1998,
-	publisher    = {MIT Press}
-}
-@article{Brown93SMT,
-	title        = {The Mathematics of Statistical Machine Translation: Parameter Estimation},
-	author       = {Brown, Peter F. and Della Pietra, Stephen A. and Della Pietra, Vincent J. and Mercer, Robert L.},
-	year         = 1993,
-	journal      = {Computational Linguistics},
-	volume       = 19,
-	number       = 2,
-	pages        = {263--311}
-}
-@article{ying2021dual,
-  title={A Dual Approach to Constrained Markov Decision Processes with Entropy Regularization},
-  author={Ying, Donghao and Ding, Yuhao and Lavaei, Javad},
-  journal={25th International Conference on Artificial Intelligence and Statistics (AISTATS)},
-  year={2022}
-}
-@inproceedings{dorigo2016ant,
-	title        = {Ant-Q: A reinforcement learning approach to the traveling salesman problem},
-	author       = {Dorigo, Marco and Gambardella, L. M.},
-	year         = 2016,
-	booktitle    = {Twelfth International Conference on Machine Learning},
-	pages        = {252--260}
-}
-@book{altman1999constrained,
-  title={Constrained Markov decision processes: stochastic modeling},
-  author={Altman, Eitan},
-  year={1999},
-  publisher={Routledge}
-}
-@article{chow2017risk,
-  title={Risk-constrained reinforcement learning with percentile risk criteria},
-  author={Chow, Yinlam and Ghavamzadeh, Mohammad and Janson, Lucas and Pavone, Marco},
-  journal={The Journal of Machine Learning Research},
-  volume={18},
-  number={1},
-  pages={6070--6120},
-  year={2017},
-  publisher={JMLR. org}
-}
-@book{martelli2011introduction,
-  title={Introduction to discrete dynamical systems and chaos},
-  author={Martelli, Mario},
-  volume={53},
-  year={2011},
-  publisher={John Wiley \& Sons}
-}
-@incollection{powell2021reinforcement,
-  title={From reinforcement learning to optimal control: A unified framework for sequential decisions},
-  author={Powell, Warren B},
-  booktitle={Handbook of Reinforcement Learning and Control},
-  pages={29--74},
-  year={2021},
-  publisher={Springer}
-}
-@book{meyn2012markov,
-  title={Markov chains and stochastic stability},
-  author={Meyn, Sean P and Tweedie, Richard L},
-  year={2012},
-  publisher={Springer Science \& Business Media}
-}
-@incollection{heger1994consideration,
-  title={Consideration of risk in reinforcement learning},
-  author={Heger, Matthias},
-  booktitle={Machine Learning Proceedings 1994},
-  pages={105--111},
-  year={1994},
-  publisher={Elsevier}
-}
-
-@article{tang2019worst,
-  title={Worst cases policy gradients},
-  author={Tang, Yichuan Charlie and Zhang, Jian and Salakhutdinov, Ruslan},
-  journal={arXiv preprint arXiv:1911.03618},
-  year={2019}
-}
-@inproceedings{eysenbach2018leave,
-  title={Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning},
-  author={Eysenbach, B and Gu, S and Ibarz, J and Levine, S},
-  booktitle={6th International Conference on Learning Representations},
-  year={2018},
-  organization={OpenReview. net}
-}
-@article{smirnova2019distributionally,
-  title={Distributionally robust reinforcement learning},
-  author={Smirnova, Elena and Dohmatob, Elvis and Mary, J{\'e}r{\'e}mie},
-  journal={arXiv preprint arXiv:1902.08708},
-  year={2019}
-}
-@inproceedings{pinto2017robust,
-  title={Robust adversarial reinforcement learning},
-  author={Pinto, Lerrel and Davidson, James and Sukthankar, Rahul and Gupta, Abhinav},
-  booktitle={International Conference on Machine Learning},
-  pages={2817--2826},
-  year={2017},
-  organization={PMLR}
-}
-@article{xu2010distributionally,
-  title={Distributionally robust Markov decision processes},
-  author={Xu, Huan and Mannor, Shie},
-  journal={Advances in Neural Information Processing Systems},
-  volume={23},
-  year={2010}
-}
-@inproceedings{dann2019policy,
-  title={Policy certificates: Towards accountable reinforcement learning},
-  author={Dann, Christoph and Li, Lihong and Wei, Wei and Brunskill, Emma},
-  booktitle={International Conference on Machine Learning},
-  pages={1507--1516},
-  year={2019},
-  organization={PMLR}
-}
-@inproceedings{thomas2015high,
-  title={High-confidence off-policy evaluation},
-  author={Thomas, Philip and Theocharous, Georgios and Ghavamzadeh, Mohammad},
-  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
-  volume={29},
-  number={1},
-  year={2015}
-}
-@inproceedings{uchibe2007constrained,
-  title={Constrained reinforcement learning from intrinsic and extrinsic rewards},
-  author={Uchibe, Eiji and Doya, Kenji},
-  booktitle={2007 IEEE 6th International Conference on Development and Learning},
-  pages={163--168},
-  year={2007},
-  organization={IEEE}
-}
-@article{paternain2022safe,
-  title={Safe policies for reinforcement learning via primal-dual methods},
-  author={Paternain, Santiago and Calvo-Fullana, Miguel and Chamon, Luiz FO and Ribeiro, Alejandro},
-  journal={IEEE Transactions on Automatic Control},
-  year={2022},
-  publisher={IEEE}
-}
-@article{chen2021primal,
-  title={A primal-dual approach to constrained Markov decision processes},
-  author={Chen, Yi and Dong, Jing and Wang, Zhaoran},
-  journal={arXiv preprint arXiv:2101.10895},
-  year={2021}
-}
-@article{borkar2005actor,
-  title={An actor-critic algorithm for constrained Markov decision processes},
-  author={Borkar, Vivek S},
-  journal={Systems \& control letters},
-  volume={54},
-  number={3},
-  pages={207--213},
-  year={2005},
-  publisher={Elsevier}
-}
-@article{bhatnagar2012online,
-  title={An online actor--critic algorithm with function approximation for constrained markov decision processes},
-  author={Bhatnagar, Shalabh and Lakshmanan, K},
-  journal={Journal of Optimization Theory and Applications},
-  volume={153},
-  number={3},
-  pages={688--708},
-  year={2012},
-  publisher={Springer}
-}
-@article{altman1992introduction,
-	title        = {An introduction to kernel and nearest-neighbor nonparametric regression},
-	author       = {Altman, Naomi S.},
-	year         = 1992,
-	journal      = {The American Statistician},
-	publisher    = {Taylor \& Francis},
-	volume       = 46,
-	number       = 3,
-	pages        = {175--185}
-}
-@article{chandola2009anomaly,
-	title        = {Anomaly detection: A survey},
-	author       = {Chandola, Varun and Banerjee, Arindam and Kumar, Vipin},
-	year         = 2009,
-	journal      = {ACM Computing Surveys},
-	volume       = 41,
-	number       = 3,
-	pages        = {15:1--15:58}
-}
-@article{jain1999data,
-	title        = {Data clustering: A review},
-	author       = {Jain, Anil K. and Murty, M. Narasimha and Flynn, Patrick J.},
-	year         = 1999,
-	journal      = {ACM Computing Surveys},
-	volume       = 31,
-	number       = 3,
-	pages        = {264--323}
-}
-@inproceedings{zhang2006anomaly,
-	title        = {Anomaly based network intrusion detection with unsupervised outlier detection},
-	author       = {Zhang, Jiong and Zulkernine, Mohammad},
-	year         = 2006,
-	booktitle    = {IEEE International Conference on Communications},
-	volume       = 5,
-	pages        = {2388--2393}
-}
-@inproceedings{christodorescu2006static,
-	title        = {Static analysis of executables to detect malicious patterns},
-	author       = {Christodorescu, Mihai and Jha, Somesh},
-	year         = 2006,
-	booktitle    = {12th USENIX Security Symposium}
-}
-@article{hotelling1933analysis,
-	title        = {Analysis of a complex of statistical variables into principal components.},
-	author       = {Hotelling, Harold},
-	year         = 1933,
-	journal      = {Journal of educational psychology},
-	publisher    = {Warwick \& York},
-	volume       = 24,
-	number       = 6,
-	pages        = 417
-}
-@article{silver2016mastering,
-	title        = {Mastering the game of {Go} with deep neural networks and tree search},
-	author       = {Silver, David and Huang, Aja and Maddison, Chris J. and Guez, Arthur and Sifre, Laurent and van den Driessche, George and Schrittwieser, Julian and Antonoglou, Ioannis and Panneershelvam, Veda and Lanctot, Marc and Dieleman, Sander and Grewe, Dominik and Nham, John and Kalchbrenner, Nal and Sutskever, Ilya and Lillicrap, Timothy and Leach, Madeleine and Kavukcuoglu, Koray and Graepel, Thore and Hassabis, Demis},
-	year         = 2016,
-	journal      = {Nature},
-	publisher    = {Nature Publishing Group},
-	volume       = 529,
-	number       = 7587,
-	pages        = {484--489}
-}
-@article{werbos1988generalization,
-	title        = {Generalization of backpropagation with application to a recurrent gas market model},
-	author       = {Werbos, Paul J},
-	year         = 1988,
-	journal      = {Neural Networks},
-	publisher    = {Elsevier},
-	volume       = 1,
-	number       = 4,
-	pages        = {339--356}
-}
-%inproceedings{papernot2015distillation,
-  title={Distillation as a Defense to Adversarial Perturbations against Deep Neural Networks},
-  author={Papernot, Nicolas and McDaniel, Patrick and Wu, Xi and Jha, Somesh and Swami, Ananthram},
-  booktitle={Proceedings of the 37th IEEE Symposium on Security and Privacy},
-  year={2016},
-  organization={IEEE}
-}
-
-
-%article{papernot2016practical,
-	title={Practical Black-Box Attacks against Deep Learning Systems using Adversarial Examples},
-	author={Nicolas Papernot and Patrick McDaniel and Ian Goodfellow and Somesh Jha and Z. Berkay Celik and Ananthram Swami},
-	journal={arXiv preprint arXiv:1602.02697},
-	year={2016}
-}
-@inproceedings{papernot2015limitations,
-	title        = {The Limitations of Deep Learning in Adversarial Settings},
-	author       = {Nicolas Papernot and Patrick McDaniel and Somesh Jha and Matt Fredrikson and Z. Berkay Celik and Ananthram Swami},
-	year         = 2016,
-	booktitle    = {1st IEEE European Symposium on Security and Privacy}
-}
-%INPROCEEDINGS{goodfellow2014explaining,
-  author = {Goodfellow, Ian J and Shlens, Jonathon and Szegedy, Christian},
-  title = {Explaining and Harnessing Adversarial Examples},
-  booktitle = {3d International Conference on Learning Representations},
-  year = 2015
-}
-
-%INPROCEEDINGS{szegedy2013intriguing,
-  author = {Szegedy, Christian and Zaremba, Wojciech and Sutskever, Ilya and
-	Bruna, Joan and Erhan, Dumitru and Goodfellow, Ian and Fergus, Rob},
-  title = {Intriguing properties of neural networks},
-  booktitle = {International
-	Conference on Learning Representations},
-  year = 2014
-}
-@inproceedings{dahl2013large,
-	title        = {Large-scale malware classification using random projections and neural networks},
-	author       = {Dahl, George E. and Stokes, Jack W. and Deng, Li and Yu, Dong},
-	year         = 2013,
-	booktitle    = {Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on},
-	pages        = {3422--3426},
-	organization = {IEEE}
-}
-%INPROCEEDINGS{krizhevsky2012imagenet,
-  author = {Krizhevsky, Alex and Sutskever, Ilya and Hinton, Geoffrey E.},
-  title = {Imagenet classification with deep convolutional neural networks},
-  booktitle = {Advances in Neural Information Processing Systems},
-  year = {2012},
-  pages = {1097--1105}
-}
-
-%INPROCEEDINGS{sutskever2014sequence,
-  author = {Sutskever, Ilya and Vinyals, Oriol and Le, Quoc V.},
-  title = {Sequence to sequence learning with neural networks},
-  booktitle = {Advances in Neural Information Processing Systems},
-  year = 2014,
-  pages = {3104-3112}
-}
-@inproceedings{yuan2014droid,
-	title        = {Droid-Sec: Deep learning in android malware detection},
-	author       = {Yuan, Zhenlong and Lu, Yongqiang and Wang, Zhaoguo and Xue, Yibo},
-	year         = 2014,
-	booktitle    = {ACM Conference on SIGCOMM},
-	pages        = {371--372}
-}
-@misc{PayPal,
-	title        = {How PayPal beats the bad guys with machine learning},
-	author       = {Knorr, Eric},
-	year         = 2015,
-	url          = {http://www.infoworld.com/article/2907877/machine-learning/how-paypal-reduces-fraud-with-machine-learning.html},
-	institution  = {InfoWorld}
-}
-@misc{PayPlug,
-	title        = {{PayPlug} Documentation: Fraud Prediction {API}},
-	author       = {{PayPlug}},
-	year         = 2016,
-	url          = {https://www.payplug.com/docs/api/#fraud-prediction-api-beta-}
-}
-@misc{lasagne,
-	title        = {Lasagne: Lightweight library to build and train neural networks in Theano},
-	author       = {Battenberg, Eric and Dieleman, Sander and Nouri, Daniel and Olson, Eben and van den Oord, A�ron and Raffel, Colin and Schl�ter, Jan and Kaae S�nderby, S�ren},
-	year         = 2015,
-	url          = {https://github.com/Lasagne/Lasagne}
-}
-@inproceedings{laskov2009framework,
-	title        = {A framework for quantitative security analysis of machine learning},
-	author       = {Laskov, Pavel and Kloft, Marius},
-	year         = 2009,
-	booktitle    = {2nd ACM Workshop on Security and Artificial Intelligence},
-	pages        = {1--4},
-	organization = {ACM}
-}
-@inproceedings{barreno2006can,
-	title        = {Can machine learning be secure?},
-	author       = {Barreno, Marco and Nelson, Blaine and Sears, Russell and Joseph, Anthony D and Tygar, J Doug},
-	year         = 2006,
-	booktitle    = {ACM Symposium on Information, Computer and Communications Security},
-	pages        = {16--25},
-	organization = {ACM}
-}
-@inproceedings{nguyen2014deep,
-	title        = {Deep Neural Networks are Easily Fooled: High Confidence Predictions for Unrecognizable Images},
-	author       = {Nguyen, Anh and Yosinski, Jason and Clune, Jeff},
-	year         = 2015,
-	booktitle    = {In Computer Vision and Pattern Recognition (CVPR 2015)},
-	pages        = {427--436},
-	organization = {IEEE}
-}
-@misc{NVIDIATegra,
-	title        = {NVIDIA Tegra Drive PX: Self-Driving Car Computer},
-	author       = {NVIDIA},
-	year         = 2015,
-	url          = {http://www.nvidia.com/object/drive-px.html},
-	institution  = {InfoWorld}
-}
-@misc{deepmind2016energy,
-	title        = {DeepMind {AI} Reduces {Google} Data Centre Cooling Bill by 40\%},
-	author       = {Google DeepMind},
-	year         = 2016,
-	url          = {https://deepmind.com/blog/deepmind-ai-reduces-google-data-centre-cooling-bill-40/}
-}
-@misc{mcsherry2016statistical,
-	title        = {Statistical inference considered harmful},
-	author       = {Frank McSherry},
-	year         = 2016,
-	url          = {https://github.com/frankmcsherry/blog/blob/master/posts/2016-06-14.md}
-}
-@misc{GoogleSmartReply,
-	title        = {Computer, respond to this email},
-	author       = {Greg Corrado},
-	year         = 2015,
-	url          = {http://googleresearch.blogspot.com/2015/11/computer-respond-to-this-email.html},
-	institution  = {Google}
-}
-@inproceedings{sainath2013deep,
-	title        = {Deep convolutional neural networks for LVCSR},
-	author       = {Sainath, Tara N and Mohamed, Abdel-rahman and Kingsbury, Brian and Ramabhadran, Bhuvana},
-	year         = 2013,
-	booktitle    = {Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on},
-	pages        = {8614--8618},
-	organization = {IEEE}
-}
-@inproceedings{hinton2015distilling,
-	title        = {Distilling the knowledge in a neural network},
-	author       = {Hinton, Geoffrey and Vinyals, Oriol and Dean, Jeff},
-	year         = 2014,
-	journal      = {arXiv preprint arXiv:1503.02531},
-	booktitle    = {NIPS-14 Workshop on Deep Learning and Representation Learning},
-	organization = {arXiv:1503.02531}
-}
-@article{hinton2007learning,
-	title        = {Learning multiple layers of representation},
-	author       = {Hinton, Geoffrey E},
-	year         = 2007,
-	journal      = {Trends in cognitive sciences},
-	publisher    = {Elsevier},
-	volume       = 11,
-	number       = 10,
-	pages        = {428--434}
-}
-@article{pascanu2012understanding,
-	title        = {Understanding the exploding gradient problem},
-	author       = {Pascanu, Razvan and Mikolov, Tomas and Bengio, Yoshua},
-	year         = 2012,
-	journal      = {Computing Research Repository (CoRR) abs/1211.5063}
-}
-@article{rumelhart1988learning,
-	title        = {Learning representations by back-propagating errors},
-	author       = {Rumelhart, David E. and Hinton, Geoffrey E. and Williams, Ronald J.},
-	year         = 1988,
-	journal      = {Cognitive modeling},
-	volume       = 5,
-	number       = 3,
-	pages        = 1
-}
-@article{bergstra2012random,
-	title        = {Random search for hyper-parameter optimization},
-	author       = {Bergstra, James and Bengio, Yoshua},
-	year         = 2012,
-	journal      = {Journal of Machine Learning Research},
-	volume       = 13,
-	number       = 1,
-	pages        = {281--305}
-}
-%inproceedings{gu2014towards,
-  title={Towards deep neural network architectures robust to adversarial examples},
-  author={Gu, Shixiang and Rigazio, Luca},
-  booktitle = {Proceedings of the 2015 International
-	Conference on Learning Representations},
-  year = {2015},
-  organization = {Computational and Biological Learning Society}
-
-}
-@inproceedings{sermanet2014overfeat,
-	title        = {OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks},
-	author       = {Sermanet, Pierre and Eigen, David and Zhang, Xiang and Mathieu, Michael and Fergus, Rob and LeCun, Yann},
-	year         = 2014,
-	booktitle    = {International Conference on Learning Representations (ICLR 2014)},
-	organization = {arXiv preprint arXiv:1312.6229}
-}
-%inproceedings{biggio2012poisoning,
-  title={Poisoning attacks against support vector machines},
-  author={Biggio, Battista and Nelson, Blaine and Laskov Pavel},
-  booktitle={29th International Conference on Machine Learning},
-  year={2012}
-}
-@inproceedings{biggio2011support,
-	title        = {Support Vector Machines Under Adversarial Label Noise},
-	author       = {Biggio, Battista and Nelson, Blaine and Laskov, Pavel},
-	year         = 2011,
-	booktitle    = {Asian Conference on Machine Learning},
-	pages        = {97--112}
-}
-%inproceedings{huang2011adversarial,
-  title={Adversarial machine learning},
-  author={Huang, Ling and Joseph, Anthony D and Nelson, Blaine and Rubinstein, Benjamin IP and Tygar, JD},
-  booktitle={4th ACM Workshop on Security and Artificial Intelligence},
-  pages={43--58},
-  year=2011
-}
-@article{biggio2014security,
-	title        = {Security evaluation of pattern classifiers under attack},
-	author       = {Biggio, Battista and Fumera, Giorgio and Roli, Fabio},
-	year         = 2014,
-	journal      = {Knowledge and Data Engineering, IEEE Transactions on},
-	booktitle    = {Support Vector Machines Applications},
-	publisher    = {IEEE},
-	volume       = 26,
-	number       = 4,
-	pages        = {984--996}
-}
-@article{barreno2010security,
-	title        = {The security of machine learning},
-	author       = {Barreno, Marco and Nelson, Blaine and Joseph, Anthony D and Tygar, JD},
-	year         = 2010,
-	journal      = {Machine Learning},
-	publisher    = {Springer},
-	volume       = 81,
-	number       = 2,
-	pages        = {121--148}
-}
-@article{drucker1992improving,
-	title        = {Improving generalization performance using double backpropagation},
-	author       = {Drucker, Harris and Le Cun, Yann},
-	year         = 1992,
-	journal      = {IEEE Transactions on Neural Networks},
-	volume       = 3,
-	number       = 6,
-	pages        = {991--997}
-}
-@article{drucker1999support,
-	title        = {Support vector machines for spam categorization},
-	author       = {Drucker, Harris and Wu, Donghui and Vapnik, Vladimir N.},
-	year         = 1999,
-	journal      = {IEEE Transactions on Neural Networks},
-	volume       = 10,
-	number       = 5,
-	pages        = {1048--1054}
-}
-@article{miyato2015distributional,
-	title        = {Distributional Smoothing by Virtual Adversarial Examples},
-	author       = {Takeru Miyato and Shin{-}ichi Maeda and Masanori Koyama and Ken Nakae and Shin Ishii},
-	year         = 2015,
-	journal      = {CoRR},
-	volume       = {abs/1507.00677},
-	pages        = 25
-}
-@article{cirecsan2012multi,
-	title        = {Multi-column deep neural network for traffic sign classification},
-	author       = {Cire{\c{s}}an, Dan and Meier, Ueli and Masci, Jonathan and Schmidhuber, J{\"u}rgen},
-	year         = 2012,
-	journal      = {Neural Networks},
-	publisher    = {Elsevier},
-	volume       = 32,
-	pages        = {333--338}
-}
-@inproceedings{anjos2011counter,
-	title        = {Counter-measures to photo attacks in face recognition: a public database and a baseline},
-	author       = {Anjos, Andre and Marcel, Sebastien},
-	year         = 2011,
-	booktitle    = {Proceedings of the 2011 International Joint Conference on Biometrics},
-	pages        = {1--7},
-	organization = {IEEE}
-}
-%incollection{biggio2013evasion,
-  title={Evasion attacks against machine learning at test time},
-  author={Biggio, Battista and Corona, Igino and Maiorca, Davide and Nelson, Blaine and {\v{S}}rndi{\'c}, Nedim and Laskov, Pavel and Giacinto, Giorgio and Roli, Fabio},
-  booktitle={Machine Learning and Knowledge Discovery in Databases},
-  pages={387--402},
-  year={2013},
-  publisher={Springer}
-}
-@article{biggio2014pattern,
-	title        = {Pattern recognition systems under attack: Design issues and research challenges},
-	author       = {Biggio, Battista and Fumera, Giorgio and Roli, Fabio},
-	year         = 2014,
-	journal      = {International Journal of Pattern Recognition and Artificial Intelligence},
-	publisher    = {World Scientific},
-	volume       = 28,
-	number       = {07},
-	pages        = 1460002
-}
-@misc{krizhevsky2009learning,
-	title        = {Learning multiple layers of features from tiny images},
-	author       = {Krizhevsky, Alex and Hinton, Geoffrey},
-	year         = 2009,
-	publisher    = {Citeseer}
-}
-@misc{lecun1998mnist,
-	title        = {The MNIST database of handwritten digits},
-	author       = {LeCun, Yann and Cortes, Corinna},
-	year         = 1998
-}
-@inproceedings{bergstra2010theano,
-	title        = {Theano: a CPU and GPU math expression compiler},
-	author       = {Bergstra, James and Breuleux, Olivier and Bastien, Fr{\'e}d{\'e}ric and Lamblin, Pascal and Pascanu, Razvan and Desjardins, Guillaume and Turian, Joseph and Warde-Farley, David and Bengio, Yoshua},
-	year         = 2010,
-	booktitle    = {Proceedings of the Python for scientific computing conference (SciPy)},
-	volume       = 4,
-	pages        = 3,
-	organization = {Austin, TX}
-}
-@misc{TheanoTutorial,
-	title        = {Convolutional Neural Networks (LeNet)},
-	author       = {LISA lab},
-	year         = 2010,
-	url          = {http://deeplearning.net/tutorial/lenet.html},
-	howpublished = {Convolutional Neural Networks (LeNet)(http://deeplearning.net/tutorial/lenet.html))},
-	institution  = {University of Montreal}
-}
-@article{denton2015deep,
-	title        = {Deep generative image models using a laplacian pyramid of adversarial networks},
-	author       = {Denton, Emily and Chintala, Soumith and Szlam, Arthur and Fergus, Rob},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1506.05751}
-}
-%inproceedings{goodfellow2014generative,
-  title={Generative adversarial nets},
-  author={Goodfellow, Ian and Pouget-Abadie, Jean and Mirza, Mehdi and Xu, Bing and Warde-Farley, David and Ozair, Sherjil and Courville, Aaron and Bengio, Yoshua},
-  booktitle={Advances in Neural Information Processing Systems},
-  pages={2672--2680},
-  year={2014}
-}
-@article{Cybenko92,
-	title        = {Approximation by superpositions of a sigmoidal function},
-	author       = {George Cybenko},
-	year         = 1992,
-	journal      = {{Mathematics of Control, Signals, and Systems }},
-	volume       = 5,
-	number       = 4,
-	pages        = 455
-}
-@article{SSSSS10,
-	title        = {Learnability, stability and uniform convergence},
-	author       = {Shalev-Shwartz, Shai and Shamir, Ohad and Srebro, Nathan and Sridharan, Karthik},
-	year         = 2010,
-	journal      = {Journal of Machine Learning Research},
-	volume       = 11,
-	pages        = {2635--2670}
-}
-@article{H92,
-	title        = {Decision Theoretic Generalizations of the {PAC} Model for Neural Net and Other Learning Applications},
-	author       = {David Haussler},
-	year         = 1992,
-	journal      = {Inf. Comput.},
-	volume       = 100,
-	number       = 1,
-	pages        = {78--150}
-}
-@article{Valiant84,
-	title        = {A Theory of the Learnable},
-	author       = {Leslie G. Valiant},
-	year         = 1984,
-	journal      = {Commununications of the ACM},
-	volume       = 27,
-	number       = 11,
-	pages        = {1134--1142}
-}
-@inproceedings{ba2014deep,
-	title        = {Do deep nets really need to be deep?},
-	author       = {Ba, Jimmy and Caruana, Rich},
-	year         = 2014,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {2654--2662}
-}
-@article{erhan2010does,
-	title        = {Why does unsupervised pre-training help deep learning?},
-	author       = {Erhan, Dumitru and Bengio, Yoshua and Courville, Aaron and Manzagol, Pierre-Antoine and Vincent, Pascal and Bengio, Samy},
-	year         = 2010,
-	journal      = {Journal of Machine Learning Research},
-	volume       = 11,
-	pages        = {625--660}
-}
-@article{bengio2012deep,
-	title        = {Deep learning of representations for unsupervised and transfer learning},
-	author       = {Bengio, Yoshua},
-	year         = 2012,
-	journal      = {Unsupervised and Transfer Learning Challenges in Machine Learning},
-	volume       = 7,
-	pages        = 19
-}
-@inproceedings{glorot2011domain,
-	title        = {Domain adaptation for large-scale sentiment classification: A deep learning approach},
-	author       = {Glorot, Xavier and Bordes, Antoine and Bengio, Yoshua},
-	year         = 2011,
-	booktitle    = {28th International Conference on Machine Learning},
-	pages        = {513--520}
-}
-@inproceedings{masci2011stacked,
-	title        = {Stacked convolutional auto-encoders for hierarchical feature extraction},
-	author       = {Masci, Jonathan and Meier, Ueli and Cire{\c{s}}an, Dan and Schmidhuber, J{\"u}rgen},
-	year         = 2011,
-	booktitle    = {International Conference on Artificial Neural Networks and Machine Learning},
-	pages        = {52--59}
-}
-@inproceedings{fogla2006evading,
-	title        = {Evading network anomaly detection systems: formal reasoning and practical techniques},
-	author       = {Fogla, Prahlad and Lee, Wenke},
-	year         = 2006,
-	booktitle    = {13th ACM conference on Computer and Communications Security},
-	pages        = {59--68}
-}
-%inproceedings{moosavi2015deepfool,
-  title={DeepFool: A simple and accurate method to fool deep neural networks},
-  author={Moosavi-Dezfooli, Seyed-Mohsen and Fawzi, Alhussein and Frossard, Pascal},
-  booktitle="IEEE Conference on Computer Vision and Pattern Recognition",
-  pages="2574-2582",
-  year=2016
-}
-@article{simonyan2013deep,
-	title        = {Deep inside convolutional networks: Visualising image classification models and saliency maps},
-	author       = {Simonyan, Karen and Vedaldi, Andrea and Zisserman, Andrew},
-	year         = 2013,
-	journal      = {arXiv preprint arXiv:1312.6034}
-}
-%inproceedings{lowd2005adversarial,
-  title={Adversarial learning},
-  author={Lowd, Daniel and Meek, Christopher},
-  booktitle={Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining},
-  pages={641--647},
-  year={2005},
-  organization={ACM}
-}
-@article{sabour2015adversarial,
-	title        = {Adversarial Manipulation of Deep Representations},
-	author       = {Sabour, Sara and Cao, Yanshuai and Faghri, Fartash and Fleet, David J},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1511.05122}
-}
-@article{tabacof2015exploring,
-	title        = {Exploring the Space of Adversarial Images},
-	author       = {Tabacof, Pedro and Valle, Eduardo},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1510.05328},
-	booktitle    = {Neural Networks (IJCNN), 2016 International Joint Conference on},
-	pages        = {426--433},
-	organization = {IEEE}
-}
-@article{fawzi2015analysis,
-	title        = {Analysis of classifiers' robustness to adversarial perturbations},
-	author       = {Fawzi, Alhussein and Fawzi, Omar and Frossard, Pascal},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1502.02590}
-}
-@book{murphy2012machine,
-	title        = {Machine Learning: A Probabilistic Perspective},
-	author       = {Murphy, Kevin P.},
-	year         = 2012,
-	publisher    = {MIT Press}
-}
-@inproceedings{xu2006robust,
-	title        = {Robust support vector machine training via convex outlier ablation},
-	author       = {Xu, Linli and Crammer, Koby and Schuurmans, Dale},
-	year         = 2006,
-	booktitle    = {Twenty-First AAAI National Conference on Artificial Intelligence},
-	volume       = 6,
-	pages        = {536--542}
-}
-@incollection{stempfel2009learning,
-	title        = {Learning {SVMs} from sloppily labeled data},
-	author       = {Stempfel, Guillaume and Ralaivola, Liva},
-	year         = 2009,
-	booktitle    = {International Conference on Artificial Neural Networks},
-	publisher    = {Springer},
-	pages        = {884--893}
-}
-@book{huber2011robust,
-	title        = {Robust Statistics},
-	author       = {Huber, Peter J.},
-	year         = 2011,
-	booktitle    = {International Encyclopedia of Statistical Science},
-	publisher    = {Springer},
-	pages        = {1248--1251}
-}
-@book{hampel2011robust,
-	title        = {Robust statistics: the approach based on influence functions},
-	author       = {Hampel, Frank R and Ronchetti, Elvezio M and Rousseeuw, Peter J and Stahel, Werner A},
-	year         = 2011,
-	publisher    = {John Wiley \& Sons},
-	volume       = 114
-}
-@book{maronna2006robust,
-	title        = {Robust statistics},
-	author       = {Maronna, RARD and Martin, Douglas and Yohai, Victor},
-	year         = 2006,
-	publisher    = {John Wiley \& Sons, Chichester. ISBN}
-}
-@article{rodrigues2009robustness,
-	title        = {Robustness of multimodal biometric fusion methods against spoof attacks},
-	author       = {Rodrigues, Ricardo N and Ling, Lee Luan and Govindaraju, Venu},
-	year         = 2009,
-	journal      = {Journal of Visual Languages \& Computing},
-	publisher    = {Elsevier},
-	volume       = 20,
-	number       = 3,
-	pages        = {169--179}
-}
-@inproceedings{kolcz2009feature,
-	title        = {Feature weighting for improved classifier robustness},
-	author       = {Ko{\l}cz, Aleksander and Teo, Choon Hui},
-	year         = 2009,
-	booktitle    = {CEAS?09: sixth conference on email and anti-spam}
-}
-@inproceedings{dalvi2004adversarial,
-	title        = {Adversarial classification},
-	author       = {Dalvi, Nilesh and Domingos, Pedro and Sanghai, Sumit and Verma, Deepak and others},
-	year         = 2004,
-	booktitle    = {Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining},
-	pages        = {99--108},
-	organization = {ACM}
-}
-@inproceedings{cardenas2006evaluation,
-	title        = {Evaluation of classifiers: practical considerations for security applications},
-	author       = {C{\'a}rdenas, Alvaro A. and Baras, John S.},
-	year         = 2006,
-	booktitle    = {AAAI Workshop on Evaluation Methods for Machine Learning},
-	pages        = {409--415}
-}
-@article{laskov2010machine,
-	title        = {Machine learning in adversarial environments},
-	author       = {Laskov, Pavel and Lippmann, Richard},
-	year         = 2010,
-	journal      = {Machine learning},
-	publisher    = {Springer},
-	volume       = 81,
-	number       = 2,
-	pages        = {115--119}
-}
-@inproceedings{newsome2006paragraph,
-	title        = {Paragraph: Thwarting signature learning by training maliciously},
-	author       = {Newsome, James and Karp, Brad and Song, Dawn},
-	year         = 2006,
-	booktitle    = {Recent advances in intrusion detection},
-	pages        = {81--105},
-	organization = {Springer}
-}
-%inproceedings{rubinstein2009antidote,
-  title={Antidote: Understanding and defending against poisoning of anomaly detectors},
-  author={Rubinstein, Benjamin I. P. and Nelson, Blaine and Huang, Ling and Joseph, Anthony D. and Lau, Shing-hon and Rao, Satish and Taft, Nina and Tygar, J. D.},
-  booktitle={9th ACM SIGCOMM Conference on Internet measurement},
-  pages={1--14},
-  year=2009
-}
-@article{biggio2010multiple,
-	title        = {Multiple classifier systems for robust classifier design in adversarial environments},
-	author       = {Biggio, Battista and Fumera, Giorgio and Roli, Fabio},
-	year         = 2010,
-	journal      = {International Journal of Machine Learning and Cybernetics},
-	publisher    = {Springer},
-	volume       = 1,
-	number       = {1-4},
-	pages        = {27--41}
-}
-%inproceedings{kloft2010online,
-  title={Online anomaly detection under adversarial impact},
-  author={Kloft, Marius and Laskov, Pavel},
-  booktitle={13th International Conference on Artificial Intelligence and Statistics},
-  pages={405--412},
-  year={2010}
-}
-@article{mnih2013playing,
-	title        = {Playing atari with deep reinforcement learning},
-	author       = {Mnih, Volodymyr and Kavukcuoglu, Koray and Silver, David and Graves, Alex and Antonoglou, Ioannis and Wierstra, Daan and Riedmiller, Martin},
-	year         = 2013,
-	journal      = {arXiv preprint arXiv:1312.5602}
-}
-
-@article{bengio2009learning,
-	title        = {Learning deep architectures for AI},
-	author       = {Bengio, Yoshua},
-	year         = 2009,
-	journal      = {Foundations and trends{\textregistered} in Machine Learning},
-	publisher    = {Now Publishers Inc.},
-	volume       = 2,
-	number       = 1,
-	pages        = {1--127}
-}
-@inproceedings{ciresan2012multi,
-	title        = {Multi-column deep neural networks for image classification},
-	author       = {Ciresan, Dan and Meier, Ueli and Schmidhuber, J{\"u}rgen},
-	year         = 2012,
-	booktitle    = {Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on},
-	pages        = {3642--3649},
-	organization = {IEEE}
-}
-@inproceedings{cirecsan2011committee,
-	title        = {A committee of neural networks for traffic sign classification},
-	author       = {Cire{\c{s}}an, Dan and Meier, Ueli and Masci, Jonathan and Schmidhuber, J{\"u}rgen},
-	year         = 2011,
-	booktitle    = {Neural Networks (IJCNN), The 2011 International Joint Conference on},
-	pages        = {1918--1921},
-	organization = {IEEE}
-}
-@inproceedings{mnih2014recurrent,
-	title        = {Recurrent models of visual attention},
-	author       = {Mnih, Volodymyr and Heess, Nicolas and Graves, Alex and others},
-	year         = 2014,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {2204--2212}
-}
-@inproceedings{salakhutdinov2009deep,
-	title        = {Deep boltzmann machines},
-	author       = {Salakhutdinov, Ruslan and Hinton, Geoffrey E},
-	year         = 2009,
-	booktitle    = {International Conference on Artificial Intelligence and Statistics},
-	pages        = {448--455}
-}
-@article{salakhutdinov2012efficient,
-	title        = {An efficient learning procedure for deep Boltzmann machines},
-	author       = {Salakhutdinov, Ruslan and Hinton, Geoffrey},
-	year         = 2012,
-	journal      = {Neural computation},
-	publisher    = {MIT Press},
-	volume       = 24,
-	number       = 8,
-	pages        = {1967--2006}
-}
-@article{farabet2013learning,
-	title        = {Learning hierarchical features for scene labeling},
-	author       = {Farabet, Clement and Couprie, Camille and Najman, Laurent and LeCun, Yann},
-	year         = 2013,
-	journal      = {Pattern Analysis and Machine Intelligence, IEEE Transactions on},
-	publisher    = {IEEE},
-	volume       = 35,
-	number       = 8,
-	pages        = {1915--1929}
-}
-@article{hinton2006reducing,
-	title        = {Reducing the dimensionality of data with neural networks},
-	author       = {Hinton, Geoffrey E and Salakhutdinov, Ruslan R},
-	year         = 2006,
-	journal      = {Science},
-	publisher    = {American Association for the Advancement of Science},
-	volume       = 313,
-	number       = 5786,
-	pages        = {504--507}
-}
-@article{bengio2013representation,
-	title        = {Representation learning: A review and new perspectives},
-	author       = {Bengio, Yoshua and Courville, Aaron and Vincent, Pierre},
-	year         = 2013,
-	journal      = {IEEE Transactions on Pattern Analysis and Machine Intelligence},
-	volume       = 35,
-	number       = 8,
-	pages        = {1798--1828}
-}
-@article{vincent2010stacked,
-	title        = {Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion},
-	author       = {Vincent, Pascal and Larochelle, Hugo and Lajoie, Isabelle and Bengio, Yoshua and Manzagol, Pierre-Antoine},
-	year         = 2010,
-	journal      = {Journal of Machine Learning Research},
-	volume       = 11,
-	pages        = {3371--3408}
-}
-@inproceedings{vincent2008extracting,
-	title        = {Extracting and composing robust features with denoising autoencoders},
-	author       = {Vincent, Pascal and Larochelle, Hugo and Bengio, Yoshua and Manzagol, Pierre-Antoine},
-	year         = 2008,
-	booktitle    = {25th International Conference on Machine Learning},
-	pages        = {1096--1103},
-	organization = {ACM}
-}
-@article{srivastava2013modeling,
-	title        = {Modeling documents with deep boltzmann machines},
-	author       = {Srivastava, Nitish and Salakhutdinov, Ruslan R. and Hinton, Geoffrey E.},
-	year         = 2013,
-	journal      = {arXiv preprint arXiv:1309.6865}
-}
-@article{goodfellow2013maxout,
-	title        = {Maxout networks},
-	author       = {Goodfellow, Ian J. and Warde-Farley, David and Mirza, Mehdi and Courville, Aaron and Bengio, Yoshua},
-	year         = 2013,
-	journal      = {arXiv preprint arXiv:1302.4389}
-}
-@article{springenberg2014striving,
-	title        = {Striving for Simplicity: The All Convolutional Net},
-	author       = {Springenberg, Jost Tobias and Dosovitskiy, Alexey and Brox, Thomas and Riedmiller, Martin},
-	year         = 2014,
-	journal      = {arXiv preprint arXiv:1412.6806}
-}
-@article{snoek2015scalable,
-	title        = {Scalable Bayesian Optimization Using Deep Neural Networks},
-	author       = {Snoek, Jasper and Rippel, Oren and Swersky, Kevin and Kiros, Ryan and Satish, Nadathur and Sundaram, Narayanan and Patwary, Md and Ali, Mostofa and Adams, Ryan P. and others},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1502.05700}
-}
-@inproceedings{dosovitskiy2014discriminative,
-	title        = {Discriminative unsupervised feature learning with convolutional neural networks},
-	author       = {Dosovitskiy, Alexey and Springenberg, Jost Tobias and Riedmiller, Martin and Brox, Thomas},
-	year         = 2014,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {766--774}
-}
-@inproceedings{larochelle2008classification,
-	title        = {Classification using discriminative restricted Boltzmann machines},
-	author       = {Larochelle, Hugo and Bengio, Yoshua},
-	year         = 2008,
-	booktitle    = {Proceedings of the 25th international conference on Machine learning},
-	pages        = {536--543},
-	organization = {ACM}
-}
-@article{wang2018cascade,
-  title={Cascade energy optimization for waste heat recovery in distributed energy systems},
-  author={Wang, Xuan and Jin, Ming and Feng, Wei and Shu, Gequn and Tian, Hua and Liang, Youcai},
-  journal={Applied energy},
-  volume={230},
-  pages={679--695},
-  year={2018},
-  publisher={Elsevier}
-}
-@inproceedings{cai2015learning,
-	title        = {Learning complexity-aware cascades for deep pedestrian detection},
-	author       = {Cai, Zhaowei and Saberian, Mohammad and Vasconcelos, Nuno},
-	year         = 2015,
-	booktitle    = {Proceedings of the IEEE International Conference on Computer Vision},
-	pages        = {3361--3369}
-}
-@article{alipanahi2015predicting,
-	title        = {Predicting the sequence specificities of DNA-and RNA-binding proteins by deep learning},
-	author       = {Alipanahi, Babak and Delong, Andrew and Weirauch, Matthew T and Frey, Brendan J},
-	year         = 2015,
-	journal      = {Nature biotechnology},
-	publisher    = {Nature Publishing Group}
-}
-@book{webb2003statistical,
-	title        = {Statistical pattern recognition},
-	author       = {Webb, Andrew R},
-	year         = 2003,
-	publisher    = {John Wiley \& Sons}
-}
-@book{duda2012pattern,
-	title        = {Pattern classification},
-	author       = {Duda, Richard O and Hart, Peter E and Stork, David G},
-	year         = 2012,
-	publisher    = {John Wiley \& Sons}
-}
-@article{bishop2006pattern,
-	title        = {Pattern Recognition},
-	author       = {Bishop, Christopher M},
-	year         = 2006,
-	journal      = {Machine Learning}
-}
-@inproceedings{cao2015deep,
-	title        = {Deep Modeling Complex Couplings within Financial Markets.},
-	author       = {Cao, Wei and Hu, Liang and Cao, Longbing},
-	year         = 2015,
-	booktitle    = {AAAI},
-	pages        = {2518--2524}
-}
-@article{fehrer2015improving,
-	title        = {Improving Decision Analytics with Deep Learning: The Case of Financial Disclosures},
-	author       = {Fehrer, Ralph and Feuerriegel, Stefan},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1508.01993}
-}
-@inproceedings{linda2009neural,
-	title        = {Neural network based intrusion detection system for critical infrastructures},
-	author       = {Linda, Ondrej and Vollmer, Todd and Manic, Milos},
-	year         = 2009,
-	booktitle    = {Neural Networks, 2009. IJCNN 2009. International Joint Conference on},
-	pages        = {1827--1834},
-	organization = {IEEE}
-}
-@article{tesauro1996neural,
-	title        = {Neural networks for computer virus recognition},
-	author       = {Tesauro, Gerald J and Kephart, Jeffrey O and Sorkin, Gregory B},
-	year         = 1996,
-	journal      = {IEEE expert},
-	publisher    = {IEEE},
-	volume       = 11,
-	number       = 4,
-	pages        = {5--6}
-}
-@inproceedings{alazab2011zero,
-	title        = {Zero-day malware detection based on supervised learning algorithms of api call signatures},
-	author       = {Alazab, Mamoun and Venkatraman, Sitalakshmi and Watters, Paul and Alazab, Moutaz},
-	year         = 2011,
-	booktitle    = {Proceedings of the Ninth Australasian Data Mining Conference-Volume 121},
-	pages        = {171--182},
-	organization = {Australian Computer Society, Inc.}
-}
-@inproceedings{socher2011parsing,
-	title        = {Parsing natural scenes and natural language with recursive neural networks},
-	author       = {Socher, Richard and Lin, Cliff C and Manning, Chris and Ng, Andrew Y},
-	year         = 2011,
-	booktitle    = {Proceedings of the 28th international conference on machine learning (ICML-11)},
-	pages        = {129--136}
-}
-@inproceedings{grangier2009deep,
-	title        = {Deep convolutional networks for scene parsing},
-	author       = {Grangier, David and Bottou, L{\'e}on and Collobert, Ronan},
-	year         = 2009,
-	booktitle    = {ICML 2009 Deep Learning Workshop},
-	volume       = 3,
-	organization = {Citeseer}
-}
-@article{kraus2015classifying,
-	title        = {Classifying and Segmenting Microscopy Images Using Convolutional Multiple Instance Learning},
-	author       = {Kraus, Oren Z and Ba, Lei Jimmy and Frey, Brendan},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1511.05286}
-}
-@article{leung2014deep,
-	title        = {Deep learning of the tissue-regulated splicing code},
-	author       = {Leung, Michael KK and Xiong, Hui Yuan and Lee, Leo J and Frey, Brendan J},
-	year         = 2014,
-	journal      = {Bioinformatics},
-	publisher    = {Oxford Univ Press},
-	volume       = 30,
-	number       = 12,
-	pages        = {i121--i129}
-}
-@article{hinton2012deep,
-	title        = {Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups},
-	author       = {Hinton, Geoffrey and Deng, Li and Yu, Dong and Dahl, George E and Mohamed, Abdel-rahman and Jaitly, Navdeep and Senior, Andrew and Vanhoucke, Vincent and Nguyen, Patrick and Sainath, Tara N and others},
-	year         = 2012,
-	journal      = {Signal Processing Magazine, IEEE},
-	publisher    = {IEEE},
-	volume       = 29,
-	number       = 6,
-	pages        = {82--97}
-}
-@article{dahl2012context,
-	title        = {Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition},
-	author       = {Dahl, George E and Yu, Dong and Deng, Li and Acero, Alex},
-	year         = 2012,
-	journal      = {Audio, Speech, and Language Processing, IEEE Transactions on},
-	publisher    = {IEEE},
-	volume       = 20,
-	number       = 1,
-	pages        = {30--42}
-}
-@inproceedings{deng2013new,
-	title        = {New types of deep neural network learning for speech recognition and related applications: An overview},
-	author       = {Deng, Li and Hinton, Geoffrey and Kingsbury, Brian},
-	year         = 2013,
-	booktitle    = {Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on},
-	pages        = {8599--8603},
-	organization = {IEEE}
-}
-@inproceedings{graves2013speech,
-	title        = {Speech recognition with deep recurrent neural networks},
-	author       = {Graves, Alan and Mohamed, Abdel-rahman and Hinton, Geoffrey},
-	year         = 2013,
-	booktitle    = {Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on},
-	pages        = {6645--6649},
-	organization = {IEEE}
-}
-@inproceedings{karpathy2015deep,
-	title        = {Deep visual-semantic alignments for generating image descriptions},
-	author       = {Karpathy, Andrej and Fei-Fei, Li},
-	year         = 2015,
-	booktitle    = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
-	pages        = {3128--3137}
-}
-@inproceedings{karpathy2014deep,
-	title        = {Deep fragment embeddings for bidirectional image sentence mapping},
-	author       = {Karpathy, Andrej and Joulin, Armand and Li, Fei Fei F},
-	year         = 2014,
-	booktitle    = {Advances in neural information processing systems},
-	pages        = {1889--1897}
-}
-@inproceedings{vinyals2015show,
-	title        = {Show and tell: A neural image caption generator},
-	author       = {Vinyals, Oriol and Toshev, Alexander and Bengio, Samy and Erhan, Dumitru},
-	year         = 2015,
-	booktitle    = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
-	pages        = {3156--3164}
-}
-@inproceedings{sivakorn2016am,
-	title        = {I Am Robot: (Deep) Learning to Break Semantic Image {CAPTCHAs}},
-	author       = {Suphannee Sivakorn, Iasonas Polakis and Angelos D. Keromytis},
-	year         = 2016,
-	booktitle    = {Proceedings of the 1st IEEE European Symposium on Security and Privacy},
-	organization = {IEEE}
-}
-@article{li2015convergent,
-	title        = {Convergent Learning: Do different neural networks learn the same representations?},
-	author       = {Li, Yixuan and Yosinski, Jason and Clune, Jeff and Lipson, Hod and Hopcroft, John},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1511.07543}
-}
-@article{huang2015learning,
-	title        = {Learning with a strong adversary},
-	author       = {Huang, Ruitong and Xu, Bing and Schuurmans, Dale and Szepesvari, Csaba},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1511.03034}
-}
-@article{liu1989limited,
-	title        = {On the limited memory BFGS method for large scale optimization},
-	author       = {Liu, Dong C and Nocedal, Jorge},
-	year         = 1989,
-	journal      = {Mathematical programming},
-	publisher    = {Springer},
-	volume       = 45,
-	number       = {1-3},
-	pages        = {503--528}
-}
-@incollection{WardeFarley16,
-	title        = {Adversarial Perturbations of Deep Neural Networks},
-	author       = {David Warde-Farley and Ian Goodfellow},
-	year         = 2016,
-	booktitle    = {Advanced Structured Prediction},
-	editor       = {Tamir Hazan and George Papandreou and Daniel Tarlow}
-}
-@article{yosinski2015understanding,
-	title        = {Understanding neural networks through deep visualization},
-	author       = {Yosinski, Jason and Clune, Jeff and Nguyen, Anh and Fuchs, Thomas and Lipson, Hod},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1506.06579}
-}
-@article{nokland2015improving,
-	title        = {Improving Back-Propagation by Adding an Adversarial Gradient},
-	author       = {N{\o}kland, Arild},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1510.04189}
-}
-@article{edwards2015censoring,
-	title        = {Censoring Representations with an Adversary},
-	author       = {Edwards, Harrison and Storkey, Amos},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1511.05897}
-}
-@article{jain2015drop,
-	title        = {To drop or not to drop: Robustness, consistency and differential privacy properties of dropout},
-	author       = {Jain, Prateek and Kulkarni, Vivek and Thakurta, Abhradeep and Williams, Oliver},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1503.02031}
-}
-@article{robinson2015confusing,
-	title        = {Confusing Deep Convolution Networks by Relabelling},
-	author       = {Robinson, Leigh and Graham, Benjamin},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1510.06925}
-}
-@incollection{bottou2010large,
-	title        = {Large-scale machine learning with stochastic gradient descent},
-	author       = {Bottou, L{\'e}on},
-	year         = 2010,
-	booktitle    = {Proceedings of COMPSTAT'2010},
-	publisher    = {Springer},
-	pages        = {177--186}
-}
-@inproceedings{rasmus2015semi,
-	title        = {Semi-Supervised Learning with Ladder Networks},
-	author       = {Rasmus, Antti and Berglund, Mathias and Honkala, Mikko and Valpola, Harri and Raiko, Tapani},
-	year         = 2015,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {3532--3540}
-}
-@incollection{zeiler2014visualizing,
-	title        = {Visualizing and understanding convolutional networks},
-	author       = {Zeiler, Matthew D and Fergus, Rob},
-	year         = 2014,
-	booktitle    = {Computer vision--ECCV 2014},
-	publisher    = {Springer},
-	pages        = {818--833}
-}
-@inproceedings{xiao2015feature,
-	title        = {Is Feature Selection Secure against Training Data Poisoning?},
-	author       = {Xiao, Huang and Biggio, Battista and Brown, Gavin and Fumera, Giorgio and Eckert, Claudia and Roli, Fabio},
-	year         = 2015,
-	booktitle    = {Proceedings of the 32nd International Conference on Machine Learning (ICML-15)},
-	pages        = {1689--1698}
-}
-@inproceedings{pascanu2015malware,
-	title        = {Malware classification with recurrent networks},
-	author       = {Pascanu, Razvan and Stokes, Jack W and Sanossian, Hermineh and Marinescu, Mady and Thomas, Anil},
-	year         = 2015,
-	booktitle    = {Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on},
-	pages        = {1916--1920},
-	organization = {IEEE}
-}
-@book{schurmann1996pattern,
-	title        = {Pattern classification: a unified view of statistical and neural approaches},
-	author       = {Sch{\"u}rmann, J{\"u}rgen},
-	year         = 1996,
-	publisher    = {Wiley Online Library}
-}
-@book{freedman2009statistical,
-	title        = {Statistical models: theory and practice},
-	author       = {Freedman, David A},
-	year         = 2009,
-	publisher    = {cambridge university press}
-}
-@book{neter1996applied,
-	title        = {Applied linear statistical models},
-	author       = {Neter, John and Kutner, Michael H and Nachtsheim, Christopher J and Wasserman, William},
-	year         = 1996,
-	publisher    = {Irwin Chicago},
-	volume       = 4
-}
-@inproceedings{provos2004virtual,
-	title        = {A Virtual Honeypot Framework.},
-	author       = {Provos, Niels and others},
-	year         = 2004,
-	booktitle    = {USENIX Security Symposium},
-	volume       = 173,
-	pages        = {1--14}
-}
-@article{dekel2010learning,
-	title        = {Learning to classify with missing and corrupted features},
-	author       = {Dekel, Ofer and Shamir, Ohad and Xiao, Lin},
-	year         = 2010,
-	journal      = {Machine learning},
-	publisher    = {Springer},
-	volume       = 81,
-	number       = 2,
-	pages        = {149--178}
-}
-@inproceedings{kloft2007poisoning,
-	title        = {A poisoning attack against online anomaly detection},
-	author       = {Kloft, Marius and Laskov, Pavel},
-	year         = 2007,
-	booktitle    = {NIPS Workshop on Machine Learning in Adversarial Environments for Computer Security}
-}
-@inproceedings{perdisci2006misleading,
-	title        = {Misleading worm signature generators using deliberate noise injection},
-	author       = {Perdisci, Roberto and Dagon, David and Lee, Wenke and Fogla, Prahlad and Sharif, Monirul},
-	year         = 2006,
-	booktitle    = {Security and Privacy, 2006 IEEE Symposium on},
-	pages        = {15--pp},
-	organization = {IEEE}
-}
-@inproceedings{fogla2006polymorphic,
-	title        = {Polymorphic Blending Attacks.},
-	author       = {Fogla, Prahlad and Sharif, Monirul I and Perdisci, Roberto and Kolesnikov, Oleg M and Lee, Wenke},
-	year         = 2006,
-	booktitle    = {USENIX Security}
-}
-@inproceedings{newsome2005polygraph,
-	title        = {Polygraph: Automatically generating signatures for polymorphic worms},
-	author       = {Newsome, James and Karp, Brad and Song, Dawn},
-	year         = 2005,
-	booktitle    = {Security and Privacy, 2005 IEEE Symposium on},
-	pages        = {226--241},
-	organization = {IEEE}
-}
-@inproceedings{nair2010rectified,
-	title        = {Rectified linear units improve restricted boltzmann machines},
-	author       = {Nair, Vinod and Hinton, Geoffrey E},
-	year         = 2010,
-	booktitle    = {Proceedings of the 27th International Conference on Machine Learning (ICML-10)},
-	pages        = {807--814}
-}
-@inproceedings{glorot2011deep,
-	title        = {Deep sparse rectifier neural networks},
-	author       = {Glorot, Xavier and Bordes, Antoine and Bengio, Yoshua},
-	year         = 2011,
-	booktitle    = {International Conference on Artificial Intelligence and Statistics},
-	pages        = {315--323}
-}
-@article{kearns1993learning,
-	title        = {Learning in the presence of malicious errors},
-	author       = {Kearns, Michael and Li, Ming},
-	year         = 1993,
-	journal      = {SIAM Journal on Computing},
-	publisher    = {SIAM},
-	volume       = 22,
-	number       = 4,
-	pages        = {807--837}
-}
-@inproceedings{bruckner2011stackelberg,
-	title        = {Stackelberg games for adversarial prediction problems},
-	author       = {Br{\"u}ckner, Michael and Scheffer, Tobias},
-	year         = 2011,
-	booktitle    = {Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining},
-	pages        = {547--555},
-	organization = {ACM}
-}
-@inproceedings{li2014feature,
-	title        = {Feature cross-substitution in adversarial classification},
-	author       = {Li, Bo and Vorobeychik, Yevgeniy},
-	year         = 2014,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {2087--2095}
-}
-@inproceedings{li2016data,
-	title        = {Data Poisoning Attacks on Factorization-Based Collaborative Filtering},
-	author       = {Li, Bo and Wang, Yining and Singh, Aarti and Vorobeychik, Yevgeniy},
-	year         = 2016,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {1885--1893}
-}
-@inproceedings{Hardt2016,
-	title        = {Strategic Classification},
-	author       = {Hardt, Moritz and Megiddo, Nimrod and Papadimitriou, Christos and Wootters, Mary},
-	year         = 2016,
-	booktitle    = {Proceedings of the 2016 ACM Conference on Innovations in Theoretical Computer Science},
-	location     = {Cambridge, Massachusetts, USA},
-	series       = {ITCS '16},
-	pages        = {111--122},
-	isbn         = {978-1-4503-4057-1},
-	numpages     = 12
-}
-@article{sinha2015physical,
-	title        = {From physical security to cybersecurity},
-	author       = {Sinha, Arunesh and Nguyen, Thanh H and Kar, Debarun and Brown, Matthew and Tambe, Milind and Jiang, Albert Xin},
-	year         = 2015,
-	journal      = {Journal of Cybersecurity},
-	publisher    = {The Oxford University Press},
-	pages        = {tyv007}
-}
-@inproceedings{sinha2016learning,
-	title        = {Learning adversary behavior in security games: A {PAC} model perspective},
-	author       = {Sinha, Arunesh and Kar, Debarun and Tambe, Milind},
-	year         = 2016,
-	booktitle    = {15th International Conference on Autonomous Agents and Multiagent Systems},
-	pages        = {214--222}
-}
-@inproceedings{alfeld2016data,
-	title        = {Data poisoning attacks against autoregressive models},
-	author       = {Alfeld, Scott and Zhu, Xiaojin and Barford, Paul},
-	year         = 2016,
-	booktitle    = {30th AAAI Conference on Artificial Intelligence},
-	pages        = {1452--1458}
-}
-@book{tambe2011security,
-	title        = {Security and game theory: algorithms, deployed systems, lessons learned},
-	author       = {Tambe, Milind},
-	year         = 2011,
-	publisher    = {Cambridge University Press}
-}
-@article{nelson2012query,
-	title        = {Query strategies for evading convex-inducing classifiers},
-	author       = {Nelson, Blaine and Rubinstein, Benjamin IP and Huang, Ling and Joseph, Anthony D and Lee, Steven J and Rao, Satish and Tygar, JD},
-	year         = 2012,
-	journal      = {Journal of Machine Learning Research},
-	publisher    = {JMLR. org},
-	volume       = 13,
-	number       = 1,
-	pages        = {1293--1332}
-}
-@inproceedings{vorobeychik2014optimal,
-	title        = {Optimal randomized classification in adversarial settings},
-	author       = {Vorobeychik, Yevgeniy and Li, Bo},
-	year         = 2014,
-	booktitle    = {13th International Conference on Autonomous Agents and Multi-Agent Systems},
-	pages        = {485--492},
-	organization = {International Foundation for Autonomous Agents and Multiagent Systems}
-}
-@inproceedings{dwork2008differential,
-	title        = {Differential privacy: A survey of results},
-	author       = {Dwork, Cynthia},
-	year         = 2008,
-	booktitle    = {International Conference on Theory and Applications of Models of Computation},
-	pages        = {1--19},
-	organization = {Springer Berlin Heidelberg}
-}
-@misc{codalab17,
-	title        = {{CodaLab}: A collaborative platform for reproducible research},
-	author       = {{CodaLab}},
-	year         = 2017,
-	url          = {https://worksheets.codalab.org}
-}
-@misc{mlcomp,
-	author       = {Percy Liang and Jacob Abernethy}
-}
-, 
-    TITLE = {{MLcomp}},
-    URL = {http://mlcomp.org/},
-    YEAR = {2010--2017}
-}
-
-%inproceedings{fredrikson2015model,
-  title={Model inversion attacks that exploit confidence information and basic countermeasures},
-  author={Fredrikson, Matt and Jha, Somesh and Ristenpart, Thomas},
-  booktitle={Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security},
-  pages={1322--1333},
-  year={2015},
-  organization={ACM}
-}
-@article{suwajanakorn2017synthesizing,
-	title        = {Synthesizing obama: learning lip sync from audio},
-	author       = {Suwajanakorn, Supasorn and Seitz, Steven M and Kemelmacher-Shlizerman, Ira},
-	year         = 2017,
-	journal      = {ACM Transactions on Graphics (TOG)},
-	publisher    = {ACM},
-	volume       = 36,
-	number       = 4,
-	pages        = 95
-}
-@inproceedings{sung2003identifying,
-	title        = {Identifying important features for intrusion detection using support vector machines and neural networks},
-	author       = {Sung, Andrew H and Mukkamala, Srinivas},
-	year         = 2003,
-	booktitle    = {Proceedings of the IEEE Symposium on Applications and the Internet}
-}
-@inproceedings{kreibich2001network,
-	title        = {Network intrusion detection: Evasion, traffic normalization, and end-to-end protocol semantics},
-	author       = {Kreibich, Christian and Handley, Mark and Paxson, V},
-	year         = 2001,
-	booktitle    = {USENIX Security Symposium},
-	volume       = 2001
-}
-F. and Varoquaux, G. and Gramfort, A. and Michel, V.
-         and Thirion, B. and Grisel, O. and Blondel, M. and Prettenhofer, P.
-         and Weiss, R. and Dubourg, V. and Vanderplas, J. and Passos, A. and
-         Cournapeau, D. and Brucher, M. and Perrot, M. and Duchesnay, E.},
- journal={Journal of Machine Learning Research},
- volume={12},
- pages={2825--2830},
- year={2011}
-}
-%article{he2017adversarial,
-  title={Adversarial Example Defenses: Ensembles of Weak Defenses are not Strong},
-  author={He, Warren and Wei, James and Chen, Xinyun and Carlini, Nicholas and Song, Dawn},
-  journal={arXiv preprint arXiv:1706.04701},
-  year={2017}
-}
-%article{liu2016delving,
-	title={Delving into transferable adversarial examples and black-box attacks},
-	author={Liu, Yanpei and Chen, Xinyun and Liu, Chang and Song, Dawn},
-	journal={arXiv preprint arXiv:1611.02770},
-	year={2016}
-}
-
-%article{huang2017adversarial,
-	title={Adversarial attacks on neural network policies},
-	author={Huang, Sandy and Papernot, Nicolas and Goodfellow, Ian and Duan, Yan and Abbeel, Pieter},
-	journal={arXiv preprint arXiv:1702.02284},
-	year={2017}
-}
-%inproceedings{carlini2017towards,
-	title={Towards evaluating the robustness of neural networks},
-	author={Carlini, Nicholas and Wagner, David},
-	booktitle={IEEE Symposium on Security and Privacy},
-	pages={39--57},
-	year=2017
-}
-%article{tramer2017space,
-	title={The Space of Transferable Adversarial Examples},
-	author={Tram{\`e}r, Florian and Papernot, Nicolas and Goodfellow, Ian and Boneh, Dan and McDaniel, Patrick},
-	journal={arXiv preprint arXiv:1704.03453},
-	year={2017}
-}
-%inproceedings{globerson2006nightmare,
-	title={Nightmare at test time: Robust learning by feature deletion},
-	author={Globerson, Amir and Roweis, Sam},
-	booktitle={23rd International Conference on Machine Learning},
-	pages={353--360},
-	year=2006
-}
-%inproceedings{tramer2016stealing,
-  title={Stealing Machine Learning Models via Prediction {APIs}},
-  author={Tram{\`e}r, Florian and Zhang, Fan and Juels, Ari and Reiter, Michael K and Ristenpart, Thomas},
-  booktitle={25th USENIX Security Symposium},
-  pages="601-618",
-  year={2016}
-}
-%inproceedings{xu2016automatically,
-  title={Automatically evading classifiers: A Case Study on {PDF} Malware Classifiers},
-  author={Xu, Weilin and Qi, Yanjun and Evans, David},
-  booktitle={Network and Distributed Systems Symposium},
-  year={2016}
-}
-@article{srisakaokul2018muldef,
-	title        = {Muldef: Multi-model-based defense against adversarial examples for neural networks},
-	author       = {Srisakaokul, Siwakorn and Zhong, Zexuan and Zhang, Yuhao and Yang, Wei and Xie, Tao},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1809.00065}
-}
-@article{liu2021learning,
-  title={Learning policies with zero or bounded constraint violation for constrained {MDP}s},
-  author={Liu, Tao and Zhou, Ruida and Kalathil, Dileep and Kumar, Panganamala and Tian, Chao},
-  journal={Advances in Neural Information Processing Systems},
-  volume={34},
-  year={2021}
-}
-
-@article{liu2021fast,
-  title={Fast Global Convergence of Policy Optimization for Constrained {MDP}s},
-  author={Liu, Tao and Zhou, Ruida and Kalathil, Dileep and Kumar, PR and Tian, Chao},
-  journal={arXiv preprint arXiv:2111.00552},
-  year={2021}
-}
-
-@inproceedings{lopez2017alexa,
-	title        = {Alexa vs. Siri vs. Cortana vs. Google Assistant: a comparison of speech-based natural user interfaces},
-	author       = {L{\'o}pez, Gustavo and Quesada, Luis and Guerrero, Luis A},
-	year         = 2017,
-	booktitle    = {International Conference on Applied Human Factors and Ergonomics},
-	pages        = {241--250},
-	organization = {Springer}
-}
-@book{marriott2017paint,
-	title        = {Paint'n Spurs: The Men Who Founded the Cowboy Artists of America},
-	author       = {Marriott, Barbara},
-	year         = 2017,
-	publisher    = {Fireship Press}
-}
-@inproceedings{he2015delving,
-	title        = {Delving deep into rectifiers: Surpassing human-level performance on imagenet classification},
-	author       = {He, Kaiming and Zhang, Xiangyu and Ren, Shaoqing and Sun, Jian},
-	year         = 2015,
-	booktitle    = {Proceedings of the IEEE international conference on computer vision},
-	pages        = {1026--1034}
-}
-@article{eykholt2018physical,
-	title        = {Physical adversarial examples for object detectors},
-	author       = {Eykholt, Kevin and Evtimov, Ivan and Fernandes, Earlence and Li, Bo and Rahmati, Amir and Tramer, Florian and Prakash, Atul and Kohno, Tadayoshi and Song, Dawn},
-	year         = 2018,
-	journal      = {WOOT}
-}
-@article{pang2019improving,
-	title        = {Improving Adversarial Robustness via Promoting Ensemble Diversity},
-	author       = {Pang, Tianyu and Xu, Kun and Du, Chao and Chen, Ning and Zhu, Jun},
-	year         = 2019,
-	journal      = {arXiv preprint arXiv:1901.08846}
-}
-@article{kariyappa2019improving,
-	title        = {Improving Adversarial Robustness of Ensembles with Diversity Training},
-	author       = {Kariyappa, Sanjay and Qureshi, Moinuddin K},
-	year         = 2019,
-	journal      = {arXiv preprint arXiv:1901.09981}
-}
-%rticle{kurakin2016adversarial,
-  title={Adversarial examples in the physical world},
-  author={Kurakin, Alexey and Goodfellow, Ian and Bengio, Samy},
-  journal={arXiv preprint arXiv:1607.02533},
-  year={2016}
-}
-%inproceedings{papernot2015distillation,
-  title={Distillation as a Defense to Adversarial Perturbations against Deep Neural Networks},
-  author={Papernot, Nicolas and McDaniel, Patrick and Wu, Xi and Jha, Somesh and Swami, Ananthram},
-  booktitle={Proceedings of the 37th IEEE Symposium on Security and Privacy},
-  year={2016},
-  organization={IEEE}
-}
-
-
-%article{papernot2016practical,
-	title={Practical Black-Box Attacks against Deep Learning Systems using Adversarial Examples},
-	author={Nicolas Papernot and Patrick McDaniel and Ian Goodfellow and Somesh Jha and Z. Berkay Celik and Ananthram Swami},
-	journal={arXiv preprint arXiv:1602.02697},
-	year={2016}
-}
-%INPROCEEDINGS{goodfellow2014explaining,
-  author = {Goodfellow, Ian J and Shlens, Jonathon and Szegedy, Christian},
-  title = {Explaining and Harnessing Adversarial Examples},
-  booktitle = {3d International Conference on Learning Representations},
-  year = 2015
-}
-
-%INPROCEEDINGS{szegedy2013intriguing,
-  author = {Szegedy, Christian and Zaremba, Wojciech and Sutskever, Ilya and
-	Bruna, Joan and Erhan, Dumitru and Goodfellow, Ian and Fergus, Rob},
-  title = {Intriguing properties of neural networks},
-  booktitle = {International
-	Conference on Learning Representations},
-  year = 2014
-}
-%INPROCEEDINGS{krizhevsky2012imagenet,
-  author = {Krizhevsky, Alex and Sutskever, Ilya and Hinton, Geoffrey E.},
-  title = {Imagenet classification with deep convolutional neural networks},
-  booktitle = {Advances in Neural Information Processing Systems},
-  year = {2012},
-  pages = {1097--1105}
-}
-
-%INPROCEEDINGS{sutskever2014sequence,
-  author = {Sutskever, Ilya and Vinyals, Oriol and Le, Quoc V.},
-  title = {Sequence to sequence learning with neural networks},
-  booktitle = {Advances in Neural Information Processing Systems},
-  year = 2014,
-  pages = {3104-3112}
-}
-%inproceedings{gu2014towards,
-  title={Towards deep neural network architectures robust to adversarial examples},
-  author={Gu, Shixiang and Rigazio, Luca},
-  booktitle = {Proceedings of the 2015 International
-	Conference on Learning Representations},
-  year = {2015},
-  organization = {Computational and Biological Learning Society}
-
-}
-%inproceedings{biggio2012poisoning,
-  title={Poisoning attacks against support vector machines},
-  author={Biggio, Battista and Nelson, Blaine and Laskov Pavel},
-  booktitle={29th International Conference on Machine Learning},
-  year={2012}
-}
-%inproceedings{huang2011adversarial,
-  title={Adversarial machine learning},
-  author={Huang, Ling and Joseph, Anthony D and Nelson, Blaine and Rubinstein, Benjamin IP and Tygar, JD},
-  booktitle={4th ACM Workshop on Security and Artificial Intelligence},
-  pages={43--58},
-  year=2011
-}
-%incollection{biggio2013evasion,
-  title={Evasion attacks against machine learning at test time},
-  author={Biggio, Battista and Corona, Igino and Maiorca, Davide and Nelson, Blaine and {\v{S}}rndi{\'c}, Nedim and Laskov, Pavel and Giacinto, Giorgio and Roli, Fabio},
-  booktitle={Machine Learning and Knowledge Discovery in Databases},
-  pages={387--402},
-  year={2013},
-  publisher={Springer}
-}
-%inproceedings{goodfellow2014generative,
-  title={Generative adversarial nets},
-  author={Goodfellow, Ian and Pouget-Abadie, Jean and Mirza, Mehdi and Xu, Bing and Warde-Farley, David and Ozair, Sherjil and Courville, Aaron and Bengio, Yoshua},
-  booktitle={Advances in Neural Information Processing Systems},
-  pages={2672--2680},
-  year={2014}
-}
-%inproceedings{moosavi2015deepfool,
-  title={DeepFool: A simple and accurate method to fool deep neural networks},
-  author={Moosavi-Dezfooli, Seyed-Mohsen and Fawzi, Alhussein and Frossard, Pascal},
-  booktitle="IEEE Conference on Computer Vision and Pattern Recognition",
-  pages="2574-2582",
-  year=2016
-}
-%inproceedings{lowd2005adversarial,
-  title={Adversarial learning},
-  author={Lowd, Daniel and Meek, Christopher},
-  booktitle={Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining},
-  pages={641--647},
-  year={2005},
-  organization={ACM}
-}
-%inproceedings{rubinstein2009antidote,
-  title={Antidote: Understanding and defending against poisoning of anomaly detectors},
-  author={Rubinstein, Benjamin I. P. and Nelson, Blaine and Huang, Ling and Joseph, Anthony D. and Lau, Shing-hon and Rao, Satish and Taft, Nina and Tygar, J. D.},
-  booktitle={9th ACM SIGCOMM Conference on Internet measurement},
-  pages={1--14},
-  year=2009
-}
-%inproceedings{kloft2010online,
-  title={Online anomaly detection under adversarial impact},
-  author={Kloft, Marius and Laskov, Pavel},
-  booktitle={13th International Conference on Artificial Intelligence and Statistics},
-  pages={405--412},
-  year={2010}
-}
-@inproceedings{xu2009robust,
-	title        = {Robust regression and lasso},
-	author       = {Xu, Huan and Caramanis, Constantine and Mannor, Shie},
-	year         = 2009,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {1801--1808}
-}
-@article{andriushchenko2019provably,
-	title        = {Provably Robust Boosted Decision Stumps and Trees against Adversarial Attacks},
-	author       = {Andriushchenko, Maksym and Hein, Matthias},
-	year         = 2019,
-	journal      = {arXiv preprint arXiv:1906.03526}
-}
-@article{eykholt2017robust,
-	title        = {Robust physical-world attacks on deep learning models},
-	author       = {Eykholt, Kevin and Evtimov, Ivan and Fernandes, Earlence and Li, Bo and Rahmati, Amir and Xiao, Chaowei and Prakash, Atul and Kohno, Tadayoshi and Song, Dawn},
-	year         = 2018,
-	journal      = {CVPR}
-}
-, 
-    TITLE = {{MLcomp}},
-    URL = {http://mlcomp.org/},
-    YEAR = {2010--2017}
-}
-
-%inproceedings{fredrikson2015model,
-  title={Model inversion attacks that exploit confidence information and basic countermeasures},
-  author={Fredrikson, Matt and Jha, Somesh and Ristenpart, Thomas},
-  booktitle={Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security},
-  pages={1322--1333},
-  year={2015},
-  organization={ACM}
-}
-@misc{Authors14,
-	title        = {The frobnicatable foo filter},
-	author       = {Authors},
-	year         = 2014,
-	note         = {BMVC14 submission ID 324. Supplied as additional material {\tt bmvc14.pdf}}
-}
-@article{hendrycks2019oe,
-	title        = {Deep Anomaly Detection with Outlier Exposure},
-	author       = {Hendrycks, Dan and Mazeika, Mantas and Dietterich, Thomas},
-	year         = 2019,
-	journal      = {International Conference on Learning Representations}
-}
-@article{brown2018unrestricted,
-	title        = {Unrestricted Adversarial Examples},
-	author       = {Tom Brown and Nicholas Carlini and Chiyuan Zhang and Catherine Olsson and Paul Christiano and Ian Goodfellow},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1809.08352}
-}
-@misc{Authors14b,
-	title        = {Frobnication tutorial},
-	author       = {Authors},
-	year         = 2014,
-	note         = {Supplied as additional material {\tt tr.pdf}}
-}
-@article{Alpher02,
-	title        = {Frobnication},
-	author       = {A. Alpher},
-	year         = 2002,
-	journal      = {Journal of Foo},
-	volume       = 12,
-	number       = 1,
-	pages        = {234--778}
-}
-@article{Alpher03,
-	title        = {Frobnication revisited},
-	author       = {A. Alpher and and J.~P.~N. Fotheringham-Smythe},
-	year         = 2003,
-	journal      = {Journal of Foo},
-	volume       = 13,
-	number       = 1,
-	pages        = {234--778}
-}
-@article{Alpher04,
-	title        = {Can a machine frobnicate?},
-	author       = {A. Alpher and and J.~P.~N. Fotheringham-Smythe and G. Gamow},
-	year         = 2004,
-	journal      = {Journal of Foo},
-	volume       = 14,
-	number       = 1,
-	pages        = {234--778}
-}
-@inproceedings{xie2017adversarial,
-	title        = {Adversarial examples for semantic segmentation and object detection},
-	author       = {Xie, Cihang and Wang, Jianyu and Zhang, Zhishuai and Zhou, Yuyin and Xie, Lingxi and Yuille, Alan},
-	year         = 2017,
-	booktitle    = {International Conference on Computer Vision. IEEE}
-}
-@article{metzen2017universal,
-	title        = {Universal adversarial perturbations against semantic image segmentation},
-	author       = {Metzen, Jan Hendrik and Kumar, Mummadi Chaithanya and Brox, Thomas and Fischer, Volker},
-	year         = 2017,
-	journal      = {stat},
-	booktitle    = {The IEEE International Conference on Computer Vision (ICCV)},
-	volume       = 1050,
-	pages        = 19
-}
-@article{arnab2017robustness,
-	title        = {On the Robustness of Semantic Segmentation Models to Adversarial Attacks},
-	author       = {Arnab, Anurag and Miksik, Ondrej and Torr, Philip HS},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1711.09856}
-}
-@article{cisse2017houdini,
-	title        = {Houdini: Fooling deep structured prediction models},
-	author       = {Cisse, Moustapha and Adi, Yossi and Neverova, Natalia and Keshet, Joseph},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1707.05373}
-}
-@article{taylor1952hospital,
-	title        = {L'Hospital's rule},
-	author       = {Taylor, Angus E},
-	year         = 1952,
-	journal      = {The American Mathematical Monthly},
-	publisher    = {JSTOR},
-	volume       = 59,
-	number       = 1,
-	pages        = {20--24}
-}
-@article{hill1975simple,
-	title        = {A simple general approach to inference about the tail of a distribution},
-	author       = {Hill, Bruce M.},
-	year         = 1975,
-	journal      = {The Annals of Statistics},
-	publisher    = {Institute of Mathematical Statistics},
-	volume       = 3,
-	number       = 5,
-	pages        = {1163--1174}
-}
-@article{jones1996brief,
-	title        = {A brief survey of bandwidth selection for density estimation},
-	author       = {Jones, M. Chris and Marron, James S. and Sheather, Simon J.},
-	year         = 1996,
-	journal      = {Journal of the American Statistical Association},
-	publisher    = {Taylor \& Francis Group},
-	volume       = 91,
-	number       = 433,
-	pages        = {401--407}
-}
-@inproceedings{romano2016measuring,
-	title        = {Measuring dependency via intrinsic dimensionality},
-	author       = {Romano, Simone and Chelly, Oussama and Nguyen, Vinh and Bailey, James and Houle, Michael E.},
-	year         = 2016,
-	booktitle    = {ICPR, 2016 23rd International Conference on},
-	pages        = {1207--1212},
-	organization = {IEEE}
-}
-@inproceedings{advperturbWIFS17,
-	title        = {The Vulnerability of Learning to Adversarial Perturbation Increases with Intrinsic Dimensionality},
-	author       = {Amsaleg, Laurent and Bailey, James and Barbe, Dominique and Erfani, Sarah M. and Nguyen, Xuan Vinh and Radovanovi\'{c}, Milo\v{s}},
-	year         = 2017,
-	month        = {December},
-	booktitle    = {9th IEEE Workshop on Information Forensics and Security (WIFS)},
-	organization = {IEEE}
-}
-@inproceedings{hendrycks17baseline,
-	title        = {A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks},
-	author       = {Dan Hendrycks and Kevin Gimpel},
-	year         = 2017,
-	booktitle    = {Proceedings of International Conference on Learning Representations}
-}
-@inproceedings{lee18ood,
-	title        = {Training Confidence-calibrated Classifiers for Detecting Out-of-Distribution Samples},
-	author       = {Kimin Lee and Honglak Lee and Kibok Lee and Jinwoo Shin},
-	year         = 2018,
-	booktitle    = {Proceedings of International Conference on Learning Representations}
-}
-@inproceedings{liu2018open,
-	title        = {Open Category Detection with PAC Guarantees},
-	author       = {Liu, Si and Garrepalli, Risheek and Dietterich, Thomas and Fern, Alan and Hendrycks, Dan},
-	year         = 2018,
-	booktitle    = {International Conference on Machine Learning},
-	pages        = {3175--3184}
-}
-@article{ma2017adversarial,
-	title        = {Adversarial Generation of Real-time Feedback with Neural Networks for Simulation-Based Training},
-	author       = {Ma, Xingjun and Bailey, J and Wijewickrema, S and Zhou, S and Mhammedi, Zakaria and Zhou, Y and O’Leary, S},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.01460}
-}
-@inproceedings{amsaleg2015estimating,
-	title        = {Estimating local intrinsic dimensionality},
-	author       = {Amsaleg, Laurent and Chelly, Oussama and Furon, Teddy and Girard, St{\'e}phane and Houle, Michael E. and Kawarabayashi, Ken{-}ichi and Nett, Michael},
-	year         = 2015,
-	booktitle    = {SIGKDD},
-	pages        = {29--38},
-	organization = {ACM}
-}
-@article{feinman2017detecting,
-	title        = {Detecting Adversarial Samples from Artifacts},
-	author       = {Feinman, Reuben and Curtin, Ryan R. and Shintre, Saurabh and Gardner, Andrew B},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.00410}
-}
-@inproceedings{carlini2017towards,
-	title        = {Towards evaluating the robustness of neural networks},
-	author       = {Carlini, Nicholas and Wagner, David},
-	year         = 2017,
-	booktitle    = {S\&P},
-	pages        = {39--57}
-}
-@article{carlini2017adversarial,
-	title        = {Adversarial Examples Are Not Easily Detected: Bypassing Ten Detection Methods},
-	author       = {Carlini, Nicholas and Wagner, David},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1705.07263},
-	booktitle    = {ACM Workshop on Artificial Intelligence and Security},
-	pages        = {3--14}
-}
-@inproceedings{gal2015dropout,
-	title        = {Dropout as a Bayesian approximation: Insights and applications},
-	author       = {Gal, Yarin and Ghahramani, Zoubin},
-	year         = 2015,
-	booktitle    = {ICML}
-}
-@inproceedings{moosavi2016deepfool,
-	title        = {Deepfool: a simple and accurate method to fool deep neural networks},
-	author       = {Moosavi-Dezfooli, Seyed-Mohsen and Fawzi, Alhussein and Frossard, Pascal},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1511.04599},
-	booktitle    = {CVPR},
-	pages        = {2574--2582}
-}
-@article{gu2014towards,
-	title        = {Towards deep neural network architectures robust to adversarial examples},
-	author       = {Gu, Shixiang and Rigazio, Luca},
-	year         = 2014,
-	journal      = {arXiv preprint arXiv:1412.5068}
-}
-@article{grosse2017statistical,
-	title        = {On the (statistical) detection of adversarial examples},
-	author       = {Grosse, Kathrin and Manoharan, Praveen and Papernot, Nicolas and Backes, Michael and McDaniel, Patrick},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1702.06280}
-}
-@article{xu2017feature,
-	title        = {Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks},
-	author       = {Xu, Weilin and Evans, David and Qi, Yanjun},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1704.01155}
-}
-@article{miyato2017virtual,
-	title        = {Virtual Adversarial Training: a Regularization Method for Supervised and Semi-supervised Learning},
-	author       = {Miyato, Takeru and Maeda, Shin-ichi and Koyama, Masanori and Ishii, Shin},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1704.03976}
-}
-@article{zhao2016suppressing,
-	title        = {Suppressing the unusual: towards robust cnns using symmetric activation functions},
-	author       = {Zhao, Qiyang and Griffin, Lewis D},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1603.05145}
-}
-@article{metzen2017detecting,
-	title        = {On detecting adversarial perturbations},
-	author       = {Metzen, Jan Hendrik and Genewein, Tim and Fischer, Volker and Bischoff, Bastian},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1702.04267}
-}
-@inproceedings{papernot2016distillation,
-	title        = {Distillation as a defense to adversarial perturbations against deep neural networks},
-	author       = {Papernot, Nicolas and McDaniel, Patrick and Wu, Xi and Jha, Somesh and Swami, Ananthram},
-	year         = 2016,
-	booktitle    = {S\&P},
-	pages        = {582--597},
-	organization = {IEEE}
-}
-@article{papernot2017extending,
-	title        = {Extending Defensive Distillation},
-	author       = {Papernot, Nicolas and McDaniel, Patrick},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1705.05264}
-}
-@article{nayebi2017biologically,
-	title        = {Biologically inspired protection of deep networks from adversarial attacks},
-	author       = {Nayebi, Aran and Ganguli, Surya},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.09202}
-}
-@article{borkar2017deepcorrect,
-	title        = {DeepCorrect: Correcting DNN models against Image Distortions},
-	author       = {Borkar, Tejas and Karam, Lina},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1705.02406}
-}
-@article{lee2017generative,
-	title        = {Generative Adversarial Trainer: Defense to Adversarial Perturbations with GAN},
-	author       = {Lee, Hyeungill and Han, Sungyeob and Lee, Jungwoo},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1705.03387}
-}
-@article{sankaranarayanan2017guided,
-	title        = {Guided Perturbations: Self Corrective Behavior in Convolutional Neural Networks},
-	author       = {Sankaranarayanan, Swami and Jain, Arpit and Lim, Ser Nam},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.07928}
-}
-@article{goodfellow2014explaining,
-	title        = {Explaining and harnessing adversarial examples},
-	author       = {Goodfellow, Ian J. and Shlens, Jonathon and Szegedy, Christian},
-	year         = 2014,
-	journal      = {arXiv preprint arXiv:1412.6572},
-	booktitle    = ICLR
-}
-@article{szegedy2013intriguing,
-	title        = {Intriguing properties of neural networks},
-	author       = {Szegedy, Christian and Zaremba, Wojciech and Sutskever, Ilya and Bruna, Joan and Erhan, Dumitru and Goodfellow, Ian and Fergus, Rob},
-	year         = 2013,
-	journal      = {arXiv preprint arXiv:1312.6199}
-}
-@article{lecun2015deep,
-	title        = {Deep learning},
-	author       = {LeCun, Yann and Bengio, Yoshua and Hinton, Geoffrey},
-	year         = 2015,
-	journal      = {Nature},
-	publisher    = {Nature Research},
-	volume       = 521,
-	number       = 7553,
-	pages        = {436--444}
-}
-@inproceedings{krizhevsky2012imagenet,
-	title        = {Imagenet classification with deep convolutional neural networks},
-	author       = {Krizhevsky, Alex and Sutskever, Ilya and Hinton, Geoffrey E.},
-	year         = 2012,
-	booktitle    = {NIPS},
-	pages        = {1097--1105}
-}
-@inproceedings{fawzi2016robustness,
-	title        = {Robustness of classifiers: from adversarial to random noise},
-	author       = {Fawzi, Alhussein and Moosavi-Dezfooli, Seyed-Mohsen and Frossard, Pascal},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1608.08967},
-	booktitle    = {NIPS},
-	pages        = {1632--1640}
-}
-@article{tanay2016boundary,
-	title        = {A boundary tilting persepective on the phenomenon of adversarial examples},
-	author       = {Tanay, Thomas and Griffin, Lewis},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1608.07690}
-}
-@inproceedings{maas2013rectifier,
-	title        = {Rectifier nonlinearities improve neural network acoustic models},
-	author       = {Maas, Andrew L and Hannun, Awni Y and Ng, Andrew Y},
-	year         = 2013,
-	booktitle    = {ICML},
-	volume       = 30,
-	number       = 1
-}
-@article{kurakin2016adversarial,
-	title        = {Adversarial examples in the physical world},
-	author       = {Kurakin, Alexey and Goodfellow, Ian and Bengio, Samy},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1607.02533}
-}
-@inproceedings{papernot2016limitations,
-	title        = {The limitations of deep learning in adversarial settings},
-	author       = {Papernot, Nicolas and McDaniel, Patrick and Jha, Somesh and Fredrikson, Matt and Celik, Z. Berkay and Swami, Ananthram},
-	year         = 2016,
-	booktitle    = {EuroS\&P},
-	pages        = {372--387},
-	organization = {IEEE}
-}
-@article{shaham2015understanding,
-	title        = {Understanding adversarial training: Increasing local stability of neural nets through robust optimization},
-	author       = {Shaham, Uri and Yamada, Yutaro and Negahban, Sahand},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1511.05432}
-}
-@article{gong2017adversarial,
-	title        = {Adversarial and Clean Data Are Not Twins},
-	author       = {Gong, Zhitao and Wang, Wenlu and Ku, Wei-Shinn},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1704.04960}
-}
-@article{hendrycks2017early,
-	title        = {Early Methods for Detecting Adversarial Images},
-	author       = {Hendrycks, Dan and Gimpel, Kevin},
-	year         = 2017
-}
-@article{bhagoji2017dimensionality,
-	title        = {Dimensionality Reduction as a Defense against Evasion Attacks on Machine Learning Classifiers},
-	author       = {Bhagoji, Arjun Nitin and Cullina, Daniel and Mittal, Prateek},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1704.02654}
-}
-@article{li2016adversarial,
-	title        = {Adversarial examples detection in deep networks with convolutional filter statistics},
-	author       = {Li, Xin and Li, Fuxin},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1612.07767}
-}
-@article{gardner2015deep,
-	title        = {Deep manifold traversal: Changing labels with convolutional features},
-	author       = {Gardner, Jacob R and Upchurch, Paul and Kusner, Matt J. and Li Yixuan and Weinberger, Kilian Q. and Bala, Kavita and Hopcroft, John E.},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1511.06421}
-}
-@inproceedings{bengio2013better,
-	title        = {Better mixing via deep representations},
-	author       = {Bengio, Yoshua and Mesnil, Gr{\'e}goire and Dauphin, Yann and Rifai, Salah},
-	year         = 2013,
-	booktitle    = {ICML},
-	pages        = {552--560}
-}
-@article{papernot2016cleverhans,
-	title        = {cleverhans v1. 0.0: an adversarial machine learning library},
-	author       = {Papernot, Nicolas and Goodfellow, Ian and Sheatsley, Ryan and Feinman, Reuben and McDaniel, Patrick},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1610.00768}
-}
-@article{tramer2017space,
-	title        = {The Space of Transferable Adversarial Examples},
-	author       = {Tram{\`e}r, Florian and Papernot, Nicolas and Goodfellow, Ian and Boneh, Dan and McDaniel, Patrick},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1704.03453}
-}
-@article{cao2017mitigating,
-	title        = {Mitigating Evasion Attacks to Deep Neural Networks via Region-based Classification},
-	author       = {Cao, Xiaoyu and Gong, Neil Zhenqiang},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1709.05583}
-}
-@inproceedings{karger2002finding,
-	title        = {Finding nearest neighbors in growth-restricted metrics},
-	author       = {Karger, David R. and Ruhl, Matthias},
-	year         = 2002,
-	booktitle    = {STOC},
-	pages        = {741--750},
-	organization = {ACM}
-}
-@inproceedings{houle2017local1,
-	title        = {Local intrinsic dimensionality {I}: an extreme-value-theoretic foundation for similarity applications},
-	author       = {Houle, Michael E.},
-	year         = 2017,
-	booktitle    = {SISAP},
-	pages        = {64--79}
-}
-@inproceedings{houle2017local2,
-	title        = {Local intrinsic dimensionality {II}: multivariate analysis and distributional support},
-	author       = {Houle, Michael E.},
-	year         = 2017,
-	booktitle    = {SISAP},
-	pages        = {80--95}
-}
-@article{wold1987principal,
-	title        = {Principal component analysis},
-	author       = {Wold, Svante and Esbensen, Kim and Geladi, Paul},
-	year         = 1987,
-	journal      = {Chemometrics and intelligent laboratory systems},
-	publisher    = {Elsevier},
-	volume       = 2,
-	number       = {1-3},
-	pages        = {37--52}
-}
-@book{coles2001introduction,
-	title        = {An introduction to statistical modeling of extreme values},
-	author       = {Coles, Stuart and Bawa, Joanna and Trenner, Lesley and Dorazio, Pat},
-	year         = 2001,
-	publisher    = {Springer},
-	volume       = 208
-}
-@inproceedings{HouleKN12,
-	title        = {Generalized Expansion Dimension},
-	author       = {Michael E. Houle and Hisashi Kashima and Michael Nett},
-	year         = 2012,
-	booktitle    = {ICDMW},
-	pages        = {587--594},
-	fullbooktitle = {Proceedings of the 12th IEEE International Conference on Data Mining Workshops (ICDMW)}
-}
-@inproceedings{lecun1990handwritten,
-	title        = {Handwritten digit recognition with a back-propagation network},
-	author       = {LeCun, Yann and Boser, Bernhard E and Denker, John S and Henderson, Donnie and Howard, Richard E and Hubbard, Wayne E and Jackel, Lawrence D},
-	year         = 1990,
-	booktitle    = {Advances in neural information processing systems},
-	pages        = {396--404}
-}
-@inproceedings{netzer2011reading,
-	title        = {Reading digits in natural images with unsupervised feature learning},
-	author       = {Netzer, Yuval and Wang, Tao and Coates, Adam and Bissacco, Alessandro and Wu, Bo and Ng, Andrew Y},
-	year         = 2011,
-	booktitle    = {NIPS workshop on deep learning and unsupervised feature learning},
-	volume       = 2011,
-	number       = 2,
-	pages        = 5
-}
-@inproceedings{li2015scalable,
-	title        = {Scalable optimization of randomized operational decisions in adversarial classification settings},
-	author       = {Li, Bo and Vorobeychik, Yevgeniy},
-	year         = 2015,
-	booktitle    = {Artificial Intelligence and Statistics},
-	pages        = {599--607}
-}
-@article{saha2000scale,
-	title        = {Scale-based fuzzy connected image segmentation: theory, algorithms, and validation},
-	author       = {Saha, Punam K and Udupa, Jayaram K and Odhner, Dewey},
-	year         = 2000,
-	journal      = {Computer Vision and Image Understanding},
-	publisher    = {Elsevier},
-	volume       = 77,
-	number       = 2,
-	pages        = {145--174}
-}
-@inproceedings{cordts2016cityscapes,
-	title        = {The cityscapes dataset for semantic urban scene understanding},
-	author       = {Cordts, Marius and Omran, Mohamed and Ramos, Sebastian and Rehfeld, Timo and Enzweiler, Markus and Benenson, Rodrigo and Franke, Uwe and Roth, Stefan and Schiele, Bernt},
-	year         = 2016,
-	booktitle    = {Proceedings of the IEEE conference on computer vision and pattern recognition},
-	pages        = {3213--3223}
-}
-@article{Everingham15,
-	title        = {The Pascal Visual Object Classes Challenge: A Retrospective},
-	author       = {Everingham, M. and Eslami, S. M. A. and Van~Gool, L. and Williams, C. K. I. and Winn, J. and Zisserman, A.},
-	year         = 2015,
-	month        = jan,
-	journal      = {International Journal of Computer Vision},
-	volume       = 111,
-	number       = 1,
-	pages        = {98--136}
-}
-@inproceedings{BrostowSFC:ECCV08,
-	title        = {Segmentation and Recognition Using Structure from Motion Point Clouds},
-	author       = {Gabriel J. Brostow and Jamie Shotton and Julien Fauqueur and Roberto Cipolla},
-	year         = 2008,
-	booktitle    = {ECCV (1)},
-	pages        = {44--57}
-}
-@article{BrostowFC:PRL2008,
-	title        = {Semantic Object Classes in Video: A High-Definition Ground Truth Database},
-	author       = {Gabriel J. Brostow and Julien Fauqueur and Roberto Cipolla},
-	year         = 2008,
-	journal      = {Pattern Recognition Letters},
-	volume       = {xx},
-	number       = {x},
-	pages        = {xx-xx}
-}
-@article{xie2017mitigating,
-	title        = {Mitigating adversarial effects through randomization},
-	author       = {Xie, Cihang and Wang, Jianyu and Zhang, Zhishuai and Ren, Zhou and Yuille, Alan},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1711.01991}
-}
-@article{dziugaite2016study,
-	title        = {A study of the effect of jpg compression on adversarial images},
-	author       = {Dziugaite, Gintare Karolina and Ghahramani, Zoubin and Roy, Daniel M},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1608.00853}
-}
-@article{xu2017end,
-	title        = {End-to-end learning of driving models from large-scale video datasets},
-	author       = {Xu, Huazhe and Gao, Yang and Yu, Fisher and Darrell, Trevor},
-	year         = 2017,
-	journal      = {arXiv preprint}
-}
-@inproceedings{Yu2017,
-	title        = {Dilated Residual Networks},
-	author       = {Fisher Yu and Vladlen Koltun and Thomas Funkhouser},
-	year         = 2017,
-	booktitle    = {Computer Vision and Pattern Recognition (CVPR)}
-}
-@inproceedings{Yu2016,
-	title        = {Multi-scale context aggregation by dilated convolutions},
-	author       = {Yu, Fisher and Koltun, Vladlen},
-	year         = 2016,
-	booktitle    = {International Conference on Learning Representations (ICLR)}
-}
-@article{yu2017deep,
-	title        = {Deep Layer Aggregation},
-	author       = {Yu, Fisher and Wang, Dequan and Darrell, Trevor},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1707.06484}
-}
-@inproceedings{zeng2014relation,
-	title        = {Relation classification via convolutional deep neural network},
-	author       = {Zeng, Daojian and Liu, Kang and Lai, Siwei and Zhou, Guangyou and Zhao, Jun},
-	year         = 2014,
-	booktitle    = {Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers},
-	pages        = {2335--2344}
-}
-@article{noda2014multimodal,
-	title        = {Multimodal integration learning of robot behavior using deep neural networks},
-	author       = {Noda, Kuniaki and Arie, Hiroaki and Suga, Yuki and Ogata, Tetsuya},
-	year         = 2014,
-	journal      = {Robotics and Autonomous Systems},
-	publisher    = {Elsevier},
-	volume       = 62,
-	number       = 6,
-	pages        = {721--736}
-}
-@article{bhagoji2017exploring,
-	title        = {Exploring the Space of Black-box Attacks on Deep Neural Networks},
-	author       = {Bhagoji, Arjun Nitin and He, Warren and Li, Bo and Song, Dawn},
-	year         = 2018,
-	journal      = {The European Conference on Computer Vision (ECCV)}
-}
-@article{ma2018characterizing,
-	title        = {Characterizing Adversarial Subspaces Using Local Intrinsic Dimensionality},
-	author       = {Ma, Xingjun and Li, Bo and Wang, Yisen and Erfani, Sarah M and Wijewickrema, Sudanthi and Houle, Michael E and Schoenebeck, Grant and Song, Dawn and Bailey, James},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1801.02613}
-}
-@article{madry2017towards,
-	title        = {Towards deep learning models resistant to adversarial attacks},
-	author       = {Madry, Aleksander and Makelov, Aleksandar and Schmidt, Ludwig and Tsipras, Dimitris and Vladu, Adrian},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1706.06083}
-}
-@article{das2017keeping,
-	title        = {Keeping the bad guys out: Protecting and vaccinating deep learning with jpeg compression},
-	author       = {Das, Nilaksh and Shanbhogue, Madhuri and Chen, Shang-Tse and Hohman, Fred and Chen, Li and Kounavis, Michael E and Chau, Duen Horng},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1705.02900}
-}
-@article{chambolle2004algorithm,
-	title        = {An algorithm for total variation minimization and applications},
-	author       = {Chambolle, Antonin},
-	year         = 2004,
-	journal      = {Journal of Mathematical imaging and vision},
-	publisher    = {Springer},
-	volume       = 20,
-	number       = 1,
-	pages        = {89--97}
-}
-@article{he2018decision,
-	title        = {Decision boundary analysis of adversarial examples},
-	author       = {He, Warren and Li, Bo and Song, Dawn},
-	year         = 2018,
-	journal      = {ICLR}
-}
-@article{xiao2018spatially,
-	title        = {Spatially transformed adversarial examples},
-	author       = {Xiao, Chaowei and Zhu, Jun-Yan and Li, Bo and He, Warren and Liu, Mingyan and Song, Dawn},
-	year         = 2018,
-	journal      = {ICLR}
-}
-@inproceedings{ling2019deepsec,
-	title        = {Deepsec: A uniform platform for security analysis of deep learning model},
-	author       = {Ling, Xiang and Ji, Shouling and Zou, Jiaxu and Wang, Jiannan and Wu, Chunming and Li, Bo and Wang, Ting},
-	year         = 2019,
-	booktitle    = {IEEE S\&P}
-}
-@article{hosseini2017blocking,
-	title        = {Blocking transferability of adversarial examples in black-box learning systems},
-	author       = {Hosseini, Hossein and Chen, Yize and Kannan, Sreeram and Zhang, Baosen and Poovendran, Radha},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.04318}
-}
-@article{cui2013localized,
-	title        = {Localized FCM clustering with spatial information for medical image segmentation and bias field estimation},
-	author       = {Cui, Wenchao and Wang, Yi and Fan, Yangyu and Feng, Yan and Lei, Tao},
-	year         = 2013,
-	journal      = {Journal of Biomedical Imaging},
-	publisher    = {Hindawi Publishing Corp.},
-	volume       = 2013,
-	pages        = 13
-}
-@inproceedings{zhao2017pyramid,
-	title        = {Pyramid scene parsing network},
-	author       = {Zhao, Hengshuang and Shi, Jianping and Qi, Xiaojuan and Wang, Xiaogang and Jia, Jiaya},
-	year         = 2017,
-	booktitle    = {IEEE Conf. on Computer Vision and Pattern Recognition (CVPR)},
-	pages        = {2881--2890}
-}
-@inproceedings{lin2016efficient,
-	title        = {Efficient piecewise training of deep structured models for semantic segmentation},
-	author       = {Lin, Guosheng and Shen, Chunhua and Van Den Hengel, Anton and Reid, Ian},
-	year         = 2016,
-	booktitle    = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
-	pages        = {3194--3203}
-}
-@inproceedings{long2015fully,
-	title        = {Fully convolutional networks for semantic segmentation},
-	author       = {Long, Jonathan and Shelhamer, Evan and Darrell, Trevor},
-	year         = 2015,
-	booktitle    = {Proceedings of the IEEE conference on computer vision and pattern recognition},
-	pages        = {3431--3440}
-}
-@inproceedings{zheng2015conditional,
-	title        = {Conditional random fields as recurrent neural networks},
-	author       = {Zheng, Shuai and Jayasumana, Sadeep and Romera-Paredes, Bernardino and Vineet, Vibhav and Su, Zhizhong and Du, Dalong and Huang, Chang and Torr, Philip HS},
-	year         = 2015,
-	booktitle    = {Proceedings of the IEEE International Conference on Computer Vision},
-	pages        = {1529--1537}
-}
-@inproceedings{liu2015semantic,
-	title        = {Semantic image segmentation via deep parsing network},
-	author       = {Liu, Ziwei and Li, Xiaoxiao and Luo, Ping and Loy, Chen-Change and Tang, Xiaoou},
-	year         = 2015,
-	booktitle    = {Computer Vision (ICCV), 2015 IEEE International Conference on},
-	pages        = {1377--1385},
-	organization = {IEEE}
-}
-@inproceedings{shotton2006textonboost,
-	title        = {Textonboost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation},
-	author       = {Shotton, Jamie and Winn, John and Rother, Carsten and Criminisi, Antonio},
-	year         = 2006,
-	booktitle    = {European conference on computer vision},
-	pages        = {1--15},
-	organization = {Springer}
-}
-@inproceedings{krahenbuhl2011efficient,
-	title        = {Efficient inference in fully connected crfs with gaussian edge potentials},
-	author       = {Kr{\"a}henb{\"u}hl, Philipp and Koltun, Vladlen},
-	year         = 2011,
-	booktitle    = {Advances in neural information processing systems},
-	pages        = {109--117}
-}
-@article{leung2001representing,
-	title        = {Representing and recognizing the visual appearance of materials using three-dimensional textons},
-	author       = {Leung, Thomas and Malik, Jitendra},
-	year         = 2001,
-	journal      = {International journal of computer vision},
-	publisher    = {Springer},
-	volume       = 43,
-	number       = 1,
-	pages        = {29--44}
-}
-@inproceedings{ronneberger2015u,
-	title        = {U-net: Convolutional networks for biomedical image segmentation},
-	author       = {Ronneberger, Olaf and Fischer, Philipp and Brox, Thomas},
-	year         = 2015,
-	booktitle    = {International Conference on Medical image computing and computer-assisted intervention},
-	pages        = {234--241},
-	organization = {Springer}
-}
-@article{badrinarayanan2017segnet,
-	title        = {Segnet: A deep convolutional encoder-decoder architecture for image segmentation},
-	author       = {Badrinarayanan, Vijay and Kendall, Alex and Cipolla, Roberto},
-	year         = 2017,
-	journal      = {IEEE transactions on pattern analysis and machine intelligence},
-	publisher    = {IEEE},
-	volume       = 39,
-	number       = 12,
-	pages        = {2481--2495}
-}
-@article{wu2016wider,
-	title        = {Wider or deeper: Revisiting the resnet model for visual recognition},
-	author       = {Wu, Zifeng and Shen, Chunhua and Hengel, Anton van den},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1611.10080}
-}
-@inproceedings{he2016deep,
-	title        = {Deep residual learning for image recognition},
-	author       = {He, Kaiming and Zhang, Xiangyu and Ren, Shaoqing and Sun, Jian},
-	year         = 2016,
-	booktitle    = {Proceedings of the IEEE conference on computer vision and pattern recognition},
-	pages        = {770--778}
-}
-@article{johnson2011unsupervised,
-	title        = {Unsupervised image segmentation evaluation and refinement using a multi-scale approach},
-	author       = {Johnson, Brian and Xie, Zhixiao},
-	year         = 2011,
-	journal      = {ISPRS Journal of Photogrammetry and Remote Sensing},
-	publisher    = {Elsevier},
-	volume       = 66,
-	number       = 4,
-	pages        = {473--483}
-}
-@article{tramer2017ensemble,
-	title        = {Ensemble adversarial training: Attacks and defenses},
-	author       = {Tram{\`e}r, Florian and Kurakin, Alexey and Papernot, Nicolas and Boneh, Dan and McDaniel, Patrick},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1705.07204}
-}
-@article{chan1998total,
-	title        = {Total variation blind deconvolution},
-	author       = {Chan, Tony F and Wong, Chiu-Kwong},
-	year         = 1998,
-	journal      = {IEEE transactions on Image Processing},
-	publisher    = {IEEE},
-	volume       = 7,
-	number       = 3,
-	pages        = {370--375}
-}
-@article{carlini2018audio,
-	title        = {Audio Adversarial Examples: Targeted Attacks on Speech-to-Text},
-	author       = {Carlini, Nicholas and Wagner, David},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1801.01944}
-}
-@article{Hsu2017Unsupervised,
-	title        = {Unsupervised Domain Adaptation for Robust Speech Recognition via Variational Autoencoder-Based Data Augmentation},
-	author       = {Hsu, Wei Ning and Zhang, Yu and Glass, James},
-	year         = 2017
-}
-@article{Yuan2018CommanderSong,
-	title        = {CommanderSong: A Systematic Approach for Practical Adversarial Voice Recognition},
-	author       = {Yuan, Xuejing and Chen, Yuxuan and Zhao, Yue and Long, Yunhui and Liu, Xiaokang and Chen, Kai and Zhang, Shengzhi and Huang, Heqing and Wang, Xiaofeng and Gunter, Carl A.},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1801.08535}
-}
-@book{Edelsbrunner1986Optimal,
-	title        = {Optimal point location in a monotone subdivision},
-	author       = {Edelsbrunner, Herbert and Guibas, Lionidas J and Stolfi, Jorge},
-	year         = 1986,
-	publisher    = {Society for Industrial and Applied Mathematics},
-	pages        = {317--340}
-}
-@article{Levenshtein1966Binary,
-	title        = {Binary codes capable of correcting deletions, insertions and reversals},
-	author       = {Levenshtein, V. I},
-	year         = 1966,
-	journal      = {Soviet Physics Doklady},
-	volume       = 10,
-	number       = 1,
-	pages        = {845&ndash;848}
-}
-@article{alzantot2018did,
-	title        = {Did you hear that? Adversarial Examples Against Automatic Speech Recognition},
-	author       = {Alzantot, Moustafa and Balaji, Bharathan and Srivastava, Mani},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1801.00554}
-}
-@string{PT = {Prentice-Hall}}
-@string{CVPR = {CVPR}}
-@string{ismir = {Proceedings of International Conference on Music Information Retrieval}}
-@string{ieeetcsad = {IEEE Transactions Circuits Systems~II: Analog and Digital Signal Processing}}
-@string{IJCV = {IJCV}}
-@string{TIP = {TIP}}
-@string{TVCG = {TVCG}}
-@string{PAMI = {PAMI}}
-@string{ICCV = {ICCV}}
-@string{ECCV = {ECCV}}
-@string{ICLR = {ICLR}}
-@string{ICCP = {ICCP}}
-@string{SIG = {SIGGRAPH}}
-@string{SIGGRAPH = {SIGGRAPH}}
-@string{SIGA = {SIGGRAPH Asia}}
-@string{TOG = {ACM Transactions on Graphics (TOG)}}
-@string{prl = {Pattern Recognition Letters}}
-@string{vldb = {Proceedings of International Conference on Very Large Databases}}
-@string{ICML = {ICML}}
-@string{NIPS = {NIPS}}
-@string{AIJ = "Artificial Intelligence"}
-@string{IJCAI = {IJCAI}}
-%"Proc. Int. Joint Conf. on Art. Intell."}
-@string{BMVC = {BMVC}}
-@string{JMLR = {JMLR}}
-@string{MICCAI = {MICCAI}}
-@string{ICLR = {ICLR}}
-@string{CGF = {Computer Graphics Forum}}
-@incollection{Bengio+chapter2007,
-	title        = {Scaling Learning Algorithms Towards {AI}},
-	author       = {Bengio, Yoshua and LeCun, Yann},
-	year         = 2007,
-	booktitle    = {Large Scale Kernel Machines},
-	publisher    = {MIT Press}
-}
-@article{Hinton06,
-	title        = {A Fast Learning Algorithm for Deep Belief Nets},
-	author       = {Hinton, Geoffrey E. and Osindero, Simon and Teh, Yee Whye},
-	year         = 2006,
-	journal      = {Neural Computation},
-	volume       = 18,
-	pages        = {1527--1554}
-}
-@article{berthelot2017began,
-	title        = {Began: Boundary equilibrium generative adversarial networks},
-	author       = {Berthelot, David and Schumm, Tom and Metz, Luke},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.10717}
-}
-@article{levine2016end,
-	title        = {End-to-end training of deep visuomotor policies},
-	author       = {Levine, Sergey and Finn, Chelsea and Darrell, Trevor and Abbeel, Pieter},
-	year         = 2016,
-	journal      = JMLR,
-	volume       = 17,
-	number       = 39,
-	pages        = {1--40}
-}
-@article{radford2015unsupervised,
-	title        = {Unsupervised representation learning with deep convolutional generative adversarial networks},
-	author       = {Radford, Alec and Metz, Luke and Chintala, Soumith},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1511.06434}
-}
-@inproceedings{goodfellow2014generative,
-	title        = {Generative adversarial nets},
-	author       = {Goodfellow, Ian and Pouget-Abadie, Jean and Mirza, Mehdi and Xu, Bing and Warde-Farley, David and Ozair, Sherjil and Courville, Aaron and Bengio, Yoshua},
-	year         = 2014,
-	booktitle    = NIPS,
-	pages        = {2672--2680}
-}
-@inproceedings{gulrajani2017improved,
-	title        = {Improved training of wasserstein gans},
-	author       = {Gulrajani, Ishaan and Ahmed, Faruk and Arjovsky, Martin and Dumoulin, Vincent and Courville, Aaron},
-	year         = 2017,
-	booktitle    = NIPS
-}
-@inproceedings{johnson2016perceptual,
-	title        = {Perceptual losses for real-time style transfer and super-resolution},
-	author       = {Johnson, Justin and Alahi, Alexandre and Fei-Fei, Li},
-	year         = 2016,
-	booktitle    = ECCV,
-	pages        = {694--711},
-	organization = {Springer}
-}
-@article{isola2017image,
-	title        = {Image-to-image translation with conditional adversarial networks},
-	author       = {Isola, Phillip and Zhu, Jun-Yan and Zhou, Tinghui and Efros, Alexei A},
-	year         = 2017,
-	journal      = CVPR
-}
-@article{zhu2017unpaired,
-	title        = {Unpaired image-to-image translation using cycle-consistent adversarial networks},
-	author       = {Zhu, Jun-Yan and Park, Taesung and Isola, Phillip and Efros, Alexei A},
-	year         = 2017,
-	journal      = ICCV
-}
-@book{scholkopf_learning_2002,
-	title        = {Learning with {Kernels}: {Support} {Vector} {Machines}, {Regularization}, {Optimization}, and {Beyond}},
-	shorttitle   = {Learning with {Kernels}},
-	author       = {Schölkopf, Bernhard and Smola, Alexander J.},
-	year         = 2002,
-	month        = jan,
-	publisher    = {MIT Press},
-	isbn         = {978-0-262-19475-4},
-	language     = {en},
-	keywords     = {Computers / Computer Science, Computers / Intelligence (AI) \& Semantics, Computers / Programming / General, Mathematics / General}
-}
-@article{mao2016least,
-	title        = {Least squares generative adversarial networks},
-	author       = {Mao, Xudong and Li, Qing and Xie, Haoran and Lau, Raymond YK and Wang, Zhen and Smolley, Stephen Paul},
-	year         = 2016,
-	journal      = {arXiv preprint ArXiv:1611.04076}
-}
-@inproceedings{huang2011adversarial,
-	title        = {Adversarial machine learning},
-	author       = {Huang, Ling and Joseph, Anthony D and Nelson, Blaine and Rubinstein, Benjamin IP and Tygar, JD},
-	year         = 2011,
-	booktitle    = {Proceedings of the 4th ACM workshop on Security and Artificial Intelligence},
-	pages        = {43--58},
-	organization = {ACM}
-}
-@inproceedings{nelson2010near,
-	title        = {Near-Optimal Evasion of Convex-Inducing Classifiers.},
-	author       = {Nelson, Blaine and Rubinstein, Benjamin IP and Huang, Ling and Joseph, Anthony D and Lau, Shing-hon and Lee, Steven J and Rao, Satish and Tran, Anthony and Tygar, J Doug},
-	year         = 2010,
-	booktitle    = {AISTATS},
-	pages        = {549--556}
-}
-@inproceedings{zhu2016generative,
-	title        = {Generative visual manipulation on the natural image manifold},
-	author       = {Zhu, Jun-Yan and Kr{\"a}henb{\"u}hl, Philipp and Shechtman, Eli and Efros, Alexei A},
-	year         = 2016,
-	booktitle    = ECCV,
-	pages        = {597--613},
-	organization = {Springer}
-}
-@inproceedings{szegedy2014intriguing,
-	title        = {Intriguing properties of neural networks},
-	author       = {Christian Szegedy and Wojciech Zaremba and Ilya Sutskever and Joan Bruna and Dumitru Erhan and Ian Goodfellow and Rob Fergus},
-	year         = 2014,
-	booktitle    = ICLR
-}
-@article{shlens2014tutorial,
-	title        = {A tutorial on principal component analysis},
-	author       = {Shlens, Jonathon},
-	year         = 2014,
-	journal      = {arXiv preprint arXiv:1404.1100}
-}
-@inproceedings{anguita2013public,
-	title        = {A Public Domain Dataset for Human Activity Recognition using Smartphones.},
-	author       = {Anguita, Davide and Ghio, Alessandro and Oneto, Luca and Parra, Xavier and Reyes-Ortiz, Jorge Luis},
-	year         = 2013,
-	booktitle    = {ESANN}
-}
-@article{scikit-learn,
-	title        = {Scikit-learn: Machine Learning in {P}ython},
-	author       = {Pedregosa, F. and Varoquaux, G. and Gramfort, A. and Michel, V. and Thirion, B. and Grisel, O. and Blondel, M. and Prettenhofer, P. and Weiss, R. and Dubourg, V. and Vanderplas, J. and Passos, A. and Cournapeau, D. and Brucher, M. and Perrot, M. and Duchesnay, E.},
-	year         = 2011,
-	journal      = JMLR,
-	volume       = 12,
-	pages        = {2825--2830}
-}
-@inproceedings{papernot2016practical,
-	title        = {Practical Black-Box Attacks against Deep Learning Systems using Adversarial Examples},
-	author       = {Papernot, Nicolas and McDaniel, Patrick and Goodfellow, Ian and Jha, Somesh and Berkay Celik, Z and Swami, Ananthram},
-	year         = 2017,
-	booktitle    = {Proceedings of the 2017 ACM Asia Conference on Computer and Communications Security}
-}
-@book{nielsen2015neural,
-	title        = {Neural Networks and Deep Learning},
-	author       = {Nielsen, Michael A.},
-	year         = 2015,
-	publisher    = {Determination Press}
-}
-@article{srivastava2014dropout,
-	title        = {Dropout: a simple way to prevent neural networks from overfitting.},
-	author       = {Srivastava, Nitish and Hinton, Geoffrey E and Krizhevsky, Alex and Sutskever, Ilya and Salakhutdinov, Ruslan},
-	year         = 2014,
-	journal      = JMLR,
-	volume       = 15,
-	number       = 1,
-	pages        = {1929--1958}
-}
-@misc{sander_dieleman_2015_27878,
-	title        = {Lasagne: First release.},
-	author       = {Sander Dieleman and Jan Schlüter et.al.},
-	year         = 2015,
-	month        = aug,
-	doi          = {10.5281/zenodo.27878},
-	url          = {http://dx.doi.org/10.5281/zenodo.27878}
-}
-@misc{imagenet_challenge,
-	title        = {NIPS 2017 targeted  adversarial  attack  competition},
-	author       = {NIPS 2017},
-	howpublished = {\url{https://github.com/tensorflow/cleverhans/tree/master/examples/nips17\_adversarial\_competition/dataset}}
-}
-@article{theano2016,
-	title        = {Theano: A {Python} framework for fast computation of mathematical expressions},
-	author       = {Theano Development Team},
-	year         = 2016,
-	journal      = {arXiv e-print arXiv:1605.02688}
-}
-@inproceedings{biggio2012poisoning,
-	title        = {Poisoning Attacks against Support Vector Machines},
-	author       = {Biggio, Battista and Nelson, Blaine and Laskov, Pavel},
-	year         = 2012,
-	journal      = {arXiv preprint arXiv:1206.6389},
-	booktitle    = ICML,
-	pages        = {1807--1814}
-}
-@inproceedings{biggio2013evasion,
-	title        = {Evasion attacks against machine learning at test time},
-	author       = {Biggio, Battista and Corona, Igino and Maiorca, Davide and Nelson, Blaine and {\v{S}}rndi{\'c}, Nedim and Laskov, Pavel and Giacinto, Giorgio and Roli, Fabio},
-	year         = 2013,
-	booktitle    = {Joint European Conference on Machine Learning and Knowledge Discovery in Databases},
-	pages        = {387--402},
-	organization = {Springer}
-}
-@article{van2009dimensionality,
-	title        = {Dimensionality reduction: a comparative review},
-	author       = {Van Der Maaten, Laurens and Postma, Eric and Van den Herik, Jaap},
-	year         = 2009,
-	journal      = JMLR,
-	volume       = 10,
-	pages        = {66--71}
-}
-@article{cortes1995support,
-	title        = {Support-vector networks},
-	author       = {Cortes, Corinna and Vapnik, Vladimir},
-	year         = 1995,
-	journal      = {Machine learning},
-	publisher    = {Springer},
-	volume       = 20,
-	number       = 3,
-	pages        = {273--297}
-}
-@inproceedings{lowd2005adversarial,
-	title        = {Adversarial learning},
-	author       = {Lowd, Daniel and Meek, Christopher},
-	year         = 2005,
-	booktitle    = {Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining},
-	pages        = {641--647},
-	organization = {ACM}
-}
-@article{rubinstein2009stealthy,
-	title        = {Stealthy poisoning attacks on PCA-based anomaly detectors},
-	author       = {Rubinstein, Benjamin IP and Nelson, Blaine and Huang, Ling and Joseph, Anthony D and Lau, Shing-hon and Rao, Satish and Taft, Nina and Tygar, JD},
-	year         = 2009,
-	journal      = {ACM SIGMETRICS Performance Evaluation Review},
-	publisher    = {ACM},
-	volume       = 37,
-	number       = 2,
-	pages        = {73--74}
-}
-@inproceedings{rubinstein2009antidote,
-	title        = {Antidote: understanding and defending against poisoning of anomaly detectors},
-	author       = {Rubinstein, Benjamin IP and Nelson, Blaine and Huang, Ling and Joseph, Anthony D and Lau, Shing-hon and Rao, Satish and Taft, Nina and Tygar, JD},
-	year         = 2009,
-	booktitle    = {Proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference},
-	pages        = {1--14},
-	organization = {ACM}
-}
-@inproceedings{kloft2010online,
-	title        = {Online Anomaly Detection under Adversarial Impact.},
-	author       = {Kloft, Marius and Laskov, Pavel},
-	year         = 2010,
-	booktitle    = {AISTATS},
-	pages        = {405--412}
-}
-@inproceedings{kantchelian2016evasion,
-	title        = {Evasion and Hardening of Tree Ensemble Classifiers},
-	author       = {Kantchelian, Alex and Tygar, JD and Joseph, Anthony D},
-	year         = 2016,
-	booktitle    = {Proceedings of the 33rd International Conference on Machine Learning (ICML-16)}
-}
-@article{mccoyd2016spoofing,
-	title        = {Spoofing 2D Face Detection: Machines See People Who Aren't There},
-	author       = {McCoyd, Michael and Wagner, David},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1608.02128}
-}
-@inproceedings{biggio2015one,
-	title        = {One-and-a-Half-Class Multiple Classifier Systems for Secure Learning Against Evasion Attacks at Test Time},
-	author       = {Biggio, Battista and Corona, Igino and He, Zhi-Min and Chan, Patrick PK and Giacinto, Giorgio and Yeung, Daniel S and Roli, Fabio},
-	year         = 2015,
-	booktitle    = {International Workshop on Multiple Classifier Systems},
-	pages        = {168--180},
-	organization = {Springer}
-}
-@inproceedings{russu2016secure,
-	title        = {Secure Kernel Machines Against Evasion Attacks},
-	author       = {Russu, Paolo and Demontis, Ambra and Biggio, Battista and Fumera, Giorgio and Roli, Fabio},
-	year         = 2016,
-	booktitle    = {Proceedings of the 2016 ACM Workshop on Artificial Intelligence and Security},
-	location     = {Vienna, Austria},
-	publisher    = {ACM},
-	address      = {New York, NY, USA},
-	series       = {AISec '16},
-	pages        = {59--69},
-	doi          = {10.1145/2996758.2996771},
-	isbn         = {978-1-4503-4573-6},
-	url          = {http://doi.acm.org/10.1145/2996758.2996771},
-	numpages     = 11,
-	acmid        = 2996771,
-	keywords     = {adversarial machine learning, evasion attacks, kernel methods, secure learning}
-}
-@inproceedings{xu2016automatically,
-	title        = {Automatically evading classifiers},
-	author       = {Xu, Weilin and Qi, Yanjun and Evans, David},
-	year         = 2016,
-	booktitle    = {Proceedings of the 2016 Network and Distributed Systems Symposium}
-}
-@article{luo2015foveation,
-	title        = {Foveation-based Mechanisms Alleviate Adversarial Examples},
-	author       = {Luo, Yan and Boix, Xavier and Roig, Gemma and Poggio, Tomaso and Zhao, Qi},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1511.06292}
-}
-@article{hendrycks2016visible,
-	title        = {Visible Progress on Adversarial Images and a New Saliency Map},
-	author       = {Hendrycks, Dan and Gimpel, Kevin},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1608.00530}
-}
-@inproceedings{SmutzS16,
-	title        = {When a Tree Falls: Using Diversity in Ensemble Classifiers to Identify Evasion in Malware Detectors},
-	author       = {Charles Smutz and Angelos Stavrou},
-	booktitle    = {23nd Annual Network and Distributed System Security Symposium, {NDSS} 2016}
-}
-@article{vsrndic2016hidost,
-	title        = {Hidost: a static machine-learning-based detector of malicious files},
-	author       = {{\v{S}}rndi{\'c}, Nedim and Laskov, Pavel},
-	year         = 2016,
-	journal      = {EURASIP Journal on Information Security},
-	publisher    = {Springer International Publishing},
-	volume       = 2016,
-	number       = 1,
-	pages        = 22
-}
-@article{collobert2011natural,
-	title        = {Natural language processing (almost) from scratch},
-	author       = {Collobert, Ronan and Weston, Jason and Bottou, L{\'e}on and Karlen, Michael and Kavukcuoglu, Koray and Kuksa, Pavel},
-	year         = 2011,
-	journal      = JMLR,
-	volume       = 12,
-	number       = {Aug},
-	pages        = {2493--2537}
-}
-@article{cormack2007email,
-	title        = {Email spam filtering: A systematic review},
-	author       = {Cormack, Gordon V},
-	year         = 2007,
-	journal      = {Foundations and Trends in Information Retrieval},
-	publisher    = {Now Publishers Inc.},
-	volume       = 1,
-	number       = 4,
-	pages        = {335--455}
-}
-@misc{NVIDIA,
-	title        = {Self Driving Vehicles Development Platform},
-	author       = {NVIDIA},
-	note         = {Accessed: 2016-10-31},
-	howpublished = {\url{http://www.nvidia.com/object/drive-px.html}}
-}
-@inproceedings{arp2014drebin,
-	title        = {DREBIN: Effective and Explainable Detection of Android Malware in Your Pocket.},
-	author       = {Arp, Daniel and Spreitzenbarth, Michael and Hubner, Malte and Gascon, Hugo and Rieck, Konrad},
-	year         = 2014,
-	booktitle    = {NDSS}
-}
-@inproceedings{taigman2014deepface,
-	title        = {Deepface: Closing the gap to human-level performance in face verification},
-	author       = {Taigman, Yaniv and Yang, Ming and Ranzato, Marc'Aurelio and Wolf, Lior},
-	year         = 2014,
-	booktitle    = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
-	pages        = {1701--1708}
-}
-@book{Scholkopf:2001:LKS:559923,
-	title        = {Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond},
-	author       = {Scholkopf, Bernhard and Smola, Alexander J.},
-	year         = 2001,
-	publisher    = {MIT Press},
-	address      = {Cambridge, MA, USA},
-	isbn         = {0262194759}
-}
-@inproceedings{Bingham:2001:RPD:502512.502546,
-	title        = {Random Projection in Dimensionality Reduction: Applications to Image and Text Data},
-	author       = {Bingham, Ella and Mannila, Heikki},
-	booktitle    = {Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining}
-}
-@article{zhang2016adversarial,
-	title        = {Adversarial feature selection against evasion attacks},
-	author       = {Zhang, Fei and Chan, Patrick PK and Biggio, Battista and Yeung, Daniel S and Roli, Fabio},
-	year         = 2016,
-	journal      = {IEEE Transactions on Cybernetics},
-	publisher    = {IEEE},
-	volume       = 46,
-	number       = 3,
-	pages        = {766--777}
-}
-@article{wang2016random,
-	title        = {Random Feature Nullification for Adversary Resistant Deep Architecture},
-	author       = {Wang, Qinglong and Guo, Wenbo and Zhang, Kaixuan and Xing, Xinyu and Giles, C Lee and Liu, Xue},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1610.01239}
-}
-@article{GrossePM0M16,
-	title        = {Adversarial Perturbations Against Deep Neural Networks for Malware Classification},
-	author       = {Kathrin Grosse and Nicolas Papernot and Praveen Manoharan and Michael Backes and Patrick McDaniel},
-	year         = 2016,
-	journal      = {CoRR},
-	volume       = {abs/1606.04435},
-	url          = {http://arxiv.org/abs/1606.04435},
-	timestamp    = {Fri, 01 Jul 2016 17:39:49 +0200},
-	biburl       = {http://dblp.uni-trier.de/rec/bib/journals/corr/GrossePM0M16},
-	bibsource    = {dblp computer science bibliography, http://dblp.org}
-}
-@inproceedings{papernot2015distillation,
-	title        = {Distillation as a Defense to Adversarial Perturbations Against Deep Neural Networks},
-	author       = {Nicolas Papernot and Patrick Drew McDaniel and Xi Wu and Somesh Jha and Ananthram Swami},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1511.04508},
-	booktitle    = {{IEEE} Symposium on Security and Privacy, {SP} 2016},
-	pages        = {582--597}
-}
-@book{goodfellow2016deep,
-	title        = {Deep learning},
-	author       = {Goodfellow, Ian and Bengio, Yoshua and Courville, Aaron},
-	year         = 2016,
-	publisher    = {MIT Press}
-}
-@article{hosseini2017deceiving,
-	title        = {Deceiving Google's Cloud Video Intelligence API Built for Summarizing Videos},
-	author       = {Hosseini, Hossein and Xiao, Baicen and Poovendran, Radha},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.09793}
-}
-@inproceedings{tramer2016stealing,
-	title        = {Stealing machine learning models via prediction apis},
-	author       = {Tram{\`e}r, Florian and Zhang, Fan and Juels, Ari and Reiter, Michael K and Ristenpart, Thomas},
-	year         = 2016,
-	booktitle    = {USENIX Security}
-}
-@inproceedings{dang2017extracting,
-	title        = {Evading Classifiers by Morphing in the Dark},
-	author       = {Dang, Hung and Yue, Huang and Chang, Ee-Chien},
-	year         = 2017,
-	journal      = {ACM CCS}
-}
-@misc{tensorflow2015-whitepaper,
-	title        = {{TensorFlow}: Large-Scale Machine Learning on Heterogeneous Systems},
-	author       = {Mart\'{\i}n~Abadi and Ashish~Agarwal and Paul~Barham and Eugene~Brevdo and Zhifeng~Chen and Craig~Citro and Greg~S.~Corrado and Andy~Davis and Jeffrey~Dean and Matthieu~Devin and Sanjay~Ghemawat and Ian~Goodfellow and Andrew~Harp and Geoffrey~Irving and Michael~Isard and Yangqing Jia and Rafal~Jozefowicz and Lukasz~Kaiser and Manjunath~Kudlur and Josh~Levenberg and Dan~Man\'{e} and Rajat~Monga and Sherry~Moore and Derek~Murray and Chris~Olah and Mike~Schuster and Jonathon~Shlens and Benoit~Steiner and Ilya~Sutskever and Kunal~Talwar and Paul~Tucker and Vincent~Vanhoucke and Vijay~Vasudevan and Fernanda~Vi\'{e}gas and Oriol~Vinyals and Pete~Warden and Martin~Wattenberg and Martin~Wicke and Yuan~Yu and Xiaoqiang~Zheng},
-	year         = 2015,
-	url          = {http://tensorflow.org/},
-	note         = {Software available from tensorflow.org}
-}
-@techreport{uther1997adversarial,
-	title        = {Adversarial reinforcement learning},
-	author       = {Uther, William and Veloso, Manuela},
-	year         = 1997,
-	institution  = {Technical report, Carnegie Mellon University, 1997. Unpublished}
-}
-@article{hannun2014deep,
-	title        = {Deep speech: Scaling up end-to-end speech recognition},
-	author       = {Hannun, Awni and Case, Carl and Casper, Jared and Catanzaro, Bryan and Diamos, Greg and Elsen, Erich and Prenger, Ryan and Satheesh, Sanjeev and Sengupta, Shubho and Coates, Adam and others},
-	year         = 2014,
-	journal      = {arXiv preprint arXiv:1412.5567}
-}
-@inproceedings{sutskever2014sequence,
-	title        = {Sequence to sequence learning with neural networks},
-	author       = {Sutskever, Ilya and Vinyals, Oriol and Le, Quoc V},
-	year         = 2014,
-	booktitle    = NIPS,
-	pages        = {3104--3112}
-}
-@article{ghory2004reinforcement,
-	title        = {Reinforcement learning in board games},
-	author       = {Ghory, Imran},
-	year         = 2004,
-	journal      = {Department of Computer Science, University of Bristol, Tech. Rep}
-}
-@article{dai2005approach,
-	title        = {An approach to tune fuzzy controllers based on reinforcement learning for autonomous vehicle control},
-	author       = {Dai, Xiaohui and Li, Chi-Kwong and Rad, Ahmad B},
-	year         = 2005,
-	journal      = {IEEE Transactions on Intelligent Transportation Systems},
-	publisher    = {IEEE},
-	volume       = 6,
-	number       = 3,
-	pages        = {285--293}
-}
-@article{busoniu2008comprehensive,
-	title        = {A comprehensive survey of multiagent reinforcement learning},
-	author       = {Busoniu, Lucian and Babuska, Robert and De Schutter, Bart},
-	year         = 2008,
-	journal      = {IEEE Transactions on Systems Man and Cybernetics Part C Applications and Reviews},
-	publisher    = {IEEE SYSTEMS MAN \& CYBERNETICS SOCIETY},
-	volume       = 38,
-	number       = 2,
-	pages        = 156
-}
-@article{sutton1992reinforcement,
-	title        = {Reinforcement learning is direct adaptive optimal control},
-	author       = {Sutton, Richard S and Barto, Andrew G and Williams, Ronald J},
-	year         = 1992,
-	journal      = {IEEE Control Systems},
-	publisher    = {IEEE},
-	volume       = 12,
-	number       = 2,
-	pages        = {19--22}
-}
-@article{liu2017robust,
-	title        = {Robust Linear Regression Against Training Data Poisoning},
-	author       = {Liu, Chang and Li, Bo and Vorobeychik, Yevgeniy and Oprea, Alina},
-	year         = 2017
-}
-@article{li2017scalable,
-	title        = {Scalable Iterative Classification for Sanitizing Large-Scale Datasets},
-	author       = {Li, Bo and Vorobeychik, Yevgeniy and Li, Muqun and Malin, Bradley},
-	year         = 2017,
-	journal      = {IEEE Transactions on Knowledge and Data Engineering},
-	publisher    = {IEEE},
-	volume       = 29,
-	number       = 3,
-	pages        = {698--711}
-}
-@article{weng2018evaluating,
-	title        = {Evaluating the robustness of neural networks: An extreme value theory approach},
-	author       = {Weng, Tsui-Wei and Zhang, Huan and Chen, Pin-Yu and Yi, Jinfeng and Su, Dong and Gao, Yupeng and Hsieh, Cho-Jui and Daniel, Luca},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1801.10578}
-}
-@article{pham2018efficient,
-	title        = {Efficient neural architecture search via parameter sharing},
-	author       = {Pham, Hieu and Guan, Melody Y and Zoph, Barret and Le, Quoc V and Dean, Jeff},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1802.03268}
-}
-@article{zoph2016neural,
-	title        = {Neural architecture search with reinforcement learning},
-	author       = {Zoph, Barret and Le, Quoc V},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1611.01578}
-}
-@article{xiao2018generating,
-	title        = {Generating adversarial examples with adversarial networks},
-	author       = {Xiao, Chaowei and Li, Bo and Zhu, Jun-Yan and He, Warren and Liu, Mingyan and Song, Dawn},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1801.02610}
-}
-@inproceedings{li2015iterative,
-	title        = {Iterative classification for sanitizing large-scale datasets},
-	author       = {Li, Bo and Vorobeychik, Yevgeniy and Li, Muqun and Malin, Bradley},
-	year         = 2015,
-	booktitle    = {Data Mining (ICDM), 2015 IEEE International Conference on},
-	pages        = {841--846},
-	organization = {IEEE}
-}
-@article{lin2017tactics,
-	title        = {Tactics of Adversarial Attack on Deep Reinforcement Learning Agents},
-	author       = {Lin, Yen-Chen and Hong, Zhang-Wei and Liao, Yuan-Hong and Shih, Meng-Li and Liu, Ming-Yu and Sun, Min},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.06748}
-}
-@article{huang2017adversarial,
-	title        = {Adversarial attacks on neural network policies},
-	author       = {Huang, Sandy and Papernot, Nicolas and Goodfellow, Ian and Duan, Yan and Abbeel, Pieter},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1702.02284}
-}
-
-
-@inproceedings{papernot2017practical,
-	title        = {Practical Black-Box Attacks against Machine Learning},
-	author       = {Papernot, Nicolas and McDaniel, Patrick and Goodfellow, Ian and Jha, Somesh and Celik, Z Berkay and Swami, Ananthram},
-	year         = 2017,
-	booktitle    = {Proceedings of the 2017 ACM on Asia Conference on Computer and Communications Security},
-	pages        = {506--519},
-	organization = {ACM}
-}
-@article{koh2017understanding,
-	title        = {Understanding Black-box Predictions via Influence Functions},
-	author       = {Koh, Pang Wei and Liang, Percy},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.04730}
-}
-@inproceedings{poison-lda,
-	title        = {The Security of Latent Dirichlet Allocation},
-	author       = {Mei, Shike and Zhu, Xiaojin},
-	year         = 2015,
-	booktitle    = {AISTATS}
-}
-@article{moosavi2016universal,
-	title        = {Universal adversarial perturbations},
-	author       = {Moosavi-Dezfooli, Seyed-Mohsen and Fawzi, Alhussein and Fawzi, Omar and Frossard, Pascal},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1610.08401}
-}
-@inproceedings{asif2015adversarial,
-	title        = {Adversarial cost-sensitive classification},
-	author       = {Asif, Kaiser and Xing, Wei and Behpour, Sima and Ziebart, Brian D},
-	year         = 2015,
-	booktitle    = {Proceedings of the Conference on Uncertainty in Artificial Intelligence}
-}
-@article{sejnowski1987parallel,
-	title        = {Parallel networks that learn to pronounce English text},
-	author       = {Sejnowski, Terrence J and Rosenberg, Charles R},
-	year         = 1987,
-	journal      = {Complex systems},
-	volume       = 1,
-	number       = 1,
-	pages        = {145--168}
-}
-@article{friedman1996another,
-	title        = {Another approach to polychotomous classifcation},
-	author       = {Friedman, J},
-	year         = 1996,
-	journal      = {Dept. Statist., Stanford Univ., Stanford, CA, USA, Tech. Rep}
-}
-@article{hsu2002comparison,
-	title        = {A comparison of methods for multiclass support vector machines},
-	author       = {Hsu, Chih-Wei and Lin, Chih-Jen},
-	year         = 2002,
-	journal      = {Neural Networks, IEEE Transactions on},
-	publisher    = {IEEE},
-	volume       = 13,
-	number       = 2,
-	pages        = {415--425}
-}
-@inproceedings{teo2007convex,
-	title        = {Convex learning with invariances},
-	author       = {Teo, Choon H and Globerson, Amir and Roweis, Sam T and Smola, Alex J},
-	year         = 2007,
-	booktitle    = NIPS,
-	pages        = {1489--1496}
-}
-@article{aly2005survey,
-	title        = {Survey on multiclass classification methods},
-	author       = {Aly, Mohamed},
-	year         = 2005,
-	journal      = {Neural Netw},
-	pages        = {1--9}
-}
-@article{fawcett1997adaptive,
-	title        = {Adaptive fraud detection},
-	author       = {Fawcett, Tom and Provost, Foster},
-	year         = 1997,
-	journal      = {Data mining and knowledge discovery},
-	publisher    = {Springer},
-	volume       = 1,
-	number       = 3,
-	pages        = {291--316}
-}
-@inproceedings{mahoney2002learning,
-	title        = {Learning nonstationary models of normal network traffic for detecting novel attacks},
-	author       = {Mahoney, Matthew V and Chan, Philip K},
-	year         = 2002,
-	booktitle    = {Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining},
-	pages        = {376--385},
-	organization = {ACM}
-}
-@inproceedings{globerson2006nightmare,
-	title        = {Nightmare at test time: robust learning by feature deletion},
-	author       = {Globerson, Amir and Roweis, Sam},
-	year         = 2006,
-	booktitle    = {Proceedings of the 23rd international conference on Machine learning},
-	pages        = {353--360},
-	organization = {ACM}
-}
-@article{bruckner2012static,
-	title        = {Static prediction games for adversarial learning problems},
-	author       = {Br{\"u}ckner, Michael and Kanzow, Christian and Scheffer, Tobias},
-	year         = 2012,
-	journal      = {Journal of Machine Learning Research},
-	publisher    = {JMLR. org},
-	volume       = 13,
-	number       = 1,
-	pages        = {2617--2654}
-}
-@book{el2003robust,
-	title        = {Robust classification with interval data},
-	author       = {El Ghaoui, Laurent and Lanckriet, Gert Ren{\'e} Georges and Natsoulis, Georges and others},
-	year         = 2003,
-	publisher    = {Computer Science Division, University of California}
-}
-@article{karlberger2007exploiting,
-	title        = {Exploiting Redundancy in Natural Language to Penetrate Bayesian Spam Filters.},
-	author       = {Karlberger, Christoph and Bayler, G{\"u}nther and Kruegel, Christopher and Kirda, Engin},
-	year         = 2007,
-	journal      = {WOOT},
-	volume       = 7,
-	pages        = {1--7}
-}
-@incollection{klimt2004enron,
-	title        = {The enron corpus: A new dataset for email classification research},
-	author       = {Klimt, Bryan and Yang, Yiming},
-	year         = 2004,
-	booktitle    = {Machine learning: ECML 2004},
-	publisher    = {Springer},
-	pages        = {217--226}
-}
-@article{lecun2010mnist,
-	title        = {{MNIST} handwritten digit database},
-	author       = {LeCun, Yann and Cortes, Corinna},
-	year         = 2010,
-	journal      = {AT\&T Labs [Online]. Available: http://yann. lecun. com/exdb/mnist}
-}
-@article{luo1992convergence,
-	title        = {On the convergence of the coordinate descent method for convex differentiable minimization},
-	author       = {Luo, Zhi-Quan and Tseng, Paul},
-	year         = 1992,
-	journal      = {Journal of Optimization Theory and Applications},
-	publisher    = {Springer},
-	volume       = 72,
-	number       = 1,
-	pages        = {7--35}
-}
-@inproceedings{ke2016behavioralBLIND,
-	title        = {Blind title},
-	author       = {Blind authors},
-	year         = 2016,
-	booktitle    = {AAAI Conference on Artificial Intelligence},
-	optnote      = {to appear}
-}
-@inproceedings{ke2016behavioral,
-	title        = {Behavioral Experiments in Email Filter Evasion},
-	author       = {Ke, Liyiming and Li, Bo and Vorobeychik, Yevgeniy},
-	year         = 2016,
-	booktitle    = {AAAI Conference on Artificial Intelligence}
-}
-@misc{Kantchelian15,
-	title        = {Evasion and Hardening of Tree Ensemble Classifiers},
-	author       = {A. Kantchelian and J. D. Tygar and A. D. Joseph},
-	year         = 2015,
-	optkey       = {},
-	howpublished = {arXiv pre-print},
-	optmonth     = {},
-	optnote      = {},
-	optannote    = {}
-}
-@book{Boyd04,
-	title        = {Convex Optimization},
-	author       = {Stephen Boyd and Lieven Vandenberghe},
-	year         = 2004,
-	publisher    = {Cambridge University Press}
-}
-@article{zheng2016improving,
-	title        = {Improving the Robustness of Deep Neural Networks via Stability Training},
-	author       = {Zheng, Stephan and Song, Yang and Leung, Thomas and Goodfellow, Ian},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1604.04326}
-}
-@article{jin2015robust,
-	title        = {Robust Convolutional Neural Networks under Adversarial Noise},
-	author       = {Jin, Jonghoon and Dundar, Aysegul and Culurciello, Eugenio},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1511.06306}
-}
-@article{kingma2014adam,
-	title        = {Adam: A method for stochastic optimization},
-	author       = {Kingma, Diederik and Ba, Jimmy},
-	year         = 2014,
-	journal      = {arXiv preprint arXiv:1412.6980}
-}
-@article{jan2002neural,
-	title        = {Neural network forecast model in deep excavation},
-	author       = {Jan, JC and Hung, Shih-Lin and Chi, SY and Chern, JC},
-	year         = 2002,
-	journal      = {Journal of Computing in Civil Engineering},
-	publisher    = {American Society of Civil Engineers},
-	volume       = 16,
-	number       = 1,
-	pages        = {59--65}
-}
-@article{li2016general,
-	title        = {A General Retraining Framework for Scalable Adversarial Classification},
-	author       = {Li, Bo and Vorobeychik, Yevgeniy and Chen, Xinyun},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1604.02606}
-}
-@article{lecun1998gradient,
-	title        = {Gradient-based learning applied to document recognition},
-	author       = {LeCun, Yann and Bottou, L{\'e}on and Bengio, Yoshua and Haffner, Patrick},
-	year         = 1998,
-	journal      = {Proceedings of the IEEE},
-	publisher    = {IEEE},
-	volume       = 86,
-	number       = 11,
-	pages        = {2278--2324}
-}
-@article{narodytska2016simple,
-	title        = {Simple Black-Box Adversarial Perturbations for Deep Networks},
-	author       = {Narodytska, Nina and Kasiviswanathan, Shiva Prasad},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1612.06299}
-}
-@article{lu2017safetynet,
-	title        = {{SafetyNet}: Detecting and rejecting adversarial examples robustly},
-	author       = {Lu, Jiajun and Issaranon, Theerasit and Forsyth, David},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1704.00103}
-}
-@article{abbasi2017robustness,
-	title        = {Robustness to Adversarial Examples through an Ensemble of Specialists},
-	author       = {Abbasi, Mahdieh and Gagn{\'e}, Christian},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1702.06856}
-}
-@article{wang2017analyzing,
-	title        = {Analyzing the Robustness of Nearest Neighbors to Adversarial Examples},
-	author       = {Wang, Yizhen and Jha, Somesh and Chaudhuri, Kamalika},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1706.03922}
-}
-@paper{alfeld2017explicit,
-	title        = {Explicit Defense Actions Against Test-Set Attacks},
-	author       = {Alfeld, Scott and Xhu, Xiaojin and Barford, Paul},
-	year         = 2017,
-	conference   = {AAAI Conference on Artificial Intelligence},
-	keywords     = {Adversarial Learning; Autoregressive Forecasting; Machine Learning}
-}
-@article{bulo2016randomized,
-	title        = {Randomized prediction games for adversarial machine learning},
-	author       = {Bul{\`o}, Samuel Rota and Biggio, Battista and Pillai, Ignazio and Pelillo, Marcello and Roli, Fabio},
-	year         = 2016,
-	journal      = {IEEE Transactions on Neural Networks and Learning Systems},
-	publisher    = {IEEE}
-}
-@article{madry_towards_2017,
-	title        = {Towards Deep Learning Models Resistant to Adversarial Attacks},
-	author       = {Mądry, Aleksander and Makelov, Aleksandar and Schmidt, Ludwig and Tsipras, Dimitris and Vladu, Adrian},
-	year         = 2017,
-	month        = jun,
-	journal      = {arXiv:1706.06083 [cs, stat]},
-	archiveprefix = {arXiv},
-	eprinttype   = {arxiv},
-	eprint       = {1706.06083},
-	primaryclass = {cs, stat},
-	keywords     = {Computer Science - Learning,Computer Science - Neural and Evolutionary Computing,Statistics - Machine Learning}
-}
-@inproceedings{deng2009imagenet,
-	title        = {Imagenet: A large-scale hierarchical image database},
-	author       = {Deng, Jia and Dong, Wei and Socher, Richard and Li, Li-Jia and Li, Kai and Fei-Fei, Li},
-	year         = 2009,
-	booktitle    = CVPR,
-	pages        = {248--255},
-	organization = {IEEE}
-}
-@article{hu2017generating,
-	title        = {Generating Adversarial Malware Examples for Black-Box Attacks Based on {GAN}},
-	author       = {Hu, Weiwei and Tan, Ying},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1702.05983}
-}
-@misc{dataset,
-	title        = {Program share website},
-	date-added   = {2014-09-20 14:20:24 +0000},
-	date-modified = {2014-09-20 14:20:24 +0000},
-	howpublished = {\url{https://malwr.com/}}
-}
-@misc{mnistchallenge,
-	author       = {Mądry, Aleksander and Makelov, Aleksandar and Schmidt, Ludwig and Tsipras, Dimitris and Vladu, Adrian},
-	year         = 2017,
-	howpublished = {\url{https://github.com/MadryLab/mnist_challenge}}
-}
-@inproceedings{cisse2017parseval,
-	title        = {Parseval networks: Improving robustness to adversarial examples},
-	author       = {Cisse, Moustapha and Bojanowski, Piotr and Grave, Edouard and Dauphin, Yann and Usunier, Nicolas},
-	year         = 2017,
-	booktitle    = {International Conference on Machine Learning},
-	pages        = {854--863}
-}
-@article{zagoruyko2016wide,
-	title        = {Wide residual networks},
-	author       = {Zagoruyko, Sergey and Komodakis, Nikos},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1605.07146}
-}
-@article{malgan,
-	title        = {Generating Multi-label Discrete Patient Records using Generative Adversarial Networks},
-	author       = {Malin, Bradley and Duke, Jon and Sun, Jimeng},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.06490}
-}
-@article{mopuri2017fast,
-	title        = {Fast Feature Fool: A data independent approach to universal adversarial perturbations},
-	author       = {Mopuri, Konda Reddy and Garg, Utsav and Babu, R Venkatesh},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1707.05572}
-}
-@book{spall2005introduction,
-	title        = {Introduction to stochastic search and optimization: estimation, simulation, and control},
-	author       = {Spall, James C},
-	year         = 2005,
-	publisher    = {John Wiley \& Sons},
-	volume       = 65
-}
-@book{hildebrand1962advanced,
-	title        = {Advanced calculus for applications},
-	author       = {Hildebrand, Francis Begnaud},
-	year         = 1962,
-	publisher    = {Prentice-Hall Englewood Cliffs, NJ},
-	volume       = 63
-}
-@misc{image_benchmarks,
-	title        = {Classification datasets results},
-	note         = {Accessed: 2017-08-22},
-	howpublished = {\url{http://rodrigob.github.io/are_we_there_yet/build/classification_datasets_results.html#494c5356524332303132207461736b2031}}
-}
-@misc{clarifai,
-	title        = {Clarifai | Image \& Video Recognition API},
-	note         = {Accessed: 2017-08-22},
-	howpublished = {\url{https://clarifai.com}}
-}
-@misc{bigml,
-	title        = {BigML.com is Machine Learning made easy},
-	note         = {Accessed: 2017-08-22},
-	howpublished = {\url{https://bigml.com}}
-}
-@misc{googlevision,
-	title        = {Vision API - Image Content Analysis | Google Cloud Platform},
-	note         = {Accessed: 2017-08-22},
-	howpublished = {\url{https://cloud.google.com/vision/}}
-}
-@misc{clarifaidisqus,
-	title        = {Clarifai Featured Hack: Block unwanted nudity in blog comments with Disqus},
-	note         = {Accessed: 2017-08-22},
-	howpublished = {\url{https://goo.gl/TCCVrR}}
-}
-@inproceedings{mathur1991performance,
-	title        = {Performance, effectiveness, and reliability issues in software testing},
-	author       = {Mathur, Aditya P},
-	year         = 1991,
-	booktitle    = {Computer Software and Applications Conference, 1991. COMPSAC'91., Proceedings of the Fifteenth Annual International},
-	pages        = {604--605},
-	organization = {IEEE}
-}
-@misc{stdcnnmodel,
-	title        = {Tensorflow CIFAR-10 tutorial model},
-	note         = {Accessed: 2017-08-22},
-	howpublished = {\url{https://github.com/tensorflow/models/tree/master/tutorials/image/cifar10}}
-}
-@misc{resnetmodel,
-	title        = {Tensorflow Resnet models},
-	note         = {Accessed: 2017-08-22},
-	howpublished = {\url{https://github.com/tensorflow/models/tree/master/resnet}}
-}
-@article{nesterov2017random,
-	title        = {Random gradient-free minimization of convex functions},
-	author       = {Nesterov, Yurii and Spokoiny, Vladimir},
-	year         = 2017,
-	journal      = {Foundations of Computational Mathematics},
-	publisher    = {Springer},
-	volume       = 17,
-	number       = 2,
-	pages        = {527--566}
-}
-@article{arora2017generalization,
-	title        = {Generalization and Equilibrium in Generative Adversarial Nets (GANs)},
-	author       = {Arora, Sanjeev and Ge, Rong and Liang, Yingyu and Ma, Tengyu and Zhang, Yi},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.00573}
-}
-@inproceedings{ciresan2012deep,
-	title        = {Deep neural networks segment neuronal membranes in electron microscopy images},
-	author       = {Ciresan, Dan and Giusti, Alessandro and Gambardella, Luca M and Schmidhuber, J{\"u}rgen},
-	year         = 2012,
-	booktitle    = NIPS,
-	pages        = {2843--2851}
-}
-@article{pomerleau1991efficient,
-	title        = {Efficient training of artificial neural networks for autonomous navigation},
-	author       = {Pomerleau, Dean A},
-	year         = 1991,
-	journal      = {Neural Computation},
-	publisher    = {MIT Press},
-	volume       = 3,
-	number       = 1,
-	pages        = {88--97}
-}
-@article{ruan2010three,
-	title        = {A three-layer back-propagation neural network for spam detection using artificial immune concentration},
-	author       = {Ruan, Guangchen and Tan, Ying},
-	year         = 2010,
-	journal      = {Soft computing},
-	publisher    = {Springer},
-	volume       = 14,
-	number       = 2,
-	pages        = {139--150}
-}
-@article{yang2017generative,
-	title        = {Generative Poisoning Attack Method Against Neural Networks},
-	author       = {Yang, Chaofei and Wu, Qing and Li, Hai and Chen, Yiran},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.01340}
-}
-@inproceedings{viola2001rapid,
-	title        = {Rapid object detection using a boosted cascade of simple features},
-	author       = {Viola, Paul and Jones, Michael},
-	year         = 2001,
-	booktitle    = {Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the 2001 IEEE Computer Society Conference on},
-	volume       = 1,
-	pages        = {I--I},
-	organization = {IEEE}
-}
-@article{felzenszwalb2010object,
-	title        = {Object detection with discriminatively trained part-based models},
-	author       = {Felzenszwalb, Pedro F and Girshick, Ross B and McAllester, David and Ramanan, Deva},
-	year         = 2010,
-	journal      = {IEEE transactions on pattern analysis and machine intelligence},
-	publisher    = {IEEE},
-	volume       = 32,
-	number       = 9,
-	pages        = {1627--1645}
-}
-@article{simonyan2014very,
-	title        = {Very deep convolutional networks for large-scale image recognition},
-	author       = {Simonyan, Karen and Zisserman, Andrew},
-	year         = 2014,
-	journal      = {arXiv preprint arXiv:1409.1556}
-}
-@book{pomerleau2012neural,
-	title        = {Neural network perception for mobile robot guidance},
-	author       = {Pomerleau, Dean A},
-	year         = 2012,
-	publisher    = {Springer Science \& Business Media},
-	volume       = 239
-}
-f
-@article{bartlett2008classification,
-	title        = {Classification with a reject option using a hinge loss},
-	author       = {Bartlett, Peter L and Wegkamp, Marten H},
-	year         = 2008,
-	journal      = JMLR,
-	volume       = 9,
-	number       = {Aug},
-	pages        = {1823--1840}
-}
-@article{baluja2017adversarial,
-	title        = {Adversarial Transformation Networks: Learning to Generate Adversarial Examples},
-	author       = {Baluja, Shumeet and Fischer, Ian},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1703.09387}
-}
-@article{che2016mode,
-	title        = {Mode regularized generative adversarial networks},
-	author       = {Che, Tong and Li, Yanran and Jacob, Athul Paul and Bengio, Yoshua and Li, Wenjie},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1612.02136}
-}
-@article{yi2017dualgan,
-	title        = {DualGAN: Unsupervised Dual Learning for Image-to-Image Translation},
-	author       = {Yi, Zili and Zhang, Hao and Gong, Ping Tan and others},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1704.02510}
-}
-@article{krizhevsky2014cifar,
-	title        = {The CIFAR-10 dataset},
-	author       = {Krizhevsky, Alex and Nair, Vinod and Hinton, Geoffrey},
-	year         = 2014,
-	journal      = {online: http://www. cs. toronto. edu/kriz/cifar. html}
-}
-@inproceedings{zhang2016colorful,
-	title        = {Colorful image colorization},
-	author       = {Zhang, Richard and Isola, Phillip and Efros, Alexei A},
-	year         = 2016,
-	booktitle    = {European Conference on Computer Vision},
-	pages        = {649--666},
-	organization = {Springer}
-}
-@article{chen2017show,
-	title        = {Show-and-Fool: Crafting Adversarial Examples for Neural Image Captioning},
-	author       = {Chen, Hongge and Zhang, Huan and Chen, Pin-Yu and Yi, Jinfeng and Hsieh, Cho-Jui},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1712.02051}
-}
-@inproceedings{nixdorf2010proposed,
-	title        = {Proposed revision of the ICHD-II criteria for headache or facial pain attributed to temporomandibular joint (TMJ) disorders},
-	author       = {Nixdorf, Don and Schiffman, Eric and Ohrbach, Richard and Anderson, Gary and Look, John},
-	year         = 2010,
-	booktitle    = {Abstracts of the 13th World Congress of Pain},
-	organization = {IASP (International Association for the Study of Pain and Omnipress)}
-}
-@article{graetz1986application,
-	title        = {The application of Landsat image data to rangeland assessment and monitoring: the development and demonstration of a land image-based resource information system (LIBRIS)},
-	author       = {Graetz, RD and Pech, Roger P and Gentle, MR and O'Callaghan, JF},
-	year         = 1986
-}
-@article{guo2017countering,
-	title        = {Countering Adversarial Images using Input Transformations},
-	author       = {Guo, Chuan and Rana, Mayank and Ciss{\'e}, Moustapha and van der Maaten, Laurens},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1711.00117}
-}
-@inproceedings{athalye2018obfuscated,
-	title        = {Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples},
-	author       = {Athalye, Anish and Carlini, Nicholas and Wagner, David},
-	year         = 2018,
-	booktitle    = {International Conference on Machine Learning},
-	pages        = {274--283}
-}
-@inproceedings{meng2017magnet,
-	title        = {Magnet: a two-pronged defense against adversarial examples},
-	author       = {Meng, Dongyu and Chen, Hao},
-	year         = 2017,
-	booktitle    = {ACM  Conference on Computer and Communications Security},
-	pages        = {135--147},
-	organization = {ACM}
-}
-@article{carlini2017magnet,
-	title        = {MagNet and" Efficient Defenses Against Adversarial Attacks" are Not Robust to Adversarial Examples},
-	author       = {Carlini, Nicholas and Wagner, David},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1711.08478}
-}
-@article{lu2018limitation,
-	title        = {On the Limitation of {MagNet} Defense against ${L_1 }$-based Adversarial Examples},
-	author       = {Lu, Pei-Hsuan and Chen, Pin-Yu and Chen, Kang-Cheng and Yu, Chia-Mu},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1805.00310}
-}
-@article{zhao2017generating,
-	title        = {Generating natural adversarial examples},
-	author       = {Zhao, Zhengli and Dua, Dheeru and Singh, Sameer},
-	year         = 2018,
-	journal      = {ICLR; arXiv preprint arXiv:1710.11342}
-}
-@article{iter2017generating,
-	title        = {Generating adversarial examples for speech recognition},
-	author       = {Iter, Dan and Huang, Jade and Jermann, Mike},
-	year         = 2017,
-	journal      = {Techcical Report}
-}
-@article{sriram2017robust,
-	title        = {Robust Speech Recognition Using Generative Adversarial Networks},
-	author       = {Sriram, Anuroop and Jun, Heewoo and Gaur, Yashesh and Satheesh, Sanjeev},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1711.01567}
-}
-@article{sundata18,
-	title        = {Data Augmentation with Adversarial Examples for Robust Speech Recognition},
-	author       = {Sun, Sining and Yeh, Ching-Feng and Ostendorf, Mari and Hwang, Mei-Yuh and Xie, Lei},
-	journal      = {ResearchGate}
-}
-@article{michelsanti2017conditional,
-	title        = {Conditional generative adversarial networks for speech enhancement and noise-robust speaker verification},
-	author       = {Michelsanti, Daniel and Tan, Zheng-Hua},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1709.01703}
-}
-@article{serdyuk2016invariant,
-	title        = {Invariant representations for noisy speech recognition},
-	author       = {Serdyuk, Dmitriy and Audhkhasi, Kartik and Brakel, Phil{\'e}mon and Ramabhadran, Bhuvana and Thomas, Samuel and Bengio, Yoshua},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1612.01928}
-}
-@inproceedings{lowe1999object,
-	title        = {Object recognition from local scale-invariant features},
-	author       = {Lowe, David G},
-	year         = 1999,
-	booktitle    = {Computer vision, 1999. The proceedings of the seventh IEEE international conference on},
-	volume       = 2,
-	pages        = {1150--1157},
-	organization = {Ieee}
-}
-@article{wang2016using,
-	title        = {Using Non-invertible Data Transformations to Build Adversarial-Robust Neural Networks},
-	author       = {Wang, Qinglong and Guo, Wenbo and Ororbia, II and Alexander, G and Xing, Xinyu and Lin, Lin and Giles, C Lee and Liu, Xue and Liu, Peng and Xiong, Gang},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1610.01934}
-}
-@inproceedings{fiscus1997post,
-	title        = {A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)},
-	author       = {Fiscus, Jonathan G},
-	year         = 1997,
-	booktitle    = {Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on},
-	pages        = {347--354},
-	organization = {IEEE}
-}
-@article{backdoor-xinyun,
-	title        = {Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning},
-	author       = {Xinyun Chen and Chang Liu and Bo Li and Kimberly Lu and Dawn Song},
-	year         = 2017,
-	journal      = {CoRR},
-	volume       = {abs/1712.05526},
-	url          = {http://arxiv.org/abs/1712.05526},
-	archiveprefix = {arXiv},
-	eprint       = {1712.05526},
-	timestamp    = {Mon, 20 Aug 2018 13:55:57 +0200},
-	biburl       = {https://dblp.org/rec/bib/journals/corr/abs-1712-05526},
-	bibsource    = {dblp computer science bibliography, https://dblp.org}
-}
-% From Dawn to Richard on 11/24
-@article{therml,
-	title        = {TherML: Thermodynamics of Machine Learning},
-	author       = {Alexander A. Alemi and Ian Fischer},
-	year         = 2018,
-	journal      = {CoRR},
-	volume       = {abs/1807.04162},
-	url          = {http://arxiv.org/abs/1807.04162},
-	archiveprefix = {arXiv},
-	eprint       = {1807.04162},
-	timestamp    = {Mon, 13 Aug 2018 16:46:15 +0200},
-	biburl       = {https://dblp.org/rec/bib/journals/corr/abs-1807-04162},
-	bibsource    = {dblp computer science bibliography, https://dblp.org}
-}
-@article{vib-uncertainty,
-	title        = {Uncertainty in the Variational Information Bottleneck},
-	author       = {Alexander A. Alemi and Ian Fischer and Joshua V. Dillon},
-	year         = 2018,
-	journal      = {CoRR},
-	volume       = {abs/1807.00906},
-	url          = {http://arxiv.org/abs/1807.00906},
-	archiveprefix = {arXiv},
-	eprint       = {1807.00906},
-	timestamp    = {Mon, 13 Aug 2018 16:48:08 +0200},
-	biburl       = {https://dblp.org/rec/bib/journals/corr/abs-1807-00906},
-	bibsource    = {dblp computer science bibliography, https://dblp.org}
-}
-@article{glibo,
-	title        = {{GILBO:} One Metric to Measure Them All},
-	author       = {Alexander A. Alemi and Ian Fischer},
-	year         = 2018,
-	journal      = {CoRR},
-	volume       = {abs/1802.04874},
-	url          = {http://arxiv.org/abs/1802.04874},
-	archiveprefix = {arXiv},
-	eprint       = {1802.04874},
-	timestamp    = {Mon, 13 Aug 2018 16:47:57 +0200},
-	biburl       = {https://dblp.org/rec/bib/journals/corr/abs-1802-04874},
-	bibsource    = {dblp computer science bibliography, https://dblp.org}
-}
-@inproceedings{broken-elbo,
-	title        = {Fixing a Broken {ELBO}},
-	author       = {Alexander A. Alemi and Ben Poole and Ian Fischer and Joshua V. Dillon and Rif A. Saurous and Kevin Murphy},
-	year         = 2018,
-	booktitle    = {Proceedings of the 35th International Conference on Machine Learning, {ICML} 2018, Stockholmsm{\"{a}}ssan, Stockholm, Sweden, July 10-15, 2018},
-	pages        = {159--168},
-	url          = {http://proceedings.mlr.press/v80/alemi18a.html},
-	crossref     = {DBLP:conf/icml/2018},
-	timestamp    = {Fri, 13 Jul 2018 14:58:25 +0200},
-	biburl       = {https://dblp.org/rec/bib/conf/icml/AlemiPFDS018},
-	bibsource    = {dblp computer science bibliography, https://dblp.org}
-}
-@article{VIB,
-	title        = {Deep Variational Information Bottleneck},
-	author       = {Alexander A. Alemi and Ian Fischer and Joshua V. Dillon and Kevin Murphy},
-	year         = 2016,
-	journal      = {CoRR},
-	volume       = {abs/1612.00410},
-	url          = {http://arxiv.org/abs/1612.00410},
-	archiveprefix = {arXiv},
-	eprint       = {1612.00410},
-	timestamp    = {Mon, 13 Aug 2018 16:46:54 +0200},
-	biburl       = {https://dblp.org/rec/bib/journals/corr/AlemiFD016},
-	bibsource    = {dblp computer science bibliography, https://dblp.org}
-}
-@article{farazi2021deep,
-  title={Deep reinforcement learning in transportation research: A review},
-  author={Farazi, Nahid Parvez and Zou, Bo and Ahamed, Tanvir and Barua, Limon},
-  journal={Transportation research interdisciplinary perspectives},
-  volume={11},
-  pages={100425},
-  year={2021},
-  publisher={Elsevier}
-}
-@article{Ray2019,
-    author = {Ray, Alex and Achiam, Joshua and Amodei, Dario},
-    title = {{Benchmarking Safe Exploration in Deep Reinforcement Learning}},
-    year = {2019}
-}
-@article{latent-dynamics,
-	title        = {Learning Latent Dynamics for Planning from Pixels},
-	author       = {Danijar Hafner and Timothy P. Lillicrap and Ian Fischer and Ruben Villegas and David Ha and Honglak Lee and James Davidson},
-	year         = 2018,
-	journal      = {CoRR},
-	volume       = {abs/1811.04551},
-	url          = {http://arxiv.org/abs/1811.04551},
-	archiveprefix = {arXiv},
-	eprint       = {1811.04551},
-	timestamp    = {Fri, 23 Nov 2018 12:43:51 +0100},
-	biburl       = {https://dblp.org/rec/bib/journals/corr/abs-1811-04551},
-	bibsource    = {dblp computer science bibliography, https://dblp.org}
-}
-% End
-@article{qu2020combining,
-  title={Combining model-based and model-free methods for nonlinear control: A provably convergent policy gradient approach},
-  author={Qu, Guannan and Yu, Chenkai and Low, Steven and Wierman, Adam},
-  journal={arXiv preprint arXiv:2006.07476},
-  year={2020}
-}
-@inproceedings{clavera2018model,
-  title={Model-based reinforcement learning via meta-policy optimization},
-  author={Clavera, Ignasi and Rothfuss, Jonas and Schulman, John and Fujita, Yasuhiro and Asfour, Tamim and Abbeel, Pieter},
-  booktitle={Conference on Robot Learning},
-  pages={617--629},
-  year={2018},
-  organization={PMLR}
-}
-@article{silver2018residual,
-  title={Residual policy learning},
-  author={Silver, Tom and Allen, Kelsey and Tenenbaum, Josh and Kaelbling, Leslie},
-  journal={arXiv preprint arXiv:1812.06298},
-  year={2018}
-}
-@article{bu2019lqr,
-  title={LQR through the lens of first order methods: Discrete-time case},
-  author={Bu, Jingjing and Mesbahi, Afshin and Fazel, Maryam and Mesbahi, Mehran},
-  journal={arXiv preprint arXiv:1907.08921},
-  year={2019}
-}
-@inproceedings{malik2019derivative,
-  title={Derivative-free methods for policy optimization: Guarantees for linear quadratic systems},
-  author={Malik, Dhruv and Pananjady, Ashwin and Bhatia, Kush and Khamaru, Koulik and Bartlett, Peter and Wainwright, Martin},
-  booktitle={The 22nd International Conference on Artificial Intelligence and Statistics},
-  pages={2916--2925},
-  year={2019},
-  organization={PMLR}
-}
-@inproceedings{fazel2018global,
-  title={Global convergence of policy gradient methods for the linear quadratic regulator},
-  author={Fazel, Maryam and Ge, Rong and Kakade, Sham and Mesbahi, Mehran},
-  booktitle={International Conference on Machine Learning},
-  pages={1467--1476},
-  year={2018},
-  organization={PMLR}
-}
-@article{abdeen2022learning,
-  title={Learning Neural Networks under Input-Output Specifications},
-  author={Ul Abdeen, Zain and Yin, He and Kekatos, Vassilis and Jin, Ming},
-  journal={ACC 2022},
-  year={2022}
-}@inproceedings{simchowitz2018learning,
-  title={Learning without mixing: Towards a sharp analysis of linear system identification},
-  author={Simchowitz, Max and Mania, Horia and Tu, Stephen and Jordan, Michael I and Recht, Benjamin},
-  booktitle={Conference On Learning Theory},
-  pages={439--473},
-  year={2018},
-  organization={PMLR}
-}
-@article{mania2019certainty,
-  title={Certainty equivalence is efficient for linear quadratic control},
-  author={Mania, Horia and Tu, Stephen and Recht, Benjamin},
-  journal={Advances in Neural Information Processing Systems},
-  volume={32},
-  year={2019}
-}
-@article{dean2020sample,
-  title={On the sample complexity of the linear quadratic regulator},
-  author={Dean, Sarah and Mania, Horia and Matni, Nikolai and Recht, Benjamin and Tu, Stephen},
-  journal={Foundations of Computational Mathematics},
-  volume={20},
-  number={4},
-  pages={633--679},
-  year={2020},
-  publisher={Springer}
-}
-@inproceedings{CEB,
-	title        = {The Conditional Entropy Bottleneck},
-	author       = {Anonymous},
-	year         = 2019,
-	booktitle    = {Submitted to International Conference on Learning Representations},
-	url          = {https://openreview.net/forum?id=rkVOXhAqY7},
-	note         = {under review}
-}
- @article{dulacarnold2020realworldrlempirical,
-           title={An empirical investigation of the challenges of real-world reinforcement learning},
-           author={Dulac-Arnold, Gabriel and
-                   Levine, Nir and
-                   Mankowitz, Daniel J. and
-                   Li, Jerry and
-                   Paduraru, Cosmin and
-                   Gowal, Sven and
-                   Hester, Todd
-                   },
-           year={2020},
- }
- @article{seiler2021control,
-  title={Control barrier functions with unmodeled input dynamics using integral quadratic constraints},
-  author={Seiler, Peter and Jankovic, Mrdjan and Hellstrom, Erik},
-  journal={IEEE Control Systems Letters},
-  volume={6},
-  pages={1664--1669},
-  year={2021},
-  publisher={IEEE}
-}
-@inproceedings{iclr2019style,
-	title        = {ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness},
-	author       = {Anonymous},
-	year         = 2019,
-	booktitle    = {Submitted to International Conference on Learning Representations},
-	url          = {https://openreview.net/forum?id=Bygh9j09KX},
-	note         = {under review}
-}
-@inproceedings{iclr2019projecting,
-	title        = {Learning Robust Representations by Projecting Superficial Statistics Out},
-	author       = {Anonymous},
-	year         = 2019,
-	booktitle    = {Submitted to International Conference on Learning Representations},
-	url          = {https://openreview.net/forum?id=rJEjjoR9K7},
-	note         = {under review}
-}
-@inproceedings{kornblith2018transfer,
-	title        = {Do Better ImageNet Models Transfer Better?},
-	author       = {Simon Kornblith and Jonathon Shlens and Quoc V. Le},
-	year         = 2018,
-	booktitle    = {arXiv preprint arXiv:1805.08974}
-}
-@article{IB,
-	title        = {{The information bottleneck method}},
-	author       = {{Tishby}, N. and {Pereira}, F.~C. and {Bialek}, W.},
-	year         = 2000,
-	month        = apr,
-	journal      = {ArXiv Physics e-prints},
-	eprint       = {physics/0004057},
-	keywords     = {Physics - Data Analysis, Statistics and Probability, Condensed Matter - Disordered Systems and Neural Networks, Computer Science - Machine Learning, Nonlinear Sciences - Adaptation and Self-Organizing Systems},
-	adsurl       = {http://adsabs.harvard.edu/abs/2000physics...4057T},
-	adsnote      = {Provided by the SAO/NASA Astrophysics Data System}
-}
-@article{hendrycks2019robustness,
-	title        = {Benchmarking Neural Network Robustness to Common Corruptions and Perturbations},
-	author       = {Dan Hendrycks and Thomas Dietterich},
-	year         = 2019,
-	journal      = {International Conference on Learning Representations}
-}
-@inproceedings{eccv-spatial-consistency,
-	title        = {Characterizing Adversarial Examples Based on Spatial Consistency Information for Semantic Segmentation},
-	author       = {Chaowei Xiao and Ruizhi Deng and Bo Li and Fisher Yu and Mingyan Liu and Dawn Song},
-	year         = 2018,
-	booktitle    = {ECCV}
-}
-@article{shafaei2018,
-	title        = {Does Your Model Know the Digit 6 Is Not a Cat? {A} Less Biased Evaluation of "Outlier" Detectors},
-	author       = {Alireza Shafaei and Mark Schmidt and James J. Little},
-	year         = 2018,
-	journal      = {CoRR},
-	volume       = {abs/1809.04729}
-}
-@inproceedings{matching-nets,
-	title        = {Matching Networks for One Shot Learning},
-	author       = {Oriol Vinyals and Charles Blundell and Tim Lillicrap and Koray Kavukcuoglu and Daan Wierstra},
-	year         = 2016,
-	booktitle    = {Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5-10, 2016, Barcelona, Spain},
-	pages        = {3630--3638},
-	url          = {http://papers.nips.cc/paper/6385-matching-networks-for-one-shot-learning},
-	crossref     = {DBLP:conf/nips/2016},
-	timestamp    = {Fri, 03 Mar 2017 14:59:41 +0100},
-	biburl       = {https://dblp.org/rec/bib/conf/nips/VinyalsBLKW16},
-	bibsource    = {dblp computer science bibliography, https://dblp.org}
-}
-@inproceedings{ilg2017flownet,
-	title        = {Flownet 2.0: Evolution of optical flow estimation with deep networks},
-	author       = {Ilg, Eddy and Mayer, Nikolaus and Saikia, Tonmoy and Keuper, Margret and Dosovitskiy, Alexey and Brox, Thomas},
-	year         = 2017,
-	booktitle    = {IEEE conference on CVPR},
-	volume       = 2,
-	pages        = 6
-}
-@article{liu2017trojaning,
-	title        = {Trojaning attack on neural networks},
-	author       = {Liu, Yingqi and Ma, Shiqing and Aafer, Yousra and Lee, Wen-Chuan and Zhai, Juan and Wang, Weihang and Zhang, Xiangyu},
-	year         = 2017
-}
-@article{cohen2019certified,
-	title        = {Certified adversarial robustness via randomized smoothing},
-	author       = {Cohen, Jeremy M and Rosenfeld, Elan and Kolter, J Zico},
-	year         = 2019,
-	journal      = {arXiv preprint arXiv:1902.02918}
-}
-@article{qiu2019semanticadv,
-	title        = {SemanticAdv: Generating Adversarial Examples via Attribute-conditional Image Editing},
-	author       = {Qiu, Haonan and Xiao, Chaowei and Yang, Lei and Yan, Xinchen and Lee, Honglak and Li, Bo},
-	year         = 2019,
-	journal      = {arXiv preprint arXiv:1906.07927}
-}
-@article{bhattad2019big,
-	title        = {Big but Imperceptible Adversarial Perturbations via Semantic Manipulation},
-	author       = {Bhattad, Anand and Chong, Min Jin and Liang, Kaizhao and Li, Bo and Forsyth, David A},
-	year         = 2019,
-	journal      = {arXiv preprint arXiv:1904.06347}
-}
-@inproceedings{raghunathan2018semidefinite,
-	title        = {Semidefinite relaxations for certifying robustness to adversarial examples},
-	author       = {Raghunathan, Aditi and Steinhardt, Jacob and Liang, Percy S},
-	year         = 2018,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {10877--10887}
-}
-@article{brendel2017decision,
-	title        = {Decision-based adversarial attacks: Reliable attacks against black-box machine learning models},
-	author       = {Brendel, Wieland and Rauber, Jonas and Bethge, Matthias},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1712.04248}
-}
-@inproceedings{chen2017zoo,
-	title        = {Zoo: Zeroth order optimization based black-box attacks to deep neural networks without training substitute models},
-	author       = {Chen, Pin-Yu and Zhang, Huan and Sharma, Yash and Yi, Jinfeng and Hsieh, Cho-Jui},
-	year         = 2017,
-	booktitle    = {Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security},
-	pages        = {15--26},
-	organization = {ACM}
-}
-@article{chen2019boundary,
-	title        = {Boundary attack++: Query-efficient decision-based adversarial attack},
-	author       = {Chen, Jianbo and Jordan, Michael I},
-	year         = 2019,
-	journal      = {arXiv preprint arXiv:1904.02144}
-}
-@article{gu2017badnets,
-	title        = {Badnets: Identifying vulnerabilities in the machine learning model supply chain},
-	author       = {Gu, Tianyu and Dolan-Gavitt, Brendan and Garg, Siddharth},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1708.06733}
-}
-@article{silver2017go,
-	title        = {Mastering the game of Go without human knowledge},
-	author       = {Silver, David and Schrittwieser, Julian and Simonyan, Karen and Antonoglou, Ioannis and Huang, Aja and Guez, Arthur and Hubert, Thomas and Baker, Lucas and Lai, Matthew and Bolton, Adrian and Chen, Yutian and Lillicrap, Timothy and Hui, Fan and Sifre, Laurent and van den Driessche, George and Graepel, Thore and Hassabis, Demis},
-	year         = 2017,
-	month        = 10,
-	journal      = {Nature},
-	volume       = 550,
-	number       = 7676,
-	pages        = {354--359}
-}
-@article{barto2003recent,
-	title        = {Recent advances in hierarchical reinforcement learning},
-	author       = {Barto, Andrew G and Mahadevan, Sridhar},
-	year         = 2003,
-	journal      = {Discrete event dynamic systems},
-	publisher    = {Springer},
-	volume       = 13,
-	number       = {1-2},
-	pages        = {41--77}
-}
-@book{gross2001topological,
-	title        = {Topological graph theory},
-	author       = {Gross, Jonathan L and Tucker, Thomas W},
-	year         = 2001,
-	publisher    = {Courier Corporation}
-}
-@inproceedings{shafahi2018poison,
-	title        = {Poison frogs! targeted clean-label poisoning attacks on neural networks},
-	author       = {Shafahi, Ali and Huang, W Ronny and Najibi, Mahyar and Suciu, Octavian and Studer, Christoph and Dumitras, Tudor and Goldstein, Tom},
-	year         = 2018,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {6106--6116}
-}
-@article{engstrom2017rotation,
-	title        = {A rotation and a translation suffice: Fooling cnns with simple transformations},
-	author       = {Engstrom, Logan and Tran, Brandon and Tsipras, Dimitris and Schmidt, Ludwig and Madry, Aleksander},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1712.02779}
-}
-@article{achiam2017surprise,
-  title={Surprise-based intrinsic motivation for deep reinforcement learning},
-  author={Achiam, Joshua and Sastry, Shankar},
-  journal={arXiv preprint arXiv:1703.01732},
-  year={2017}
-}
-@inproceedings{sadraddini2016provably,
-  title={A provably correct mpc approach to safety control of urban traffic networks},
-  author={Sadraddini, Sadra and Belta, Calin},
-  booktitle={American Control Conference},
-  pages={1679--1684},
-  year={2016},
-  organization={IEEE}
-}
-@article{carson2013robust,
-  title={A robust model predictive control algorithm augmented with a reactive safety mode},
-  author={Carson III, John M and A{\c{c}}{\i}kme{\c{s}}e, Beh{\c{c}}et and Murray, Richard M and MacMartin, Douglas G},
-  journal={Automatica},
-  volume={49},
-  number={5},
-  pages={1251--1260},
-  year={2013},
-  publisher={Elsevier}
-}
-@book{zhou1998essentials,
-  title={Essentials of robust control},
-  author={Zhou, Kemin and Doyle, John Comstock},
-  volume={104},
-  year={1998},
-  publisher={Prentice hall Upper Saddle River, NJ}
-}
-@inproceedings{zhou2016view,
-	title        = {View synthesis by appearance flow},
-	author       = {Zhou, Tinghui and Tulsiani, Shubham and Sun, Weilun and Malik, Jitendra and Efros, Alexei A},
-	year         = 2016,
-	booktitle    = ECCV,
-	pages        = {286--301},
-	organization = {Springer}
-}
-@article{yang2018realistic,
-	title        = {Realistic adversarial examples in 3d meshes},
-	author       = {Yang, Dawei and Xiao, Chaowei and Li, Bo and Deng, Jia and Liu, Mingyan},
-	year         = 2019,
-	journal      = {CVPR}
-}
-@article{3dpoint,
-	title        = {Generating adversarial 3D point cloud},
-	author       = {Chong Xiang, Charles R. Qi, Bo Li},
-	year         = 2019,
-	journal      = {CVPR}
-}
-@inproceedings{maml,
-	title        = {Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks},
-	author       = {Chelsea Finn and Pieter Abbeel and Sergey Levine},
-	year         = 2017,
-	booktitle    = ICML
-}
-@article{yang2018characterizing,
-	title        = {Characterizing Audio Adversarial Examples Using Temporal Dependency},
-	author       = {Yang, Zhuolin and Li, Bo and Chen, Pin-Yu and Song, Dawn},
-	year         = 2018,
-	journal      = {ICLR}
-}
-@article{li2018textbugger,
-	title        = {TextBugger: Generating Adversarial Text Against Real-world Applications},
-	author       = {Li, Jinfeng and Ji, Shouling and Du, Tianyu and Li, Bo and Wang, Ting},
-	year         = 2018,
-	journal      = {NDSS}
-}
-
-
-@article{huang2017adversarial,
-  title={Adversarial attacks on neural network policies},
-  author={Huang, Sandy and Papernot, Nicolas and Goodfellow, Ian and Duan, Yan and Abbeel, Pieter},
-  journal={arXiv preprint arXiv:1702.02284},
-  year={2017}
-}
-@inproceedings{gupta2021uneven,
-  title={Uneven: Universal value exploration for multi-agent reinforcement learning},
-  author={Gupta, Tarun and Mahajan, Anuj and Peng, Bei and B{\"o}hmer, Wendelin and Whiteson, Shimon},
-  booktitle={International Conference on Machine Learning},
-  pages={3930--3941},
-  year={2021},
-  organization={PMLR}
-}
-@article{mahajan2019maven,
-  title={Maven: Multi-agent variational exploration},
-  author={Mahajan, Anuj and Rashid, Tabish and Samvelyan, Mikayel and Whiteson, Shimon},
-  journal={Advances in Neural Information Processing Systems},
-  volume={32},
-  year={2019}
-}
-@article{moore2014reinforcement,
-  title={Reinforcement learning for closed-loop propofol anesthesia: a study in human volunteers},
-  author={Moore, Brett L and Pyeatt, Larry D and Kulkarni, Vivekanand and Panousis, Periklis and Padrez, Kevin and Doufas, Anthony G},
-  journal={The journal of machine learning research},
-  volume={15},
-  number={1},
-  pages={655--696},
-  year={2014},
-  publisher={JMLR. org}
-}
-@article{sallab2017deep,
-  title={Deep reinforcement learning framework for autonomous driving},
-  author={Sallab, Ahmad EL and Abdou, Mohammed and Perot, Etienne and Yogamani, Senthil},
-  journal={Electronic Imaging},
-  volume={2017},
-  number={19},
-  pages={70--76},
-  year={2017},
-  publisher={Society for Imaging Science and Technology}
-}
-@article{ahmadi2016some,
-  title={Some applications of polynomial optimization in operations research and real-time decision making},
-  author={Ahmadi, Amir Ali and Majumdar, Anirudha},
-  journal={Optimization Letters},
-  volume={10},
-  number={4},
-  pages={709--729},
-  year={2016},
-  publisher={Springer}
-}
-@inproceedings{ren2018metalearning,
-	title        = {Meta-Learning for Semi-Supervised Few-Shot Classification},
-	author       = {Mengye Ren and Sachin Ravi and Eleni Triantafillou and Jake Snell and Kevin Swersky and Josh B. Tenenbaum and Hugo Larochelle and Richard S. Zemel},
-	year         = 2018,
-	booktitle    = {International Conference on Learning Representations},
-	url          = {https://openreview.net/forum?id=HJcSzz-CZ}
-}
-@article{dawson2022safe,
-  title={Safe Control with Learned Certificates: A Survey of Neural Lyapunov, Barrier, and Contraction methods},
-  author={Dawson, Charles and Gao, Sicun and Fan, Chuchu},
-  journal={arXiv preprint arXiv:2202.11762},
-  year={2022}
-}
-@article{fox2018imitation,
-	title        = {Imitation Learning of Hierarchical Programs via Variational Inference},
-	author       = {Fox*, Roy and Shin*, Richard and Abbeel, Pieter and Goldberg, Ken and Song, Dawn and Stoica, Ion},
-	year         = 2018,
-	journal      = {ICML Neural Abstract Machines \& Program Induction Workshop}
-}
-@article{qiu2017unrealcv,
-	title        = {UnrealCV: Virtual Worlds for Computer Vision},
-	author       = {Weichao Qiu, Fangwei Zhong, Yi Zhang, Siyuan Qiao,Zihao Xiao, Tae Soo Kim, Yizhou Wang, Alan Yuille},
-	year         = 2017,
-	journal      = {ACM Multimedia Open Source Software Competition}
-}
-@article{tsukamoto2021contraction,
-  title={Contraction theory for nonlinear stability analysis and learning-based control: A tutorial overview},
-  author={Tsukamoto, Hiroyasu and Chung, Soon-Jo and Slotine, Jean-Jaques E},
-  journal={Annual Reviews in Control},
-  volume={52},
-  pages={135--169},
-  year={2021},
-  publisher={Elsevier}
-}
-@article{giesl2015review,
-  title={Review on computational methods for Lyapunov functions},
-  author={Giesl, Peter and Hafstein, Sigurdur},
-  journal={Discrete \& Continuous Dynamical Systems-B},
-  volume={20},
-  number={8},
-  pages={2291},
-  year={2015},
-  publisher={American Institute of Mathematical Sciences}
-}
-@inproceedings{klein2009sel4,
-	title        = {seL4: Formal verification of an OS kernel},
-	author       = {Klein, Gerwin and Elphinstone, Kevin and Heiser, Gernot and Andronick, June and Cock, David and Derrin, Philip and Elkaduwe, Dhammika and Engelhardt, Kai and Kolanski, Rafal and Norrish, Michael and others},
-	year         = 2009,
-	booktitle    = {Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles},
-	pages        = {207--220},
-	organization = {ACM}
-}
-@inproceedings{compcert,
-	title        = {CompCert -- A Formally Verified Optimizing Compiler},
-	author       = {Xavier Leroy and Sandrine Blazy and Daniel K\"astner and Bernhard Schommer and Markus Pister and Christian Ferdinand},
-	year         = 2016,
-	booktitle    = {ERTS 2016: Embedded Real Time Software and Systems},
-	publisher    = {SEE},
-	url          = {http://xavierleroy.org/publi/erts2016_compcert.pdf},
-	hal          = {https://hal.inria.fr/hal-01238879},
-	xtopic       = {compcert},
-	abstract     = {CompCert is the first commercially available optimizing compiler that is formally verified, using machine-assisted mathematical proofs, to be exempt from mis-compilation. The executable code it produces is proved to behave exactly as specified by the semantics of the source C program. This article gives an overview of the design of CompCert and its proof concept and then focuses on aspects relevant for industrial application. We briefly summarize practical experience and give an overview of recent CompCert development aiming at industrial usage. CompCert’s intended use is the compilation of life-critical and mission-critical software meeting high levels of assurance. In this context tool qualification is of paramount importance. We summarize the confidence argument of CompCert and give an overview of relevant qualification strategies.}
-}
-@inproceedings{song2008bitblaze,
-	title        = {BitBlaze: A new approach to computer security via binary analysis},
-	author       = {Song, Dawn and Brumley, David and Yin, Heng and Caballero, Juan and Jager, Ivan and Kang, Min Gyung and Liang, Zhenkai and Newsome, James and Poosankam, Pongsin and Saxena, Prateek},
-	year         = 2008,
-	booktitle    = {International Conference on Information Systems Security},
-	pages        = {1--25},
-	organization = {Springer}
-}
-@misc{saxena10kudzu,
-	title        = {A Symbolic Execution Framework for JavaScript},
-	author       = {Prateek Saxena and Devdatta Akhawe and Steve Hanna and Feng Mao and Stephen McCamant and Dawn Song},
-	booktitle    = {Proc. of the 31st IEEE Symposium on Security and Privacy (Oakland 2010)}
-}
-@inproceedings{chen2015using,
-	title        = {{Using Crash Hoare logic for certifying the FSCQ file system}},
-	author       = {Chen, Haogang and Ziegler, Daniel and Chajed, Tej and Chlipala, Adam and Kaashoek, M Frans and Zeldovich, Nickolai},
-	year         = 2015,
-	booktitle    = {Proceedings of the 25th Symposium on Operating Systems Principles},
-	pages        = {18--37},
-	organization = {ACM}
-}
-@inproceedings{pirlea2018mechanising,
-	title        = {{Mechanising Blockchain Consensus}},
-	author       = {P{\^\i}rlea, George and Sergey, Ilya},
-	year         = 2018,
-	booktitle    = {Proceedings of the 7th ACM SIGPLAN International Conference on Certified Programs and Proofs},
-	pages        = {78--90},
-	organization = {ACM}
-}
-@article{blanchet2008computationally,
-	title        = {{A Computationally Sound Mechanized Prover for Security Protocols}},
-	author       = {Blanchet, Bruno},
-	year         = 2008,
-	journal      = {IEEE Transactions on Dependable and Secure Computing},
-	publisher    = {IEEE},
-	volume       = 5,
-	number       = 4,
-	pages        = {193--207}
-}
-@inproceedings{hawblitzel2015ironfleet,
-	title        = {{IronFleet: Proving Practical Distributed Systems Correct}},
-	author       = {Hawblitzel, Chris and Howell, Jon and Kapritsos, Manos and Lorch, Jacob R and Parno, Bryan and Roberts, Michael L and Setty, Srinath and Zill, Brian},
-	year         = 2015,
-	booktitle    = {Proceedings of the 25th Symposium on Operating Systems Principles},
-	pages        = {1--17},
-	organization = {ACM}
-}
-@inproceedings{katz2017reluplex,
-	title        = {Reluplex: An efficient SMT solver for verifying deep neural networks},
-	author       = {Katz, Guy and Barrett, Clark and Dill, David L and Julian, Kyle and Kochenderfer, Mykel J},
-	year         = 2017,
-	booktitle    = {International Conference on Computer Aided Verification},
-	pages        = {97--117},
-	organization = {Springer}
-}
-k
-@inproceedings{singh2018approximate,
-	title        = {Approximate positively correlated distributions and approximation algorithms for D-optimal design},
-	author       = {Singh, Mohit and Xie, Weijun},
-	year         = 2018,
-	booktitle    = {Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms},
-	pages        = {2240--2255},
-	organization = {Society for Industrial and Applied Mathematics}
-}
-@article{xie2019distributionally,
-	title        = {On distributionally robust chance constrained programs with Wasserstein distance},
-	author       = {Xie, Weijun},
-	year         = 2019,
-	journal      = {Mathematical Programming},
-	publisher    = {Springer Berlin Heidelberg},
-	pages        = {1--41}
-}
-@article{xie2017distributionally,
-	title        = {Distributionally robust chance constrained optimal power flow with renewables: A conic reformulation},
-	author       = {Xie, Weijun and Ahmed, Shabbir},
-	year         = 2017,
-	journal      = {IEEE Transactions on Power Systems},
-	publisher    = {IEEE},
-	volume       = 33,
-	number       = 2,
-	pages        = {1860--1867}
-}
-@inproceedings{xie2019dba,
-	title        = {Dba: Distributed backdoor attacks against federated learning},
-	author       = {Xie, Chulin and Huang, Keli and Chen, Pin-Yu and Li, Bo},
-	year         = 2019,
-	booktitle    = {International Conference on Learning Representations}
-}
-@inproceedings{zhao2020clean,
-	title        = {Clean-label backdoor attacks on video recognition models},
-	author       = {Zhao, Shihao and Ma, Xingjun and Zheng, Xiang and Bailey, James and Chen, Jingjing and Jiang, Yu-Gang},
-	year         = 2020,
-	booktitle    = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
-	pages        = {14443--14452}
-}
-@article{feng2014distributed,
-	title        = {{Distributed robust learning}},
-	author       = {Feng, Jiashi and Xu, Huan and Mannor, Shie},
-	year         = 2014,
-	journal      = {arXiv preprint arXiv:1409.5937}
-}
-@article{bubeck2015convex,
-	title        = {{Convex optimization: Algorithms and complexity}},
-	author       = {Bubeck, S{\'e}bastien and others},
-	year         = 2015,
-	journal      = {Foundations and Trends{\textregistered} in Machine Learning},
-	publisher    = {Now Publishers, Inc.},
-	volume       = 8,
-	number       = {3-4},
-	pages        = {231--357}
-}
-@article{harinath2017review,
-	title        = {{A Review on Security Issues and Attacks in Distributed Systems}},
-	author       = {Harinath, Depavath and Satyanarayana, P and Murthy, MV Ramana},
-	year         = 2017,
-	journal      = {Journal of Advances in Information Technology},
-	volume       = 8,
-	number       = 1
-}
-@article{Konecn2016FederatedOD,
-	title        = {{Federated Optimization: Distributed Machine Learning for On-Device Intelligence}},
-	author       = {Jakub Konecn{\'y} and H. Brendan McMahan and Daniel Ramage and Peter Richt{\'a}rik},
-	year         = 2016,
-	journal      = {CoRR},
-	volume       = {abs/1610.02527}
-}
-@article{Konecn2016FederatedLS,
-	title        = {{Federated Learning: Strategies for Improving Communication Efficiency}},
-	author       = {Jakub Konecn{\'y} and H. Brendan McMahan and Felix X. Yu and Peter Richt{\'a}rik and Ananda Theertha Suresh and Dave Bacon},
-	year         = 2016,
-	journal      = {CoRR},
-	volume       = {abs/1610.05492}
-}
-@article{Meeds2015MLitBML,
-	title        = {{MLitB: machine learning in the browser}},
-	author       = {Edward Meeds and Remco Hendriks and Said al Faraby and Magiel Bruntink and Max Welling},
-	year         = 2015,
-	journal      = {PeerJ Computer Science},
-	volume       = 1
-}
-@article{Miura2015ImplementationOA,
-	title        = {{Implementation of a Practical Distributed Calculation System with Browsers and JavaScript, and Application to Distributed Deep Learning}},
-	author       = {Ken Miura and Tatsuya Harada},
-	year         = 2015,
-	journal      = {CoRR},
-	volume       = {abs/1503.05743}
-}
-@article{loosli2007training,
-	title        = {{Training invariant support vector machines using selective sampling}},
-	author       = {Loosli, Ga{\"e}lle and Canu, St{\'e}phane and Bottou, L{\'e}on},
-	year         = 2007,
-	journal      = {Large scale kernel machines},
-	pages        = {301--320}
-}
-@inproceedings{chen2018draco,
-	title        = {{DRACO: Byzantine-resilient Distributed Training via Redundant Gradients}},
-	author       = {Chen, Lingjiao and Wang, Hongyi and Charles, Zachary and Papailiopoulos, Dimitris},
-	year         = 2018,
-	booktitle    = {International Conference on Machine Learning},
-	pages        = {902--911}
-}
-@article{hawkins1971bounds,
-	title        = {{On the bounds of the range of order statistics}},
-	author       = {Hawkins, Douglas M},
-	year         = 1971,
-	journal      = {Journal of the American Statistical Association},
-	publisher    = {Taylor \& Francis},
-	volume       = 66,
-	number       = 335,
-	pages        = {644--645}
-}
-@article{arnold1979bounds,
-	title        = {{Bounds on expectations of linear systematic statistics based on dependent samples}},
-	author       = {Arnold, Barry C and Groeneveld, Richard A and others},
-	year         = 1979,
-	journal      = {The Annals of Statistics},
-	publisher    = {Institute of Mathematical Statistics},
-	volume       = 7,
-	number       = 1,
-	pages        = {220--223}
-}
-
-@article{bhagoji2018analyzing,
-	title        = {{Analyzing Federated Learning through an Adversarial Lens}},
-	author       = {Bhagoji, Arjun Nitin and Chakraborty, Supriyo and Mittal, Prateek and Calo, Seraphin},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1811.12470}
-}
-@article{bagdasaryan2018backdoor,
-	title        = {{How to backdoor federated learning}},
-	author       = {Bagdasaryan, Eugene and Veit, Andreas and Hua, Yiqing and Estrin, Deborah and Shmatikov, Vitaly},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1807.00459}
-}
-@article{mcmahan2016communication,
-	title        = {{Communication-efficient learning of deep networks from decentralized data}},
-	author       = {McMahan, H Brendan and Moore, Eider and Ramage, Daniel and Hampson, Seth and others},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1602.05629}
-}
-@book{lynch1996distributed,
-	title        = {{Distributed algorithms}},
-	author       = {Lynch, Nancy A},
-	year         = 1996,
-	publisher    = {Elsevier}
-}
-@article{avizienis2004basic,
-	title        = {{Basic concepts and taxonomy of dependable and secure computing}},
-	author       = {Avizienis, Algirdas and Laprie, J-C and Randell, Brian and Landwehr, Carl},
-	year         = 2004,
-	journal      = {IEEE transactions on dependable and secure computing},
-	publisher    = {IEEE},
-	volume       = 1,
-	number       = 1,
-	pages        = {11--33}
-}
-@book{tanenbaum2007distributed,
-	title        = {{Distributed systems: principles and paradigms}},
-	author       = {Tanenbaum, Andrew S and Van Steen, Maarten},
-	year         = 2007,
-	publisher    = {Prentice-Hall}
-}
-@techreport{fischer1982impossibility,
-	title        = {{Impossibility of distributed consensus with one faulty process}},
-	author       = {Fischer, Michael J and Lynch, Nancy A and Paterson, Michael S},
-	year         = 1982,
-	institution  = {MASSACHUSETTS INST OF TECH CAMBRIDGE LAB FOR COMPUTER SCIENCE}
-}
-@article{xing2016strategies,
-	title        = {{Strategies and principles of distributed machine learning on big data}},
-	author       = {Xing, Eric P and Ho, Qirong and Xie, Pengtao and Wei, Dai},
-	year         = 2016,
-	journal      = {Engineering},
-	publisher    = {Elsevier},
-	volume       = 2,
-	number       = 2,
-	pages        = {179--195}
-}
-@inproceedings{Xie2019SLSGDSA,
-	title        = {{SLSGD: Secure and Efficient Distributed On-device Machine Learning}},
-	author       = {Cong Xie and Sanmi Koyejo and Indranil Gupta},
-	year         = 2019,
-	booktitle    = {Joint European Conference on Machine Learning and Knowledge Discovery in Databases}
-}
-
-
-
-@inproceedings{Su2016FaultTolerantMO,
-	title        = {Fault-Tolerant Multi-Agent Optimization: Optimal Iterative Distributed Algorithms},
-	author       = {Lili Su and Nitin H. Vaidya},
-	year         = 2016,
-	booktitle    = {Symposium on Principles of Distributed Computing (PODC)}
-}
-@article{su2016defending,
-	title        = {Defending {non-Bayesian} learning against adversarial attacks},
-	author       = {Su, Lili and Vaidya, Nitin H},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1606.08883}
-}
-
-
-@article{damaskinos2018asynchronous,
-	title        = {Asynchronous {Byzantine} Machine Learning},
-	author       = {Damaskinos, Georgios and Mhamdi, El Mahdi El and Guerraoui, Rachid and Patra, Rhicheek and Taziki, Mahsa},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1802.07928}
-}
-@article{mhamdi2018hidden,
-	title        = {The Hidden Vulnerability of Distributed Learning in {Byzantium}},
-	author       = {Mhamdi, El Mahdi El and Guerraoui, Rachid and Rouault, S{\'e}bastien},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1802.07927}
-}
-@article{Lamport1982TheBG,
-	title        = {The {Byzantine} Generals Problem},
-	author       = {Leslie Lamport and Robert E. Shostak and Marshall C. Pease},
-	year         = 1982,
-	journal      = {ACM Trans. Program. Lang. Syst.},
-	volume       = 4,
-	pages        = {382--401}
-}
-@inproceedings{li2014scaling,
-	title        = {Scaling Distributed Machine Learning with the Parameter Server.},
-	author       = {Li, Mu and Andersen, David G and Park, Jun Woo and Smola, Alexander J and Ahmed, Amr and Josifovski, Vanja and Long, James and Shekita, Eugene J and Su, Bor-Yiing},
-	year         = 2014,
-	booktitle    = {USENIX Symposium on Operating Systems Design and Implementation (OSDI)},
-	volume       = 14,
-	pages        = {583--598}
-}
-@inproceedings{li2014communication,
-	title        = {Communication efficient distributed machine learning with the parameter server},
-	author       = {Li, Mu and Andersen, David G and Smola, Alexander J and Yu, Kai},
-	year         = 2014,
-	booktitle    = {Advances in Neural Information Processing Systems (NIPS)},
-	pages        = {19--27}
-}
-@article{wang2018data,
-	title        = {Data Poisoning Attacks against Online Learning},
-	author       = {Wang, Yizhen and Chaudhuri, Kamalika},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1808.08994}
-}
-@article{xie2018phocas,
-	title        = {Phocas: dimensional {Byzantine-resilient} stochastic gradient descent},
-	author       = {Xie, Cong and Koyejo, Oluwasanmi and Gupta, Indranil},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1805.09682}
-}
-@article{xie2018zeno,
-	title        = {Zeno: Distributed Stochastic Gradient Descent with Suspicion-based Fault-tolerance},
-	author       = {Cong Xie and Oluwasanmi Koyejo and Indranil Gupta},
-	year         = 2019,
-	booktitle    = {ICML}
-}
-
-@inproceedings{wu2017bolt,
-	title        = {Bolt-on differential privacy for scalable stochastic gradient descent-based analytics},
-	author       = {Wu, Xi and Li, Fengan and Kumar, Arun and Chaudhuri, Kamalika and Jha, Somesh and Naughton, Jeffrey},
-	year         = 2017,
-	booktitle    = {Proceedings of the 2017 ACM International Conference on Management of Data (SIGMOD)},
-	pages        = {1307--1322},
-	organization = {ACM}
-}
-@inproceedings{bassily2014private,
-	title        = {Private empirical risk minimization: Efficient algorithms and tight error bounds},
-	author       = {Bassily, Raef and Smith, Adam and Thakurta, Abhradeep},
-	year         = 2014,
-	booktitle    = {2014 IEEE 55th Annual Symposium on Foundations of Computer Science (FOCS)},
-	pages        = {464--473},
-	organization = {IEEE}
-}
-@inproceedings{jain2012differentially,
-	title        = {Differentially private online learning},
-	author       = {Jain, Prateek and Kothari, Pravesh and Thakurta, Abhradeep},
-	year         = 2012,
-	booktitle    = {Conference on Learning Theory (COLT)},
-	pages        = {24--1}
-}
-@article{guha2012differentially,
-	title        = {Differentially private convex optimization for empirical risk minimization and high-dimensional regression},
-	author       = {Guha Thakurta, Abhradeep},
-	year         = 2012,
-	journal      = {Dissertation}
-}
-@inproceedings{phong2017privacy,
-	title        = {Privacy-Preserving Stochastic Gradient Descent with Multiple Distributed Trainers},
-	author       = {Phong, Le Trieu},
-	year         = 2017,
-	booktitle    = {Network and System Security: 11th International Conference, NSS 2017, Helsinki, Finland, August 21--23, 2017, Proceedings 11},
-	pages        = {510--518},
-	organization = {Springer}
-}
-@article{goyal2017accurate,
-	title        = {Accurate, large minibatch {SGD}: training imagenet in 1 hour},
-	author       = {Goyal, Priya and Doll{\'a}r, Piotr and Girshick, Ross and Noordhuis, Pieter and Wesolowski, Lukasz and Kyrola, Aapo and Tulloch, Andrew and Jia, Yangqing and He, Kaiming},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1706.02677}
-}
-@article{geyer2017differentially,
-	title        = {Differentially private federated learning: A client level perspective},
-	author       = {Geyer, Robin C and Klein, Tassilo and Nabi, Moin},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1712.07557}
-}
-@inproceedings{cho2013natjam,
-	title        = {Natjam: Design and evaluation of eviction policies for supporting priorities and deadlines in mapreduce clusters},
-	author       = {Cho, Brian and Rahman, Muntasir and Chajed, Tej and Gupta, Indranil and Abad, Cristina and Roberts, Nathan and Lin, Philbert},
-	year         = 2013,
-	booktitle    = {Proceedings of the 4th annual Symposium on Cloud Computing (SoCC)},
-	pages        = 6,
-	organization = {ACM}
-}
-@inproceedings{chilimbi2014project,
-	title        = {Project {Adam}: Building an Efficient and Scalable Deep Learning Training System.},
-	author       = {Chilimbi, Trishul M and Suzue, Yutaka and Apacible, Johnson and Kalyanaraman, Karthik},
-	year         = 2014,
-	booktitle    = {USENIX Symposium on Operating Systems Design and Implementation (OSDI)},
-	volume       = 14,
-	pages        = {571--582}
-}
-@inproceedings{ferguson2012jockey,
-	title        = {Jockey: guaranteed job latency in data parallel clusters},
-	author       = {Ferguson, Andrew D and Bodik, Peter and Kandula, Srikanth and Boutin, Eric and Fonseca, Rodrigo},
-	year         = 2012,
-	booktitle    = {Proceedings of the 7th ACM European Conference on Computer Systems (EuroSys)},
-	pages        = {99--112},
-	organization = {ACM}
-}
-@inproceedings{ho2013more,
-	title        = {More effective distributed {ML} via a stale synchronous parallel parameter server},
-	author       = {Ho, Qirong and Cipar, James and Cui, Henggang and Lee, Seunghak and Kim, Jin Kyu and Gibbons, Phillip B and Gibson, Garth A and Ganger, Greg and Xing, Eric P},
-	year         = 2013,
-	booktitle    = {Advances in Neural Information Processing Systems (NIPS)},
-	pages        = {1223--1231}
-}
-@inproceedings{kim2016strads,
-	title        = {{STRADS}: a distributed framework for scheduled model parallel machine learning},
-	author       = {Kim, Jin Kyu and Ho, Qirong and Lee, Seunghak and Zheng, Xun and Dai, Wei and Gibson, Garth A and Xing, Eric P},
-	year         = 2016,
-	booktitle    = {Proceedings of the Eleventh European Conference on Computer Systems (EuroSys)},
-	pages        = 5,
-	organization = {ACM}
-}
-@article{li2015graph,
-	title        = {Graph partitioning via parallel submodular approximation to accelerate distributed machine learning},
-	author       = {Li, Mu and Andersen, Dave G and Smola, Alexander J},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1505.04636}
-}
-@inproceedings{li2014woha,
-	title        = {Woha: Deadline-aware map-reduce workflow scheduling framework over {Hadoop} clusters},
-	author       = {Li, Shen and Hu, Shaohan and Wang, Shiguang and Su, Lu and Abdelzaher, Tarek and Gupta, Indranil and Pace, Richard},
-	year         = 2014,
-	booktitle    = {2014 IEEE 34th International Conference on Distributed Computing Systems (ICDCS)},
-	pages        = {93--103},
-	organization = {IEEE}
-}
-@article{liu1973scheduling,
-	title        = {Scheduling algorithms for multiprogramming in a hard-real-time environment},
-	author       = {Liu, Chung Laung and Layland, James W},
-	year         = 1973,
-	journal      = {Journal of the ACM (JACM)},
-	publisher    = {ACM},
-	volume       = 20,
-	number       = 1,
-	pages        = {46--61}
-}
-@article{xing2015petuum,
-	title        = {Petuum: A new platform for distributed machine learning on big data},
-	author       = {Xing, Eric P and Ho, Qirong and Dai, Wei and Kim, Jin Kyu and Wei, Jinliang and Lee, Seunghak and Zheng, Xun and Xie, Pengtao and Kumar, Abhimanu and Yu, Yaoliang},
-	year         = 2015,
-	journal      = {IEEE Transactions on Big Data},
-	publisher    = {IEEE},
-	volume       = 1,
-	number       = 2,
-	pages        = {49--67}
-}
-@inproceedings{zhang2017slaq,
-	title        = {{SLAQ}: quality-driven scheduling for distributed machine learning},
-	author       = {Zhang, Haoyu and Stafman, Logan and Or, Andrew and Freedman, Michael J},
-	year         = 2017,
-	booktitle    = {Proceedings of the 2017 Symposium on Cloud Computing (SoCC)},
-	pages        = {390--404},
-	organization = {ACM}
-}
-@inproceedings{huang2015resource,
-	title        = {Resource elasticity for large-scale machine learning},
-	author       = {Huang, Botong and Boehm, Matthias and Tian, Yuanyuan and Reinwald, Berthold and Tatikonda, Shirish and Reiss, Frederick R},
-	year         = 2015,
-	booktitle    = {Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data (SIGMOD)},
-	pages        = {137--152},
-	organization = {ACM}
-}
-@misc{harlap2016tierml,
-	title        = {{TierML}: Using tiers of reliability for agile elasticity in machine learning},
-	author       = {Harlap, Aaron and Ganger, Gregory R and Gibbons, Phillip B},
-	year         = 2016,
-	publisher    = {Carnegie Mellon University, Tech. Rep}
-}
-@inproceedings{harlap2017proteus,
-	title        = {Proteus: agile ML elasticity through tiered reliability in dynamic resource markets},
-	author       = {Harlap, Aaron and Tumanov, Alexey and Chung, Andrew and Ganger, Gregory R and Gibbons, Phillip B},
-	year         = 2017,
-	booktitle    = {Proceedings of the Twelfth European Conference on Computer Systems (EuroSys)},
-	pages        = {589--604},
-	organization = {ACM}
-}
-@incollection{IR18,
-	title        = {Machine Learning Market Statistics},
-	author       = {Ironpaper},
-	year         = 2018,
-	note         = {\url{https://www.ironpaper.com/webintel/articles/machine-learning-market-statistics/}, Last visited: Nov. 2018}
-}
-@incollection{E17,
-	title        = {{EnterpriseWorks Incubator}},
-	author       = {EnterpriseWorks Incubator, Research Park, University of Illinois},
-	year         = 2018,
-	note         = {\url{http://www.researchpark.illinois.edu/enterpriseworks}, Last visited: Nov. 2018}
-}
-@misc{wilkes2018omega,
-	title        = {Omega names: name generation and derivation},
-	author       = {Wilkes, John and Gupta, Indranil and Cirne, Walfredo and Grant, Brian and Todd, Wang},
-	year         = 2018,
-	month        = may # {~10},
-	publisher    = {Google Patents},
-	note         = {US Patent App. 15/799,267}
-}
-@inproceedings{stoller2008large,
-	title        = {Large-scale virtualization in the {Emulab} network testbed},
-	author       = {Stoller, M Hibler R Ricci L and Duerig, Jonathon and Guruprasad, Shashi and Stack, Tim and Webb, Kirk and Lepreau, Jay},
-	year         = 2008,
-	booktitle    = {USENIX Annual Technical Conference, Boston, MA}
-}
-@inproceedings{noghabi2016ambry,
-	title        = {Ambry: {LinkedIn's} scalable geo-distributed object store},
-	author       = {Noghabi, Shadi A and Subramanian, Sriram and Narayanan, Priyesh and Narayanan, Sivabalan and Holla, Gopalakrishna and Zadeh, Mammad and Li, Tianwei and Gupta, Indranil and Campbell, Roy H},
-	year         = 2016,
-	booktitle    = {Proceedings of the 2016 International Conference on Management of Data (SIGMOD)},
-	pages        = {253--265},
-	organization = {ACM}
-}
-@article{noghabi2017samza,
-	title        = {Samza: stateful scalable stream processing at {LinkedIn}},
-	author       = {Noghabi, Shadi A and Paramasivam, Kartik and Pan, Yi and Ramesh, Navina and Bringhurst, Jon and Gupta, Indranil and Campbell, Roy H},
-	year         = 2017,
-	journal      = {Proceedings of the VLDB Endowment},
-	publisher    = {VLDB Endowment},
-	volume       = 10,
-	number       = 12,
-	pages        = {1634--1645}
-}
-@article{ghosh2017morphus,
-	title        = {Morphus: Supporting online reconfigurations in sharded {NoSQL} systems},
-	author       = {Ghosh, Mainak and Wang, Wenting and Holla, Gopalakrishna and Gupta, Indranil},
-	year         = 2017,
-	journal      = {IEEE Transactions on Emerging Topics in Computing},
-	publisher    = {IEEE},
-	volume       = 5,
-	number       = 4,
-	pages        = {466--479}
-}
-@inproceedings{shin2015parqua,
-	title        = {Parqua: Online reconfigurations in virtual ring-based {NoSQL} systems},
-	author       = {Shin, Yosub and Ghosh, Mainak and Gupta, Indranil},
-	year         = 2015,
-	booktitle    = {2015 International Conference on Cloud and Autonomic Computing (ICCAC)},
-	pages        = {220--223},
-	organization = {IEEE}
-}
-@inproceedings{xu2016stela,
-	title        = {Stela: Enabling stream processing systems to scale-in and scale-out on-demand},
-	author       = {Xu, Le and Peng, Boyang and Gupta, Indranil},
-	year         = 2016,
-	booktitle    = {2016 IEEE International Conference on Cloud Engineering (IC2E)},
-	pages        = {22--31},
-	organization = {IEEE}
-}
-@inproceedings{pundir2016supporting,
-	title        = {Supporting on-demand elasticity in distributed graph processing},
-	author       = {Pundir, Mayank and Kumar, Manoj and Leslie, Luke M and Gupta, Indranil and Campbell, Roy H},
-	year         = 2016,
-	booktitle    = {2016 IEEE International Conference on Cloud Engineering (IC2E)},
-	pages        = {12--21},
-	organization = {IEEE}
-}
-@article{rahman2017characterizing,
-	title        = {Characterizing and adapting the consistency-latency tradeoff in distributed key-value stores},
-	author       = {Rahman, Muntasir Raihan and Tseng, Lewis and Nguyen, Son and Gupta, Indranil and Vaidya, Nitin},
-	year         = 2017,
-	journal      = {ACM Transactions on Autonomous and Adaptive Systems (TAAS)},
-	publisher    = {ACM},
-	volume       = 11,
-	number       = 4,
-	pages        = 20
-}
-@inproceedings{liu2015quantitative,
-	title        = {Quantitative analysis of consistency in {NoSQL} key-value stores},
-	author       = {Liu, Si and Nguyen, Son and Ganhotra, Jatin and Rahman, Muntasir Raihan and Gupta, Indranil and Meseguer, Jos{\'e}},
-	year         = 2015,
-	booktitle    = {International Conference on Quantitative Evaluation of Systems},
-	pages        = {228--243},
-	organization = {Springer}
-}
-@article{yu2014introduction,
-	title        = {An introduction to computational networks and the computational network toolkit},
-	author       = {Dong Yu and Adam Eversole and Mike Seltzer and Kaisheng Yao and Oleksii Kuchaiev and Yu Zhang and Frank Seide and Zhiheng Huang and Brian Guenter and Huaming Wang and Jasha Droppo and Geoffrey Zweig and Chris Rossbach and Jie Gao and Andreas Stolcke and Jon Currey and Malcolm Slaney and Guoguo Chen and Amit Agarwal and Chris Basoglu and Marko Padmilac and Alexey Kamenev and Vladimir Ivanov and Scott Cypher and Hari Parthasarathi and Bhaskar Mitra and Baolin Peng and Xuedong Huang},
-	year         = 2014,
-	journal      = {Microsoft Technical Report MSR-TR-2014--112}
-}
-@inproceedings{lee2016dolphin,
-	title        = {Dolphin: Runtime Optimization for Distributed Machine Learning},
-	author       = {Lee, Yun Seong Lee and Weimer, Markus and Yang, Youngseok and Yu, Gyeong-In},
-	year         = 2016,
-	booktitle    = {Proc. of ICML ML Systems Workshop}
-}
-@article{chen2015mxnet,
-	title        = {{MXNet}: A flexible and efficient machine learning library for heterogeneous distributed systems},
-	author       = {Chen, Tianqi and Li, Mu and Li, Yutian and Lin, Min and Wang, Naiyan and Wang, Minjie and Xiao, Tianjun and Xu, Bing and Zhang, Chiyuan and Zhang, Zheng},
-	year         = 2015,
-	journal      = {arXiv preprint arXiv:1512.01274}
-}
-@inproceedings{mirhoseini2017device,
-	title        = {Device Placement Optimization with Reinforcement Learning},
-	author       = {Azalia Mirhoseini and Hieu Pham and Quoc V. Le and Benoit Steiner and Rasmus Larsen and Yuefeng Zhou and Naveen Kumar and Mohammad Norouzi and Samy Bengio and Jeff Dean},
-	year         = 2017,
-	booktitle    = {International Conference on Machine Learning (ICML)}
-}
-@article{meng2016mllib,
-	title        = {{MLlib}: Machine learning in {Apache Spark}},
-	author       = {Xiangrui Meng and Joseph K. Bradley and Burak Yavuz and Evan R. Sparks and Shivaram Venkataraman and Davies Liu and Jeremy Freeman and D. B. Tsai and Manish Amde and Sean Owen and Doris Xin and Reynold Xin and Michael J. Franklin and Reza Bosagh Zadeh and Matei Zaharia and Ameet S. Talwalkar},
-	year         = 2016,
-	journal      = {The Journal of Machine Learning Research (JMLR)},
-	publisher    = {JMLR. org},
-	volume       = 17,
-	number       = 1,
-	pages        = {1235--1241}
-}
-@article{adam1989security,
-	title        = {Security-control methods for statistical databases: a comparative study},
-	author       = {Adam, Nabil R and Worthmann, John C},
-	year         = 1989,
-	journal      = {ACM Computing Surveys (CSUR)},
-	publisher    = {ACM},
-	volume       = 21,
-	number       = 4,
-	pages        = {515--556}
-}
-@inproceedings{blum2005practical,
-	title        = {Practical privacy: the {SuLQ} framework},
-	author       = {Blum, Avrim and Dwork, Cynthia and McSherry, Frank and Nissim, Kobbi},
-	year         = 2005,
-	booktitle    = {Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems},
-	pages        = {128--138},
-	organization = {ACM}
-}
-@article{sweeney2002achieving,
-	title        = {Achieving k-anonymity privacy protection using generalization and suppression},
-	author       = {Sweeney, Latanya},
-	year         = 2002,
-	journal      = {International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems},
-	publisher    = {World Scientific},
-	volume       = 10,
-	number       = {05},
-	pages        = {571--588}
-}
-@inproceedings{chawla2005toward,
-	title        = {Toward privacy in public databases},
-	author       = {Chawla, Shuchi and Dwork, Cynthia and McSherry, Frank and Smith, Adam and Wee, Hoeteck},
-	year         = 2005,
-	booktitle    = {Theory of Cryptography Conference},
-	pages        = {363--385},
-	organization = {Springer}
-}
-@article{denning1980secure,
-	title        = {Secure statistical databases with random sample queries},
-	author       = {Denning, Dorothy E},
-	year         = 1980,
-	journal      = {ACM Transactions on Database Systems (TODS)},
-	publisher    = {ACM},
-	volume       = 5,
-	number       = 3,
-	pages        = {291--315}
-}
-% Pallavi's thesis
-@inproceedings{mikolov2011rnnlm,
-	title        = {{RNNLM}-recurrent neural network language modeling toolkit},
-	author       = {Mikolov, Tomas and Kombrink, Stefan and Deoras, Anoop and Burget, Lukar and Cernocky, Jan},
-	year         = 2011,
-	booktitle    = {Proc. of the 2011 ASRU Workshop},
-	pages        = {196--201}
-}
-@article{bahdanau2014neural,
-	title        = {Neural machine translation by jointly learning to align and translate},
-	author       = {Bahdanau, Dzmitry and Cho, Kyunghyun and Bengio, Yoshua},
-	year         = 2014,
-	journal      = {arXiv preprint arXiv:1409.0473}
-}
-@misc{garey1979computers,
-	title        = {Computers and intractability: a guide to {NP}-completeness},
-	author       = {Garey, Michael R and Johnson, David S},
-	year         = 1979,
-	publisher    = {WH Freeman and Company, San Francisco}
-}
-@inproceedings{de2004approximation,
-	title        = {Approximation schemes for metric bisection and partitioning},
-	author       = {Wenceslas Fernandez de la Vega and Marek Karpinski and Claire Mathieu},
-	year         = 2004,
-	booktitle    = {ACM-SIAM Symposium on Discrete Algorithms (SODA)},
-	volume       = 4,
-	pages        = {506--511}
-}
-@inproceedings{ding2001min,
-	title        = {A min-max cut algorithm for graph partitioning and data clustering},
-	author       = {Ding, Chris HQ and He, Xiaofeng and Zha, Hongyuan and Gu, Ming and Simon, Horst D},
-	year         = 2001,
-	booktitle    = {Proceedings IEEE International Conference on Data Mining (ICDM), 2001},
-	pages        = {107--114},
-	organization = {IEEE}
-}
-@article{munier1997heuristic,
-	title        = {A heuristic for a scheduling problem with communication delays},
-	author       = {Munier, Alix and K{\"o}nig, Jean-Claude},
-	year         = 1997,
-	journal      = {Operations Research},
-	publisher    = {INFORMS},
-	volume       = 45,
-	number       = 1,
-	pages        = {145--147}
-}
-@inproceedings{hanen1995approximation,
-	title        = {An approximation algorithm for scheduling dependent tasks on m processors with small communication delays},
-	author       = {Hanen, Claire and Munier, Alix},
-	year         = 1995,
-	booktitle    = {INRIA/IEEE Symposium on Emerging Technologies and Factory Automation (ETFA), 1995},
-	volume       = 1,
-	pages        = {167--189},
-	organization = {IEEE}
-}
-@article{hwang1989scheduling,
-	title        = {Scheduling precedence graphs in systems with interprocessor communication times},
-	author       = {Hwang, Jing-Jang and Chow, Yuan-Chieh and Anger, Frank D and Lee, Chung-Yee},
-	year         = 1989,
-	journal      = {SIAM Journal on Computing},
-	publisher    = {SIAM},
-	volume       = 18,
-	number       = 2,
-	pages        = {244--257}
-}
-@article{saran1995finding,
-	title        = {Finding k cuts within twice the optimal},
-	author       = {Saran, Huzur and Vazirani, Vijay V},
-	year         = 1995,
-	journal      = {SIAM Journal on Computing},
-	publisher    = {SIAM},
-	volume       = 24,
-	number       = 1,
-	pages        = {101--108}
-}
-@incollection{AWS17,
-	title        = {{Amazon Web Services}},
-	author       = {Amazon Web Services},
-	year         = 2018,
-	note         = {\url{http://aws.amazon.com}, Last visited: Nov. 2018}
-}
-@incollection{GDPR,
-	title        = {{European Union's General Data Protection Regulation (GDPR)}},
-	author       = {EU},
-	year         = 2018,
-	note         = {\url{https://eugdpr.org/}, Last visited: Nov. 2018}
-}
-@incollection{CL17,
-	title        = {{NSF Chameleon Cloud}},
-	author       = {NSF Chameleon Cloud},
-	year         = 2018,
-	note         = {\url{https://www.chameleoncloud.org/}, Last visited: Nov. 2018}
-}
-@incollection{U17,
-	title        = {Uber Ringpop Membership Protocol},
-	author       = {Uber Ringpop Membership Protocol},
-	year         = 2018,
-	note         = {\url{https://ringpop.readthedocs.org/en/latest/architecture_design.html}, Last visited: Nov. 2018}
-}
-@incollection{W14,
-	title        = {How Close Is Too Close? Industry Courts Computer Scholars},
-	author       = {Avi Wolfman-Arent},
-	year         = 2014,
-	note         = {\url{http://chronicle.com/article/How-Close-Is-Too-Close-/148301/}, Last visited: Nov. 2018}
-}
-@incollection{CCC15,
-	title        = {Cloud Computing Concepts, {Coursera MOOC} course},
-	author       = {Indranil Gupta},
-	year         = 2015,
-	note         = {\url{https://www.coursera.org/learn/cloud-computing}, Last visited: Nov. 2018}
-}
-@incollection{DPRG18,
-	title        = {{Distributed Protocols Research Group, Software.}},
-	author       = {Dept. of Computer Science, UIUC.},
-	year         = 2018,
-	note         = {\url{http://dprg.cs.uiuc.edu/downloads}, Last visited: Nov. 2018}
-}
-% segmentation benchmarks
-@inproceedings{milletari2016v,
-	title        = {{V-Net}: Fully convolutional neural networks for volumetric medical image segmentation},
-	author       = {Milletari, Fausto and Navab, Nassir and Ahmadi, Seyed-Ahmad},
-	year         = 2016,
-	booktitle    = {2016 Fourth International Conference on 3D Vision (3DV)},
-	pages        = {565--571},
-	organization = {IEEE}
-}
-@inproceedings{li2007beyond,
-	title        = {Beyond One-Third Faulty Replicas in Byzantine Fault Tolerant Systems.},
-	author       = {Li, Jinyuan and Mazi{\'e}res, David},
-	year         = 2007,
-	booktitle    = {NSDI}
-}
-% general byz tolerance
-@article{kotla2007zyzzyva,
-	title        = {Zyzzyva: speculative byzantine fault tolerance},
-	author       = {Kotla, Ramakrishna and Alvisi, Lorenzo and Dahlin, Mike and Clement, Allen and Wong, Edmund},
-	year         = 2007,
-	journal      = {ACM SIGOPS Operating Systems Review (OSR)},
-	publisher    = {ACM},
-	volume       = 41,
-	number       = 6,
-	pages        = {45--58}
-}
-@article{castro2002practical,
-	title        = {Practical {Byzantine} fault tolerance and proactive recovery},
-	author       = {Castro, Miguel and Liskov, Barbara},
-	year         = 2002,
-	journal      = {ACM Transactions on Computer Systems (TOCS)},
-	publisher    = {ACM},
-	volume       = 20,
-	number       = 4,
-	pages        = {398--461}
-}
-@inproceedings{rodrigues2001base,
-	title        = {{BASE}: Using abstraction to improve fault tolerance},
-	author       = {Rodrigues, Rodrigo and Castro, Miguel and Liskov, Barbara},
-	year         = 2001,
-	booktitle    = {ACM Operating Systems Review (SIGOPS)},
-	number       = 5,
-	pages        = {15--28},
-	organization = {ACM}
-}
-@article{yin2003separating,
-	title        = {Separating agreement from execution for {Byzantine} fault tolerant services},
-	author       = {Yin, Jian and Martin, Jean-Philippe and Venkataramani, Arun and Alvisi, Lorenzo and Dahlin, Mike},
-	year         = 2003,
-	journal      = {ACM Operating Systems Review (SIGOPS)},
-	publisher    = {ACM},
-	volume       = 37,
-	number       = 5,
-	pages        = {253--267}
-}
-@article{adya2002farsite,
-	title        = {{FARSITE}: Federated, available, and reliable storage for an incompletely trusted environment},
-	author       = {Adya, Atul and Bolosky, William J and Castro, Miguel and Cermak, Gerald and Chaiken, Ronnie and Douceur, John R and Howell, Jon and Lorch, Jacob R and Theimer, Marvin and Wattenhofer, Roger P},
-	year         = 2002,
-	journal      = {ACM Operating Systems Review (SIGOPS)},
-	publisher    = {ACM},
-	volume       = 36,
-	number       = {SI},
-	pages        = {1--14}
-}
-@article{malkhi1998byzantine,
-	title        = {Byzantine quorum systems},
-	author       = {Malkhi, Dahlia and Reiter, Michael},
-	year         = 1998,
-	journal      = {Distributed computing},
-	publisher    = {Springer},
-	volume       = 11,
-	number       = 4,
-	pages        = {203--213}
-}
-% attack model
-@article{chen2018robust,
-	title        = {Robust Physical Adversarial Attack on Faster {R-CNN} Object Detector},
-	author       = {Chen, Shang-Tse and Cornelius, Cory and Martin, Jason and Chau, Duen Horng},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1804.05810}
-}
-@article{kipf2016semi,
-	title        = {Semi-supervised classification with graph convolutional networks},
-	author       = {Kipf, Thomas N and Welling, Max},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1609.02907}
-}
-@article{dalal2018safe,
-  title={Safe exploration in continuous action spaces},
-  author={Dalal, Gal and Dvijotham, Krishnamurthy and Vecerik, Matej and Hester, Todd and Paduraru, Cosmin and Tassa, Yuval},
-  journal={arXiv preprint arXiv:1801.08757},
-  year={2018}
-}
-@inproceedings{lee2021optidice,
-  title={Optidice: Offline policy optimization via stationary distribution correction estimation},
-  author={Lee, Jongmin and Jeon, Wonseok and Lee, Byungjun and Pineau, Joelle and Kim, Kee-Eung},
-  booktitle={International Conference on Machine Learning},
-  pages={6120--6130},
-  year={2021},
-  organization={PMLR}
-}
-@article{thomas2021multi,
-  title={Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite {MDP}s},
-  author={Thomas, Philip S and Pineau, Joelle and Laroche, Romain and others},
-  journal={Advances in Neural Information Processing Systems},
-  volume={34},
-  year={2021}
-}
-@article{wu2021offline,
-  title={Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration},
-  author={Wu, Runzhe and Zhang, Yufeng and Yang, Zhuoran and Wang, Zhaoran},
-  journal={Advances in Neural Information Processing Systems},
-  volume={34},
-  year={2021}
-}
-@inproceedings{tennenholtz2020off,
-  title={Off-policy evaluation in partially observable environments},
-  author={Tennenholtz, Guy and Shalit, Uri and Mannor, Shie},
-  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
-  volume={34},
-  number={06},
-  pages={10276--10283},
-  year={2020}
-}
-@inproceedings{duan2020minimax,
-  title={Minimax-optimal off-policy evaluation with linear function approximation},
-  author={Duan, Yaqi and Jia, Zeyu and Wang, Mengdi},
-  booktitle={International Conference on Machine Learning},
-  pages={2701--2709},
-  year={2020},
-  organization={PMLR}
-}
-@inproceedings{le2019batch,
-  title={Batch policy learning under constraints},
-  author={Le, Hoang and Voloshin, Cameron and Yue, Yisong},
-  booktitle={International Conference on Machine Learning},
-  pages={3703--3712},
-  year={2019},
-  organization={PMLR}
-}
-
-@inproceedings{wang2019robust,
-  title={Robust contraction analysis of nonlinear systems via differential IQC},
-  author={Wang, Ruigang and Manchester, Ian R},
-  booktitle={2019 IEEE 58th Conference on Decision and Control (CDC)},
-  pages={6766--6771},
-  year={2019},
-  organization={IEEE}
-}
-@article{van2021kernel,
-  title={Kernel-based models for system analysis},
-  author={van Waarde, Henk J and Sepulchre, Rodolphe},
-  journal={arXiv preprint arXiv:2110.11735},
-  year={2021}
-}
-@article{veenman2015robust,
-  title={Robust stability and performance analysis with integral quadratic constraints},
-  author={Veenman, Joost and Scherer, Carsten W and K{\"o}roglua, Hakan},
-  publisher={Citeseer}
-}
-@article{srinivasan2020learning,
-  title={Learning to be safe: Deep {RL} with a safety critic},
-  author={Srinivasan, Krishnan and Eysenbach, Benjamin and Ha, Sehoon and Tan, Jie and Finn, Chelsea},
-  journal={arXiv preprint arXiv:2010.14603},
-  year={2020}
-}
-@article{jin2021bellman,
-  title={Bellman eluder dimension: New rich classes of {RL} problems, and sample-efficient algorithms},
-  author={Jin, Chi and Liu, Qinghua and Miryoosefi, Sobhan},
-  journal={Advances in Neural Information Processing Systems},
-  volume={34},
-  year={2021}
-}
-@inproceedings{geist2019theory,
-  title={A theory of regularized markov decision processes},
-  author={Geist, Matthieu and Scherrer, Bruno and Pietquin, Olivier},
-  booktitle={International Conference on Machine Learning},
-  pages={2160--2169},
-  year={2019},
-  organization={PMLR}
-}@inproceedings{jin2020provably,
-  title={Provably efficient reinforcement learning with linear function approximation},
-  author={Jin, Chi and Yang, Zhuoran and Wang, Zhaoran and Jordan, Michael I},
-  booktitle={Conference on Learning Theory},
-  pages={2137--2143},
-  year={2020},
-  organization={PMLR}
-}
-@inproceedings{jiang2017contextual,
-  title={Contextual decision processes with low bellman rank are pac-learnable},
-  author={Jiang, Nan and Krishnamurthy, Akshay and Agarwal, Alekh and Langford, John and Schapire, Robert E},
-  booktitle={International Conference on Machine Learning},
-  pages={1704--1713},
-  year={2017},
-  organization={PMLR}
-}chi2021safe,
-  title={Safe Policy Optimization with Local Generalized Linear Function Approximations},
-  author={Wachi, Akifumi and Wei, Yunyue and Sui, Yanan},
-  journal={Advances in Neural Information Processing Systems},
-  volume={34},
-  year={2021}
-}
-@inproceedings{park2020homotopy,
-  title={Homotopy method for finding the global solution of post-contingency optimal power flow},
-  author={Park, SangWoo and Glista, Elizabeth and Lavaei, Javad and Sojoudi, Somayeh},
-  booktitle={2020 American Control Conference (ACC)},
-  pages={3126--3133},
-  year={2020},
-  organization={IEEE}
-}
-@article{mulvaney2021smoothing,
-  title={Smoothing property of load variation promotes finding global solutions of time-varying optimal power flow},
-  author={Mulvaney-Kemp, Julie and Fattahi, Salar and Lavaei, Javad},
-  journal={IEEE Transactions on Control of Network Systems},
-  volume={8},
-  number={3},
-  pages={1552--1564},
-  year={2021},
-  publisher={IEEE}
-}
-@article{wachi2021safe,
-  title={Safe Policy Optimization with Local Generalized Linear Function Approximations},
-  author={Wachi, Akifumi and Wei, Yunyue and Sui, Yanan},
-  journal={Advances in Neural Information Processing Systems},
-  volume={34},
-  year={2021}
-}
-@book{aastrom2013adaptive,
-  title={Adaptive control},
-  author={{\AA}str{\"o}m, Karl J and Wittenmark, Bj{\"o}rn},
-  year={2013},
-  publisher={Courier Corporation}
-}
-@article{izzo2019survey,
-  title={A survey on artificial intelligence trends in spacecraft guidance dynamics and control},
-  author={Izzo, Dario and M{\"a}rtens, Marcus and Pan, Binfeng},
-  journal={Astrodynamics},
-  volume={3},
-  number={4},
-  pages={287--299},
-  year={2019},
-  publisher={Springer}
-}
-@inproceedings{wachi2020safe,
-  title={Safe reinforcement learning in constrained markov decision processes},
-  author={Wachi, Akifumi and Sui, Yanan},
-  booktitle={International Conference on Machine Learning},
-  pages={9797--9806},
-  year={2020},
-  organization={PMLR}
-}
-@inproceedings{abbeel2004apprenticeship,
-  title={Apprenticeship learning via inverse reinforcement learning},
-  author={Abbeel, Pieter and Ng, Andrew Y},
-  booktitle={International Conference on Machine Learning (ICML)},
-  pages={1},
-  year={2004}
-}
-@inproceedings{tang2010parameterized,
-  title={Parameterized maneuver learning for autonomous helicopter flight},
-  author={Tang, Jie and Singh, Arjun and Goehausen, Nimbus and Abbeel, Pieter},
-  booktitle={IEEE International Conference on Robotics and Automation},
-  pages={1142--1148},
-  year={2010},
-  organization={IEEE}
-}
-@inproceedings{wachi2018safe,
-  title={Safe exploration and optimization of constrained {MDPs} using gaussian processes},
-  author={Wachi, Akifumi and Sui, Yanan and Yue, Yisong and Ono, Masahiro},
-  booktitle={AAAI Conference on Artificial Intelligence (AAAI)},
-  year={2018}
-}
-@inproceedings{achiam2017constrained,
-  title={Constrained policy optimization},
-  author={Achiam, Joshua and Held, David and Tamar, Aviv and Abbeel, Pieter},
-  booktitle={International Conference on Machine Learning (ICML)},
-  pages={22--31},
-  year={2017},
-  organization={PMLR}
-}
-@inproceedings{moldovan2012safe,
-  title={Safe exploration in Markov decision processes},
-  author={Moldovan, Teodor Mihai and Abbeel, Pieter},
-  booktitle={International Conference on Machine Learning (ICML)},
-  pages={1451--1458},
-  year={2012}
-}
-@article{garcia2015comprehensive,
-  title={A comprehensive survey on safe reinforcement learning},
-  author={Garc{\i}a, Javier and Fern{\'a}ndez, Fernando},
-  journal={Journal of Machine Learning Research},
-  volume={16},
-  number={1},
-  pages={1437--1480},
-  year={2015}
-}
-@inproceedings{gyongyi2005link,
-	title        = {Link spam alliances},
-	author       = {Gy{\"o}ngyi, Zolt{\'a}n and Garcia-Molina, Hector},
-	year         = 2005,
-	booktitle    = {Proceedings of the 31st international conference on Very Large Data Bases (VLDB)},
-	pages        = {517--528},
-	organization = {VLDB Endowment}
-}
-@inproceedings{gonzalez2012powergraph,
-	title        = {Powergraph: distributed graph-parallel computation on natural graphs.},
-	author       = {Gonzalez, Joseph E and Low, Yucheng and Gu, Haijie and Bickson, Danny and Guestrin, Carlos},
-	year         = 2012,
-	booktitle    = {USENIX Symposium on Operating Systems Design and Implementation (OSDI)},
-	volume       = 12,
-	pages        = 2
-}
-% proposed experiments
-@inproceedings{liu2016ssd,
-	title        = {{SSD}: Single shot multibox detector},
-	author       = {Liu, Wei and Anguelov, Dragomir and Erhan, Dumitru and Szegedy, Christian and Reed, Scott and Fu, Cheng-Yang and Berg, Alexander C},
-	year         = 2016,
-	booktitle    = {European Conference on Computer Vision (ECCV)},
-	pages        = {21--37},
-	organization = {Springer}
-}
-@article{ren2017faster,
-	title        = {Faster {R-CNN}: towards real-time object detection with region proposal networks},
-	author       = {Ren, Shaoqing and He, Kaiming and Girshick, Ross and Sun, Jian},
-	year         = 2017,
-	journal      = {IEEE Transactions on Pattern Analysis \& Machine Intelligence (PAMI)},
-	publisher    = {IEEE},
-	number       = 6,
-	pages        = {1137--1149}
-}
-@article{redmon2018yolov3,
-	title        = {Yolov3: An incremental improvement},
-	author       = {Redmon, Joseph and Farhadi, Ali},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1804.02767}
-}
-@inproceedings{he2017mask,
-	title        = {Mask {R-CNN}},
-	author       = {He, Kaiming and Gkioxari, Georgia and Doll{\'a}r, Piotr and Girshick, Ross},
-	year         = 2017,
-	booktitle    = {2017 IEEE International Conference on Computer Vision (ICCV)},
-	pages        = {2980--2988},
-	organization = {IEEE}
-}
-@article{chen2017rethinking,
-	title        = {Rethinking atrous convolution for semantic image segmentation},
-	author       = {Chen, Liang-Chieh and Papandreou, George and Schroff, Florian and Adam, Hartwig},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1706.05587}
-}
-@article{wu2016google,
-	title        = {Google's neural machine translation system: Bridging the gap between human and machine translation},
-	author       = {Yonghui Wu and Mike Schuster and Zhifeng Chen and Quoc V. Le and Mohammad Norouzi and Wolfgang Macherey and Maxim Krikun and Yuan Cao and Qin Gao and Klaus Macherey and Jeff Klingner and Apurva Shah and Melvin Johnson and Xiaobing Liu and Lukasz Kaiser and Stephan Gouws and Yoshikiyo Kato and Taku Kudo and Hideto Kazawa and Keith Stevens and George Kurian and Nishant Patil and Wei Wang and Cliff Young and Jason Smith and Jason Riesa and Alex Rudnick and Oriol Vinyals and Gregory S. Corrado and Macduff Hughes and Jeffrey Dean},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1609.08144}
-}
-@inproceedings{vaswani2017attention,
-	title        = {Attention is all you need},
-	author       = {Vaswani, Ashish and Shazeer, Noam and Parmar, Niki and Uszkoreit, Jakob and Jones, Llion and Gomez, Aidan N and Kaiser, {\L}ukasz and Polosukhin, Illia},
-	year         = 2017,
-	booktitle    = {Advances in Neural Information Processing Systems (NIPS)},
-	pages        = {5998--6008}
-}
-@article{radford2017learning,
-	title        = {Learning to generate reviews and discovering sentiment},
-	author       = {Radford, Alec and Jozefowicz, Rafal and Sutskever, Ilya},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1704.01444}
-}
-% to be checked
-@misc{bright2018meltdown,
-	title        = {{``Meltdown'' and ``Spectre''}: Every modern processor has unfixable security flaws},
-	author       = {Bright, P.},
-	year         = 2018,
-	publisher    = {January}
-}
-@inproceedings{kotla2007safestore,
-	title        = {{SafeStore}: A durable and practical storage system},
-	author       = {Kotla, Ramakrishna and Alvisi, Lorenzo and Dahlin, Mike},
-	year         = 2007,
-	booktitle    = {USENIX Annual Technical Conference},
-	pages        = {129--142}
-}
-@article{lu2008learning,
-	title        = {Learning from mistakes: a comprehensive study on real world concurrency bug characteristics},
-	author       = {Lu, Shan and Park, Soyeon and Seo, Eunsoo and Zhou, Yuanyuan},
-	year         = 2008,
-	journal      = {ACM Operating Systems Review (SIGOPS)},
-	publisher    = {ACM},
-	volume       = 42,
-	number       = 2,
-	pages        = {329--339}
-}
-@article{act1996health,
-	title        = {Health insurance portability and accountability act of 1996},
-	author       = {Steve Anderson: HealthInsurance.org},
-	year         = 1996,
-	journal      = {Public law},
-	volume       = 104,
-	pages        = 191
-}
-@inproceedings{wei2015managed,
-	title        = {Managed communication and consistency for fast data-parallel iterative analytics},
-	author       = {Wei, Jinliang and Dai, Wei and Qiao, Aurick and Ho, Qirong and Cui, Henggang and Ganger, Gregory R and Gibbons, Phillip B and Gibson, Garth A and Xing, Eric P},
-	year         = 2015,
-	booktitle    = {Proceedings of the Sixth ACM Symposium on Cloud Computing (SoCC)},
-	pages        = {381--394},
-	organization = {ACM}
-}
-@article{pantelopoulos2010survey,
-	title        = {A survey on wearable sensor-based systems for health monitoring and prognosis},
-	author       = {Pantelopoulos, Alexandros and Bourbakis, Nikolaos G},
-	year         = 2010,
-	journal      = {IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews)},
-	publisher    = {IEEE},
-	volume       = 40,
-	number       = 1,
-	pages        = {1--12}
-}
-@book{roth1988shapley,
-	title        = {The Shapley value: essays in honor of Lloyd S. Shapley},
-	author       = {Roth, Alvin E},
-	year         = 1988,
-	publisher    = {Cambridge University Press}
-}
-@inproceedings{jia2019towards,
-	title        = {Towards efficient data valuation based on the shapley value},
-	author       = {Jia, Ruoxi and Dao, David and Wang, Boxin and Hubis, Frances Ann and Hynes, Nick and G{\"u}rel, Nezihe Merve and Li, Bo and Zhang, Ce and Song, Dawn and Spanos, Costas J},
-	year         = 2019,
-	booktitle    = {The 22nd International Conference on Artificial Intelligence and Statistics},
-	pages        = {1167--1176},
-	organization = {PMLR}
-}
-@article{Yang2017ByRDiEBD,
-	title        = {ByRDiE: Byzantine-Resilient Distributed Coordinate Descent for Decentralized Learning},
-	author       = {Zhixiong Yang and Waheed Uz Zaman Bajwa},
-	year         = 2017,
-	journal      = {IEEE Transactions on Signal and Information Processing over Networks},
-	volume       = 5,
-	pages        = {611--627}
-}
-@article{Yuan2013OnTC,
-	title        = {On the Convergence of Decentralized Gradient Descent},
-	author       = {Kun Yuan and Qing Ling and Wotao Yin},
-	year         = 2013,
-	journal      = {SIAM Journal on Optimization},
-	volume       = 26,
-	pages        = {1835--1854}
-}
-@article{Shi2014EXTRAAE,
-	title        = {EXTRA: An Exact First-Order Algorithm for Decentralized Consensus Optimization},
-	author       = {Wei Shi and Qing Ling and Gang Wu and Wotao Yin},
-	year         = 2014,
-	journal      = {SIAM Journal on Optimization},
-	volume       = 25,
-	pages        = {944--966}
-}
-@article{caldas2018leaf,
-	title        = {Leaf: A benchmark for federated settings},
-	author       = {Caldas, Sebastian and Wu, Peter and Li, Tian and Kone{\v{c}}n{\`y}, Jakub and McMahan, H Brendan and Smith, Virginia and Talwalkar, Ameet},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1812.01097}
-}
-@article{xie2019asynchronous,
-	title        = {Asynchronous Federated Optimization},
-	author       = {Xie, Cong and Koyejo, Sanmi and Gupta, Indranil},
-	year         = 2019,
-	journal      = {arXiv preprint arXiv:1903.03934}
-}
-@article{xie2019async,
-	title        = {Zeno++: Robust Fully Asynchronous SGD},
-	author       = {Xie, Cong and Koyejo, Oluwasanmi and Gupta, Indranil},
-	year         = 2019,
-	journal      = {arXiv preprint arXiv:1903.07020}
-}
-@article{chen2017targeted,
-	title        = {Targeted backdoor attacks on deep learning systems using data poisoning},
-	author       = {Chen, Xinyun and Liu, Chang and Li, Bo and Lu, Kimberly and Song, Dawn},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1712.05526}
-}
-@article{wang2019neural,
-	title        = {Neural cleanse: Identifying and mitigating backdoor attacks in neural networks},
-	author       = {Wang, Bolun and Yao, Yuanshun and Shan, Shawn and Li, Huiying and Viswanath, Bimal and Zheng, Haitao and Zhao, Ben Y},
-	year         = 2019,
-	journal      = {Neural Cleanse: Identifying and Mitigating Backdoor Attacks in Neural Networks},
-	publisher    = {IEEE},
-	pages        = {0}
-}
-@inproceedings{aydinoglu2021stability,
-  title={Stability analysis of complementarity systems with neural network controllers},
-  author={Aydinoglu, Alp and Fazlyab, Mahyar and Morari, Manfred and Posa, Michael},
-  booktitle={International Conference on Hybrid Systems: Computation and Control},
-  pages={1--10},
-  year={2021}
-}
-@article{fazlyab2020safety,
-  title={Safety verification and robustness analysis of neural networks via quadratic constraints and semidefinite programming},
-  author={Fazlyab, Mahyar and Morari, Manfred and Pappas, George J},
-  journal={IEEE Transactions on Automatic Control},
-  year={2020},
-  publisher={IEEE}
-}
-@article{taylor2020control,
-  title={A control barrier perspective on episodic learning via projection-to-state safety},
-  author={Taylor, Andrew J and Singletary, Andrew and Yue, Yisong and Ames, Aaron D},
-  journal={IEEE Control Systems Letters},
-  volume={5},
-  number={3},
-  pages={1019--1024},
-  year={2020},
-  publisher={IEEE}
-}
-@inproceedings{cheng2019accelerating,
-  title={Accelerating imitation learning with predictive models},
-  author={Cheng, Ching-An and Yan, Xinyan and Theodorou, Evangelos and Boots, Byron},
-  booktitle={International Conference on Artificial Intelligence and Statistics},
-  pages={3187--3196},
-  year={2019},
-  organization={PMLR}
-}
-@inproceedings{cheng2019end,
-  title={End-to-end safe reinforcement learning through barrier functions for safety-critical continuous control tasks},
-  author={Cheng, Richard and Orosz, G{\'a}bor and Murray, Richard M and Burdick, Joel W},
-  booktitle={AAAI Conference on Artificial Intelligence},
-  volume={33},
-  number={01},
-  pages={3387--3395},
-  year={2019}
-}
-@inproceedings{taylor2020learning,
-  title={Learning for safety-critical control with control barrier functions},
-  author={Taylor, Andrew and Singletary, Andrew and Yue, Yisong and Ames, Aaron},
-  booktitle={Learning for Dynamics and Control},
-  pages={708--717},
-  year={2020},
-  organization={PMLR}
-}
-@inproceedings{dean2020guaranteeing,
-  title={Guaranteeing safety of learned perception modules via measurement-robust control barrier functions},
-  author={Dean, Sarah and Taylor, Andrew J and Cosner, Ryan K and Recht, Benjamin and Ames, Aaron D},
-  booktitle={Learning for Dynamics and Control},
-  year={2020},
-  organization={PMLR}
-}
-@article{jankovic2018robust,
-  title={Robust control barrier functions for constrained stabilization of nonlinear systems},
-  author={Jankovic, Mrdjan},
-  journal={Automatica},
-  volume={96},
-  pages={359--367},
-  year={2018},
-  publisher={Elsevier}
-}
-@article{pulina2012challenging,
-  title={Challenging SMT solvers to verify neural networks},
-  author={Pulina, Luca and Tacchella, Armando},
-  journal={AI Communications},
-  volume={25},
-  number={2},
-  pages={117--135},
-  year={2012},
-  publisher={IOS Press}
-}
-@article{zhang2018efficient,
-  title={Efficient Neural Network Robustness Certification with General Activation Functions},
-  author={Zhang, Huan and Weng, Tsui-Wei and Chen, Pin-Yu and Hsieh, Cho-Jui and Daniel, Luca},
-  journal={Advances in Neural Information Processing Systems},
-  volume={31},
-  pages={4939--4948},
-  year={2018}
-}
-@article{lessard2016analysis,
-  title={Analysis and design of optimization algorithms via integral quadratic constraints},
-  author={Lessard, Laurent and Recht, Benjamin and Packard, Andrew},
-  journal={SIAM Journal on Optimization},
-  volume={26},
-  number={1},
-  pages={57--95},
-  year={2016},
-  publisher={SIAM}
-}
-@article{iannelli2019region,
-  title={Region of attraction analysis with integral quadratic constraints},
-  author={Iannelli, Andrea and Seiler, Peter and Marcos, Andr{\'e}s},
-  journal={Automatica},
-  volume={109},
-  pages={108543},
-  year={2019},
-  publisher={Elsevier}
-}
-@article{pfifer2015overview,
-  title={An overview of integral quadratic constraints for delayed nonlinear and parameter-varying systems},
-  author={Pfifer, Harald and Seiler, Peter},
-  journal={arXiv preprint arXiv:1504.02502},
-  year={2015}
-}
-@inproceedings{rantzer1996stability,
-  title={Stability criteria based on integral quadratic constraints},
-  author={Rantzer, A and Megretski, A},
-  booktitle={IEEE Conference on Decision and Control},
-  volume={1},
-  pages={215--220},
-  year={1996},
-  organization={IEEE}
-}
-@article{willems1972dissipative,
-  title={Dissipative dynamical systems part I: General theory},
-  author={Willems, Jan C},
-  journal={Archive for rational mechanics and analysis},
-  volume={45},
-  number={5},
-  pages={321--351},
-  year={1972},
-  publisher={Springer}
-}
-@article{megretski1997system,
-  title={System analysis via integral quadratic constraints},
-  author={Megretski, Alexandre and Rantzer, Anders},
-  journal={IEEE Transactions on Automatic Control},
-  volume={42},
-  number={6},
-  pages={819--830},
-  year={1997},
-  publisher={IEEE}
-}
-@article{xiang2018verification,
-  title={Verification for machine learning, autonomy, and neural networks survey},
-  author={Xiang, Weiming and Musau, Patrick and Wild, Ayana A and Lopez, Diego Manzanas and Hamilton, Nathaniel and Yang, Xiaodong and Rosenfeld, Joel and Johnson, Taylor T},
-  journal={arXiv preprint arXiv:1810.01989},
-  year={2018}
-}
-@article{hewing2020learning,
-  title={Learning-based model predictive control: Toward safe learning in control},
-  author={Hewing, Lukas and Wabersich, Kim P and Menner, Marcel and Zeilinger, Melanie N},
-  journal={Annual Review of Control, Robotics, and Autonomous Systems},
-  volume={3},
-  pages={269--296},
-  year={2020},
-  publisher={Annual Reviews}
-}
-@inproceedings{koller2018learning,
-  title={Learning-based model predictive control for safe exploration},
-  author={Koller, Torsten and Berkenkamp, Felix and Turchetta, Matteo and Krause, Andreas},
-  booktitle={2018 IEEE conference on decision and control (CDC)},
-  pages={6059--6066},
-  year={2018},
-  organization={IEEE}
-}
-@article{perkins2002lyapunov,
-  title={Lyapunov design for safe reinforcement learning},
-  author={Perkins, Theodore J and Barto, Andrew G},
-  journal={Journal of Machine Learning Research},
-  volume={3},
-  number={Dec},
-  pages={803--832},
-  year={2002}
-}
-@inproceedings{chow2018lyapunov,
-  title={A {Lyapunov}-based Approach to Safe Reinforcement Learning},
-  author={Chow, Yinlam and Nachum, Ofir and Du{\'e}{\~n}ez-Guzm{\'a}n, Edgar A and Ghavamzadeh, Mohammad},
-  booktitle={Advances in Neural Information Processing Systems},
-  year={2018}
-}
-@inproceedings{berkenkamp2017safe,
-  title={Safe model-based reinforcement learning with stability guarantees},
-  author={Berkenkamp, Felix and Turchetta, Matteo and Schoellig, Angela P and Krause, Andreas},
-  booktitle={Advances in Neural Information Processing Systems},
-  pages={908--919},
-  year={2017}
-}
-@INPROCEEDINGS{6225136,
-  author={Gillula, Jeremy H. and Tomlin, Claire J.},
-  booktitle={IEEE International Conference on Robotics and Automation}, 
-  title={Guaranteed Safe Online Learning via Reachability: tracking a ground target using a quadrotor}, 
-  year={2012},
-  volume={},
-  number={},
-  pages={2723-2730},
-  doi={10.1109/ICRA.2012.6225136}}
-@article{fisac2018general,
-  title={A general safety framework for learning-based control in uncertain robotic systems},
-  author={Fisac, Jaime F and Akametalu, Anayo K and Zeilinger, Melanie N and Kaynama, Shahab and Gillula, Jeremy and Tomlin, Claire J},
-  journal={IEEE Transactions on Automatic Control},
-  volume={64},
-  number={7},
-  pages={2737--2752},
-  year={2018},
-  publisher={IEEE}
-}
-@inproceedings{hein2017formal,
-  title={Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation},
-  author={Hein, Matthias and Andriushchenko, Maksym},
-  booktitle={Advances in Neural Information Processing Systems},
-  year={2017}
-}
-@inproceedings{mirman2018differentiable,
-  title={Differentiable abstract interpretation for provably robust neural networks},
-  author={Mirman, Matthew and Gehr, Timon and Vechev, Martin},
-  booktitle={International Conference on Machine Learning},
-  pages={3578--3586},
-  year={2018},
-  organization={PMLR}
-}
-@article{xiang2018output,
-  title={Output reachable set estimation and verification for multilayer neural networks},
-  author={Xiang, Weiming and Tran, Hoang-Dung and Johnson, Taylor T},
-  journal={IEEE Transactions on Neural Networks and Learning Systems},
-  volume={29},
-  number={11},
-  pages={5777--5783},
-  year={2018},
-  publisher={IEEE}
-}
-@inproceedings{ivanov2019verisig,
-  title={Verisig: verifying safety properties of hybrid systems with neural network controllers},
-  author={Ivanov, Radoslav and Weimer, James and Alur, Rajeev and Pappas, George J and Lee, Insup},
-  booktitle={ACM International Conference on Hybrid Systems: Computation and Control},
-  pages={169--178},
-  year={2019}
-}
-@inproceedings{stooke2020responsive,
-  title={Responsive safety in reinforcement learning by {PID} lagrangian methods},
-  author={Stooke, Adam and Achiam, Joshua and Abbeel, Pieter},
-  booktitle={International Conference on Machine Learning},
-  pages={9133--9143},
-  year={2020},
-  organization={PMLR}
-}
-@article{salman2019convex,
-  title={A Convex Relaxation Barrier to Tight Robustness Verification of Neural Networks},
-  author={Salman, Hadi and Yang, Greg and Zhang, Huan and Hsieh, Cho-Jui and Zhang, Pengchuan},
-  journal={Advances in Neural Information Processing Systems},
-  volume={32},
-  pages={9835--9846},
-  year={2019}
-}
-@inproceedings{wong2018provable,
-  title={Provable defenses against adversarial examples via the convex outer adversarial polytope},
-  author={Wong, Eric and Kolter, Zico},
-  booktitle={International Conference on Machine Learning},
-  pages={5286--5295},
-  year={2018},
-  organization={PMLR}
-}
-@article{bastani2016measuring,
-  title={Measuring neural net robustness with constraints},
-  author={Bastani, Osbert and Ioannou, Yani and Lampropoulos, Leonidas and Vytiniotis, Dimitrios and Nori, Aditya and Criminisi, Antonio},
-  journal={Advances in neural information processing systems},
-  volume={29},
-  pages={2613--2621},
-  year={2016}
-}
-@inproceedings{clark2019control,
-  title={Control barrier functions for complete and incomplete information stochastic systems},
-  author={Clark, Andrew},
-  booktitle={American Control Conference},
-  pages={2928--2935},
-  year={2019},
-  organization={IEEE}
-}
-@inproceedings{nilsson2020lyapunov,
-  title={Lyapunov-like conditions for tight exit probability bounds through comparison theorems for {SDEs}},
-  author={Nilsson, Petter and Ames, Aaron D},
-  booktitle={American Control Conference},
-  pages={5175--5181},
-  year={2020},
-  organization={IEEE}
-}
-@inproceedings{takano2018robust,
-  title={Robust constrained stabilization control using control Lyapunov and control barrier function in the presence of measurement noises},
-  author={Takano, Rin and Yamakita, Masaki},
-  booktitle={IEEE Conference on Control Technology and Applications},
-  pages={300--305},
-  year={2018},
-  organization={IEEE}
-}
-@inproceedings{choi2020reinforcement,
-  title={Reinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions},
-  author={Choi, Jason and Casta{\~n}eda, Fernando and Tomlin, Claire J and Sreenath, Koushil},
-  booktitle={Robotics: Science and Systems},
-  year={2020}
-}
-
-
-@inproceedings{julius2009trajectory,
-  title={Trajectory based verification using local finite-time invariance},
-  author={Julius, A Agung and Pappas, George J},
-  booktitle={International Workshop on Hybrid Systems: Computation and Control},
-  pages={223--236},
-  year={2009},
-  organization={Springer}
-}
-@inproceedings{huang2012computing,
-  title={Computing bounded reach sets from sampled simulation traces},
-  author={Huang, Zhenqi and Mitra, Sayan},
-  booktitle={Proceedings of the ACM international conference on Hybrid Systems: Computation and Control},
-  pages={291--294},
-  year={2012}
-}
-@book{sun2019power,
-  title={Power system control under cascading failures: understanding, mitigation, and system restoration},
-  author={Sun, Kai and Hou, Yunhe and Sun, Wei and Qi, Junjian},
-  year={2019},
-  publisher={John Wiley \& Sons}
-}
-@inproceedings{donze2007systematic,
-  title={Systematic simulation using sensitivity analysis},
-  author={Donz{\'e}, Alexandre and Maler, Oded},
-  booktitle={International Workshop on Hybrid Systems: Computation and Control},
-  pages={174--189},
-  year={2007},
-  organization={Springer}
-}
-@incollection{arcak2018simulation,
-  title={Simulation-based reachability analysis for nonlinear systems using componentwise contraction properties},
-  author={Arcak, Murat and Maidens, John},
-  booktitle={Principles of Modeling},
-  pages={61--76},
-  year={2018},
-  publisher={Springer}
-}
-@article{maidens2014reachability,
-  title={Reachability analysis of nonlinear systems using matrix measures},
-  author={Maidens, John and Arcak, Murat},
-  journal={IEEE Transactions on Automatic Control},
-  volume={60},
-  number={1},
-  pages={265--270},
-  year={2014},
-  publisher={IEEE}
-}
-@online{boffi_learning_2020-1,
-  title = {Learning {{Stability Certificates}} from {{Data}}},
-  author = {Boffi, Nicholas M. and Tu, Stephen and Matni, Nikolai and Slotine, Jean-Jacques E. and Sindhwani, Vikas},
-  date = {2020-09-14},
-  shortjournal = {ArXiv200805952 Cs Eess Stat},
-  url = {http://arxiv.org/abs/2008.05952},
-  urldate = {2021-07-06},
-  abstract = {Many existing tools in nonlinear control theory for establishing stability or safety of a dynamical system can be distilled to the construction of a certificate function that guarantees a desired property. However, algorithms for synthesizing certificate functions typically require a closed-form analytical expression of the underlying dynamics, which rules out their use on many modern robotic platforms. To circumvent this issue, we develop algorithms for learning certificate functions only from trajectory data. We establish bounds on the generalization error - the probability that a certificate will not certify a new, unseen trajectory - when learning from trajectories, and we convert such generalization error bounds into global stability guarantees. We demonstrate empirically that certificates for complex dynamics can be efficiently learned, and that the learned certificates can be used for downstream tasks such as adaptive control.},
-  archiveprefix = {arXiv},
-  eprint = {2008.05952},
-  eprinttype = {arxiv},
-  keywords = {Computer Science - Machine Learning,Electrical Engineering and Systems Science - Systems and Control,Statistics - Machine Learning},
-  primaryclass = {cs, eess, stat}
-}
-
-@inproceedings{gurriet2018online,
-  title={An online approach to active set invariance},
-  author={Gurriet, Thomas and Mote, Mark and Ames, Aaron D and Feron, Eric},
-  booktitle={IEEE Conference on Decision and Control},
-  pages={3592--3599},
-  year={2018},
-  organization={IEEE}
-}
-@article{blanchini1999set,
-  title={Set invariance in control},
-  author={Blanchini, Franco},
-  journal={Automatica},
-  volume={35},
-  number={11},
-  pages={1747--1767},
-  year={1999},
-  publisher={Elsevier}
-}
-@inproceedings{ames2019control,
-  title={Control barrier functions: Theory and applications},
-  author={Ames, Aaron D and Coogan, Samuel and Egerstedt, Magnus and Notomista, Gennaro and Sreenath, Koushil and Tabuada, Paulo},
-  booktitle={European Control Conference},
-  pages={3420--3431},
-  year={2019},
-  organization={IEEE}
-}
-@article{ames2016control,
-  title={Control barrier function based quadratic programs for safety critical systems},
-  author={Ames, Aaron D and Xu, Xiangru and Grizzle, Jessy W and Tabuada, Paulo},
-  journal={IEEE Transactions on Automatic Control},
-  volume={62},
-  number={8},
-  pages={3861--3876},
-  year={2016},
-  publisher={IEEE}
-}
-
-@inproceedings{donti2020enforcing,
-  title={Enforcing robust control guarantees within neural network policies},
-  author={Donti, Priya L and Roderick, Melrose and Fazlyab, Mahyar and Kolter, J Zico},
-  booktitle={International Conference on Learning Representations},
-  year={2020}
-}
-@inproceedings{tran2018spectral,
-	title        = {Spectral signatures in backdoor attacks},
-	author       = {Tran, Brandon and Li, Jerry and Madry, Aleksander},
-	year         = 2018,
-	booktitle    = {Advances in Neural Information Processing Systems},
-	pages        = {8000--8010}
-}
-@inproceedings{Yu2018Parallel,
-	title        = {Parallel Restarted {SGD} with Faster Convergence and Less Communication: Demystifying Why Model Averaging Works for Deep Learning},
-	author       = {Hao Yu and Sen Xiang Yang and Shenghuo Zhu},
-	year         = 2018,
-	booktitle    = {AAAI}
-}
-@article{Stich2018Local,
-	title        = {Local {SGD} Converges Fast and Communicates Little},
-	author       = {Sebastian U. Stich},
-	year         = 2018,
-	journal      = {ArXiv},
-	volume       = {abs/1805.09767}
-}
-@incollection{attestation,
-	title        = {{Key and ID Attestation}},
-	author       = {source.android.com},
-	year         = 2019,
-	note         = {\url{https://source.android.com/security/keystore/attestation}, Last visited: Mar. 2019}
-}
-@article{liacco1978,
-	title        = {Power/energy: System security: The computer's role: Several security-related functions can be aided by the digital computer, and linked together by a software scheme},
-	author       = {T. E. D. {Liacco}},
-	year         = 1978,
-	journal      = {IEEE Spectrum},
-	volume       = 15,
-	number       = 6,
-	pages        = {43--50}
-}
-@article{fink1978,
-	title        = {Operating under stress and strain [electrical power systems control under emergency conditions]},
-	author       = {L. H. {Fink} and K. {Carlsen}},
-	year         = 1978,
-	journal      = {IEEE Spectrum},
-	volume       = 15,
-	number       = 3,
-	pages        = {48--53}
-}
-
-@online{lindemann_learning_2020-1,
-  title = {Learning {{Hybrid Control Barrier Functions}} from {{Data}}},
-  author = {Lindemann, Lars and Hu, Haimin and Robey, Alexander and Zhang, Hanwen and Dimarogonas, Dimos V. and Tu, Stephen and Matni, Nikolai},
-  date = {2020-11-08},
-  shortjournal = {ArXiv201104112 Cs Eess Math},
-  url = {http://arxiv.org/abs/2011.04112},
-  urldate = {2021-07-06},
-  abstract = {Motivated by the lack of systematic tools to obtain safe control laws for hybrid systems, we propose an optimization-based framework for learning certifiably safe control laws from data. In particular, we assume a setting in which the system dynamics are known and in which data exhibiting safe system behavior is available. We propose hybrid control barrier functions for hybrid systems as a means to synthesize safe control inputs. Based on this notion, we present an optimization-based framework to learn such hybrid control barrier functions from data. Importantly, we identify sufficient conditions on the data such that feasibility of the optimization problem ensures correctness of the learned hybrid control barrier functions, and hence the safety of the system. We illustrate our findings in two simulations studies, including a compass gait walker.},
-  archiveprefix = {arXiv},
-  eprint = {2011.04112},
-  eprinttype = {arxiv},
-  keywords = {Computer Science - Machine Learning,Electrical Engineering and Systems Science - Systems and Control,Mathematics - Optimization and Control},
-  primaryclass = {cs, eess, math}
-}
-
-@article{forni2013differentially,
-  title={On differentially dissipative dynamical systems},
-  author={Forni, Fulvio and Sepulchre, Rodolphe},
-  journal={IFAC Proceedings Volumes},
-  volume={46},
-  number={23},
-  pages={15--20},
-  year={2013},
-  publisher={Elsevier}
-}
-@article{pavlov2008incremental,
-  title={Incremental passivity and output regulation},
-  author={Pavlov, Alexey and Marconi, Lorenzo},
-  journal={Systems \& Control Letters},
-  volume={57},
-  number={5},
-  pages={400--409},
-  year={2008},
-  publisher={Elsevier}
-}
-@article{stan2007analysis,
-  title={Analysis of interconnected oscillators by dissipativity theory},
-  author={Stan, Guy-Bart and Sepulchre, Rodolphe},
-  journal={IEEE Transactions on Automatic Control},
-  volume={52},
-  number={2},
-  pages={256--270},
-  year={2007},
-  publisher={IEEE}
-}
-@article{burger2014duality,
-  title={Duality and network theory in passivity-based cooperative control},
-  author={B{\"u}rger, Mathias and Zelazo, Daniel and Allg{\"o}wer, Frank},
-  journal={Automatica},
-  volume={50},
-  number={8},
-  pages={2051--2061},
-  year={2014},
-  publisher={Elsevier}
-}
-@article{hines2011equilibrium,
-  title={Equilibrium-independent passivity: A new definition and numerical certification},
-  author={Hines, George H and Arcak, Murat and Packard, Andrew K},
-  journal={Automatica},
-  volume={47},
-  number={9},
-  pages={1949--1956},
-  year={2011},
-  publisher={Elsevier}
-}
-@book{sandholm2010population,
-  title={Population games and evolutionary dynamics},
-  author={Sandholm, William H},
-  year={2010},
-  publisher={MIT press}
-}
-@article{schweidel2021compositional,
-  title={Compositional Analysis of Interconnected Systems using Delta Dissipativity},
-  author={Schweidel, Katherine S and Arcak, Murat},
-  journal={IEEE Control Systems Letters},
-  year={2021},
-  publisher={IEEE}
-}
-@incollection{willems1976stability,
-  title={Stability of Large Scale Interconnected Systems},
-  author={Willems, Jan C},
-  booktitle={Directions in Large-Scale Systems},
-  pages={401--410},
-  year={1976},
-  publisher={Springer}
-}
-@book{siljak2011decentralized,
-  title={Decentralized control of complex systems},
-  author={Siljak, Dragoslav D},
-  year={2011},
-  publisher={Courier Corporation}
-}
-@article{dashkovskiy2010small,
-  title={Small gain theorems for large scale systems and construction of {ISS} Lyapunov functions},
-  author={Dashkovskiy, Sergey N and R{\"u}ffer, Bj{\"o}rn S and Wirth, Fabian R},
-  journal={SIAM Journal on Control and Optimization},
-  volume={48},
-  number={6},
-  pages={4089--4118},
-  year={2010},
-  publisher={SIAM}
-}
-@article{zohrizadeh2020survey,
-	title        = {A survey on conic relaxations of optimal power flow problem},
-	author       = {Fariba Zohrizadeh and Cedric Josz and Ming Jin and Ramtin Madani and Javad Lavaei and Somayeh Sojoudi},
-	year         = 2020,
-	journal      = {European Journal of Operational Research},
-	volume       = 287,
-	number       = 2,
-	pages        = {391--409},
-	doi          = {https://doi.org/10.1016/j.ejor.2020.01.034},
-	issn         = {0377-2217},
-	abstract     = {Conic optimization has recently emerged as a powerful tool for designing tractable and guaranteed algorithms for power system operation. On the one hand, tractability is crucial due to the large size of modern electricity transmission grids. This is a result of the numerous interconnections that have been built over time. On the other hand, guarantees are needed to ensure reliability and safety for consumers at a time when power systems are growing in complexity. This is in large part due to the high penetration of renewable energy sources and the advent of electric vehicles. The aim of this paper is to review the latest literature in order to demonstrate the success of conic optimization when applied to power systems. The main focus is on how linear programming, second-order cone programming, and semidefinite programming can be used to address a central problem named the optimal power flow problem. We describe how they are used to design convex relaxations of this highly challenging non-convex optimization problem. We also show how sum-of-squares can be used to strengthen these relaxations. Finally, we present advances in first-order methods, interior-point methods, and nonconvex methods for solving conic optimization. Challenges for future research are also discussed.},
-	url_link     = {http://www.sciencedirect.com/science/article/pii/S0377221720300552},
-	url_pdf      = {Review_OPF_2019.pdf},
-	keywords     = {Power system, Optimization, Graph theory}
-}
-@article{dehghanpour2018survey,
-	title        = {A survey on state estimation techniques and challenges in smart distribution systems},
-	author       = {Dehghanpour, Kaveh and Wang, Zhaoyu and Wang, Jianhui and Yuan, Yuxuan and Bu, Fankun},
-	year         = 2018,
-	journal      = {IEEE Transactions on Smart Grid},
-	publisher    = {IEEE},
-	volume       = 10,
-	number       = 2,
-	pages        = {2312--2322}
-}
-@article{kekatos2017psse,
-	title        = {PSSE redux: Convex relaxation, decentralized, robust, and dynamic approaches},
-	author       = {Kekatos, Vassilis and Wang, Gang and Zhu, Hao and Giannakis, Georgios B},
-	year         = 2017,
-	journal      = {arXiv preprint arXiv:1708.03981}
-}
-@article{van2018large,
-	title        = {Large-scale unit commitment under uncertainty: an updated literature survey},
-	author       = {van Ackooij, Wim and Lopez, I Danti and Frangioni, Antonio and Lacalandra, Fabrizio and Tahanan, Milad},
-	year         = 2018,
-	journal      = {Annals of Operations Research},
-	publisher    = {Springer},
-	volume       = 271,
-	number       = 1,
-	pages        = {11--85}
-}
-@article{panteli2015grid,
-	title        = {The grid: Stronger, bigger, smarter?: Presenting a conceptual framework of power system resilience},
-	author       = {Panteli, Mathaios and Mancarella, Pierluigi},
-	year         = 2015,
-	journal      = {IEEE Power and Energy Magazine},
-	publisher    = {IEEE},
-	volume       = 13,
-	number       = 3,
-	pages        = {58--66}
-}
-@article{bie2017battling,
-	title        = {Battling the extreme: A study on the power system resilience},
-	author       = {Bie, Zhaohong and Lin, Yanling and Li, Gengfeng and Li, Furong},
-	year         = 2017,
-	journal      = {Proceedings of the IEEE},
-	publisher    = {IEEE},
-	volume       = 105,
-	number       = 7,
-	pages        = {1253--1266}
-}
-@article{kulis2012metric,
-	title        = {Metric learning: A survey},
-	author       = {Kulis, Brian and others},
-	year         = 2012,
-	journal      = {Foundations and trends in machine learning},
-	volume       = 5,
-	number       = 4,
-	pages        = {287--364}
-}
-@article{kaya2019deep,
-	title        = {Deep metric learning: A survey},
-	author       = {Kaya, Mahmut and Bilge, Hasan {\c{S}}akir},
-	year         = 2019,
-	journal      = {Symmetry},
-	publisher    = {Multidisciplinary Digital Publishing Institute},
-	volume       = 11,
-	number       = 9,
-	pages        = 1066
-}
-@inproceedings{stanescu2014hierarchical,
-	title        = {Hierarchical Adversarial Search Applied to Real-Time Strategy Games.},
-	author       = {Stanescu, Marius and Barriga, Nicolas A and Buro, Michael},
-	year         = 2014,
-	booktitle    = {AIIDE}
-}
-@inproceedings{ramanujan2010adversarial,
-	title        = {On adversarial search spaces and sampling-based planning},
-	author       = {Ramanujan, Raghuram and Sabharwal, Ashish and Selman, Bart},
-	year         = 2010,
-	booktitle    = {International Conference on Automated Planning and Scheduling},
-	organization = {Citeseer}
-}
-@article{browne2012survey,
-	title        = {A survey of monte carlo tree search methods},
-	author       = {Browne, Cameron B and Powley, Edward and Whitehouse, Daniel and Lucas, Simon M and Cowling, Peter I and Rohlfshagen, Philipp and Tavener, Stephen and Perez, Diego and Samothrakis, Spyridon and Colton, Simon},
-	year         = 2012,
-	journal      = {IEEE Transactions on Computational Intelligence and AI in games},
-	publisher    = {IEEE},
-	volume       = 4,
-	number       = 1,
-	pages        = {1--43}
-}
-@inproceedings{levine2013guided,
-	title        = {Guided policy search},
-	author       = {Levine, Sergey and Koltun, Vladlen},
-	year         = 2013,
-	booktitle    = {International Conference on Machine Learning},
-	pages        = {1--9}
-}
-@book{camacho2013model,
-	title        = {Model predictive control},
-	author       = {Camacho, Eduardo F and Alba, Carlos Bordons},
-	year         = 2013,
-	publisher    = {Springer Science \& Business Media}
-}
-@incollection{yu2010graph,
-	title        = {Graph reachability queries: A survey},
-	author       = {Yu, Jeffrey Xu and Cheng, Jiefeng},
-	year         = 2010,
-	booktitle    = {Managing and Mining Graph Data},
-	publisher    = {Springer},
-	pages        = {181--215}
-}
-@article{jin2010reachability,
-	title        = {Reachability analysis based transient stability design in power systems},
-	author       = {Jin, Licheng and Kumar, Ratnesh and Elia, Nicola},
-	year         = 2010,
-	journal      = {International Journal of Electrical Power \& Energy Systems},
-	publisher    = {Elsevier},
-	volume       = 32,
-	number       = 7,
-	pages        = {782--787}
-}
-@article{brockman2016openai,
-	title        = {{OpenAI} gym},
-	author       = {Brockman, Greg and Cheung, Vicki and Pettersson, Ludwig and Schneider, Jonas and Schulman, John and Tang, Jie and Zaremba, Wojciech},
-	year         = 2016,
-	journal      = {arXiv preprint arXiv:1606.01540}
-}
-
-@article{jin2018power,
-	title        = {Power grid AC-based state estimation: Vulnerability analysis against cyber attacks},
-	author       = {Jin, Ming and Lavaei, Javad and Johansson, Karl Henrik},
-	year         = 2018,
-	journal      = {IEEE Transactions on Automatic Control},
-	publisher    = {IEEE},
-	volume       = 64,
-	number       = 5,
-	pages        = {1784--1799}
-}
-@misc{jin2019power,
-	title        = {Power up! Robust Graph Convolutional Network against Evasion Attacks based on Graph Powering},
-	author       = {Ming Jin and Heng Chang and Wenwu Zhu and Somayeh Sojoudi},
-	year         = 2019,
-	eprint       = {1905.10029},
-	archiveprefix = {arXiv},
-	primaryclass = {cs.LG}
-}
-@article{das2021synthetic,
-	title        = {Synthetic Data Augmentation for Class Balancing in Building Tabular Datasets},
-	author       = {Das et al., Hari Prasanna},
-	year         = 2021,
-	journal      = {Under Preparation}
-}
-@inproceedings{li2019mad,
-	title        = {{MAD-GAN}: Multivariate anomaly detection for time series data with generative adversarial networks},
-	author       = {Li, Dan and Chen, Dacheng and Jin, Baihong and Shi, Lei and Goh, Jonathan and Ng, See-Kiong},
-	year         = 2019,
-	booktitle    = {International Conference on Artificial Neural Networks},
-	pages        = {703--716},
-	organization = {Springer}
-}
-@article{song2018constructing,
-	title        = {Constructing unrestricted adversarial examples with generative models},
-	author       = {Song, Yang and Shu, Rui and Kushman, Nate and Ermon, Stefano},
-	year         = 2018,
-	journal      = {arXiv preprint arXiv:1805.07894}
-}
-@inproceedings{yoon2021winning,
-	title        = {Winning the L2{\{}RPN{\}} Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic},
-	author       = {Deunsol Yoon and Sunghoon Hong and Byung-Jun Lee and Kee-Eung Kim},
-	year         = 2021,
-	booktitle    = {International Conference on Learning Representations},
-	url          = {https://openreview.net/forum?id=LmUJqB1Cz8}
-}
-@article{cyber_attack_report,
-	title        = {Cybersecurity and the North American electric grid: New policy approaches to address an evolving threat.},
-	year         = 2014,
-	journal      = {Source: http://goo.gl/WgUbPI}
-}
-@article{physical_attack_report,
-	title        = {Assault on California power station raises alarm on potential for terrorism.},
-	year         = 2014,
-	journal      = {Source: http://goo.gl/RiuhI1}
-}
-@article{soltan2015joint,
-	title        = {Joint cyber and physical attacks on power grids: Graph theoretical approaches for information recovery},
-	author       = {Soltan, Saleh and Yannakakis, Mihalis and Zussman, Gil},
-	year         = 2015,
-	journal      = {ACM SIGMETRICS Performance Evaluation Review},
-	publisher    = {ACM New York, NY, USA},
-	volume       = 43,
-	number       = 1,
-	pages        = {361--374}
-}
-@misc{xiu2020variational,
-	title        = {Variational Disentanglement for Rare Event Modeling},
-	author       = {Zidi Xiu and Chenyang Tao and Michael Gao and Connor Davis and Benjamin A. Goldstein and Ricardo Henao},
-	year         = 2020,
-	eprint       = {2009.08541},
-	archiveprefix = {arXiv},
-	primaryclass = {stat.ML}
-}
-@misc{chen2020residual,
-	title        = {Residual Flows for Invertible Generative Modeling},
-	author       = {Ricky T. Q. Chen and Jens Behrmann and David Duvenaud and Jörn-Henrik Jacobsen},
-	year         = 2020,
-	eprint       = {1906.02735},
-	archiveprefix = {arXiv},
-	primaryclass = {stat.ML}
-}
-@article{texas_report,
-	title        = {How the Texas power grid failed and what could stop it from happening again.},
-	year         = 2021,
-	journal      = {Source: https://tinyurl.com/texas-power-failure}
-}
-@article{california_report,
-	title        = {What the wildfires tell us about the shortcomings of California’s electric grid.},
-	year         = 2020,
-	journal      = {Source: https://tinyurl.com/ca-wildfire-power-failure}
-}
-@article{2021_3C_dqm,
-  title={A Unified Framework for Task-Driven Data Quality Management},
-  author={Wang, Tianhao and Zeng, Yi and Jin, Ming and Jia, Ruoxi},
-  journal={arXiv preprint arXiv:2106.05484},
-  year={2021}
-}
-
-@inproceedings{2021_1C_dim,
-	title        = {Diminishing Regret for Online Nonconvex Optimization},
-	author       = {SangWoo Park and Julie Mulvaney-Kemp and Ming Jin and Javad Lavaei},
-	year         = 2021,
-	booktitle    = {American Control Conference},
-	pages        = {},
-	url_pdf      = {regret_ONO_2020_1.pdf},
-	abstract     = {A single nonconvex optimization is NP-hard in the worst case, and so is a sequence of nonconvex problems viewed separately. For online nonconvex optimization (ONO) problems, widely used local search algorithms are guaranteed to track a sequence of local optima, but offer no promises about global optimality. In this paper, we introduce the concept of nonconvexity regret to measure the performance of a local search method against a global optimization solver for ONO. We define the notion of depth of a global minimum, and show that memory and random explorations drive the nonconvexity regret to zero if the variability of the objective function is low compared to the depth of the global minima. We prove probabilistic guarantees on the regret bound that depend on the evolution of the landscapes of the time-varying objective functions. Then, based on the notions of missing mass and 1-occupancy set, we develop a practical algorithm that works even when there is no such information on the landscapes. The theoretical results imply that the existence of a low-complexity optimization at any arbitrary time instance of ONO can nullify the NP-hardness of the entire ONO problem. The results are verified through numerical simulations.},
-	keywords     = {Optimization}
-}
-@article{2019_0J_personal,
-	title        = {Personal thermal comfort models with wearable sensors},
-	author       = {Shichao Liu and Stefano Schiavon and Hari Prasanna Das and Ming Jin and Costas J. Spanos},
-	year         = 2019,
-	journal      = {Building and Environment},
-	volume       = 162,
-	pages        = 106281,
-	doi          = {https://doi.org/10.1016/j.buildenv.2019.106281},
-	issn         = {0360-1323},
-	url_link     = {http://www.sciencedirect.com/science/article/pii/S0360132319304913},
-	keywords     = {Smart city, Machine learning, Energy system},
-	abstract     = {A personal comfort model is an approach to thermal comfort modeling, for thermal environmental design and control, that predicts an individual's thermal comfort response, instead of the average response of a large population. We developed personal thermal comfort models using lab grade wearable in normal daily activities. We collected physiological signals (e.g., skin temperature, heart rate) of 14 subjects (6 female and 8 male adults) and environmental parameters (e.g., air temperature, relative humidity) for 2–4 weeks (at least 20 h per day). Then we trained 14 models for each subject with different machine-learning algorithms to predict their thermal preference. The results show that the median prediction power could be up to 24%/78%/0.79 (Cohen's kappa/accuracy/AUC) with all features considered. The median prediction power reaches 21%/71%/0.7 after 200 subjective votes. We explored the importance of different features on the prediction performance by considering all subjects in one dataset. When all features included for the entire dataset, personal comfort models can generate the highest performance of 35%/76%/0.80 by the most predictive algorithm. Personal comfort models display the highest prediction power when occupants' thermal sensations is outside thermal neutrality. Skin temperature measured at the ankle is more predictive than measured at the wrist. We suggest that Cohen's kappa or AUC should be employed to assess the performance of personal thermal comfort models for imbalanced datasets due to the capacity to exclude random success.}
-}
-@article{2019_0J_provision,
-	title        = {Provision of secondary frequency regulation by coordinated dispatch of industrial loads and thermal power plants},
-	author       = {Yi Bao and Jian Xu and Wei Feng and Yuanzhang Sun and Siyang Liao and Rongxin Yin and Yazhou Jiang and Ming Jin and Chris Marnay},
-	year         = 2019,
-	journal      = {Applied Energy},
-	volume       = 241,
-	pages        = {302--312},
-	doi          = {https://doi.org/10.1016/j.apenergy.2019.03.025},
-	issn         = {0306-2619},
-	url_link     = {http://www.sciencedirect.com/science/article/pii/S0306261919304234},
-	keywords     = {Control theory, Power system},
-	abstract     = {Demand responsive industrial loads with high thermal inertia have potential to provide ancillary service for frequency regulation in the power market. To capture the benefit, this study proposes a new hierarchical framework to coordinate the demand responsive industrial loads with thermal power plants in an industrial park for secondary frequency control. In the proposed framework, demand responsive loads and generating resources are coordinated for optimal dispatch in two-time scales: (1) the regulation reserve of the industrial park is optimally scheduled in a day-ahead manner. The stochastic regulation signal is replaced by the specific extremely trajectories. Furthermore, the extremely trajectories are achieved by the day-ahead predicted regulation mileage. The resulting benefit is to transform the stochastic reserve scheduling problem into a deterministic optimization; (2) a model predictive control strategy is proposed to dispatch the industry park in real time with an objective to maximize the revenue. The proposed technology is tested using a real-world industrial electrolysis power system based upon Pennsylvania, Jersey, and Maryland (PJM) power market. Various scenarios are simulated to study the performance of the proposed approach to enable industry parks to provide ancillary service into the power market. The simulation results indicate that an industrial park with a capacity of 500 MW can provide up to 40 MW ancillary service for participation in the secondary frequency regulation. The proposed strategy is demonstrated to be capable of maintaining the economic and secure operation of the industrial park while satisfying performance requirements from the real world regulation market.}
-}
-@article{2018_1J_cascade,
-	title        = {Cascade energy optimization for waste heat recovery in distributed energy systems},
-	author       = {Xuan Wang and Ming Jin and Wei Feng and Gequn Shu and Hua Tian and Youcai Liang},
-	year         = 2018,
-	journal      = {Applied Energy},
-	volume       = 230,
-	pages        = {679--695},
-	doi          = {https://doi.org/10.1016/j.apenergy.2018.08.124},
-	issn         = {0306-2619},
-	url_link     = {http://www.sciencedirect.com/science/article/pii/S0306261918312984},
-	keywords     = {Optimization, Energy system},
-	abstract     = {The efficiency of distributed energy systems can be significantly increased through waste heat recovery from industry or power generation. The technologies used for this process are typically dependent on the quality and temperature grades of waste heat. To maximize the efficiency of cascade heat utilization, it is important to optimize the choice of waste heat recovery technologies and their operation. In this paper, a detailed mixed integer linear programming optimization model is proposed for waste heat recovery in a district-scale microgrid. The model can distinguish waste heat quality for planning and operation optimization of distributed energy systems. Heat utilization technologies are formulated in this developed model and categorized in different temperature grades. The developed model is validated using four typical cases under different settings of system operation and business models. It is found that the optimization model, by distinguishing waste heat temperature, can increase energy cost savings by around 5%, compared to models that do not consider waste heat temperature grades. Additionally, the results indicate that the developed model can provide more realistic configuration and technologies dispatch.}
-}
-@article{2018_1J_design,
-	title        = {Design Automation for Smart Building Systems},
-	author       = {R. {Jia} and B. {Jin} and M. {Jin} and Y. {Zhou} and I. C. {Konstantakopoulos} and H. {Zou} and J. {Kim} and D. {Li} and W. {Gu} and R. {Arghandeh} and P. {Nuzzo} and S. {Schiavon} and A. L. {Sangiovanni-Vincentelli} and C. J. {Spanos}},
-	year         = 2018,
-	journal      = {Proceedings of the IEEE},
-	volume       = 106,
-	number       = 9,
-	pages        = {1680--1699},
-	doi          = {10.1109/JPROC.2018.2856932},
-	url_link     = {https://ieeexplore.ieee.org/document/8466990},
-	url_pdf      = {Design_IEEEProc.pdf},
-	abstract     = {Smart buildings today are aimed at providing safe, healthy, comfortable, affordable, and beautiful spaces in a carbon and energy-efficient way. They are emerging as complex cyber-physical systems with humans in the loop. Cost, the need to cope with increasing functional complexity, flexibility, fragmentation of the supply chain, and time-to-market pressure are rendering the traditional heuristic and ad hoc design paradigms inefficient and insufficient for the future. In this paper, we present a platform-based methodology for smart building design. Platform-based design (PBD) promotes the reuse of hardware and software on shared infrastructures, enables rapid prototyping of applications, and involves extensive exploration of the design space to optimize design performance. In this paper, we identify, abstract, and formalize components of smart buildings, and present a design flow that maps high-level specifications of desired building applications to their physical implementations under the PBD framework. A case study on the design of on-demand heating, ventilation, and air conditioning (HVAC) systems is presented to demonstrate the use of PBD.},
-	keywords     = {Cyber-physical system, Optimization, Energy system}
-}
-@inproceedings{2020_2C_ope,
-	title        = {Towards Off-policy Evaluation as a Prerequisite for Real-world Reinforcement Learning in Building Control},
-	author       = {Chen, Bingqing and Jin, Ming and Wang, Zhe and Hong, Tianzhen and Berg{\'e}s, Mario},
-	year         = 2020,
-	booktitle    = {International Workshop on Reinforcement Learning for Energy Management in Buildings \& Cities (RLEM)},
-	pages        = {52--56},
-	url_link     = {https://dl.acm.org/doi/abs/10.1145/3427773.3427871},
-	url_pdf      = {OPE_RLEM20.pdf},
-	abstract     = {We present an initial study of off-policy evaluation (OPE), a problem prerequisite to real-world reinforcement learning (RL), in the context of building control. OPE is the problem of estimating a policy's performance without running it on the actual system, using historical data from the existing controller. It enables the control engineers to ensure a new, pretrained policy satisfies the performance requirements and safety constraints of a real-world system, prior to interacting with it. While many methods have been developed for OPE, no study has evaluated which ones are suitable for building operational data, which are generated by deterministic policies and have limited coverage of the state-action space. After reviewing existing works and their assumptions, we adopted the approximate model (AM) method. Furthermore, we used bootstrapping to quantify uncertainty and correct for bias. In a simulation study, we evaluated the proposed approach on 10 policies pretrained with imitation learning. On average, the AM method estimated the energy and comfort costs with 1.84% and 14.1% error, respectively.},
-	keywords     = {Machine learning, Energy system, Smart city}
-}
-@article{tsitsiklis_convergence_1984,
-  title = {Convergence and Asymptotic Agreement in Distributed Decision Problems},
-  author = {Tsitsiklis, J. and Athans, M.},
-  date = {1984-01},
-  journaltitle = {IEEE Transactions on Automatic Control},
-  shortjournal = {IEEE Trans. Autom. Control},
-  volume = {29},
-  pages = {42--50},
-  issn = {1558-2523},
-  doi = {10.1109/TAC.1984.1103385},
-  abstract = {We consider a distributed team decision problem in which different agents obtain from the environment different stochastic measurements, possibly at different random times, related to the same uncertain random vector. Each agent has the same objective function and prior probability distribution. We assume that each agent can compute an optimal tentative decision based upon his own observation and that these tentative decisions are communicated and received, possibly at random times, by a subset of other agents. Conditions for asymptotic convergence of each agent's decison sequence and asymptotic agreement of all agents' decisions are derived.},
-  eventtitle = {{{IEEE Transactions}} on {{Automatic Control}}},
-  keywords = {Convergence,Cost function,Decision making,Game theory,History,Large-scale systems,Probability distribution,Random variables,Stochastic processes,Terminology},
-  number = {1}
-}
-
-@article{zhang2015byzantine,
-  title={Byzantine attack and defense in cognitive radio networks: A survey},
-  author={Zhang, Linyuan and Ding, Guoru and Wu, Qihui and Zou, Yulong and Han, Zhu and Wang, Jinlong},
-  journal={IEEE Communications Surveys \& Tutorials},
-  volume={17},
-  number={3},
-  pages={1342--1363},
-  year={2015},
-  publisher={IEEE}
-}
-@article{lan2020communication,
-  title={Communication-efficient algorithms for decentralized and stochastic optimization},
-  author={Lan, Guanghui and Lee, Soomin and Zhou, Yi},
-  journal={Mathematical Programming},
-  volume={180},
-  number={1},
-  pages={237--284},
-  year={2020},
-  publisher={Springer}
-}
-@article{mota2013d,
-  title={{D-ADMM}: A communication-efficient distributed algorithm for separable optimization},
-  author={Mota, Joao FC and Xavier, Joao MF and Aguiar, Pedro MQ and P{\"u}schel, Markus},
-  journal={IEEE Transactions on Signal processing},
-  volume={61},
-  number={10},
-  pages={2718--2723},
-  year={2013},
-  publisher={IEEE}
-}
-@article{dimarogonas2011distributed,
-  title={Distributed event-triggered control for multi-agent systems},
-  author={Dimarogonas, Dimos V and Frazzoli, Emilio and Johansson, Karl H},
-  journal={IEEE Transactions on Automatic Control},
-  volume={57},
-  number={5},
-  pages={1291--1297},
-  year={2011},
-  publisher={IEEE}
-}
-@article{de2021constrained,
-  title={Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms},
-  author={De Nijs, Frits and Walraven, Erwin and De Weerdt, Mathijs and Spaan, Matthijs},
-  journal={Journal of Artificial Intelligence Research},
-  volume={70},
-  pages={955--1001},
-  year={2021}
-}
-@article{alistarh2017qsgd,
-  title={{QSGD}: Communication-efficient {SGD} via gradient quantization and encoding},
-  author={Alistarh, Dan and Grubic, Demjan and Li, Jerry and Tomioka, Ryota and Vojnovic, Milan},
-  journal={Advances in Neural Information Processing Systems},
-  volume={30},
-  pages={1709--1720},
-  year={2017}
-}
-@article{jordan2018communication,
-  title={Communication-efficient distributed statistical inference},
-  author={Jordan, Michael I and Lee, Jason D and Yang, Yun},
-  journal={Journal of the American Statistical Association},
-  year={2018},
-  publisher={Taylor \& Francis}
-}
-@article{lomuscio2017approach,
-  title={An approach to reachability analysis for feed-forward relu neural networks},
-  author={Lomuscio, Alessio and Maganti, Lalit},
-  journal={arXiv preprint arXiv:1706.07351},
-  year={2017}
-}
-
-
-@book{boyd2004convex,
-  title={Convex optimization},
-  author={Boyd, Stephen and Boyd, Stephen P and Vandenberghe, Lieven},
-  year={2004},
-  publisher={Cambridge university press}
-}
-
-@article{huang2019reachnn,
-  title={Reachnn: Reachability analysis of neural-network controlled systems},
-  author={Huang, Chao and Fan, Jiameng and Li, Wenchao and Chen, Xin and Zhu, Qi},
-  journal={ACM Transactions on Embedded Computing Systems},
-  volume={18},
-  number={5s},
-  pages={1--22},
-  year={2019},
-  publisher={ACM New York, NY, USA}
-}
-@article{silva2020opportunities,
-  title={Opportunities and challenges in deep learning adversarial robustness: A survey},
-  author={Silva, Samuel Henrique and Najafirad, Peyman},
-  journal={arXiv preprint arXiv:2007.00753},
-  year={2020}
-}
-@inproceedings{2020_3C_dl,
-	title        = {Deep Learning for Reactive Power Control of Smart Inverters under Communication Constraints},
-	author       = {Gupta, Sarthak and Kekatos, Vassilis and Jin, Ming},
-	year         = 2020,
-	pages={1-6},
-	booktitle    = {IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm)},
-	url_link     = {https://smartgridcomm.info/day/1},
-	url_pdf      = {DL_SmartInverter_2020.pdf},
-	url_arxiv    = {https://arxiv.org/abs/2007.05868},
-	abstract     = {Aiming for the median solution between cyber-intensive optimal power flow (OPF) solutions and subpar local control, this work advocates deciding inverter injection setpoints using deep neural networks (DNNs). Instead of fitting OPF solutions in a black-box manner, inverter DNNs are naturally integrated with the feeder model and trained to minimize a grid-wide objective subject to inverter and network constraints enforced on the average over uncertain grid conditions. Learning occurs in a quasi-stationary fashion and is posed as a stochastic OPF, handled via stochastic primal-dual updates acting on grid data scenarios. Although trained as a whole, the proposed DNN is operated in a master-slave architecture. Its master part is run at the utility to output a condensed control signal broadcast to all inverters. Its slave parts are implemented by inverters and are driven by the utility signal along with local inverter readings. This novel DNN structure uniquely addresses the small-big data conundrum where utilities collect detailed smart meter readings yet on an hourly basis, while in real time inverters should be driven by local inputs and minimal utility coordination to save on communication. Numerical tests corroborate the efficacy of this physics-aware DNN-based inverter solution over an optimal control policy.},
-	keywords     = {Machine learning, Power system}
-}
-@article{2020_3J_boundary,
-	title        = {Boundary Defense Against Cyber Threat for Power System State Estimation},
-	author       = {M. {Jin} and J. {Lavaei} and S. {Sojoudi} and R. {Baldick}},
-	year         = 2020,
-	journal      = {IEEE Transactions on Information Forensics and Security},
-	note = {Full version: \url{http://www.jinming.tech/papers/TIFS_full_2020.pdf}},
-	volume       = {},
-	number       = {},
-	pages        = {1--16},
-	doi          = {10.1109/TIFS.2020.3043065},
-	abstract     = {The operation of power grids is becoming increasingly data-centric. While the abundance of data could improve system efficiency, it poses major reliability challenges. In particular, state estimation aims to find the operating state of a network from the telemetered data, but an undetected attack on the data could lead to making wrong operational decisions for the system and trigger a large-scale blackout. Nevertheless, understanding the vulnerability of state estimation with regards to cyberattacks, which is a special instance of graph-structured quadratic sensing problem, has been hindered by the lack of tools for studying the topological and data-analytic aspects of networks. Algorithmic robustness is critical in extracting reliable information from abundant but untrusted grid data. For a large-scale power grid, we quantify, analyze, and visualize the regions of the network that are not robust to cyberattacks in the sense that there exists a data manipulation strategy for each of those local regions that misleads the operator at the global scale and yields a wrong estimation of the state of the network at almost all buses. We also propose an optimization-based graphical boundary defense mechanism to identify the border of the geographical area in which data have been manipulated. The proposed method does not allow a local attack to have a global effect on the data analysis of the entire network, which enhances the situational awareness of the grid, especially in the face of adversity. The developed mathematical framework reveals key geometric and algebraic factors that can affect algorithmic robustness and is used to study the vulnerability of the U.S. power grid in this paper.},
-	url_pdf      = {TIFS_Boundary_Defense.pdf},
-	url_link     = {https://ieeexplore.ieee.org/document/9290080},
-	keywords     = {Cybersecurity, Power system, Optimization, Graph theory}
-}
-
-
-@article{2020_2J_stabilityrl,
-	title        = {Stability-Certified Reinforcement Learning: A Control-Theoretic Perspective},
-	author       = {M. {Jin} and J. {Lavaei}},
-	year         = 2020,
-	journal      = {IEEE Access},
-	volume={8},
-  number={},
-  pages={229086-229100},
-	doi          = {10.1109/ACCESS.2020.3045114},
-	abstract     = {We investigate the important problem of certifying stability of reinforcement learning policies when interconnected with nonlinear dynamical systems. We show that by regulating the partial gradients of policies, strong guarantees of robust stability can be obtained based on a proposed semidefinite programming feasibility problem. The method is able to certify a large set of stabilizing controllers by exploiting problem-specific structures; furthermore, we analyze and establish its (non)conservatism. Empirical evaluations on two decentralized control tasks, namely multi-flight formation and power system frequency regulation, demonstrate that the reinforcement learning agents can have high performance within the stability-certified parameter space and also exhibit stable learning behaviors in the long run.},
-	url_pdf      = {SafeRL_2020.pdf},
-	url_link     = {https://ieeexplore.ieee.org/document/9296215},
-	keywords     = {Control theory, Machine learning, Power system}
-}
-@article{2020_2J_scalable,
-	title        = {Scalable and Robust State Estimation From Abundant but Untrusted Data},
-	author       = {M. {Jin} and I. {Molybog} and R. {Mohammadi-Ghazi} and J. {Lavaei}},
-	year         = 2020,
-	journal      = {IEEE Transactions on Smart Grid},
-	volume       = 11,
-	number       = 3,
-	pages        = {1880--1894},
-	doi          = {10.1109/TSG.2019.2944986},
-	abstract     = {Power system state estimation is an important problem in grid operation that has a long tradition of research since 1960s. Due to the nonconvexity of the problem, existing approaches based on local search methods are susceptible to spurious local minima, which could endanger the reliability of the system. In general, even in the absence of noise, it is challenging to provide a practical condition under which one can uniquely identify the global solution due to its NP-hardness. In this study, we propose a linear basis of representation that succinctly captures the topology of the network and enables an efficient two-stage estimation method in case the amount of measured data is not too low. Based on this framework, we propose an identifiability condition that numerically depicts the boundary where one can warrant efficient recovery of the unique global minimum. Furthermore, we develop a robustness metric called “mutual incoherence,” which underpins theoretical analysis of global recovery condition and statistical error bounds in the presence of both dense noise and bad data. The method demonstrates superior performance over existing methods in terms of both estimation accuracy and bad data robustness in an array of benchmark systems. Above all, it is scalable to large systems with more than 13,000 buses and can achieve accurate estimation within a minute.},
-	url_pdf      = {PSSE_Linear_Estimator.pdf},
-	url_link     = {https://ieeexplore.ieee.org/document/8855013},
-	keywords     = {Power system, Optimization, Cybersecurity}
-}
-@article{horowitz2019overview,
-  title={An overview of distributed energy resource (DER) interconnection: current practices and emerging solutions},
-  author={Horowitz, Kelsey A and Peterson, Zachary and Coddington, Michael H and Ding, Fei and Sigrin, Benjamin O and Saleem, Danish and Baldwin, Sara E and Lydic, Brian and Stanfield, Sky C and Enbar, Nadav and others},
-  year={2019},
-  publisher={National Renewable Energy Lab.(NREL), Golden, CO (United States)}
-}
-@article{basak2012literature,
-  title={A literature review on integration of distributed energy resources in the perspective of control, protection and stability of microgrid},
-  author={Basak, Prasenjit and Chowdhury, S and nee Dey, S Halder and Chowdhury, SP},
-  journal={Renewable and Sustainable Energy Reviews},
-  volume={16},
-  number={8},
-  pages={5545--5556},
-  year={2012},
-  publisher={Elsevier}
-}
-@STANDARD{IEEE1547,
-title			=   {IEEE 1547 Standard for Interconnecting Distributed Resources with Electric Power Systems},
-year				=   {2014},
-organization    	=	{IEEE},
-url				=	"http://grouper.ieee.org/groups/scc21/1547/1547_index.html"
-}
-@inproceedings{farivar2012optimal,
-  title={Optimal inverter VAR control in distribution systems with high PV penetration},
-  author={Farivar, Masoud and Neal, Russell and Clarke, Christopher and Low, Steven},
-  booktitle={IEEE Power and Energy Society general meeting},
-  pages={1--7},
-  year={2012},
-  organization={IEEE}
-}
-@article{rebours2007survey,
-  title={A survey of frequency and voltage control ancillary services—Part I: Technical features},
-  author={Rebours, Yann G and Kirschen, Daniel S and Trotignon, Marc and Rossignol, Sbastien},
-  journal={IEEE Transactions on power systems},
-  volume={22},
-  number={1},
-  pages={350--357},
-  year={2007},
-  publisher={IEEE}
-}
-@article{jalali2019designing,
-  title={Designing reactive power control rules for smart inverters using support vector machines},
-  author={Jalali, Mana and Kekatos, Vassilis and Gatsis, Nikolaos and Deka, Deepjyoti},
-  journal={IEEE Transactions on Smart Grid},
-  volume={11},
-  number={2},
-  pages={1759--1770},
-  year={2019},
-  publisher={IEEE}
-}
-@article{bevrani2010renewable,
-  title={Renewable energy sources and frequency regulation: survey and new perspectives},
-  author={Bevrani, Hassan and Ghosh, Arindam and Ledwich, Gerard},
-  journal={IET Renewable Power Generation},
-  volume={4},
-  number={5},
-  pages={438--457},
-  year={2010},
-  publisher={IET}
-}
-@article{2019_3J_powercyber,
-	title        = {Power Grid {AC}-Based State Estimation: Vulnerability Analysis Against Cyber Attacks},
-	author       = {M. {Jin} and J. {Lavaei} and K. H. {Johansson}},
-	year         = 2019,
-	journal      = {IEEE Transactions on Automatic Control},
-	volume       = 64,
-	number       = 5,
-	pages        = {1784--1799},
-	doi          = {10.1109/TAC.2018.2852774},
-	abstract     = {To ensure grid efficiency and reliability, power system operators continuously monitor the operational characteristics of the grid through a critical process called state estimation (SE), which performs the task by filtering and fusing various measurements collected from grid sensors. This study analyzes the vulnerability of the key operation module, namely ac-based SE, against potential cyber attacks on data integrity, also known as false data injection attack (FDIA). A general form of FDIA can be formulated as an optimization problem, whose objective is to find a stealthy and sparse data injection vector on the sensor measurements with the aim of making the state estimate spurious and misleading. Due to the nonlinear ac measurement model and the cardinality constraint, the problem includes both continuous and discrete nonlinearities. To solve the FDIA problem efficiently, we propose a novel convexification framework based on semidefinite programming (SDP). By analyzing a globally optimal SDP solution, we delineate the “attackable region” for any given set of measurement types and grid topology, where the spurious state can be falsified by FDIA. Furthermore, we prove that the attack is stealthy and sparse, and derive performance bounds. Simulation results on various IEEE test cases indicate the efficacy of the proposed convexification approach. From the grid protection point of view, the results of this study can be used to design a security metric for the current practice against cyber attacks, redesign the bad data detection scheme, and inform proposals of grid hardening. From a theoretical point of view, the proposed framework can be used for other nonconvex problems in power systems and beyond.},
-	url_pdf      = {FDIA_AC_2018.pdf},
-	url_link     = {https://ieeexplore.ieee.org/document/8403288},
-	keywords     = {Power system, Optimization, Graph theory, Cybersecurity}
-}
-@inproceedings{li2019rsa,
-  title={RSA: Byzantine-robust stochastic aggregation methods for distributed learning from heterogeneous datasets},
-  author={Li, Liping and Xu, Wei and Chen, Tianyi and Giannakis, Georgios B and Ling, Qing},
-  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
-  volume={33},
-  number={01},
-  pages={1544--1551},
-  year={2019}
-}
-@inproceedings{guerraoui2018hidden,
-  title={The hidden vulnerability of distributed learning in Byzantium},
-  author={Guerraoui, Rachid and Rouault, S{\'e}bastien and others},
-  booktitle={International Conference on Machine Learning},
-  pages={3521--3530},
-  year={2018},
-  organization={PMLR}
-}
-@article{guo2020towards,
-  title={Towards byzantine-resilient learning in decentralized systems},
-  author={Guo, Shangwei and Zhang, Tianwei and Xie, Xiaofei and Ma, Lei and Xiang, Tao and Liu, Yang},
-  journal={arXiv preprint arXiv:2002.08569},
-  year={2020}
-}
-@article{yang2019bridge,
-  title={{BRIDGE}: Byzantine-resilient decentralized gradient descent},
-  author={Yang, Zhixiong and Bajwa, Waheed U},
-  journal={arXiv preprint arXiv:1908.08098},
-  year={2019}
-}
-@article{peng2021byzantine,
-  title={Byzantine-robust decentralized stochastic optimization over static and time-varying networks},
-  author={Peng, Jie and Li, Weiyu and Ling, Qing},
-  journal={Signal Processing},
-  volume={183},
-  pages={108020},
-  year={2021},
-  publisher={Elsevier}
-}
-@article{ben2015robust,
-  title={Robust distributed consensus using total variation},
-  author={Ben-Ameur, Walid and Bianchi, Pascal and Jakubowicz, Jeremie},
-  journal={IEEE Transactions on Automatic Control},
-  volume={61},
-  number={6},
-  pages={1550--1564},
-  year={2015},
-  publisher={IEEE}
-}
-
-@electronic{pecandata,
-    organization = {Pecan Street Inc.},
-	title = {Dataport},
-	url 	= {https://dataport.cloud/},
-	year=	{2018}}
-
-@inproceedings{blanchard2017machine,
-  title={Machine learning with adversaries: {Byzantine} tolerant gradient descent},
-  author={Blanchard, Peva and El Mhamdi, El Mahdi and Guerraoui, Rachid and Stainer, Julien},
-  booktitle={Advances in Neural Information Processing Systems},
-  pages={118--128},
-  year={2017}
-}
-@inproceedings{alistarh2018byzantine,
-  title={Byzantine Stochastic Gradient Descent},
-  author={Alistarh, Dan and Allen-Zhu, Zeyuan and Li, Jerry},
-  booktitle={Advances in Neural Information Processing Systems},
-  year={2018}
-}
-@inproceedings{yin2018byzantine,
-  title={Byzantine-robust distributed learning: Towards optimal statistical rates},
-  author={Yin, Dong and Chen, Yudong and Kannan, Ramchandran and Bartlett, Peter},
-  booktitle={International Conference on Machine Learning},
-  pages={5650--5659},
-  year={2018},
-  organization={PMLR}
-}
-@article{arulkumaran2017deep,
-  title={Deep reinforcement learning: A brief survey},
-  author={Arulkumaran, Kai and Deisenroth, Marc Peter and Brundage, Miles and Bharath, Anil Anthony},
-  journal={IEEE Signal Processing Magazine},
-  volume={34},
-  number={6},
-  pages={26--38},
-  year={2017},
-  publisher={IEEE}
-}
-@book{milano2010power,
-  title={Power system modelling and scripting},
-  author={Milano, Federico},
-  year={2010},
-  publisher={Springer Science \& Business Media}
-}
-@book{machowski2020power,
-  title={Power system dynamics: stability and control},
-  author={Machowski, Jan and Lubosny, Zbigniew and Bialek, Janusz W and Bumby, James R},
-  year={2020},
-  publisher={John Wiley \& Sons}
-}
-@inproceedings{abed2006data,
-  title={Data-driven power system operations},
-  author={Abed, Eyad H and Namachchivaya, Navaratnam Sri and Overbye, Thomas J and Pai, MA and Sauer, Peter W and Sussman, Alan},
-  booktitle={International Conference on Computational Science},
-  pages={448--455},
-  year={2006},
-  organization={Springer}
-}
-
-@article{testfeeder,
-  title={Analytic considerations and design basis for the IEEE distribution test feeders},
-  author={Schneider, Kevin P and Mather, BA and Pal, BC and Ten, C-W and Shirek, Greg J and Zhu, Hao and Fuller, Jason C and Pereira, Jos{\'e} Luiz Rezende and Ochoa, Luis Fernando and de Araujo, Leandro Ramos and others},
-  journal={IEEE Transactions on Power Systems},
-  note={Online datasets: https://site.ieee.org/pes-testfeeders/resources/},
-  volume={33},
-  number={3},
-  pages={3181--3188},
-  year={2017},
-  publisher={IEEE}
-}
-@online{kaggle,
-author = {{Kaggle Team}},
-title  = {Kaggle competitions},
-date   = {2021-07},
-url    = {https://www.kaggle.com/competitions}
-}
-@online{grid2op,
-author = {{RTE France}},
-title  = {Grid2Op documentation},
-date   = {2021-07},
-url    = {https://grid2op.readthedocs.io/}
-}
-
-@online{cpstest,
-author = {{SinBerBEST}},
-title  = {Cyber Physical Testbed},
-date   = {2021-07},
-url    = {https://sinberbest.berkeley.edu/content-page/cyber-physical-testbed}
-}
-@article{larson2019derivative,
-  title={Derivative-free optimization methods},
-  author={Larson, Jeffrey and Menickelly, Matt and Wild, Stefan M},
-  journal={Acta Numerica},
-  volume={28},
-  pages={287--404},
-  year={2019},
-  publisher={Cambridge University Press}
-}
-@article{yu2016supply,
-  title={Supply--demand balancing for power management in smart grid: A Stackelberg game approach},
-  author={Yu, Mengmeng and Hong, Seung Ho},
-  journal={Applied energy},
-  volume={164},
-  pages={702--710},
-  year={2016},
-  publisher={Elsevier}
-}
-@article{mnih2015human,
-  title={Human-level control through deep reinforcement learning},
-  author={Mnih, Volodymyr and Kavukcuoglu, Koray and Silver, David and Rusu, Andrei A and Veness, Joel and Bellemare, Marc G and Graves, Alex and Riedmiller, Martin and Fidjeland, Andreas K and Ostrovski, Georg and others},
-  journal={Nature},
-  volume={518},
-  number={7540},
-  pages={529--533},
-  year={2015},
-  publisher={Nature Publishing Group}
-}
-@inproceedings{fan2020theoretical,
-  title={A theoretical analysis of deep {Q}-learning},
-  author={Fan, Jianqing and Wang, Zhaoran and Xie, Yuchen and Yang, Zhuoran},
-  booktitle={Learning for Dynamics and Control},
-  pages={486--489},
-  year={2020},
-  organization={PMLR}
-}
-@article{watkins1992q,
-  title={{Q}-learning},
-  author={Watkins, Christopher JCH and Dayan, Peter},
-  journal={Machine learning},
-  volume={8},
-  number={3-4},
-  pages={279--292},
-  year={1992},
-  publisher={Springer}
-}
-@inproceedings{lillicrap2016continuous,
-  title={Continuous control with deep reinforcement learning.},
-  author={Lillicrap, Timothy P and Hunt, Jonathan J and Pritzel, Alexander and Heess, Nicolas and Erez, Tom and Tassa, Yuval and Silver, David and Wierstra, Daan},
-  booktitle={International Conference on Learning Representations},
-  year={2016}
-}
-@inproceedings{konda2000actor,
-  title={Actor-critic algorithms},
-  author={Konda, Vijay R and Tsitsiklis, John N},
-  booktitle={Advances in neural information processing systems},
-  pages={1008--1014},
-  year={2000}
-}
-@article{zahid2021security,
-  title={Security risks in cyber physical systems—A systematic mapping study},
-  author={Zahid, Maryam and Inayat, Irum and Daneva, Maya and Mehmood, Zahid},
-  journal={Journal of Software: Evolution and Process},
-  pages={e2346},
-  year={2021},
-  publisher={Wiley Online Library}
-}
-@article{zhang2021physical,
-  title={Physical safety and cyber security analysis of multi-agent systems: A survey of recent advances},
-  author={Zhang, Dan and Feng, Gang and Shi, Yang and Srinivasan, Dipti},
-  journal={IEEE/CAA Journal of Automatica Sinica},
-  volume={8},
-  number={2},
-  pages={319--333},
-  year={2021},
-  publisher={IEEE}
-}
-@article{babalola2016real,
-  title={Real-time cascading failures prevention for multiple contingencies in smart grids through a multi-agent system},
-  author={Babalola, Adeniyi A and Belkacemi, Rabie and Zarrabian, Sina},
-  journal={IEEE Transactions on Smart Grid},
-  volume={9},
-  number={1},
-  pages={373--385},
-  year={2016},
-  publisher={IEEE}
-}
-@inproceedings{guo2019less,
-  title={Less is more: Real-time failure localization in power systems},
-  author={Guo, Linqi and Liang, Chen and Zocca, Alessandro and Low, Steven H and Wierman, Adam},
-  booktitle={IEEE Conference on Decision and Control},
-  pages={3871--3877},
-  year={2019},
-  organization={IEEE}
-}
-@article{rahnama2019learning,
-  title={Learning-based event-triggered control for synchronization of passive multiagent systems under attack},
-  author={Rahnama, Arash and Antsaklis, Panos J},
-  journal={IEEE Transactions on Automatic Control},
-  volume={65},
-  number={10},
-  pages={4170--4185},
-  year={2019},
-  publisher={IEEE}
-}
-@article{hines2007controlling,
-  title={Controlling cascading failures with cooperative autonomous agents},
-  author={Hines, Paul and Talukdar, Sarosh},
-  journal={International journal of critical infrastructures},
-  volume={3},
-  number={1-2},
-  pages={192--220},
-  year={2007},
-  publisher={Inderscience Publishers}
-}
-@article{shoukry2015event,
-  title={Event-triggered state observers for sparse sensor noise/attacks},
-  author={Shoukry, Yasser and Tabuada, Paulo},
-  journal={IEEE Transactions on Automatic Control},
-  volume={61},
-  number={8},
-  pages={2079--2091},
-  year={2015},
-  publisher={IEEE}
-}
-@article{ding2020secure,
-  title={Secure state estimation and control of cyber-physical systems: A survey},
-  author={Ding, Derui and Han, Qing-Long and Ge, Xiaohua and Wang, Jun},
-  journal={IEEE Transactions on Systems, Man, and Cybernetics: Systems},
-  volume={51},
-  number={1},
-  pages={176--190},
-  year={2020},
-  publisher={IEEE}
-}
-@article{pasqualetti2013attack,
-  title={Attack detection and identification in cyber-physical systems},
-  author={Pasqualetti, Fabio and D{\"o}rfler, Florian and Bullo, Francesco},
-  journal={IEEE transactions on automatic control},
-  volume={58},
-  number={11},
-  pages={2715--2729},
-  year={2013},
-  publisher={IEEE}
-}
-@article{mustafa2020resilient,
-  title={Resilient synchronization of distributed multi-agent systems under attacks},
-  author={Mustafa, Aquib and Modares, Hamidreza and Moghadam, Rohollah},
-  journal={Automatica},
-  volume={115},
-  pages={108869},
-  year={2020},
-  publisher={Elsevier}
-}
-@incollection{lamport2019byzantine,
-  title={The Byzantine generals problem},
-  author={Lamport, Leslie and Shostak, Robert and Pease, Marshall},
-  booktitle={Concurrency: the Works of Leslie Lamport},
-  pages={203--226},
-  year={2019}
-}
-@article{xie2018generalized,
-  title={Generalized {Byzantine}-tolerant {SGD}},
-  author={Xie, Cong and Koyejo, Oluwasanmi and Gupta, Indranil},
-  journal={arXiv preprint arXiv:1802.10116},
-  year={2018}
-}
-@article{liu2020distributed,
-  title={Distributed communication-aware motion planning for networked mobile robots under formal specifications},
-  author={Liu, Zhiyu and Wu, Bo and Dai, Jin and Lin, Hai},
-  journal={IEEE Transactions on Control of Network Systems},
-  volume={7},
-  number={4},
-  pages={1801--1811},
-  year={2020},
-  publisher={IEEE}
-}
-@book{bienstock2015electrical,
-  title={Electrical transmission system cascades and vulnerability: an operations research viewpoint},
-  author={Bienstock, Daniel},
-  year={2015},
-  publisher={SIAM}
-}
-@article{chen2017distributed,
-  title={Distributed statistical machine learning in adversarial settings: Byzantine gradient descent},
-  author={Chen, Yudong and Su, Lili and Xu, Jiaming},
-  journal={Proceedings of the ACM on Measurement and Analysis of Computing Systems},
-  volume={1},
-  number={2},
-  pages={1--25},
-  year={2017},
-  publisher={ACM New York, NY, USA}
-}
-@article{wu2021byzantine,
-  title={Byzantine-resilient decentralized TD learning with linear function approximation},
-  author={Wu, Zhaoxian and Shen, Han and Chen, Tianyi and Ling, Qing},
-  journal={IEEE Transactions on Signal Processing},
-  year={2021},
-  publisher={IEEE}
-}
-@inproceedings{lu2021decentralized,
-  title={Decentralized policy gradient descent ascent for safe multi-agent reinforcement learning},
-  author={Lu, Songtao and Zhang, Kaiqing and Chen, Tianyi and Basar, Tamer and Horesh, Lior and Vinod, Ria and Chen, Pin Yu and others},
-  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
-  volume={35},
-  number={10},
-  pages={8767--8775},
-  year={2021}
-}
-@article{balasubramanian2021zeroth,
-  title={Zeroth-Order Nonconvex Stochastic Optimization: Handling Constraints, High Dimensionality, and Saddle Points},
-  author={Balasubramanian, Krishnakumar and Ghadimi, Saeed},
-  journal={Foundations of Computational Mathematics},
-  pages={1--42},
-  year={2021},
-  publisher={Springer}
-}
-@article{salimans2017evolution,
-  title={Evolution strategies as a scalable alternative to reinforcement learning},
-  author={Salimans, Tim and Ho, Jonathan and Chen, Xi and Sidor, Szymon and Sutskever, Ilya},
-  journal={arXiv preprint arXiv:1703.03864},
-  year={2017}
-}
-@article{ghadimi2013stochastic,
-  title={Stochastic first-and zeroth-order methods for nonconvex stochastic programming},
-  author={Ghadimi, Saeed and Lan, Guanghui},
-  journal={SIAM Journal on Optimization},
-  volume={23},
-  number={4},
-  pages={2341--2368},
-  year={2013},
-  publisher={SIAM}
-}
-@article{dougherty2016extremum,
-  title={An extremum-seeking controller for distributed optimization over sensor networks},
-  author={Dougherty, Sean and Guay, Martin},
-  journal={IEEE Transactions on Automatic Control},
-  volume={62},
-  number={2},
-  pages={928--933},
-  year={2016},
-  publisher={IEEE}
-}
-@article{marden2014achieving,
-  title={Achieving pareto optimality through distributed learning},
-  author={Marden, Jason R and Young, H Peyton and Pao, Lucy Y},
-  journal={SIAM Journal on Control and Optimization},
-  volume={52},
-  number={5},
-  pages={2753--2770},
-  year={2014},
-  publisher={SIAM}
-}
-@book{liu2012stochastic,
-  title={Stochastic averaging and stochastic extremum seeking},
-  author={Liu, Shu-Jun and Krstic, Miroslav},
-  year={2012},
-  publisher={Springer Science \& Business Media}
-}
-@inproceedings{menon2014collaborative,
-  title={Collaborative extremum seeking for welfare optimization},
-  author={Menon, Anup and Baras, John S},
-  booktitle={IEEE Conference on Decision and Control},
-  pages={346--351},
-  year={2014},
-  organization={IEEE}
-}
-@inproceedings{tang2020zeroth,
-  title={Zeroth-order feedback optimization for cooperative multi-agent systems},
-  author={Tang, Yujie and Ren, Zhaolin and Li, Na},
-  booktitle={IEEE Conference on Decision and Control},
-  pages={3649--3656},
-  year={2020},
-  organization={IEEE}
-}
-@article{danilova2020recent,
-  title={Recent theoretical advances in non-convex optimization},
-  author={Danilova, Marina and Dvurechensky, Pavel and Gasnikov, Alexander and Gorbunov, Eduard and Guminov, Sergey and Kamzolov, Dmitry and Shibaev, Innokentiy},
-  journal={arXiv preprint arXiv:2012.06188},
-  year={2020}
-}
-@inproceedings{zakeri_data-driven_2019,
-  title = {A {{Data}}-Driven {{Adaptive Controller Reconfiguration}} for {{Fault Mitigation}}: {{A Passivity Approach}}},
-  shorttitle = {A {{Data}}-Driven {{Adaptive Controller Reconfiguration}} for {{Fault Mitigation}}},
-  booktitle = {2019 27th {{Mediterranean Conference}} on {{Control}} and {{Automation}} ({{MED}})},
-  author = {Zakeri, H. and Antsaklis, P. J.},
-  date = {2019-07},
-  pages = {25--30},
-  doi = {10.1109/MED.2019.8798490},
-  abstract = {This paper presents a new data-driven fault identification and controller reconfiguration algorithm. The presented algorithm relies only on the system's input and output data, and it does not require a detailed system description. The proposed algorithm detects changes in the input-output behavior of the system, whether due to faults or malicious attacks and then reacts by reconfiguring the existing controller. This method does not identify the internal structure of the system nor the extent and nature of the attack; hence it can quickly react to faults and attacks. The proposed method can be readily applied to various applications without significant modifications or tuning, as demonstrated by the examples in the paper.},
-  eventtitle = {2019 27th {{Mediterranean Conference}} on {{Control}} and {{Automation}} ({{MED}})},
-  keywords = {\#IndicesSurvey,controller reconfiguration,Data-driven control,fault detection,passivity indices,safety}
-}
-@article{chang2020distributed,
-  title={Distributed learning in the nonconvex world: From batch data to streaming and beyond},
-  author={Chang, Tsung-Hui and Hong, Mingyi and Wai, Hoi-To and Zhang, Xinwei and Lu, Songtao},
-  journal={IEEE Signal Processing Magazine},
-  volume={37},
-  number={3},
-  pages={26--38},
-  year={2020},
-  publisher={IEEE}
-}
-
-@article{halin1976s,
-  title={S-functions for graphs},
-  author={Halin, R},
-  journal={J. Geom.},
-  volume={8},
-  number={1/2},
-  pages={171--186},
-  year={1976}
-}
-
-@inproceedings{sun2020improving,
-  title={Improving the sample and communication complexity for decentralized non-convex optimization: Joint gradient estimation and tracking},
-  author={Sun, Haoran and Lu, Songtao and Hong, Mingyi},
-  booktitle={International Conference on Machine Learning},
-  pages={9217--9228},
-  year={2020},
-  organization={PMLR}
-}
-@inproceedings{hong2017prox,
-  title={Prox-PDA: The proximal primal-dual algorithm for fast distributed nonconvex optimization and learning over networks},
-  author={Hong, Mingyi and Hajinezhad, Davood and Zhao, Ming-Min},
-  booktitle={International Conference on Machine Learning},
-  pages={1529--1538},
-  year={2017},
-  organization={PMLR}
-}
-@inproceedings{lian2017can,
-  title={Can decentralized algorithms outperform centralized algorithms? a case study for decentralized parallel stochastic gradient descent},
-  author={Lian, Xiangru and Zhang, Ce and Zhang, Huan and Hsieh, Cho-Jui and Zhang, Wei and Liu, Ji},
-  booktitle={Advances in Neural Information Processing Systems},
-  pages={5336--5346},
-  year={2017}
-}
-@article{halsted2021survey,
-  title={A Survey of Distributed Optimization Methods for Multi-Robot Systems},
-  author={Halsted, Trevor and Shorinwa, Ola and Yu, Javier and Schwager, Mac},
-  journal={arXiv preprint arXiv:2103.12840},
-  year={2021}
-}
-@article{nedic2018distributed,
-  title={Distributed optimization for control},
-  author={Nedi{\'c}, Angelia and Liu, Ji},
-  journal={Annual Review of Control, Robotics, and Autonomous Systems},
-  volume={1},
-  pages={77--103},
-  year={2018},
-  publisher={Annual Reviews}
-}
-@article{molzahn2017survey,
-  title={A survey of distributed optimization and control algorithms for electric power systems},
-  author={Molzahn, Daniel K and D{\"o}rfler, Florian and Sandberg, Henrik and Low, Steven H and Chakrabarti, Sambuddha and Baldick, Ross and Lavaei, Javad},
-  journal={IEEE Transactions on Smart Grid},
-  volume={8},
-  number={6},
-  pages={2941--2962},
-  year={2017},
-  publisher={IEEE}
-}
-@article{yang2019survey,
-  title={A survey of distributed optimization},
-  author={Yang, Tao and Yi, Xinlei and Wu, Junfeng and Yuan, Ye and Wu, Di and Meng, Ziyang and Hong, Yiguang and Wang, Hong and Lin, Zongli and Johansson, Karl H},
-  journal={Annual Reviews in Control},
-  volume={47},
-  pages={278--305},
-  year={2019},
-  publisher={Elsevier}
-}
-@article{ding2019survey,
-  title={A survey on model-based distributed control and filtering for industrial cyber-physical systems},
-  author={Ding, Derui and Han, Qing-Long and Wang, Zidong and Ge, Xiaohua},
-  journal={IEEE Transactions on Industrial Informatics},
-  volume={15},
-  number={5},
-  pages={2483--2499},
-  year={2019},
-  publisher={IEEE}
-}
-@book{bullo2009distributed,
-  title={Distributed control of robotic networks},
-  author={Bullo, Francesco and Cort{\'e}s, Jorge and Martinez, Sonia},
-  year={2009},
-  publisher={Princeton University Press}
-}
-@article{davis1985fault,
-  title={Fault location techniques for distributed control interconnection networks},
-  author={Davis, Nathaniel J and Hsu, William Tsun-Yuk and Siegel, Howard Jay},
-  journal={IEEE Transactions on Computers},
-  volume={100},
-  number={10},
-  pages={902--910},
-  year={1985},
-  publisher={IEEE}
-}
-@article{meng2017dynamic,
-  title={Dynamic distributed control for networks with cooperative--antagonistic interactions},
-  author={Meng, Deyuan},
-  journal={IEEE Transactions on Automatic Control},
-  volume={63},
-  number={8},
-  pages={2311--2326},
-  year={2017},
-  publisher={IEEE}
-}
-@article{nedic2010constrained,
-  title={Constrained consensus and optimization in multi-agent networks},
-  author={Nedic, Angelia and Ozdaglar, Asuman and Parrilo, Pablo A},
-  journal={IEEE Transactions on Automatic Control},
-  volume={55},
-  number={4},
-  pages={922--938},
-  year={2010},
-  publisher={IEEE}
-}
-@article{chen2021communication,
-  title={Communication-Efficient Policy Gradient Methods for Distributed Reinforcement Learning},
-  author={Chen, Tianyi and Zhang, Kaiqing and Giannakis, Georgios B and Basar, Tamer},
-  journal={IEEE Transactions on Control of Network Systems},
-  year={2021},
-  publisher={IEEE}
-}
-@inproceedings{qu2020scalable,
-  title={Scalable reinforcement learning of localized policies for multi-agent networked systems},
-  author={Qu, Guannan and Wierman, Adam and Li, Na},
-  booktitle={Learning for Dynamics and Control},
-  pages={256--266},
-  year={2020},
-  organization={PMLR}
-}
-@article{zhang2021multi,
-  title={Multi-agent reinforcement learning: A selective overview of theories and algorithms},
-  author={Zhang, Kaiqing and Yang, Zhuoran and Ba{\c{s}}ar, Tamer},
-  journal={Handbook of Reinforcement Learning and Control},
-  pages={321--384},
-  year={2021},
-  publisher={Springer}
-}
-@article{wai2018multi,
-  title={Multi-agent reinforcement learning via double averaging primal-dual optimization},
-  author={Wai, Hoi To and Yang, Zhuoran and Hong, Mingyi and Wang, Zhaoran},
-  journal={Advances in Neural Information Processing Systems},
-  volume={2018},
-  pages={9649--9660},
-  year={2018}
-}
-@inproceedings{zhang2018fully,
-  title={Fully decentralized multi-agent reinforcement learning with networked agents},
-  author={Zhang, Kaiqing and Yang, Zhuoran and Liu, Han and Zhang, Tong and Basar, Tamer},
-  booktitle={International Conference on Machine Learning},
-  pages={5872--5881},
-  year={2018},
-  organization={PMLR}
-}
-@inproceedings{omidshafiei2017deep,
-  title={Deep decentralized multi-task multi-agent reinforcement learning under partial observability},
-  author={Omidshafiei, Shayegan and Pazis, Jason and Amato, Christopher and How, Jonathan P and Vian, John},
-  booktitle={International Conference on Machine Learning},
-  pages={2681--2690},
-  year={2017},
-  organization={PMLR}
-}
-@inproceedings{gupta2017cooperative,
-  title={Cooperative multi-agent control using deep reinforcement learning},
-  author={Gupta, Jayesh K and Egorov, Maxim and Kochenderfer, Mykel},
-  booktitle={International Conference on Autonomous Agents and Multiagent Systems},
-  pages={66--83},
-  year={2017},
-  organization={Springer}
-}
-@incollection{littman1994markov,
-  title={Markov games as a framework for multi-agent reinforcement learning},
-  author={Littman, Michael L},
-  booktitle={Machine learning proceedings},
-  pages={157--163},
-  year={1994},
-  publisher={Elsevier}
-}
-@article{bucsoniu2010multi,
-  title={Multi-agent reinforcement learning: An overview},
-  author={Bu{\c{s}}oniu, Lucian and Babu{\v{s}}ka, Robert and De Schutter, Bart},
-  journal={Innovations in multi-agent systems and applications-1},
-  pages={183--221},
-  year={2010},
-  publisher={Springer}
-}
-@inproceedings{schneider1999distributed,
-  title={Distributed Value Functions},
-  author={Schneider, Jeff G and Wong, Weng-Keen and Moore, Andrew W and Riedmiller, Martin A},
-  booktitle={International Conference on Machine Learning},
-  pages={371--378},
-  year={1999}
-}
-@inproceedings{boyan1994packet,
-  title={Packet routing in dynamically changing networks: A reinforcement learning approach},
-  author={Boyan, Justin A and Littman, Michael L},
-  booktitle={Advances in neural information processing systems},
-  pages={671--678},
-  year={1994}
-}
-@inproceedings{wolpert1999general,
-  title={General principles of learning-based multi-agent systems},
-  author={Wolpert, David H and Wheeler, Kevin R and Tumer, Kagan},
-  booktitle={Proceedings of the Annual Conference on Autonomous Agents},
-  pages={77--83},
-  year={1999}
-}
-@article{claus1998dynamics,
-  title={The dynamics of reinforcement learning in cooperative multiagent systems},
-  author={Claus, Caroline and Boutilier, Craig},
-  journal={AAAI/IAAI},
-  volume={1998},
-  number={746-752},
-  pages={2},
-  year={1998}
-}
-@article{2020_0J_control,
-	title        = {Control of Superheat of Organic Rankine Cycle under Transient Heat Source based on Deep Reinforcement Learning},
-	author       = {Xuan Wang and Rui Wang and Ming Jin and Gequn Shu and Hua Tian and Jiaying Pan},
-	year         = 2020,
-	journal      = {Applied Energy},
-	volume       = 278,
-	pages        = 115637,
-	doi          = {https://doi.org/10.1016/j.apenergy.2020.115637},
-	issn         = {0306-2619},
-	url_link     = {http://www.sciencedirect.com/science/article/pii/S0306261920311399},
-	keywords     = {Energy system, Machine learning},
-	abstract     = {The organic Rankine cycle (ORC) is a promising technology for engine waste heat recovery. During real-world operation, the engine working condition varies frequently to satisfy the power demand; thus, the transient nature of engine waste heat presents significant control challenges for the ORC. To control the superheat of the ORC precisely under a transient heat source, several optimal control methods have been used such as model predictive control and dynamic programing. However, most of them depend strongly on the accurate prediction of future disturbances. Deep reinforcement learning (DRL) is an artificial-intelligence algorithm that can overcome the aforementioned disadvantage, but the potential of DRL in control of thermodynamic systems has not yet been investigated. Thus, this paper proposes two DRL-based control methods for controlling the superheat of ORC under a transient heat source. One directly uses the DRL agent to learn the control strategy (DRL control), and the other uses the DRL agent to optimize the parameters of the proportional–integral–derivative (PID) controller (DRL-based PID control). Additionally, a switching mechanism between different DRL controllers is proposed for improving the training efficiency and enlarging the operation range of the controller. The results of this study indicate that the DRL agent can satisfactorily perform the control task and optimize the traditional controller under the trained and untrained transient heat source. Specifically, the DRL control can track the reference superheat with an average error of only 0.19 K, whereas that of the traditional PID control is 2.16 K. Furthermore, the proposed switching DRL control exhibits excellent tracking performance with an average error of only 0.21 K and robustness over a wide range of operation conditions. The successful application of DRL demonstrates its considerable potential for the control of thermodynamic systems, providing a useful reference and motivation for the application to other thermodynamic systems.}
-}
-@article{2018_2J_automated,
-	title        = {Automated mobile sensing: Towards high-granularity agile indoor environmental quality monitoring},
-	author       = {Ming Jin and Shichao Liu and Stefano Schiavon and Costas Spanos},
-	year         = 2018,
-	journal      = {Building and Environment},
-	volume       = 127,
-	pages        = {268--276},
-	doi          = {https://doi.org/10.1016/j.buildenv.2017.11.003},
-	issn         = {0360-1323},
-	url_link     = {http://www.sciencedirect.com/science/article/pii/S0360132317305012},
-	url_code     = {https://github.com/jinming99/IEQbot},
-	keywords     = {Smart city, Energy system, Data mining},
-	abstract     = {Indoor environmental quality (IEQ) is a critical aspect of the built environment to ensure occupant health, comfort, well-being and productivity. Existing IEQ monitoring approaches rely on sensor networks deployed at selected locations to collect environmental measurements, and are limited in scale and adaptability due to infrastructure cost and maintenance requirement. To enable high-granularity IEQ monitoring with agile adaption to the dynamic indoor environment, we propose an “automated mobile sensing” system that dispatches a sensor-rich navigation-capable robot to actively survey the indoor space. Data collected in this fashion is sparse in the joint temporal and spatial domain, and cannot be used directly for IEQ evaluation. To deal with this special characteristics, we developed a spatio-temporal interpolation algorithm to capture the global trend and local variation in order to use the data efficiently to reconstruct the IEQ dynamics. We compared the performance of the automated mobile sensing with a dense sensor network in a laboratory where we measured the air-change effectiveness (ASHRAE standard 129) for four different conditions. Results indicate that automated mobile sensing is able to accurately estimate the parameters with a minimal sensor cost and calibration effort. Potential applications of this system include indoor thermal comfort, lighting, indoor air quality and acoustic monitoring, pollutant source identification, and building commissioning. We shared publicly the source codes for robot control, sensor setup, and interpolation algorithm to encourage comparison study and further development.}
-}
-@article{2018_2J_microgrid,
-	title        = {Microgrid to enable optimal distributed energy retail and end-user demand response},
-	author       = {Ming Jin and Wei Feng and Chris Marnay and Costas Spanos},
-	year         = 2018,
-	journal      = {Applied Energy},
-	volume       = 210,
-	pages        = {1321--1335},
-	doi          = {https://doi.org/10.1016/j.apenergy.2017.05.103},
-	issn         = {0306-2619},
-	url_link     = {http://www.sciencedirect.com/science/article/pii/S0306261917306062},
-	url_pdf      = {mr-pod.pdf},
-	url_conference = {der_isgt.pdf},
-	keywords     = {Power system, Energy system},
-	abstract     = {In the face of unprecedented challenges in environmental sustainability and grid resilience, there is an increasingly held consensus regarding the adoption of distributed and renewable energy resources such as microgrids (MGs), and the utilization of flexible electric loads by demand response (DR) to potentially drive a necessary paradigm shift in energy production and consumption patterns. However, the potential value of distributed generation and demand flexibility has not yet been fully realized in the operation of MGs. This study investigates the pricing and operation strategy with DR for a MG retailer in an integrated energy system (IES). Based on co-optimizing retail rates and MG dispatch formulated as a mixed integer quadratic programming (MIQP) problem, our model devises a dynamic pricing scheme that reflects the cost of generation and promotes DR, in tandem with an optimal dispatch plan that exploits spark spread and facilitates the integration of renewables, resulting in improved retailer profits and system stability. Main issues like integrated energy coupling and customer bill reduction are addressed during pricing to ensure rates competitiveness and customer protection. By evaluating on real datasets, the system is demonstrated to optimally coordinate storage, renewables, and combined heat and power (CHP), reduce carbon dioxide emission while maintaining profits, and effectively alleviate the PV curtailment problem. The model can be used by retailers and MG operators to optimize their operations, as well as regulators to design new utility rates in support of the ongoing transformation of energy systems.}
-}
-@article{2018_3J_occ,
-	title        = {Occupancy Detection via Environmental Sensing},
-	author       = {M. {Jin} and N. {Bekiaris-Liberis} and K. {Weekly} and C. J. {Spanos} and A. M. {Bayen}},
-	year         = 2018,
-	journal      = {IEEE Transactions on Automation Science and Engineering},
-	volume       = 15,
-	number       = 2,
-	pages        = {443--455},
-	doi          = {10.1109/TASE.2016.2619720},
-	abstract     = {Sensing by proxy (SbP) is proposed in this paper as a sensing paradigm for occupancy detection, where the inference is based on “proxy” measurements such as temperature and CO2 concentrations. The effects of occupants on indoor environments are captured by constitutive models comprising a coupled partial differential equation-ordinary differential equation system that exploits the spatial and physical features. Sensor fusion of multiple environmental parameters is enabled in the proposed framework. We report on experiments conducted under simulated conditions and real-life circumstances, when the variation of occupancy follows a schedule as the ground truth. The inference of the number of occupants in the room based on CO2 concentration at the air return and air supply vents by our approach achieves an overall mean squared error of 0.6044 (fractional person), while the best alternative by Bayes net is 1.2061 (fractional person). Results from the projected ventilation analysis show that SbP can potentially save 55% of total ventilation compared with the traditional fixed schedule ventilation strategy, while at the same time maintain a reasonably comfort profile for the occupants.},
-	url_link     = {https://ieeexplore.ieee.org/document/7742900},
-	url_pdf      = {occ_tase.pdf},
-	url_synopsis = {SbP_synopsis.pdf},
-	url_code     = {https://github.com/jinming99/Sensing-by-proxy},
-	keywords     = {Control theory, Smart city, Data mining}
-}
-@article{2018_1J_review,
-	title        = {A Review of Microgrid Development in the United States--A Decade of Progress on Policies, Demonstrations, Controls, and Software Tools},
-	author       = {Feng, Wei and Jin, Ming and Liu, Xu and Bao, Yi and Marnay, Chris and Yao, Cheng and Yu, Jiancheng},
-	year         = 2018,
-	journal      = {Applied energy},
-	publisher    = {Elsevier},
-	volume       = 228,
-	pages        = {1656--1668},
-	abstract     = {Microgrids have become increasingly popular in the United States. Supported by favorable federal and local policies, microgrid projects can provide greater energy stability and resilience within a project site or community. This paper reviews major federal, state, and utility-level policies driving microgrid development in the United States. Representative U.S. demonstration projects are selected and their technical characteristics and non-technical features are introduced. The paper discusses trends in the technology development of microgrid systems as well as microgrid control methods and interactions within the electricity market. Software tools for microgrid design, planning, and performance analysis are illustrated with each tool’s core capability. Finally, the paper summarizes the successes and lessons learned during the recent expansion of the U.S. microgrid industry that may serve as a reference for other countries developing their own microgrid industries.},
-	url_pdf      = {MG_Review.pdf},
-	url_link     = {https://www.sciencedirect.com/science/article/abs/pii/S0306261918309644},
-	keywords     = {Power system}
-}
-@inproceedings{huster_limitations_2019,
-  title = {Limitations of the {{Lipschitz Constant}} as a {{Defense Against Adversarial Examples}}},
-  booktitle = {{{ECML PKDD}} 2018 {{Workshops}}},
-  author = {Huster, Todd and Chiang, Cho-Yu Jason and Chadha, Ritu},
-  date = {2019},
-  pages = {16--29},
-  publisher = {{Springer International Publishing}},
-  location = {{Cham}},
-  doi = {10.1007/978-3-030-13453-2_2},
-  abstract = {Several recent papers have discussed utilizing Lipschitz constants to limit the susceptibility of neural networks to adversarial examples. We analyze recently proposed methods for computing the Lipschitz constant. We show that the Lipschitz constant may indeed enable adversarially robust neural networks. However, the methods currently employed for computing it suffer from theoretical and practical limitations. We argue that addressing this shortcoming is a promising direction for future research into certified adversarial defenses.},
-  isbn = {978-3-030-13453-2},
-  keywords = {Adversarial examples,Lipschitz constant},
-  langid = {english},
-  series = {Lecture {{Notes}} in {{Computer Science}}}
-}
-@article{vandenberghe2015chordal,
-  title={Chordal graphs and semidefinite optimization},
-  author={Vandenberghe, Lieven and Andersen, Martin S},
-  journal={Foundations and Trends in Optimization},
-  volume={1},
-  number={4},
-  pages={241--433},
-  year={2015},
-  publisher={Now Publishers Inc. Hanover, MA, USA}
-}
-@article{su_byzantine-resilient_2021,
-  title = {Byzantine-{{Resilient Multiagent Optimization}}},
-  author = {Su, Lili and Vaidya, Nitin H.},
-  date = {2021-05},
-  journaltitle = {IEEE Transactions on Automatic Control},
-  shortjournal = {IEEE Trans. Autom. Control},
-  volume = {66},
-  pages = {2227--2233},
-  issn = {1558-2523},
-  doi = {10.1109/TAC.2020.3008139},
-  abstract = {We consider the problem of multiagent optimization wherein an unknown subset of agents suffer Byzantine faults and thus behave adversarially. We assume that each agent \$i\$ has a local cost function \$f\_i\$, and the overarching goal of the good agents is to collaboratively minimize a global objective that properly aggregates these local cost functions. To the best of our knowledge, we are among the first to study Byzantine-resilient optimization where no central coordinating agent exists, and we are the first to characterize the structures of the convex coefficients of the achievable global objectives. Dealing with Byzantine faults is very challenging. For example, in contrast to fault-free networks, reaching Byzantine-resilient agreement even in the simplest setting is far from trivial. We take a step toward solving the proposed Byzantine-resilient multiagent optimization problem by focusing on scalar local cost functions. Our results might provide useful insights for the general local cost functions.},
-  eventtitle = {{{IEEE Transactions}} on {{Automatic Control}}},
-  keywords = {Aggregates,Computational and artificial intelligence,Computational modeling,Consensus algorithm,Cost function,fault tolerance,fault tolerant control,Focusing,machine learning/distributed computing,reliability,Standards},
-  number = {5}
-}
-@article{lemon2016low,
-  title={Low-Rank Semidefinite Programming: Theory and Applications},
-  author={Lemon, Alex and So, Anthony Man-Cho and Ye, Yinyu},
-  journal={Foundations and Trends in Optimization},
-  volume={2},
-  number={1-2},
-  pages={1--156},
-  year={2016},
-  publisher={Now Publishers Inc. Hanover, MA, USA}
-}
-@article{chi2019nonconvex,
-  title={Nonconvex optimization meets low-rank matrix factorization: An overview},
-  author={Chi, Yuejie and Lu, Yue M and Chen, Yuxin},
-  journal={IEEE Transactions on Signal Processing},
-  volume={67},
-  number={20},
-  pages={5239--5269},
-  year={2019},
-  publisher={IEEE}
-}
-@article{bertsimas2021new,
-  title={A new perspective on low-rank optimization},
-  author={Bertsimas, Dimitris and Cory-Wright, Ryan and Pauphilet, Jean},
-  journal={arXiv preprint arXiv:2105.05947},
-  year={2021}
-}
-@article{loh2017statistical,
-  title={Statistical consistency and asymptotic normality for high-dimensional robust $ M $-estimators},
-  author={Loh, Po-Ling},
-  journal={The Annals of Statistics},
-  volume={45},
-  number={2},
-  pages={866--896},
-  year={2017},
-  publisher={Institute of Mathematical Statistics}
-}
-
-
-
-@online{energy2021,
-  title = {New energy outlook 2021},
-  author = {Bloomberg New Energy Finance, J.},
-  date = {2021-01-07},
-  url = {https://about.bnef.com/new-energy-outlook/},
-  urldate = {2021-07-20}
-}
-
-
-@article{chow1995inertial,
-  title={Inertial and slow coherency aggregation algorithms for power system dynamic model reduction},
-  author={Chow, Joe H and Galarza, Ricardo and Accari, Pierre and Price, William W},
-  journal={IEEE Transactions on Power Systems},
-  volume={10},
-  number={2},
-  pages={680--685},
-  year={1995},
-  publisher={IEEE}
-}
-
-@online{li_byzantine_2021,
-  title = {Byzantine {{Resilient Distributed Multi}}-{{Task Learning}}},
-  author = {Li, Jiani and Abbas, Waseem and Koutsoukos, Xenofon},
-  date = {2021-01-07},
-  shortjournal = {ArXiv201013032 Cs Stat},
-  url = {http://arxiv.org/abs/2010.13032},
-  urldate = {2021-07-11},
-  abstract = {Distributed multi-task learning provides significant advantages in multi-agent networks with heterogeneous data sources where agents aim to learn distinct but correlated models simultaneously.However, distributed algorithms for learning relatedness among tasks are not resilient in the presence of Byzantine agents. In this paper, we present an approach for Byzantine resilient distributed multi-task learning. We propose an efficient online weight assignment rule by measuring the accumulated loss using an agent's data and its neighbors' models. A small accumulated loss indicates a large similarity between the two tasks. In order to ensure the Byzantine resilience of the aggregation at a normal agent, we introduce a step for filtering out larger losses. We analyze the approach for convex models and show that normal agents converge resiliently towards the global minimum.Further, aggregation with the proposed weight assignment rule always results in an improved expected regret than the non-cooperative case. Finally, we demonstrate the approach using three case studies, including regression and classification problems, and show that our method exhibits good empirical performance for non-convex models, such as convolutional neural networks.},
-  archiveprefix = {arXiv},
-  eprint = {2010.13032},
-  eprinttype = {arxiv},
-  keywords = {Computer Science - Machine Learning,Statistics - Machine Learning},
-  primaryclass = {cs, stat}
-}
-@article{yang_consensus_2013,
-  title = {Consensus {{Based Approach}} for {{Economic Dispatch Problem}} in a {{Smart Grid}}},
-  author = {Yang, Shiping and Tan, Sicong and Xu, Jian-Xin},
-  date = {2013-11},
-  journaltitle = {IEEE Transactions on Power Systems},
-  shortjournal = {IEEE Trans. Power Syst.},
-  volume = {28},
-  pages = {4416--4426},
-  issn = {1558-0679},
-  doi = {10.1109/TPWRS.2013.2271640},
-  abstract = {Economic dispatch problem (EDP) is an important class of optimization problems in the smart grid, which aims at minimizing the total cost when generating certain amount of power. In this work, a novel consensus based algorithm is proposed to solve EDP in a distributed fashion. The quadratic convex cost functions are assumed in the problem formulation, and the strongly connected communication topology is sufficient for the information exchange. Unlike centralized approaches, the proposed algorithm enables generators to collectively learn the mismatch between demand and total amount of power generation. The estimated mismatch is then used as a feedback mechanism to adjust current power generation by each generator. With a tactical initial setup, eventually, all generators can automatically minimize the total cost in a collective sense.},
-  eventtitle = {{{IEEE Transactions}} on {{Power Systems}}},
-  keywords = {Algorithm design and analysis,Consensus,Cost function,distributed algorithm,Distributed algorithms,Eigenvalues and eigenfunctions,Generators,multi-agent systems,optimal dispatch,Power generation,Smart grids},
-  number = {4}
-}
-@article{han-lim_choi_consensus-based_2009,
-  title = {Consensus-{{Based Decentralized Auctions}} for {{Robust Task Allocation}}},
-  author = {{Han-Lim Choi} and Brunet, L. and How, J.P.},
-  date = {2009-08},
-  journaltitle = {IEEE Transactions on Robotics},
-  shortjournal = {IEEE Trans. Robot.},
-  volume = {25},
-  pages = {912--926},
-  issn = {1552-3098, 1941-0468},
-  doi = {10.1109/TRO.2009.2022423},
-  url = {http://ieeexplore.ieee.org/document/5072249/},
-  urldate = {2021-07-21},
-  abstract = {This paper addresses task allocation to coordinate a fleet of autonomous vehicles by presenting two decentralized algorithms: the consensus-based auction algorithm (CBAA) and its generalization to the multi-assignment problem, i.e., the consensus-based bundle algorithm (CBBA). These algorithms utilize a market-based decision strategy as the mechanism for decentralized task selection and use a consensus routine based on local communication as the conflict resolution mechanism to achieve agreement on the winning bid values. Under reasonable assumptions on the scoring scheme, both of the proposed algorithms are proven to guarantee convergence to a conflict-free assignment, and it is shown that the converged solutions exhibit provable worst-case performance. It is also demonstrated that CBAA and CBBA produce conflict-free feasible solutions that are robust to both inconsistencies in the situational awareness across the fleet and variations in the communication network topology. Numerical experiments confirm superior convergence properties and performance when compared with existing auction-based task-allocation algorithms.},
-  langid = {english},
-  number = {4}
-}
-@article{2017_3J_virtual,
-	title        = {Virtual Occupancy Sensing: Using Smart Meters to Indicate Your Presence},
-	author       = {M. {Jin} and R. {Jia} and C. J. {Spanos}},
-	year         = 2017,
-	journal      = {IEEE Transactions on Mobile Computing},
-	volume       = 16,
-	number       = 11,
-	pages        = {3264--3277},
-	doi          = {10.1109/TMC.2017.2684806},
-	note         = {},
-	abstract     = {Occupancy detection for buildings is crucial to improving energy efficiency, user comfort, and space utility. However, existing methods require dedicated system setup, continuous calibration, and frequent maintenance. With the instrumentation of electricity meters in millions of homes and offices, however, power measurement presents a unique opportunity for a non-intrusive and cost-effective way to detect occupant presence. This study develops solutions to the problems when no data or limited data is available for training, as motivated by difficulties in ground truth collection. Experimental evaluations on data from both residential and commercial buildings indicate that the proposed methods for binary occupancy detection are nearly as accurate as models learned with sufficient data, with accuracies of approximately 78 to 93 percent for residences and 90 percent for offices. This study shows that power usage contains valuable and sensitive user information, demonstrating a virtual occupancy sensing approach with minimal system calibration and setup.},
-	url_pdf      = {smart_meter_presence.pdf},
-	url_supplementary = {smart_meter_presence_supp.pdf},
-	url_link     = {https://ieeexplore.ieee.org/document/7882676},
-	url_media    = {https://spectrum.ieee.org/view-from-the-valley/energy/the-smarter-grid/what-does-your-smart-meter-know-about-you},
-	keywords     = {Smart city, Data mining, Machine learning}
-}
-@article{2017_2J_moddr,
-	title        = {MOD-DR: Microgrid optimal dispatch with demand response},
-	author       = {Ming Jin and Wei Feng and Ping Liu and Chris Marnay and Costas Spanos},
-	year         = 2017,
-	journal      = {Applied Energy},
-	volume       = 187,
-	pages        = {758--776},
-	doi          = {https://doi.org/10.1016/j.apenergy.2016.11.093},
-	issn         = {0306-2619},
-	url_link     = {http://www.sciencedirect.com/science/article/pii/S030626191631724X},
-	keywords     = {Power system},
-	abstract     = {In the face of unprecedented challenges of upcoming fossil fuel shortage and reliability and security of the grid, there is an increasing interest in adopting distributed, renewable, energy resources, such as microgrids (MGs), and engaging flexible electric loads in power system operations to potentially drive a paradigm shift in energy production and consumption patterns. Prior work on MG dispatch has leveraged decentralized technologies like combined heat and power (CHP) and heat pumps to promote efficiency and economic gains; however, the flexibility of demand has yet to be fully exploited in cooperation with the grid to offer added benefits and ancillary services. The object of the study is to develop microgrid optimal dispatch with demand response (MOD-DR), which fills in the gap by coordinating both the demand and supply sides in a renewable-integrated, storage-augmented, DR-enabled MG to achieve economically viable and system-wide resilient solutions. The key contribution of this paper is the formulation of a multi-objective optimization with prevailing constraints and utility trade-off based on the model of a large-scale MG with flexible loads, which leads to the derivation of strategies that incorporate uncertainty in scheduling. Evaluation using real datasets is conducted to analyze the uncertainty effects and demand response potentials, demonstrating in a campus prototype a 17.5% peak load reduction and 8.8% cost savings for MOD-DR compared to the non-trivial baseline, which is on par with the Oracle for perfect predictions.},
-	url_pdf      = {MOD-DR.pdf}
-}
-@article{2017_1J_winips,
-	title        = {WinIPS: WiFi-Based Non-Intrusive Indoor Positioning System With Online Radio Map Construction and Adaptation},
-	author       = {H. {Zou} and M. {Jin} and H. {Jiang} and L. {Xie} and C. J. {Spanos}},
-	year         = 2017,
-	journal      = {IEEE Transactions on Wireless Communications},
-	volume       = 16,
-	number       = 12,
-	pages        = {8118--8130},
-	doi          = {10.1109/TWC.2017.2757472},
-	abstract     = {WiFi fingerprinting-based indoor positioning system (IPS) has become the most promising solution for indoor localization. However, there are two major drawbacks that hamper its large-scale implementation. First, an offline site survey process is required which is extremely time-consuming and labor-intensive. Second, the RSS fingerprint database built offline is vulnerable to environmental dynamics. To address these issues comprehensively, in this paper, we propose WinIPS, a WiFi-based non-intrusive IPS that enables automatic online radio map construction and adaptation, aiming for calibration-free indoor localization. WinIPS can capture data packets transmitted in existing WiFi traffic and extract the RSS and MAC addresses of both WiFi access points (APs) and mobile devices in a non-intrusive manner. APs can be used as online reference points for radio map construction. A novel Gaussian process regression model is proposed to approximate the non-uniform RSS distribution of an indoor environment. Extensive experiments were conducted, which demonstrated that WinIPS outperforms existing solutions in terms of both RSS estimation accuracy and localization accuracy.},
-	url_pdf      = {WinIPS.pdf},
-	url_link     = {https://ieeexplore.ieee.org/document/8057286/versions},
-	keywords     = {Smart city, Data mining, Machine learning}
-}
-@article{2018_1J_robust,
-	title        = {A Robust Utility Learning Framework via Inverse Optimization},
-	author       = {I. C. {Konstantakopoulos} and L. J. {Ratliff} and M. {Jin} and S. S. {Sastry} and C. J. {Spanos}},
-	year         = 2018,
-	journal      = {IEEE Transactions on Control Systems Technology},
-	volume       = 26,
-	number       = 3,
-	pages        = {954--970},
-	doi          = {10.1109/TCST.2017.2699163},
-	abstract     = {In many smart infrastructure applications, flexibility in achieving sustainability goals can be gained by engaging end users. However, these users often have heterogeneous preferences that are unknown to the decision maker tasked with improving operational efficiency. Modeling user interaction as a continuous game between noncooperative players, we propose a robust parametric utility learning framework that employs constrained feasible generalized least squares estimation with heteroskedastic inference. To improve forecasting performance, we extend the robust utility learning scheme by employing bootstrapping with bagging, bumping, and gradient boosting ensemble methods. Moreover, we estimate the noise covariance, which provides approximated correlations between players, which we leverage to develop a novel correlated utility learning framework. We apply the proposed methods both to a toy example arising from Bertrand-Nash competition between two firms and to data from a social game experiment designed to encourage energy efficient behavior among smart building occupants. Using occupant voting data for shared resources such as lighting, we simulate the game defined by the estimated utility functions to demonstrate the performance of the proposed methods.},
-	url_pdf      = {robust_inverse_learning.pdf},
-	keywords     = {Smart city, Optimization, Machine learning, Game theory},
-	url_link     = {https://ieeexplore.ieee.org/document/7932982}
-}
-@article{2017_0J_transportation,
-	title        = {Measuring fine-grained metro interchange time via smartphones},
-	author       = {Weixi Gu and Kai Zhang and Zimu Zhou and Ming Jin and Yuxun Zhou and Xi Liu and Costas J. Spanos and Zuo-Jun (Max) Shen and Wei-Hua Lin and Lin Zhang},
-	year         = 2017,
-	journal      = {Transportation Research Part C: Emerging Technologies},
-	volume       = 81,
-	pages        = {153--171},
-	doi          = {https://doi.org/10.1016/j.trc.2017.05.014},
-	issn         = {0968-090X},
-	url_link     = {http://www.sciencedirect.com/science/article/pii/S0968090X17301523},
-	url_pdf      = {/metro_smartphone.pdf},
-	keywords     = {Data mining, Smart city},
-	abstract     = {High variability interchange times often significantly affect the reliability of metro travels. Fine-grained measurements of interchange times during metro transfers can provide valuable insights on the crowdedness of stations, usage of station facilities and efficiency of metro lines. Measuring interchange times in metro systems is challenging since agent-operated systems like automatic fare collection systems only provide coarse-grained trip information and popular localization services like GPS are often inaccessible underground. In this paper, we propose a smartphone-based interchange time measuring method from the passengers’ perspective. It leverages low-power sensors embedded in modern smartphones to record ambient contextual features, and utilizes a two-tier classifier to infer interchange states during a metro trip, and further distinguishes 10 fine-grained cases during interchanges. Experimental results within 6months across over 14 subway lines in 3 major cities demonstrate that our approach yields an overall interchange state inference F1-measurement of 91.0% and an average time error of less than 2min at an inference interval of 20s, and an average accuracy of 89.3% to distinguish the 10 fine-grained interchange cases. We also conducted a series of case studies using measurements collected from crowdsourced users during 3months, which reveals findings previously unattainable without fine-grained interchange time measurements, such as portions of waiting time during interchange, interchange directions, usage of facilities (stairs/escalators/lifts), and the root causes of long interchange times.}
-}
-@article{2017_0J_longitudinal,
-	title        = {Longitudinal assessment of thermal and perceived air quality acceptability in relation to temperature, humidity, and CO2 exposure in Singapore},
-	author       = {Toby C.T. Cheung and Stefano Schiavon and Elliott T. Gall and Ming Jin and William W Nazaroff},
-	year         = 2017,
-	journal      = {Building and Environment},
-	volume       = 115,
-	pages        = {80--90},
-	doi          = {https://doi.org/10.1016/j.buildenv.2017.01.014},
-	issn         = {0360-1323},
-	url_link     = {http://www.sciencedirect.com/science/article/pii/S036013231730015X},
-	keywords     = {Smart city, Data mining},
-	abstract     = {Thermal acceptability (TA) and perceived air quality acceptability (PAQA) are typically analyzed in climate chambers or cross-sectional field studies. Individual factors, such as expectations and perceived environment history, may influence the acceptability response. Longitudinal studies with multi-day design are absent in the literature. Fifteen Singaporean subjects participated in a 7-day longitudinal experiment in which they carried a portable sensor that continuously recorded personal air temperature, relative humidity and carbon dioxide concentration at 1-min intervals. Instantaneous TA and PAQA were regularly sampled by survey for each subject. High acceptability was found at home, restaurants and workplaces, whereas low acceptability was found for outdoor and transport environments. The participants, from Singapore's modern tropical environment spent an average of 96% of their time indoors. Weak associations were reported between acceptabilities and measured physical parameters taken independently. Clustering data by location, subject's sleeping ventilation habit, air-conditioning operation status and the changes in physical parameters over a designated time period enhanced the understanding of the acceptability results. In general, acceptability was lower for those who slept in air-conditioned environments than for those who slept without air-conditioning. The carbon dioxide mixing ratio was critical for PAQA predictions but not for TA. The Gaussian process (GP) had a better predictive power than a multiple linear regression approach. Using GP, we found that a general predictive model had comparable simulation performance as for individual predictive models. The longitudinal experiment has demonstrated effectiveness for TA and PAQA analysis, which could be beneficial to future studies in personal comfort prediction.}
-}
-@article{2016_1J_mapsentinel,
-	title        = {Mapsentinel: Can the knowledge of space use improve indoor tracking further?},
-	author       = {Jia, Ruoxi and Jin, Ming and Zou, Han and Yesilata, Yigitcan and Xie, Lihua and Spanos, Costas},
-	year         = 2016,
-	journal      = {Sensors},
-	publisher    = {Multidisciplinary Digital Publishing Institute},
-	volume       = 16,
-	number       = 4,
-	pages        = 472,
-	abstract     = {Estimating an occupant’s location is arguably the most fundamental sensing task in smart buildings. The applications for fine-grained, responsive building operations require the location sensing systems to provide location estimates in real time, also known as indoor tracking. Existing indoor tracking systems require occupants to carry specialized devices or install programs on their smartphone to collect inertial sensing data. In this paper, we propose MapSentinel, which performs non-intrusive location sensing based on WiFi access points and ultrasonic sensors. MapSentinel combines the noisy sensor readings with the floormap information to estimate locations. One key observation supporting our work is that occupants exhibit distinctive motion characteristics at different locations on the floormap, e.g., constrained motion along the corridor or in the cubicle zones, and free movement in the open space. While extensive research has been performed on using a floormap as a tool to obtain correct walking trajectories without wall-crossings, there have been few attempts to incorporate the knowledge of space use available from the floormap into the location estimation. This paper argues that the knowledge of space use as an additional information source presents new opportunities for indoor tracking. The fusion of heterogeneous information is theoretically formulated within the Factor Graph framework, and the Context-Augmented Particle Filtering algorithm is developed to efficiently solve real-time walking trajectories. Our evaluation in a large office space shows that the MapSentinel can achieve accuracy improvement of 31.3% compared with the purely WiFi-based tracking system.},
-	url_link     = {https://www.mdpi.com/1424-8220/16/4/472},
-	url_pdf      = {mapsentinel.pdf},
-	keywords     = {Smart city, Data mining}
-}
-@article{2015_0J_modeling,
-	title        = {Modeling and Estimation of the Humans' Effect on the CO2 Dynamics Inside a Conference Room},
-	author       = {K. {Weekly} and N. {Bekiaris-Liberis} and M. {Jin} and A. M. {Bayen}},
-	year         = 2015,
-	journal      = {IEEE Transactions on Control Systems Technology},
-	volume       = 23,
-	number       = 5,
-	pages        = {1770--1781},
-	doi          = {10.1109/TCST.2014.2384002},
-	abstract     = {We develop a data driven, partial differential equation-ordinary differential equation model that describes the response of the carbon dioxide (CO 2 ) dynamics inside a conference room, due to the presence of humans, or of a user-controlled exogenous source of CO 2 . We conduct three controlled experiments to develop and tune a model whose output matches the measured output concentration of CO 2 inside the room, when known inputs are applied to the model. In the first experiment, a controlled amount of CO 2 gas is released inside the room from a regulated supply, and in the second and third experiments, a known number of humans produce a certain amount of CO 2 inside the room. For the estimation of the exogenous inputs, we design an observer, based on our model, using measurements of CO 2 concentrations at two locations inside the room. We perform several simulation studies for the illustration of our results.},
-	url_link     = {https://ieeexplore.ieee.org/abstract/document/7004837},
-	url_pdf      = {modelco2.pdf},
-	keywords     = {Control theory, Data mining, Smart city}
-}
-@inproceedings{2019_2C_towards,
-	title        = {Towards Robust and Scalable Power System State Estimation},
-	author       = {Jin, Ming and Molybog, Igor and Mohammadi-Ghazi, Reza and Lavaei, Javad},
-	year         = 2019,
-	booktitle    = {IEEE Conference on Decision and Control (CDC)},
-	pages        = {3245--3252},
-	abstract     = {Power system state estimation is an important instance of data-driven decision making in power systems. Yet due to the nonconvexity of the problem, existing approaches based on local search methods are susceptible to spurious local minima. In this study, we propose a linear basis of representation that succinctly captures the topology of the network and enables an efficient two-stage estimation method when the amount of measured data is not too low. Furthermore, we develop a robustness metric called "mutual incoherence," which provides robustness guarantees in the presence of bad data. The proposed method demonstrates superior performance over existing methods in terms of both estimation accuracy and bad data detection for an array of benchmark systems. This technique is shown to be scalable to large systems with more than 13,000 nodes and can achieve an accurate estimation within a minute.},
-	url_link     = {https://ieeexplore.ieee.org/document/9030243},
-	keywords     = {Power system, Optimization},
-	url_pdf      = {linear-SE_2019_2.pdf}
-}
-@book{sastry2013nonlinear,
-  title={Nonlinear systems: analysis, stability, and control},
-  author={Sastry, Shankar},
-  volume={10},
-  year={2013},
-  publisher={Springer Science \& Business Media}
-}
-
-@inproceedings{yakubovich1971,
-	title="S-procedure in nonlinear control theory (in Russian)",
-	author="V.A. Yakubovich",
-	booktitle="Vestnik Leningrad. Univ.",
-	year="1971"
-}
-@book{boyd1994linear,
-  title={Linear matrix inequalities in system and control theory},
-  author={Boyd, Stephen and El Ghaoui, Laurent and Feron, Eric and Balakrishnan, Venkataramanan},
-  year={1994},
-  publisher={SIAM}
-}
-
-@article{2021_PPG,
-	title        = {Recurrent Neural Network Controllers Synthesis with Stability Guarantees for Partially Observed Systems},
-	author       = {Gu, Fangda and Yin, He and Arcak, Murat and El Ghaoui, Laurent and Seiler, Peter and Jin, Ming},
-	year         = 2022,
-	journal         = {AAAI Conference on Artificial Intelligence (AAAI)},
-	keywords     = {Optimization, Machine learning, Power system}
-}
-@article{zhang2020learning,
-  title={Learning invariant representations for reinforcement learning without reconstruction},
-  author={Zhang, Amy and McAllister, Rowan and Calandra, Roberto and Gal, Yarin and Levine, Sergey},
-  journal={arXiv preprint arXiv:2006.10742},
-  year={2020}
-}
-@inproceedings{zeng2021adversarial,
-  title={Adversarial Unlearning of Backdoors via Implicit Hypergradient},
-  author={Zeng, Yi and Chen, Si and Park, Won and Mao, Zhuoqing and Jin, Ming and Jia, Ruoxi},
-  booktitle={International Conference on Learning Representations},
-  year={2022}
-}
-@article{smith2020approximate,
-  title={Approximate abstractions of control systems with an application to aggregation},
-  author={Smith, Stanley W and Arcak, Murat and Zamani, Majid},
-  journal={Automatica},
-  volume={119},
-  pages={109065},
-  year={2020},
-  publisher={Elsevier}
-}
-@article{girard2011approximate,
-  title={Approximate bisimulation: A bridge between computer science and control theory},
-  author={Girard, Antoine and Pappas, George J},
-  journal={European Journal of Control},
-  volume={17},
-  number={5-6},
-  pages={568--578},
-  year={2011},
-  publisher={Elsevier}
-}
-@article{pagnoncelli2009sample,
-  title={Sample average approximation method for chance constrained programming: theory and applications},
-  author={Pagnoncelli, Bernardo K and Ahmed, Shabbir and Shapiro, Alexander},
-  journal={Journal of optimization theory and applications},
-  volume={142},
-  number={2},
-  pages={399--416},
-  year={2009},
-  publisher={Springer}
-}
-@article{nemirovski2007convex,
-  title={Convex approximations of chance constrained programs},
-  author={Nemirovski, Arkadi and Shapiro, Alexander},
-  journal={SIAM Journal on Optimization},
-  volume={17},
-  number={4},
-  pages={969--996},
-  year={2007},
-  publisher={SIAM}
-}
-@article{campi2009scenario,
-  title={The scenario approach for systems and control design},
-  author={Campi, Marco C and Garatti, Simone and Prandini, Maria},
-  journal={Annual Reviews in Control},
-  volume={33},
-  number={2},
-  pages={149--157},
-  year={2009},
-  publisher={Elsevier}
-}
-@article{kaushik2022iterative,
-  title={Iterative Implicit Gradients for Nonconvex Optimization with Variational Inequality Constraints},
-  author={Kaushik, Harshal D and Jin, Ming},
-  journal={arXiv preprint arXiv:2203.12653},
-  year={2022}
-}
-@article{2021_Proxy,
-  author={Gupta, Sarthak and Kekatos, Vassilis and Jin, Ming},
-   journal={IEEE Transactions on Smart Grid}, 
-  title={Controlling Smart Inverters Using Proxies: A Chance-Constrained DNN-Based Approach}, 
-  year={2022},
-  volume={13},
-  number={2},
-  pages={1310-1321}
-}
-@article{2021_3C_imitation,
-  title={Imitation learning with stability and safety guarantees},
-  author={Yin, He and Seiler, Peter and Jin, Ming and Arcak, Murat},
-  journal={IEEE Control Systems Letters},
-  year={2021},
-  publisher={IEEE}
-}
-
-@inproceedings{2021_3C_power,
-	title        = {Power up! Robust graph convolutional network via graph powering},
-	author       = {Jin, Ming and Chang, Heng and Zhu, Wenwu and Sojoudi, Somayeh},
-	year         = 2021,
-	booktitle    = {AAAI Conference on Artificial Intelligence (AAAI)},
-	url_pdf      = {Robust_GCN.pdf},
-	url_arxiv    = {https://arxiv.org/abs/1905.10029},
-	keywords     = {Graph theory, Machine learning},
-	abstract     = {Graph convolutional networks (GCNs) are powerful tools for graph-structured data. However, they have been recently shown to be vulnerable to topological attacks. To enhance adversarial robustness, we go beyond spectral graph theory to robust graph theory. By challenging the classical graph Laplacian, we propose a new convolution operator that is provably robust in the spectral domain and is incorporated in the GCN architecture to improve expressivity and interpretability. By extending the original graph to a sequence of graphs, we also propose a robust training paradigm that encourages transferability across graphs that span a range of spatial and spectral characteristics. The proposed approaches are demonstrated in extensive experiments to simultaneously improve performance in both benign and adversarial situations.}
-}
-@inproceedings{2018_2C_control,
-	title        = {Control-Theoretic Analysis of Smoothness for Stability-Certified Reinforcement Learning},
-	author       = {M. {Jin} and J. {Lavaei}},
-	year         = 2018,
-	booktitle    = {IEEE Conference on Decision and Control (CDC)},
-	volume       = {},
-	number       = {},
-	pages        = {6840--6847},
-	doi          = {10.1109/CDC.2018.8618996},
-	abstract     = {It is critical to obtain stability certificate before deploying reinforcement learning in real-world mission-critical systems. This study justifies the intuition that smoothness (i.e., small changes in inputs lead to small changes in outputs) is an important property for stability-certified reinforcement learning from a control-theoretic perspective. The smoothness margin can be obtained by solving a feasibility problem based on semi-definite programming for both linear and nonlinear dynamical systems, and it does not need to access the exact parameters of the learned controllers. Numerical evaluation on nonlinear and decentralized frequency control for large-scale power grids demonstrates that the smoothness margin can certify stability during both exploration and deployment for (deep) neural-network policies, which substantially surpass nominal controllers in performance. The study opens up new opportunities for robust Lipschitz continuous policy learning.},
-	url_pdf      = {smoothRL_cdc_tech.pdf},
-	url_link     = {https://ieeexplore.ieee.org/document/8618996},
-	keywords     = {Control theory, Machine learning}
-}
-@inproceedings{2019_1C_biscuit,
-	title        = {BISCUIT: Building Intelligent System Customer Investment Tools},
-	author       = {Ming Jin and Ruoxi Jia and Hari Prasanna Das and Wei Feng and Costas Spanos},
-	year         = 2019,
-	booktitle    = {International Conference on Applied Energy (ICAE)},
-	volume       = 158,
-	pages        = {6152--6157},
-	doi          = {https://doi.org/10.1016/j.egypro.2019.01.495},
-	issn         = {1876-6102},
-	url_link     = {http://www.sciencedirect.com/science/article/pii/S1876610219305181},
-	url_pdf      = {BISCUIT_ICAE_CR.pdf},
-	keywords     = {Smart city, Optimization, Energy system},
-	abstract     = {Smart buildings as human-cyber-physical systems (h-CPSs) are capable of providing intelligent services, such as indoor positioning, personalized lighting, demand-based heating ventilation and air-conditioning, and automatic fault detection and recovery, just to name a few. However, most buildings nowadays lack the basic components and infrastructure to support such services. The investment decision of intelligent system design and retrofit can be a daunting task, because it involves both hardware (sensors, actuators, servers) and software (operating systems, service algorithms), which have issues of compatibility, functionality constraints, and opportunities of co-design of synergy. This work proposes a user-oriented investment decision toolset aimed at handling the complexity of exploration in the large design space and to enhance cost-effectiveness, energy efficiency, and human-centric values. The toolset is demonstrated in a case study to retrofit a medium-sized building, where it is shown to propose a design that significantly lowers the overall investment cost while achieving user specifications.}
-}
-@inproceedings{2019_2C_advanced,
-	title        = {Advanced Building Control via Deep Reinforcement Learning},
-	author       = {Ruoxi Jia and Ming Jin and Kaiyu Sun and Tianzhen Hong and Costas Spanos},
-	year         = 2019,
-	booktitle    = {International Conference on Applied Energy (ICAE)},
-	volume       = 158,
-	pages        = {6158--6163},
-	doi          = {https://doi.org/10.1016/j.egypro.2019.01.494},
-	issn         = {1876-6102},
-	url_link     = {http://www.sciencedirect.com/science/article/pii/S187661021930517X},
-	url_pdf      = {BuildingRL_ICAE_CR.pdf},
-	keywords     = {Smart city, Machine learning, Energy system},
-	abstract     = {Building control is a challenging task, not least because of complex building dynamics ad multiple control objectives that are often conflicting. To tackle this challenge, we explore an end-to-end deep reinforcement learning paradigm, which learns an optimal control strategy to reduce energy consumption and to enhance occupant comfort from the data of building-controller interactions. Because real-world control policies need to be interpretable and efficient in learning, this work makes the following key contributions: (1) we investigated a systematic approach to encode expert knowledge in reinforcement learning through “experience replay” and/or “expert policy guidance”; (2) we proposed to regulate the smoothness property of the neural network to penalize the erratic behavior, which is found to dramatically stabilize the learning process and lead to interpretable control laws; (3) we established a virtual testbed for building control by combining the state-of-the-art building energy simulator EnergyPlus with a python environment to provide a systematic evaluation and comparison platform, which will not only further our understanding of the strengths and weaknesses of existing building control algorithms, but also suggest directions for future research. We experimentally verified our proposed deep reinforcement learning paradigm on the virtual testbed in case studies, which demonstrated promising results.}
-}
-@inproceedings{2017_3C_semidefinite,
-	title        = {A semidefinite programming relaxation under false data injection attacks against power grid AC state estimation},
-	author       = {M. {Jin} and J. {Lavaei} and K. {Johansson}},
-	year         = 2017,
-	booktitle    = {Annual Allerton Conference on Communication, Control, and Computing (Allerton)},
-	volume       = {},
-	number       = {},
-	pages        = {236--243},
-	doi          = {10.1109/ALLERTON.2017.8262743},
-	url_link     = {https://ieeexplore.ieee.org/document/8262743},
-	url_pdf      = {cyberattack_allerton.pdf},
-	abstract     = {The integration of sensing and information technology renders the power grid susceptible to cyber-attacks. To understand how vulnerable the state estimator is, we study its behavior under the worst attacks possible. A general false data injection attack (FDIA) based on the AC model is formulated, where the attacker manipulates sensor measurements to mislead the system operator to make decisions based on a falsified state. To stage such an attack, the optimization problem incorporates constraints of limited resources (allowing only a limited number of measurements to be altered), and stealth operation (ensuring the cyber hack cannot be identified by the bad data detection algorithm). Due to the nonlinear AC power flow model and combinatorial selection of compromised sensors, the problem is nonconvex and cannot be solved in polynomial time; however, it is shown that convexification of the original problem based on a semidefinite programming (SDP) relaxation and a sparsity penalty is able to recover a near-optimal solution. This represents the first study to solve the AC-based FDIA. Simulations on a 30-bus system illustrate that the proposed attack requires only sparse sensor manipulation and remains stealthy from the residual-based bad data detection mechanism. In light of the analysis, this study raises new challenges on grid defense mechanism and attack detection strategy.},
-	keywords     = {Optimization, Cybersecurity, Power system}
-}
-
-@article{johnson2001introduction,
-  title={An introduction to the bootstrap},
-  author={Johnson, Roger W},
-  journal={Teaching statistics},
-  volume={23},
-  number={2},
-  pages={49--54},
-  year={2001}
-}
-@inproceedings{fujimoto2019off,
-  title={Off-policy deep reinforcement learning without exploration},
-  author={Fujimoto, Scott and Meger, David and Precup, Doina},
-  booktitle={International Conference on Machine Learning},
-  pages={2052--2062},
-  year={2019},
-  organization={PMLR}
-}
-@inproceedings{munos2016safe,
-  title={Safe and efficient off-policy reinforcement learning},
-  author={Munos, R{\'e}mi and Stepleton, Thomas and Harutyunyan, Anna and Bellemare, Marc G},
-  booktitle={Advances on Neural Information Processing Systems},
-  pages={1054--1062},
-  year={2016}
-}
-@article{levine2020offline,
-  title={Offline reinforcement learning: Tutorial, review, and perspectives on open problems},
-  author={Levine, Sergey and Kumar, Aviral and Tucker, George and Fu, Justin},
-  journal={arXiv preprint arXiv:2005.01643},
-  year={2020}
-}
-@inproceedings{2017_4C_inverse,
-	title        = {Inverse reinforcement learning via deep gaussian process},
-	author       = {Jin, Ming and Damianou, Andreas and Abbeel, Pieter and Spanos, Costas},
-	year         = 2017,
-	booktitle    = {Conference on Uncertainty in Artificial Intelligence (UAI)},
-	note         = {},
-	url_pdf      = {dgpirl_uai.pdf},
-	url_supplementary = {dgpirl_supp.pdf},
-	url_code     = {https://github.com/jinming99/DGP-IRL},
-	url_link     = {http://auai.org/uai2017/proceedings/papers/48.pdf},
-	abstract     = {We propose a new approach to inverse reinforcement learning (IRL) based on the deep Gaussian process (deep GP) model, which is capable of learning complicated reward structures with few demonstrations. Our model stacks multiple latent GP layers to learn abstract representations of the state feature space, which is linked to the demonstrations through the Maximum Entropy learning framework. Incorporating the IRL engine into the nonlinear latent structure renders existing deep GP inference approaches intractable. To tackle this, we develop a non-standard variational approximation framework which extends previous inference schemes. This allows for approximate Bayesian treatment of the feature space and guards against overfitting. Carrying out representation and inverse reinforcement learning simultaneously within our model outperforms state-of-the-art approaches, as we demonstrate with experiments on standard benchmarks ("object world","highway driving") and a new benchmark ("binary world").},
-	keywords     = {Machine learning, Optimization}
-}
-@inproceedings{2017_2C_indoor,
-	title        = {Indoor environmental quality monitoring by autonomous mobile sensing},
-	author       = {Jin, Ming and Liu, Shichao and Tian, Yulun and Lu, Mingjian and Schiavon, Stefano and Spanos, Costas},
-	year         = 2017,
-	booktitle    = {ACM International Conference on Systems for Energy-Efficient Built Environments (BuildSys)},
-	pages        = {1--4},
-	url_pdf      = {ieq_buildsys.pdf},
-	abstract     = {Indoor environmental quality (IEQ) monitoring is a critical task in building operation, maintenance, and diagnosis. Current approach based on static sensor network is not scalable for IEQ assessment that relies on costly sensing instruments. The study proposes to leverage autonomous mobility to reduce sensing infrastructure cost and enable real-time high-granularity monitoring that can be otherwise inhibitively laborious. Unique to the autonomous mobile sensing methodology, the collected IEQ samples are highly sparse in both spatial and temporal domains. The study develops spatiotemporal (ST) interpolation methods based on ST binning, global trend extraction, and local variation estimation, which efficiently use the data to construct accurate depiction of the indoor environment evolution. The method is evaluated by a standard protocol for ventilation assessment, where the estimation is shown to be highly correlated with the ground truth, and reveals the true ventilation conditions.},
-	keywords     = {Data mining, Smart city, Energy system}
-}
-@inproceedings{2018_0C_review,
-	title        = {Review of Microgrid Development in the United States and China and Lessons Learned for China},
-	author       = {Jiancheng Yu and Chris Marnay and Ming Jin and Cheng Yao and Xu Liu and Wei Feng},
-	year         = 2018,
-	booktitle    = {Applied Energy Symposium and Forum, Renewable Energy Integration with Mini/Microgrids (REM)},
-	volume       = 145,
-	pages        = {217--222},
-	doi          = {https://doi.org/10.1016/j.egypro.2018.04.038},
-	issn         = {1876-6102},
-	url_link     = {http://www.sciencedirect.com/science/article/pii/S1876610218300444},
-	url_pdf      = {REM2017.pdf},
-	keywords     = {Energy system, Power system},
-	abstract     = {The U.S. has emerged as the microgrid development leader with around 40% of worldwide capacity. Over the last decade, demonstrations have been executed by a mix of civilian federal, military, private, and local government entities. While their motivations are mixed, resilience became the focus following Superstorm Sandy in 2012, especially in the highly active northeast states. This paper describes U.S. microgrid demonstrations. Then it shows China’s effort to develop microgrids and compares the difference between U.S. and Chinese projects. Finally, based on U.S. experience, recommendations are provided for Chinese microgrids.}
-}
-@inproceedings{2017_1C_leveraging,
-	title        = {Leveraging correlations in utility learning},
-	author       = {I. C. {Konstantakopoulos} and L. J. {Ratliff} and M. {Jin} and C. J. {Spanos}},
-	year         = 2017,
-	booktitle    = {American Control Conference (ACC)},
-	volume       = {},
-	number       = {},
-	pages        = {5249--5256},
-	doi          = {10.23919/ACC.2017.7963770},
-	url_link     = {https://ieeexplore.ieee.org/abstract/document/7963770},
-	url_pdf      = {leverage_correlation.pdf},
-	abstract     = {We present two approaches for leveraging correlations in learning the utilities of non-cooperative agents' competing in a game: correlation and coalition utility learning. In the former, we estimate the correlations between agents using constrained Feasible Generalized Least Squares with noise estimation and then use the estimated correlations to generate a correlation utility function for each agent which is a weighted sum of its own estimated utility function and all the agents' estimated utilities that are highly correlated with them. We then optimize the weights to boost the performance of the estimators. In the latter, we use a small amount of training data to estimate the correlations between players and form coalitions between agents that are positively correlated. We then estimate the parameters of the utility functions for each coalition where agents in a coalition jointly optimize their utilities. The correlation utility learning method outperforms existing schemes while the coalition utility learning method is simple enough to be adapted to an online framework after an initial training phase, yet it matches the performance of much more complex schemes. To demonstrate the efficacy of the estimation schemes, we apply them to data collected from a social game framework for incentivizing more efficient shared resource consumption in smart buildings.},
-	keywords     = {Game theory, Smart city, Optimization, Data mining}
-}
-@inproceedings{2016_2C_metroeye,
-	title        = {Metroeye: smart tracking your metro trips underground},
-	author       = {Gu, Weixi and Jin, Ming and Zhou, Zimu and Spanos, Costas J and Zhang, Lin},
-	year         = 2016,
-	booktitle    = {International Conference on Mobile and Ubiquitous Systems: Computing, Networking and Services (MobiQuitous)},
-	pages        = {84--93},
-	note         = {<font style="color:#FF0000">(Best Paper Runner-up)</font>},
-	url_pdf      = {metroeye_paper.pdf},
-	url_link     = {https://dl.acm.org/doi/10.1145/2968219.2971437},
-	url_poster   = {metroeye.pdf},
-	keywords     = {Smart city, Data mining},
-	abstract     = {Subway has become the first choice of traveling for people in metropolis due to its efficiency and convenience. Yet passengers have to rely on subway broadcasts to know their locations because popular localization services (e.g. GPS and wireless localization technologies) are often unavailable underground. To this end, we propose MetroEye, a fine-grained passenger tracking service underground. MetroEye leverages smartphone sensors to record ambient contextual features, and infers the state of passengers (including stop, running, and interchange) during a metro trip using a Conditional Random Field (CRF) model. MetroEye further provides arrival alarm services based on individual passenger state, and aggregates crowdsourced interchange durations to guide passengers for intelligent metro trip planning. Experimental results within 6 months across over 14 subway trains in 3 major cities demonstrate that MetroEye outperforms the state-of-the-art.}
-}
-@book{national2017enhancing,
-  title={Enhancing the resilience of the nation's electricity system},
-  author={{National Academies of Sciences, Engineering, and Medicine and others}},
-  year={2017},
-  publisher={National Academies Press}
-}
-@inproceedings{2016_1C_inverse,
-	title        = {Inverse modeling of non-cooperative agents via mixture of utilities},
-	author       = {I. C. {Konstantakopoulos} and L. J. {Ratliff} and M. {Jin} and C. J. {Spanos} and S. S. {Sastry}},
-	year         = 2016,
-	booktitle    = {IEEE Conference on Decision and Control},
-	volume       = {},
-	number       = {},
-	pages        = {6327--6334},
-	doi          = {10.1109/CDC.2016.7799243},
-	url_link     = {https://ieeexplore.ieee.org/document/7799243},
-	url_pdf      = {InverseModel_2016.pdf},
-	abstract     = {We describe a new method of parametric utility learning for non-cooperative, continuous games using a probabilistic interpretation for combining multiple utility functions - thereby creating a mixture of utilities - under non-spherical noise terms. We present an adaptation of mixture of regression models that takes in to account heteroskedasticity. We show the performance of the proposed method by estimating the utility functions of players using data from a social game experiment designed to encourage energy efficient behavior amongst building occupants. Using occupant voting data we simulate the new game defined by the estimated mixture of utilities and show that the resulting forecast is more accurate than robust utility learning methods such as constrained Feasible Generalized Least Squares (cFGLS), ensemble methods such as bagging, and classical methods such as Ordinary Least Squares (OLS).},
-	keywords     = {Smart city, Optimization, Data mining}
-}
-@inproceedings{2016_1C_winips,
-	title        = {WinIPS: WiFi-based non-intrusive IPS for online radio map construction},
-	author       = {H. {Zou} and  {Ming Jin} and H. {Jiang} and L. {Xie} and C. {Spanos}},
-	year         = 2016,
-	booktitle    = {IEEE Conference on Computer Communications (INFOCOM) Workshops},
-	volume       = {},
-	number       = {},
-	pages        = {1081--1082},
-	doi          = {10.1109/INFCOMW.2016.7562263},
-	url_link     = {https://ieeexplore.ieee.org/abstract/document/7562263},
-	url_pdf      = {infocom_winips.pdf},
-	abstract     = {Existing WiFi fingerprinting-based Indoor Positioning System (IPS) suffers from two major bottlenecks. One is that the offline site survey process is extremely time-consuming and labor-intensive. The other is that the offline calibrated received signal strength (RSS) fingerprint database is vulnerable to environmental dynamics. To address these issues comprehensively, in this paper, we propose WinIPS, a WiFi-based non-intrusive IPS that enables automatic online radio map construction and adaptation for calibration-free indoor localization. WinIPS is able to capture data packets transmitted in the existing WiFi traffic and extract the RSS values and MAC addresses of both access points (AP) and mobile devices (MD) in a non-intrusive manner. By leveraging APs as online reference points for radio map construction, we can completely remove the needs of laborious offline site survey process. The constructed radio map is more robust to environmental dynamics since it is updated automatically in real-time. Extensive experimental results verify the superiority of WinIPS in terms of RSS estimation accuracy and localization accuracy, and these merits make it more suitable for practical large-scale implementation.},
-	keywords     = {Data mining, Smart city}
-}
-@inproceedings{2016_1C_smart,
-	title        = {Smart building energy efficiency via social game: a robust utility learning framework for closing–the–loop},
-	author       = {I. C. {Konstantakopoulos} and L. J. {Ratliff} and M. {Jin} and C. {Spanos} and S. S. {Sastry}},
-	year         = 2016,
-	booktitle    = {International Workshop on Science of Smart City Operations and Platforms Engineering (SCOPE)},
-	volume       = {},
-	number       = {},
-	pages        = {1--6},
-	doi          = {10.1109/SCOPE.2016.7515054},
-	url_link     = {https://ieeexplore.ieee.org/abstract/document/7515054},
-	url_pdf      = {Scope2016CameraReady.pdf},
-	abstract     = {Given a non-cooperative, continuous game, we describe a framework for parametric utility learning. Using heteroskedasticity inference, we adapt a Constrained Feasible Generalized Least Squares (cFGLS) utility learning method in which estimator variance is reduced, unbiased, and consistent. We extend our utility learning method using bootstrapping and bagging. We show the performance of the proposed method using data from a social game experiment designed to encourage energy efficient behavior amongst building occupants. Using occupant voting data we simulate the game defined by the estimated utility functions and show that the performance of our robust utility learning method and quantify its improvement over classical methods such as Ordinary Least Squares (OLS).},
-	keywords     = {Data mining, Optimization, Game theory, Smart city, Energy system}
-}
-@inproceedings{2015_3C_sbp,
-	title        = {Sensing by proxy: Occupancy detection based on indoor CO2 concentration},
-	author       = {Jin, Ming and Bekiaris-Liberis, Nikolaos and Weekly, Kevin and Spanos, Costas and Bayen, Alexandre},
-	year         = 2015,
-	booktitle    = {UBICOMM 2015},
-	volume       = 14,
-	note         = {},
-	url_pdf      = {sbp.pdf},
-	url_link     = {https://www.iaria.org/conferences2015/UBICOMM15.html},
-	abstract     = {Sensing by proxy, as described in this study, is a sensing paradigm which infers latent factors by “proxy” measurements based on constitutive models that exploit the spatial and physical features in the system. In this study, we demonstrate the efficiency of sensing by proxy for occupancy detection based on indoor CO2 concentration. We propose a link model that relates the proxy measurements with unknown human emission rates based on a data-driven model which consists of a coupled Partial Differential Equation (PDE) – Ordinary Differential Equation (ODE) system. We report on several experimental results using both a CO2 pump that emulates human breathing, as well as measurements of actual occupancy by performing controlled field experiments, in order to validate our model. Parameters of the model are datadriven, which exhibit long-term stability and robustness across all the occupants experiments. The inference of the number of occupants in the room based on CO2 measurements at the air return and air supply vents by sensing by proxy outperforms a range of machine learning algorithms, and achieves an overall mean squared error of 0.6569 (fractional person), while the best alternative by Bayes net is 1.2061 (fractional person). Building indoor occupancy is essential to facilitate heating, ventilation, and air conditioning (HVAC) control, lighting adjustment, and occupancy-aware services to achieve occupancy comfort and energy efficiency. The significance of this study is the proposal of a paradigm of sensing that results in a parsimonious and accurate occupancy inference model, which holds considerable potential for energy saving and improvement of HVAC operations. The proposed framework can be also applied to other tasks, such as indoor pollutants source identification, while requiring minimal infrastructure expenses.},
-	keywords     = {Smart city, Energy system, Data mining}
-}
-@inproceedings{2015_2C_soundloc,
-	title        = {SoundLoc: Accurate room-level indoor localization using acoustic signatures},
-	author       = {R. {Jia} and M. {Jin} and Z. {Chen} and C. J. {Spanos}},
-	year         = 2015,
-	booktitle    = {IEEE International Conference on Automation Science and Engineering (CASE)},
-	volume       = {},
-	number       = {},
-	pages        = {186--193},
-	doi          = {10.1109/CoASE.2015.7294060},
-	note         = {<a style="color:#FF0000" href="http://www.technologyreview.com/view/529176/an-indoor-positioning-system-based-on-echolocation/">(Featured in 'MIT Technology Review')</a>},
-	url_link     = {https://ieeexplore.ieee.org/document/7294060},
-	url_pdf      = {soundloc.pdf},
-	url_media    = {https://www.technologyreview.com/view/529176/an-indoor-positioning-system-based-on-echolocation/},
-	abstract     = {Room-level indoor localization is of particular interest in the energy-efficient smart building, as services, such as lighting and ventilation, can be targeted towards individual rooms based on occupancy instead of an entire floor. Hence, this paper focuses on identifying the room where a person or a mobile device is physically present. Existing room-level localization methods, however, require special infrastructure to annotate rooms with special signatures. SoundLoc is a room-level localization scheme that exploits the intrinsic acoustic properties of individual rooms and obviates the needs for infrastructures. As we will show in the study, rooms' acoustic properties can be characterized by Room Impulse Response (RIR). Nevertheless, obtaining precise RIRs is a time-consuming and expensive process. The main contributions of our work are the following: First, a cost-effective RIR measurement system is designed and the Noise Adaptive Extraction of Reverberation (NAER) algorithm is developed to estimate room acoustic parameters in noisy conditions. Second, a comprehensive physical and statistical analysis of features extracted from RIRs is performed. Also, SoundLoc is evaluated using the dataset consisting of ten (10) different rooms and the overall accuracy of 97.8% has been achieved.},
-	keywords     = {Data mining, Smart city, Energy system}
-}
-@inproceedings{2015_3C_brief,
-	title        = {BRIEF: Bayesian Regression of Infinite Expert Forecasters for single and multiple time series prediction},
-	author       = {M. {Jin} and C. J. {Spanos}},
-	year         = 2015,
-	booktitle    = {IEEE Conference on Decision and Control (CDC)},
-	volume       = {},
-	number       = {},
-	pages        = {78--83},
-	doi          = {10.1109/CDC.2015.7402089},
-	url_link     = {https://ieeexplore.ieee.org/document/7402089},
-	url_pdf      = {brief_cdc.pdf},
-	abstract     = {Bayesian Regression of Infinite Expert Forecasters (BRIEF) as proposed in the study is a prediction algorithm for time-varying systems. The method is based on regret minimization by tracking the performance of an inifinite pool of experts for single and multiple time series. The inverse correlation weighted error (ICWE) employed in BRIEF takes into account the dependency structure among multiple time series, which can also be adapted to multi-step ahead predictions. Theoretical bounds show that the cumulative regret grows at rate O(log T) with respect to the oracle that can select the best strategy in retrospect. As the per round regret vanishes, BRIEF is indistinguishable to the oracle when the horizon increases. Also since the bound applies to any choice of input subject to the euclidean norm constraint, the method can be applied to adversarial settings. Experimental results verify that BRIEF excels in single and multiple steps ahead prediction of ARMAX simulated data and building energy consumptions.},
-	keywords     = {Machine learning, Optimization}
-}
-@inproceedings{2015_1C_apec,
-	title        = {APEC: Auto planner for efficient configuration of indoor positioning system},
-	author       = {Jin, Ming and Jia, Ruoxi and Spanos, Costas},
-	year         = 2015,
-	booktitle    = {Int. Conf. Mobile Ubiquitous Comput. Syst. Services Technol.(UBICOMM)},
-	pages        = {100--107},
-	note         = {<font style="color:#FF0000">(Invitation for IARIA Journals)</font>},
-	url_pdf      = {apec.pdf},
-	url_link     = {https://www.iaria.org/conferences2015/AwardsUBICOMM15.html},
-	abstract     = {Fingerprints-based methods have been prevailing in indoor positioning systems, whereas they have certain drawbacks that fingerprints collection in the offline phase requires considerable manpower and time. Auto Planner for Efficient Configuration (APEC) systematically exploits router setups and fingerprints allocations over space by taking into account user preferences and budget constraints. The task of configuration is formulated as an optimization problem, whose objective is the expected loss based on the Hierarchical Bayesian Signal Model (HBSM) and theoretical results on the misclassification rates. To reduce the computational complexity of large-scale problems, two heuristics are employed, i.e., the coordinate descent and the router-fingerprints decoupling, which are validated by simulation analysis. Experiments with three mobile devices (Android, iPad, iPhone) in two setups (7 or 9 access points) verify that the expected loss is a reliable predictor of the actual loss of the system (objective consistency), and that APEC outperforms the random and uniform approaches (solution superiority). Since APEC focuses on the system configuration in the planning stage, it can be combined with other fingerprinting processes in the online phase to improve the utility of the system},
-	keywords     = {Optimization, Data mining, Smart city, Energy system}
-}
-@inproceedings{2015_2C_rest,
-	title        = {REST: a reliable estimation of stopping time algorithm for social game experiments},
-	author       = {Jin, Ming and Ratliff, Lillian J and Konstantakopoulos, Ioannis and Spanos, Costas and Sastry, Shankar},
-	year         = 2015,
-	booktitle    = {ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS)},
-	pages        = {90--99},
-	url_pdf      = {rest.pdf},
-	url_link     = {https://dl.acm.org/doi/abs/10.1145/2735960.2735974},
-	url_code     = {https://github.com/jinming99/REST},
-	abstract     = {Through a social game, we integrate building occupants into the control and management of an office building that is instrumented with networked embedded systems for sensing and actuation. The goal of the social game is to both incentivize building occupants to be more energy efficient and learn behavioral models for occupants so that the building can be made sustainable through automation. Given a generative model for the occupants behavior in the competitive environment created by the social game, we develop a method for learning the parameters of the behavioral model as we conduct the experiment by adopting a learning to learn framework. Using tools from statistical learning, we provide bounds on the parameter inference error. In addition, we provide an algorithm for computing the stopping time required for a specified level of confidence in estimation. We show the performance of our algorithm in several examples.},
-	keywords     = {Smart city, Optimization, Energy system}
-}
-@inproceedings{2015_1C_powerpred,
-	title        = {Power prediction through energy consumption pattern recognition for smart buildings},
-	author       = {M. {Jin} and L. {Zhang} and C. J. {Spanos}},
-	year         = 2015,
-	booktitle    = {IEEE International Conference on Automation Science and Engineering (CASE)},
-	volume       = {},
-	number       = {},
-	pages        = {419--424},
-	doi          = {10.1109/CoASE.2015.7294115},
-	url_link     = {https://ieeexplore.ieee.org/document/7294115},
-	url_pdf      = {power_pred.pdf},
-	abstract     = {In this paper, we propose a Non-negative Mixture of Experts (NME) model for smart buildings that is capable of making accurate power forecasting by recognizing characteristic consumption patterns. The model uses prediction error as a metric to guide the feature learning process subject to non-negativity constraints. The objective is to understand and model energy consumption behaviors in commercial buildings at the appliance level so as to facilitate dynamic pricing and demand response. Application of the NME model to a large dataset of device power measurements results in the discovery of meaningful energy usage patterns that are characteristic of the working and idle states of the building space, with the additional advantage that the learned features also optimize the energy prediction model. The model can be learned by stochastic gradient descent, which is suitable for large-scale problems, and an online version is also suggested.},
-	keywords     = {Machine learning, Smart city, Energy system}
-}
-@inproceedings{2014_1C_model,
-	title        = {Modeling of end-use energy profile: An appliance-data-driven stochastic approach},
-	author       = {Z. {Kang} and M. {Jin} and C. J. {Spanos}},
-	year         = 2014,
-	booktitle    = {Annual Conference of the IEEE Industrial Electronics Society (IECON)},
-	volume       = {},
-	number       = {},
-	pages        = {5382--5388},
-	doi          = {10.1109/IECON.2014.7049322},
-	url_link     = {https://ieeexplore.ieee.org/abstract/document/7049322},
-	url_pdf      = {model_use.pdf},
-	abstract     = {In this paper, the modeling of building end-use energy profile is comprehensively investigated. Top-down and Bottom-up approaches are discussed with a focus on the latter for better integration with occupant information. Compared to the Time-Of-Use (TOU) data used in previous Bottom-up models, this work utilizes high frequency sampled appliance power consumption data from wireless sensor network, and hence builds an appliance-data-driven probability based end-use energy profile model. ON/OFF probabilities of appliances are used in this model, to build a non-homogeneous Markov Chain, compared to the duration statistics based model that is widely used in other works. The simulation results show the capability of the model to capture the diversity and variability of different categories of end-use appliance energy profile, which can further help on the design of a modern robust building power system.},
-	keywords     = {Energy system, Data mining}
-}
-@inproceedings{2014_2C_environment,
-	title        = {Environmental sensing by wearable device for indoor activity and location estimation},
-	author       = {M. {Jin} and H. {Zou} and K. {Weekly} and R. {Jia} and A. M. {Bayen} and C. J. {Spanos}},
-	year         = 2014,
-	booktitle    = {Annual Conference of the IEEE Industrial Electronics Society (IECON)},
-	volume       = {},
-	number       = {},
-	pages        = {5369--5375},
-	doi          = {10.1109/IECON.2014.7049320},
-	url_pdf      = {environmental_sensing.pdf},
-	url_poster   = {environmental_sensing_poster.pdf},
-	url_link     = {https://ieeexplore.ieee.org/document/7049320},
-	abstract     = {We present results from a set of experiments in this pilot study to investigate the causal influence of user activity on various environmental parameters monitored by occupant-carried multi-purpose sensors. Hypotheses with respect to each type of measurements are verified, including temperature, humidity, and light level collected during eight typical activities: sitting in lab / cubicle, indoor walking / running, resting after physical activity, climbing stairs, taking elevators, and outdoor walking. Our main contribution is the development of features for activity and location recognition based on environmental measurements, which exploit location- and activity-specific characteristics and capture the trends resulted from the underlying physiological process. The features are statistically shown to have good separability and are also information-rich. Fusing environmental sensing together with acceleration is shown to achieve classification accuracy as high as 99.13%. For building applications, this study motivates a sensor fusion paradigm for learning individualized activity, location, and environmental preferences for energy management and user comfort.},
-	keywords     = {Data mining, Smart city, Energy system}
-}
-@inproceedings{2014_3C_presence,
-	title        = {Presencesense: Zero-training algorithm for individual presence detection based on power monitoring},
-	author       = {Jin, Ming and Jia, Ruoxi and Kang, Zhaoyi and Konstantakopoulos, Ioannis C and Spanos, Costas J},
-	year         = 2014,
-	booktitle    = {ACM Conference on Embedded Systems for Energy-Efficient Buildings},
-	pages        = {1--10},
-	url_link     = {https://dl.acm.org/doi/abs/10.1145/2674061.2674073},
-	url_pdf      = {/presencesense.pdf},
-	url_arxiv    = {https://arxiv.org/abs/1407.4395},
-	abstract     = {Non-intrusive presence detection of individuals in commercial buildings is much easier to implement than intrusive methods such as passive infrared, acoustic sensors, and camera. Individual power consumption, while providing useful feedback and motivation for energy saving, can be used as a valuable source for presence detection. We conduct pilot experiments in an office setting to collect individual presence data by ultrasonic sensors, acceleration sensors, and WiFi access points, in addition to the individual power monitoring data. PresenceSense (PS), a semi-supervised learning algorithm based on power measurement that trains itself with only unlabeled data, is proposed, analyzed and evaluated in the study. Without any labeling efforts, which are usually tedious and time consuming, PresenceSense outperforms popular models whose parameters are optimized over a large training set. The results are interpreted and potential applications of PresenceSense on other data sources are discussed. The significance of this study attaches to space security, occupancy behavior modeling, and energy saving of plug loads.},
-	keywords     = {Machine learning, Data mining, Energy system, Smart city}
-}
-
-@TECHREPORT{DOE2021Control,
-  AUTHOR =        {{U.S. Department of Energy}},
-  TITLE =         {Electricity Transmission System Research and Development: Automatic Control Systems},
-  YEAR  =         {2021},
-  URL =   {https://www.energy.gov/sites/default/files/2021-05/Automatic\%20Controls\%20Dagle\%20Schoenwald_0.pdf}
-}
-
-
-@TECHREPORT{khatter2022iac,
-  AUTHOR =        {{Khatter, Vanshaj and Wani, Qasim and Jin, Ming}},
-  TITLE =         {Actor-Critic Reinforcement Learning in Continuous State-Action Spaces with Optimization as Deterministic Policies},
-  YEAR  =         {2022},
-  URL =   {http://www.jinming.tech/papers/IAC.pdf}
-}
-
-@TECHREPORT{khatter2021zoirl,
-  AUTHOR =        {{Khatter, Vanshaj and Wani, Qasim and Kaushik, Harshal and Chang, Z. and Jin, Ming}},
-  TITLE =         {Zeroth-Order Implicit Reinforcement Learning for Sequential Decision Making in Distributed Control Systems},
-  YEAR  =         {2021},
-  URL =   {http://www.jinming.tech/papers/ZO-iRL2.pdf}
-}
-
-@TECHREPORT{kaushik2022inverse,
-  AUTHOR =        {{Kaushik, Harshal and Al-Tawaha, Ahmad and Jia, Ruoxi  and Jin, Ming }},
-  TITLE =         {Generalization Bounds in Decision-Focused Learning for Noncooperative Games},
-  YEAR  =         {2022},
-  URL =   {http://www.jinming.tech/papers/GenDFLGame.pdf}
-}
-
-@TECHREPORT{DOE2020Plan,
-  AUTHOR =        {{U.S. Department of Energy}},
-  TITLE =         {The {U.S. Department of Energy’s Ten-Year-Plans for the Office of Science National Laboratories}},
-  YEAR  =         {2020},
-  URL =   {https://science.osti.gov/-/media/lp/pdf/laboratory-planning-process/Ten-Year-Plans_SC_National_Laboratories_FY-2020.pdf}
-}
-@misc{vamvoudakishandbook,
-  title={Handbook of Reinforcement Learning and Control},
-  author={Vamvoudakis, Kyriakos G and Wan, Yan and Lewis, Frank L and Cansever, Derya},
-  year={2021},
-  booktitle={Studies in Systems, Decision and Control. Cham: Springer International Publishing.},
-  volume={325},
-  publisher={Springer}
-}
-@incollection{devraj2021fundamental,
-  title={Fundamental design principles for reinforcement learning algorithms},
-  author={Devraj, Adithya M and Bu{\v{s}}i{\'c}, Ana and Meyn, Sean},
-  booktitle={Handbook of Reinforcement Learning and Control},
-  pages={75--137},
-  year={2021},
-  publisher={Springer}
-}
-@article{recht2019tour,
-  title={A tour of reinforcement learning: The view from continuous control},
-  author={Recht, Benjamin},
-  journal={Annual Review of Control, Robotics, and Autonomous Systems},
-  volume={2},
-  pages={253--279},
-  year={2019},
-  publisher={Annual Reviews}
-}
-@inproceedings{zhao2021fairness,
-  title={Fairness-aware online meta-learning},
-  author={Zhao, Chen and Chen, Feng and Thuraisingham, Bhavani},
-  booktitle={Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery \& Data Mining},
-  pages={2294--2304},
-  year={2021}
-}
-
-@inproceedings{todorov2012mujoco,
-  title={Mujoco: A physics engine for model-based control},
-  author={Todorov, Emanuel and Erez, Tom and Tassa, Yuval},
-  booktitle={2012 IEEE/RSJ international conference on intelligent robots and systems},
-  pages={5026--5033},
-  year={2012},
-  organization={IEEE}
-}
-@article{suilen2022robust,
-  title={Robust Anytime Learning of Markov Decision Processes},
-  author={Suilen, Marnix and Sim{\~a}o, Thiago D and Jansen, Nils and Parker, David},
-  journal={arXiv preprint arXiv:2205.15827},
-  year={2022}
-}
-@article{lewis2009reinforcement,
-  title={Reinforcement learning and adaptive dynamic programming for feedback control},
-  author={Lewis, Frank L and Vrabie, Draguna},
-  journal={IEEE circuits and systems magazine},
-  volume={9},
-  number={3},
-  pages={32--50},
-  year={2009},
-  publisher={IEEE}
-}
-@inproceedings{niazadeh2021online,
-  title={Online learning via offline greedy algorithms: Applications in market design and optimization},
-  author={Niazadeh, Rad and Golrezaei, Negin and Wang, Joshua R and Susan, Fransisca and Badanidiyuru, Ashwinkumar},
-  booktitle={Proceedings of the 22nd ACM Conference on Economics and Computation},
-  pages={737--738},
-  year={2021}
-}
-
-@article{mihatsch2002risk,
-  title={Risk-sensitive reinforcement learning},
-  author={Mihatsch, Oliver and Neuneier, Ralph},
-  journal={Machine learning},
-  volume={49},
-  number={2},
-  pages={267--290},
-  year={2002},
-  publisher={Springer}
-}
-@article{pauli2021linear,
-  title={Linear systems with neural network nonlinearities: Improved stability analysis via acausal Zames-Falb multipliers},
-  author={Pauli, Patricia and Gramlich, Dennis and Berberich, Julian and Allg{\"o}wer, Frank},
-  journal={arXiv preprint arXiv:2103.17106},
-  year={2021}
-}
-@article{vinogradska2017stability,
-  title={Stability of controllers for Gaussian process dynamics},
-  author={Vinogradska, Julia and Bischoff, Bastian and Nguyen-Tuong, Duy and Peters, Jan},
-  journal={The Journal of Machine Learning Research},
-  volume={18},
-  number={1},
-  pages={3483--3519},
-  year={2017},
-  publisher={JMLR. org}
-}
-
-
-@article{Koller2019Learningbased,
-  title={Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning},
-  author={Koller, Torsten and Berkenkamp, Felix and Turchetta, Matteo and Boedecker, Joschka and Krause, Andreas},
-  journal={arXiv preprint arXiv:1906.12189},
-  year={2019}
-}
-@inproceedings{jain2018learning,
-  title={Learning and control using Gaussian processes},
-  author={Jain, Achin and Nghiem, Truong and Morari, Manfred and Mangharam, Rahul},
-  booktitle={International Conference on Cyber-Physical Systems},
-  pages={140--149},
-  year={2018},
-  organization={IEEE}
-}
-@article{padakandla2021survey,
-  title={A survey of reinforcement learning algorithms for dynamically varying environments},
-  author={Padakandla, Sindhu},
-  journal={ACM Computing Surveys (CSUR)},
-  volume={54},
-  number={6},
-  pages={1--25},
-  year={2021},
-  publisher={ACM New York, NY, USA}
-}
-@article{lamnabhi2017systems,
-  title={Systems \& control for the future of humanity, research agenda: Current and future roles, impact and grand challenges},
-  author={Lamnabhi-Lagarrigue, Francoise and Annaswamy, Anuradha and Engell, Sebastian and Isaksson, Alf and Khargonekar, Pramod and Murray, Richard M and Nijmeijer, Henk and Samad, Tariq and Tilbury, Dawn and Van den Hof, Paul},
-  journal={Annual Reviews in Control},
-  volume={43},
-  pages={1--64},
-  year={2017},
-  publisher={Elsevier}
-}
-@book{khalil2002nonlinear,
-      author        = "Khalil, Hassan K",
-      title         = "{Nonlinear systems; 3rd ed.}",
-      publisher     = "Prentice-Hall",
-      address       = "Upper Saddle River, NJ",
-      year          = "2002",
-      url           = "https://cds.cern.ch/record/1173048",
-      note          = "The book can be consulted by contacting: PH-AID: Wallet,
-                       Lionel",
-}
-@article{zhao2021guaranteed,
-  title={Guaranteed Contraction Control in the Presence of Imperfectly Learned Dynamics},
-  author={Zhao, Pan and Guo, Ziyao and Cheng, Yikun and Gahlawat, Aditya and Hovakimyan, Naira},
-  journal={arXiv preprint arXiv:2112.08222},
-  year={2021}
-}
-@article{denevi2019online,
-  title={Online-within-online meta-learning},
-  author={Denevi, Giulia and Stamos, Dimitris and Ciliberto, Carlo and Pontil, Massimiliano},
-  journal={Advances in Neural Information Processing Systems},
-  volume={32},
-  year={2019}
-}
-@inproceedings{joshi2019deep,
-  title={Deep model reference adaptive control},
-  author={Joshi, Girish and Chowdhary, Girish},
-  booktitle={2019 IEEE 58th Conference on Decision and Control (CDC)},
-  pages={4601--4608},
-  year={2019},
-  organization={IEEE}
-}
-@article{liang20162015,
-  title={The 2015 {Ukraine} blackout: Implications for false data injection attacks},
-  author={Liang, Gaoqi and Weller, Steven R and Zhao, Junhua and Luo, Fengji and Dong, Zhao Yang},
-  journal={IEEE Transactions on Power Systems},
-  volume={32},
-  number={4},
-  pages={3317--3318},
-  year={2016},
-  publisher={IEEE}
-}
-@book{ioannou2012robust,
-  title={Robust Adaptive Control},
-  author={Ioannou, Petros A and Sun, Jing},
-  year={2012},
-  publisher={Courier Corporation}
-}
-@book{hovakimyan2010L1,
-  title={L1 adaptive control theory: Guaranteed robustness with fast adaptation},
-  author={Hovakimyan, Naira and Cao, Chengyu},
-  year={2010},
-  publisher={SIAM}
-}
-@article{chen2015disturbance,
-  title={Disturbance-observer-based control and related methods—An overview},
-  author={Chen, Wen-Hua and Yang, Jun and Guo, Lei and Li, Shihua},
-  journal={IEEE Transactions on industrial electronics},
-  volume={63},
-  number={2},
-  pages={1083--1095},
-  year={2015},
-  publisher={IEEE}
-}
-@inproceedings{2014_2C_social,
-	title        = {Social game for building energy efficiency: Incentive design},
-	author       = {L. J. {Ratliff} and M. {Jin} and I. C. {Konstantakopoulos} and C. {Spanos} and S. S. {Sastry}},
-	year         = 2014,
-	booktitle    = {Annual Allerton Conference on Communication, Control, and Computing (Allerton)},
-	volume       = {},
-	number       = {},
-	pages        = {1011--1018},
-	doi          = {10.1109/ALLERTON.2014.7028565},
-	url_link     = {https://ieeexplore.ieee.org/document/7028565},
-	url_pdf      = {social_game.pdf},
-	abstract     = {We present analysis and results of a social game encouraging energy efficient behavior in occupants by distributing points which determine the likelihood of winning in a lottery. We estimate occupants utilities and formulate the interaction between the building manager and the occupants as a reversed Stackelberg game in which there are multiple followers that play in a non-cooperative game. The estimated utilities are used for determining the occupant behavior in the non-cooperative game. Due to nonconvexities and complexity of the problem, in particular the size of the joint distribution across the states of the occupants, we solve the resulting the bilevel optimization problem using a particle swarm optimization method. Drawing from the distribution across player states, we compute the Nash equilibrium of the game using the resulting leader choice. We show that the behavior of the agents under the leader choice results in greater utility for the leader.},
-	keywords     = {Game theory, Optimization, Smart city, Energy system}
-}
-
-@article{hospedales2020meta,
-  title={Meta-learning in neural networks: A survey},
-  author={Hospedales, Timothy and Antoniou, Antreas and Micaelli, Paul and Storkey, Amos},
-  journal={arXiv preprint arXiv:2004.05439},
-  year={2020}
-}
-
-@inproceedings{balcan2019provable,
-  title={Provable guarantees for gradient-based meta-learning},
-  author={Balcan, Maria-Florina and Khodak, Mikhail and Talwalkar, Ameet},
-  booktitle={International Conference on Machine Learning},
-  pages={424--433},
-  year={2019},
-  organization={PMLR}
-}
-@inproceedings{finn2019online,
-  title={Online meta-learning},
-  author={Finn, Chelsea and Rajeswaran, Aravind and Kakade, Sham and Levine, Sergey},
-  booktitle={International Conference on Machine Learning},
-  pages={1920--1930},
-  year={2019},
-  organization={PMLR}
-}
-@inproceedings{balcan2015efficient,
-  title={Efficient representations for lifelong learning and autoencoding},
-  author={Balcan, Maria-Florina and Blum, Avrim and Vempala, Santosh},
-  booktitle={Conference on Learning Theory},
-  pages={191--210},
-  year={2015},
-  organization={PMLR}
-}
-@inproceedings{du2020few,
-  title={Few-Shot Learning via Learning the Representation, Provably},
-  author={Du, Simon Shaolei and Hu, Wei and Kakade, Sham M and Lee, Jason D and Lei, Qi},
-  booktitle={International Conference on Learning Representations},
-  year={2020}
-}
-@article{zhang2017improved,
-  title={Improved dynamic regret for non-degenerate functions},
-  author={Zhang, Lijun and Yang, Tianbao and Yi, Jinfeng and Jin, Rong and Zhou, Zhi-Hua},
-  journal={Advances in Neural Information Processing Systems},
-  volume={30},
-  year={2017}
-}
-@inproceedings{resler2019adversarial,
-  title={Adversarial online learning with noise},
-  author={Resler, Alon and Mansour, Yishay},
-  booktitle={International Conference on Machine Learning},
-  pages={5429--5437},
-  year={2019},
-  organization={PMLR}
-}
-@inproceedings{yang2016tracking,
-  title={Tracking slowly moving clairvoyant: Optimal dynamic regret of online learning with true and noisy gradient},
-  author={Yang, Tianbao and Zhang, Lijun and Jin, Rong and Yi, Jinfeng},
-  booktitle={International Conference on Machine Learning},
-  pages={449--457},
-  year={2016},
-  organization={PMLR}
-}
-@article{bedi2018tracking,
-  title={Tracking moving agents via inexact online gradient descent algorithm},
-  author={Bedi, Amrit Singh and Sarma, Paban and Rajawat, Ketan},
-  journal={IEEE Journal of Selected Topics in Signal Processing},
-  volume={12},
-  number={1},
-  pages={202--217},
-  year={2018},
-  publisher={IEEE}
-}
-@article{dixit2019online,
-  title={Online learning with inexact proximal online gradient descent algorithms},
-  author={Dixit, Rishabh and Bedi, Amrit Singh and Tripathi, Ruchi and Rajawat, Ketan},
-  journal={IEEE Transactions on Signal Processing},
-  volume={67},
-  number={5},
-  pages={1338--1352},
-  year={2019},
-  publisher={IEEE}
-}
-@article{cesa2011online,
-  title={Online learning of noisy data},
-  author={Cesa-Bianchi, Nicolo and Shalev-Shwartz, Shai and Shamir, Ohad},
-  journal={IEEE Transactions on Information Theory},
-  volume={57},
-  number={12},
-  pages={7907--7931},
-  year={2011},
-  publisher={IEEE}
-}
-
-@article{maurer2016benefit,
-  title={The benefit of multitask representation learning},
-  author={Maurer, Andreas and Pontil, Massimiliano and Romera-Paredes, Bernardino},
-  journal={Journal of Machine Learning Research},
-  volume={17},
-  number={81},
-  pages={1--32},
-  year={2016}
-}
-
-%%
-@inproceedings{finn2017model,
-  title={Model-agnostic meta-learning for fast adaptation of deep networks},
-  author={Finn, Chelsea and Abbeel, Pieter and Levine, Sergey},
-  booktitle={International conference on machine learning},
-  pages={1126--1135},
-  year={2017},
-  organization={PMLR}
-}
-
-@article{deleu2018effects,
-  title={The effects of negative adaptation in model-agnostic meta-learning},
-  author={Deleu, Tristan and Bengio, Yoshua},
-  journal={arXiv preprint arXiv:1812.02159},
-  year={2018}
-}
-
-@article{luo2021mesa,
-  title={MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance},
-  author={Luo, Michael and Balakrishna, Ashwin and Thananjeyan, Brijen and Nair, Suraj and Ibarz, Julian and Tan, Jie and Finn, Chelsea and Stoica, Ion and Goldberg, Ken},
-  journal={arXiv preprint arXiv:2112.03575},
-  year={2021}
-}
-
-
-
-
-@article{bolte2017error,
-  title={From error bounds to the complexity of first-order descent methods for convex functions},
-  author={Bolte, J{\'e}r{\^o}me and Nguyen, Trong Phong and Peypouquet, Juan and Suter, Bruce W},
-  journal={Mathematical Programming},
-  volume={165},
-  number={2},
-  pages={471--507},
-  year={2017},
-  publisher={Springer}
-}
-
-@article{nachum2019dualdice,
-  title={Dualdice: Behavior-agnostic estimation of discounted stationary distribution corrections},
-  author={Nachum, Ofir and Chow, Yinlam and Dai, Bo and Li, Lihong},
-  journal={Advances in Neural Information Processing Systems},
-  volume={32},
-  year={2019}
-}
-
-@inproceedings{bhandari2018finite,
-  title={A finite time analysis of temporal difference learning with linear function approximation},
-  author={Bhandari, Jalaj and Russo, Daniel and Singal, Raghav},
-  booktitle={Conference on learning theory},
-  pages={1691--1692},
-  year={2018},
-  organization={PMLR}
-}
-
-@article{zhang2021convergence,
-  title={On the convergence and sample efficiency of variance-reduced policy gradient method},
-  author={Zhang, Junyu and Ni, Chengzhuo and Szepesvari, Csaba and Wang, Mengdi and others},
-  journal={Advances in Neural Information Processing Systems},
-  volume={34},
-  year={2021}
-}
-
-
-
-@article{nichol2018first,
-  title={On first-order meta-learning algorithms},
-  author={Nichol, Alex and Achiam, Joshua and Schulman, John},
-  journal={arXiv preprint arXiv:1803.02999},
-  year={2018}
-}
-
-@article{shalev2008mind,
-  title={Mind the duality gap: Logarithmic regret algorithms for online optimization},
-  author={Shalev-Shwartz, Shai and Kakade, Sham M},
-  journal={Advances in Neural Information Processing Systems},
-  volume={21},
-  year={2008}
-}
-
-
-@article{deleu2018effects,
-  title={The effects of negative adaptation in model-agnostic meta-learning},
-  author={Deleu, Tristan and Bengio, Yoshua},
-  journal={arXiv preprint arXiv:1812.02159},
-  year={2018}
-}
-
-@article{luo2021mesa,
-  title={MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance},
-  author={Luo, Michael and Balakrishna, Ashwin and Thananjeyan, Brijen and Nair, Suraj and Ibarz, Julian and Tan, Jie and Finn, Chelsea and Stoica, Ion and Goldberg, Ken},
-  journal={arXiv preprint arXiv:2112.03575},
-  year={2021}
-}
-
-
-@article{mnih2013playing,
-  title={Playing atari with deep reinforcement learning},
-  author={Mnih, Volodymyr and Kavukcuoglu, Koray and Silver, David and Graves, Alex and Antonoglou, Ioannis and Wierstra, Daan and Riedmiller, Martin},
-  journal={arXiv preprint arXiv:1312.5602},
-  year={2013}
-}
-
-@article{lillicrap2015continuous,
-  title={Continuous control with deep reinforcement learning},
-  author={Lillicrap, Timothy P and Hunt, Jonathan J and Pritzel, Alexander and Heess, Nicolas and Erez, Tom and Tassa, Yuval and Silver, David and Wierstra, Daan},
-  journal={arXiv preprint arXiv:1509.02971},
-  year={2015}
-}
-
-@article{silver2016mastering,
-  title={Mastering the game of Go with deep neural networks and tree search},
-  author={Silver, David and Huang, Aja and Maddison, Chris J and Guez, Arthur and Sifre, Laurent and Van Den Driessche, George and Schrittwieser, Julian and Antonoglou, Ioannis and Panneershelvam, Veda and Lanctot, Marc and others},
-  journal={nature},
-  volume={529},
-  number={7587},
-  pages={484--489},
-  year={2016},
-  publisher={Nature Publishing Group}
-}
-
-@article{degrave2022magnetic,
-  title={Magnetic control of tokamak plasmas through deep reinforcement learning},
-  author={Degrave, Jonas and Felici, Federico and Buchli, Jonas and Neunert, Michael and Tracey, Brendan and Carpanese, Francesco and Ewalds, Timo and Hafner, Roland and Abdolmaleki, Abbas and de Las Casas, Diego and others},
-  journal={Nature},
-  volume={602},
-  number={7897},
-  pages={414--419},
-  year={2022},
-  publisher={Nature Publishing Group}
-}
-
-@inproceedings{fujimoto2018addressing,
-  title={Addressing function approximation error in actor-critic methods},
-  author={Fujimoto, Scott and Hoof, Herke and Meger, David},
-  booktitle={International conference on machine learning},
-  pages={1587--1596},
-  year={2018},
-  organization={PMLR}
-}
-
-@article{vilalta2002perspective,
-  title={A perspective view and survey of meta-learning},
-  author={Vilalta, Ricardo and Drissi, Youssef},
-  journal={Artificial intelligence review},
-  volume={18},
-  number={2},
-  pages={77--95},
-  year={2002},
-  publisher={Springer}
-}
-
-
-@inproceedings{wachi2020safe,
-  title={Safe reinforcement learning in constrained markov decision processes},
-  author={Wachi, Akifumi and Sui, Yanan},
-  booktitle={International Conference on Machine Learning},
-  pages={9797--9806},
-  year={2020},
-  organization={PMLR}
-}
-
-
-@article{bai2021achieving,
-  title={Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach},
-  author={Bai, Qinbo and Bedi, Amrit Singh and Agarwal, Mridul and Koppel, Alec and Aggarwal, Vaneet},
-  journal={arXiv preprint arXiv:2109.06332},
-  year={2021}
-}
-
-@article{nichol2018first,
-  title={On first-order meta-learning algorithms},
-  author={Nichol, Alex and Achiam, Joshua and Schulman, John},
-  journal={arXiv preprint arXiv:1803.02999},
-  year={2018}
-}
-
-
-@incollection{cunningham2008supervised,
-  title={Supervised learning},
-  author={Cunningham, P{\'a}draig and Cord, Matthieu and Delany, Sarah Jane},
-  booktitle={Machine learning techniques for multimedia},
-  pages={21--49},
-  year={2008},
-  publisher={Springer}
-}
-
-@inproceedings{nock2020supervised,
-  title={Supervised learning: No loss no cry},
-  author={Nock, Richard and Menon, Aditya},
-  booktitle={International Conference on Machine Learning},
-  pages={7370--7380},
-  year={2020},
-  organization={PMLR}
-}
-@book{levin2017markov,
-  title={Markov chains and mixing times},
-  author={Levin, David A and Peres, Yuval},
-  volume={107},
-  year={2017},
-  publisher={American Mathematical Soc.}
-}
-
-
-@article{ji2022theoretical,
-  title={Theoretical convergence of multi-step model-agnostic meta-learning},
-  author={Ji, Kaiyi and Yang, Junjie and Liang, Yingbin},
-  journal={Journal of Machine Learning Research},
-  volume={23},
-  number={29},
-  pages={1--41},
-  year={2022}
-}
-
-@article{xu2020improving,
-  title={Improving sample complexity bounds for (natural) actor-critic algorithms},
-  author={Xu, Tengyu and Wang, Zhe and Liang, Yingbin},
-  journal={Advances in Neural Information Processing Systems},
-  volume={33},
-  pages={4358--4369},
-  year={2020}
-}
-
-@article{hasselt2010double,
-  title={Double {Q}-learning},
-  author={Hasselt, Hado},
-  journal={Advances in neural information processing systems},
-  volume={23},
-  year={2010}
-}
-
-@inproceedings{alquier2017regret,
-  title={Regret bounds for lifelong learning},
-  author={Alquier, Pierre and Pontil, Massimiliano and others},
-  booktitle={Artificial Intelligence and Statistics},
-  pages={261--269},
-  year={2017},
-  organization={PMLR}
-}
-
-@article{chen2021understanding,
-  title={Understanding Domain Randomization for Sim-to-real Transfer},
-  author={Chen, Xiaoyu and Hu, Jiachen and Jin, Chi and Li, Lihong and Wang, Liwei},
-  journal={arXiv preprint arXiv:2110.03239},
-  year={2021}
-}
-
-@article{davis2020stochastic,
-  title={Stochastic subgradient method converges on tame functions},
-  author={Davis, Damek and Drusvyatskiy, Dmitriy and Kakade, Sham and Lee, Jason D},
-  journal={Foundations of computational mathematics},
-  volume={20},
-  number={1},
-  pages={119--154},
-  year={2020},
-  publisher={Springer}
-}
-
-@inproceedings{hallak2017consistent,
-  title={Consistent on-line off-policy evaluation},
-  author={Hallak, Assaf and Mannor, Shie},
-  booktitle={International Conference on Machine Learning},
-  pages={1372--1383},
-  year={2017},
-  organization={PMLR}
-}
-
-@article{liu2018breaking,
-  title={Breaking the curse of horizon: Infinite-horizon off-policy estimation},
-  author={Liu, Qiang and Li, Lihong and Tang, Ziyang and Zhou, Dengyong},
-  journal={Advances in Neural Information Processing Systems},
-  volume={31},
-  year={2018}
-}
-
-@inproceedings{gelada2019off,
-  title={Off-policy deep reinforcement learning by bootstrapping the covariate shift},
-  author={Gelada, Carles and Bellemare, Marc G},
-  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
-  volume={33},
-  number={01},
-  pages={3647--3655},
-  year={2019}
-}
-
-@article{drusvyatskiy2018error,
-  title={Error bounds, quadratic growth, and linear convergence of proximal methods},
-  author={Drusvyatskiy, Dmitriy and Lewis, Adrian S},
-  journal={Mathematics of Operations Research},
-  volume={43},
-  number={3},
-  pages={919--948},
-  year={2018},
-  publisher={INFORMS}
-}
-
-@article{johnstone2020faster,
-  title={Faster subgradient methods for functions with H{\"o}lderian growth},
-  author={Johnstone, Patrick R and Moulin, Pierre},
-  journal={Mathematical Programming},
-  volume={180},
-  number={1},
-  pages={417--450},
-  year={2020},
-  publisher={Springer}
-}
-
-@article{van1996geometric,
-  title={Geometric categories and o-minimal structures},
-  author={Van den Dries, Lou and Miller, Chris},
-  journal={Duke Mathematical Journal},
-  volume={84},
-  number={2},
-  pages={497--540},
-  year={1996},
-  publisher={Duke University Press}
-}
-
-@article{bolte2007clarke,
-  title={Clarke subgradients of stratifiable functions},
-  author={Bolte, J{\'e}r{\^o}me and Daniilidis, Aris and Lewis, Adrian and Shiota, Masahiro},
-  journal={SIAM Journal on Optimization},
-  volume={18},
-  number={2},
-  pages={556--572},
-  year={2007},
-  publisher={SIAM}
-}
-
-@article{ioffe2009invitation,
-  title={An invitation to tame optimization},
-  author={Ioffe, Alexander D},
-  journal={SIAM Journal on Optimization},
-  volume={19},
-  number={4},
-  pages={1894--1917},
-  year={2009},
-  publisher={SIAM}
-}
-
-@article{ying2008online,
-  title={Online gradient descent learning algorithms},
-  author={Ying, Yiming and Pontil, Massimiliano},
-  journal={Foundations of Computational Mathematics},
-  volume={8},
-  number={5},
-  pages={561--596},
-  year={2008},
-  publisher={Springer}
-}
-
-@article{kirsch2019improving,
-  title={Improving generalization in meta reinforcement learning using learned objectives},
-  author={Kirsch, Louis and van Steenkiste, Sjoerd and Schmidhuber, J{\"u}rgen},
-  journal={arXiv preprint arXiv:1910.04098},
-  year={2019}
-}
-
-@inproceedings{schoettler2020meta,
-  title={Meta-reinforcement learning for robotic industrial insertion tasks},
-  author={Schoettler, Gerrit and Nair, Ashvin and Ojea, Juan Aparicio and Levine, Sergey and Solowjow, Eugen},
-  booktitle={2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
-  pages={9728--9735},
-  year={2020},
-  organization={IEEE}
-}
-@inproceedings{mitchell2021offline,
-  title={Offline meta-reinforcement learning with advantage weighting},
-  author={Mitchell, Eric and Rafailov, Rafael and Peng, Xue Bin and Levine, Sergey and Finn, Chelsea},
-  booktitle={International Conference on Machine Learning},
-  pages={7780--7791},
-  year={2021},
-  organization={PMLR}
-}
-@article{bura2021safe,
-  title={Safe Exploration for Constrained Reinforcement Learning with Provable Guarantees},
-  author={Bura, Archana and HasanzadeZonuzy, Aria and Kalathil, Dileep and Shakkottai, Srinivas and Chamberland, Jean-Francois},
-  journal={arXiv preprint arXiv:2112.00885},
-  year={2021}
-}
-@inproceedings{sodhani2021multi,
-  title={Multi-task reinforcement learning with context-based representations},
-  author={Sodhani, Shagun and Zhang, Amy and Pineau, Joelle},
-  booktitle={International Conference on Machine Learning},
-  pages={9767--9779},
-  year={2021},
-  organization={PMLR}
-}
-@inproceedings{zintgraf2021exploration,
-  title={Exploration in approximate hyper-state space for meta reinforcement learning},
-  author={Zintgraf, Luisa M and Feng, Leo and Lu, Cong and Igl, Maximilian and Hartikainen, Kristian and Hofmann, Katja and Whiteson, Shimon},
-  booktitle={International Conference on Machine Learning},
-  pages={12991--13001},
-  year={2021},
-  organization={PMLR}
-}
-@article{jin2021power,
-  title={The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces},
-  author={Jin, Chi and Liu, Qinghua and Yu, Tiancheng},
-  journal={arXiv preprint arXiv:2106.03352},
-  year={2021}
-}
-@inproceedings{lykouris2021corruption,
-  title={Corruption-robust exploration in episodic reinforcement learning},
-  author={Lykouris, Thodoris and Simchowitz, Max and Slivkins, Alex and Sun, Wen},
-  booktitle={Conference on Learning Theory},
-  pages={3242--3245},
-  year={2021},
-  organization={PMLR}
-}
-@inproceedings{pan2019risk,
-  title={Risk averse robust adversarial reinforcement learning},
-  author={Pan, Xinlei and Seita, Daniel and Gao, Yang and Canny, John},
-  booktitle={2019 International Conference on Robotics and Automation (ICRA)},
-  pages={8522--8528},
-  year={2019},
-  organization={IEEE}
-}
-
-
-@inproceedings{arndt2020meta,
-  title={Meta reinforcement learning for sim-to-real domain adaptation},
-  author={Arndt, Karol and Hazara, Murtaza and Ghadirzadeh, Ali and Kyrki, Ville},
-  booktitle={2020 IEEE International Conference on Robotics and Automation (ICRA)},
-  pages={2725--2731},
-  year={2020},
-  organization={IEEE}
-}
-
-
-@article{jiang2019improving,
-  title={Improving federated learning personalization via model agnostic meta learning},
-  author={Jiang, Yihan and Kone{\v{c}}n{\`y}, Jakub and Rush, Keith and Kannan, Sreeram},
-  journal={arXiv preprint arXiv:1909.12488},
-  year={2019}
-}
-
-@article{li2019differentially,
-  title={Differentially private meta-learning},
-  author={Li, Jeffrey and Khodak, Mikhail and Caldas, Sebastian and Talwalkar, Ameet},
-  journal={arXiv preprint arXiv:1909.05830},
-  year={2019}
-}
-
-@article{ren2018meta,
-  title={Meta-learning for semi-supervised few-shot classification},
-  author={Ren, Mengye and Triantafillou, Eleni and Ravi, Sachin and Snell, Jake and Swersky, Kevin and Tenenbaum, Joshua B and Larochelle, Hugo and Zemel, Richard S},
-  journal={arXiv preprint arXiv:1803.00676},
-  year={2018}
-}
-
-@article{nagabandi2018learning,
-  title={Learning to adapt in dynamic, real-world environments through meta-reinforcement learning},
-  author={Nagabandi, Anusha and Clavera, Ignasi and Liu, Simin and Fearing, Ronald S and Abbeel, Pieter and Levine, Sergey and Finn, Chelsea},
-  journal={arXiv preprint arXiv:1803.11347},
-  year={2018}
-}
-
-@inproceedings{geibel2006reinforcement,
-  title={Reinforcement learning for {MDP}s with constraints},
-  author={Geibel, Peter},
-  booktitle={European Conference on Machine Learning},
-  pages={646--653},
-  year={2006},
-  organization={Springer}
-}
-
-@article{jaderberg2019human,
-  title={Human-level performance in 3D multiplayer games with population-based reinforcement learning},
-  author={Jaderberg, Max and Czarnecki, Wojciech M and Dunning, Iain and Marris, Luke and Lever, Guy and Castaneda, Antonio Garcia and Beattie, Charles and Rabinowitz, Neil C and Morcos, Ari S and Ruderman, Avraham and others},
-  journal={Science},
-  volume={364},
-  number={6443},
-  pages={859--865},
-  year={2019},
-  publisher={American Association for the Advancement of Science}
-}
-
-@article{fallah2021convergence,
-  title={On the Convergence Theory of Debiased Model-Agnostic Meta-Reinforcement Learning},
-  author={Fallah, Alireza and Georgiev, Kristian and Mokhtari, Aryan and Ozdaglar, Asuman},
-  journal={Advances in Neural Information Processing Systems},
-  volume={34},
-  year={2021}
-}
-
-@inproceedings{liu2019taming,
-  title={Taming maml: Efficient unbiased meta-reinforcement learning},
-  author={Liu, Hao and Socher, Richard and Xiong, Caiming},
-  booktitle={International conference on machine learning},
-  pages={4061--4071},
-  year={2019},
-  organization={PMLR}
-}
-
-@article{song2019maml,
-  title={Es-maml: Simple hessian-free meta learning},
-  author={Song, Xingyou and Gao, Wenbo and Yang, Yuxiang and Choromanski, Krzysztof and Pacchiano, Aldo and Tang, Yunhao},
-  journal={arXiv preprint arXiv:1910.01215},
-  year={2019}
-}
-
-@article{rothfuss2018promp,
-  title={Promp: Proximal meta-policy search},
-  author={Rothfuss, Jonas and Lee, Dennis and Clavera, Ignasi and Asfour, Tamim and Abbeel, Pieter},
-  journal={arXiv preprint arXiv:1810.06784},
-  year={2018}
-}
-
-@article{young2018metatrace,
-  title={Metatrace: Online step-size tuning by meta-gradient descent for reinforcement learning control},
-  author={Young, Kenny and Wang, Baoxiang and Taylor, Matthew E},
-  journal={arXiv preprint arXiv:1805.04514},
-  year={2018}
-}
-
-@article{balcan2021learning,
-  title={Learning-to-learn non-convex piecewise-Lipschitz functions},
-  author={Balcan, Maria-Florina F and Khodak, Mikhail and Sharma, Dravyansh and Talwalkar, Ameet},
-  journal={Advances in Neural Information Processing Systems},
-  volume={34},
-  year={2021}
-}
-
-@incollection{joyce2011kullback,
-  title={Kullback-leibler divergence},
-  author={Joyce, James M},
-  booktitle={International encyclopedia of statistical science},
-  pages={720--722},
-  year={2011},
-  publisher={Springer}
-}
-
-
-@article{zhao2020dynamic,
-  title={Dynamic regret of convex and smooth functions},
-  author={Zhao, Peng and Zhang, Yu-Jie and Zhang, Lijun and Zhou, Zhi-Hua},
-  journal={Advances in Neural Information Processing Systems},
-  volume={33},
-  pages={12510--12520},
-  year={2020}
-}
-
-@article{du2018gradient,
-  title={Gradient descent provably optimizes over-parameterized neural networks},
-  author={Du, Simon S and Zhai, Xiyu and Poczos, Barnabas and Singh, Aarti},
-  journal={arXiv preprint arXiv:1810.02054},
-  year={2018}
-}
-
-@article{cai2019neural,
-  title={Neural temporal-difference learning converges to global optima},
-  author={Cai, Qi and Yang, Zhuoran and Lee, Jason D and Wang, Zhaoran},
-  journal={Advances in Neural Information Processing Systems},
-  volume={32},
-  year={2019}
-}
-
-@article{duan2016rl,
-  title={$\text{RL}^2$: Fast reinforcement learning via slow reinforcement learning},
-  author={Duan, Yan and Schulman, John and Chen, Xi and Bartlett, Peter L and Sutskever, Ilya and Abbeel, Pieter},
-  journal={arXiv preprint arXiv:1611.02779},
-  year={2016}
-}
-
-@article{li2017meta,
-  title={Meta-{SGD}: Learning to learn quickly for few-shot learning},
-  author={Li, Zhenguo and Zhou, Fengwei and Chen, Fei and Li, Hang},
-  journal={arXiv preprint arXiv:1707.09835},
-  year={2017}
-}
-
-@inproceedings{pecka2014safe,
-  title={Safe exploration techniques for reinforcement learning--an overview},
-  author={Pecka, Martin and Svoboda, Tomas},
-  booktitle={International Workshop on Modelling and Simulation for Autonomous Systems},
-  pages={357--375},
-  year={2014},
-  organization={Springer}
-}
-
-@book{hiriart2013convex,
-  title={Convex analysis and minimization algorithms I: Fundamentals},
-  author={Hiriart-Urruty, Jean-Baptiste and Lemar{\'e}chal, Claude},
-  volume={305},
-  year={2013},
-  publisher={Springer science \& business media}
-}
\ No newline at end of file
diff --git a/Paper2Video/src/latex_proj/iclr2023_conference.bst b/Paper2Video/src/latex_proj/iclr2023_conference.bst
deleted file mode 100644
index 73ee2926d61e1719f8ad625fc53182f57393d2ee..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/iclr2023_conference.bst
+++ /dev/null
@@ -1,1440 +0,0 @@
-%% File: `iclr2017.bst'
-%% A copy of iclm2010.bst, which is a modification of `plainnl.bst' for use with natbib package 
-%%
-%% Copyright 2010 Hal Daum\'e III
-%% Modified by J. Fürnkranz
-%% - Changed labels from (X and Y, 2000) to (X & Y, 2000)
-%%
-%% Copyright 1993-2007 Patrick W Daly
-%% Max-Planck-Institut f\"ur Sonnensystemforschung
-%% Max-Planck-Str. 2
-%% D-37191 Katlenburg-Lindau
-%% Germany
-%% E-mail: daly@mps.mpg.de
-%%
-%% This program can be redistributed and/or modified under the terms
-%% of the LaTeX Project Public License Distributed from CTAN
-%% archives in directory macros/latex/base/lppl.txt; either
-%% version 1 of the License, or any later version.
-%%
- % Version and source file information:
- % \ProvidesFile{icml2010.mbs}[2007/11/26 1.93 (PWD)]
- %
- % BibTeX `plainnat' family
- %   version 0.99b for BibTeX versions 0.99a or later,
- %   for LaTeX versions 2.09 and 2e.
- %
- % For use with the `natbib.sty' package; emulates the corresponding
- %   member of the `plain' family, but with author-year citations.
- %
- % With version 6.0 of `natbib.sty', it may also be used for numerical
- %   citations, while retaining the commands \citeauthor, \citefullauthor,
- %   and \citeyear to print the corresponding information.
- %
- % For version 7.0 of `natbib.sty', the KEY field replaces missing
- %   authors/editors, and the date is left blank in \bibitem.
- %
- % Includes field EID for the sequence/citation number of electronic journals
- %  which is used instead of page numbers.
- %
- % Includes fields ISBN and ISSN.
- %
- % Includes field URL for Internet addresses.
- %
- % Includes field DOI for Digital Object Idenfifiers.
- %
- % Works best with the url.sty package of Donald Arseneau.
- %
- % Works with identical authors and year are further sorted by
- %   citation key, to preserve any natural sequence.
- %
-ENTRY
-  { address
-    author
-    booktitle
-    chapter
-    doi
-    eid
-    edition
-    editor
-    howpublished
-    institution
-    isbn
-    issn
-    journal
-    key
-    month
-    note
-    number
-    organization
-    pages
-    publisher
-    school
-    series
-    title
-    type
-    url
-    volume
-    year
-  }
-  {}
-  { label extra.label sort.label short.list }
-
-INTEGERS { output.state before.all mid.sentence after.sentence after.block }
-
-FUNCTION {init.state.consts}
-{ #0 'before.all :=
-  #1 'mid.sentence :=
-  #2 'after.sentence :=
-  #3 'after.block :=
-}
-
-STRINGS { s t }
-
-FUNCTION {output.nonnull}
-{ 's :=
-  output.state mid.sentence =
-    { ", " * write$ }
-    { output.state after.block =
-        { add.period$ write$
-          newline$
-          "\newblock " write$
-        }
-        { output.state before.all =
-            'write$
-            { add.period$ " " * write$ }
-          if$
-        }
-      if$
-      mid.sentence 'output.state :=
-    }
-  if$
-  s
-}
-
-FUNCTION {output}
-{ duplicate$ empty$
-    'pop$
-    'output.nonnull
-  if$
-}
-
-FUNCTION {output.check}
-{ 't :=
-  duplicate$ empty$
-    { pop$ "empty " t * " in " * cite$ * warning$ }
-    'output.nonnull
-  if$
-}
-
-FUNCTION {fin.entry}
-{ add.period$
-  write$
-  newline$
-}
-
-FUNCTION {new.block}
-{ output.state before.all =
-    'skip$
-    { after.block 'output.state := }
-  if$
-}
-
-FUNCTION {new.sentence}
-{ output.state after.block =
-    'skip$
-    { output.state before.all =
-        'skip$
-        { after.sentence 'output.state := }
-      if$
-    }
-  if$
-}
-
-FUNCTION {not}
-{   { #0 }
-    { #1 }
-  if$
-}
-
-FUNCTION {and}
-{   'skip$
-    { pop$ #0 }
-  if$
-}
-
-FUNCTION {or}
-{   { pop$ #1 }
-    'skip$
-  if$
-}
-
-FUNCTION {new.block.checka}
-{ empty$
-    'skip$
-    'new.block
-  if$
-}
-
-FUNCTION {new.block.checkb}
-{ empty$
-  swap$ empty$
-  and
-    'skip$
-    'new.block
-  if$
-}
-
-FUNCTION {new.sentence.checka}
-{ empty$
-    'skip$
-    'new.sentence
-  if$
-}
-
-FUNCTION {new.sentence.checkb}
-{ empty$
-  swap$ empty$
-  and
-    'skip$
-    'new.sentence
-  if$
-}
-
-FUNCTION {field.or.null}
-{ duplicate$ empty$
-    { pop$ "" }
-    'skip$
-  if$
-}
-
-FUNCTION {emphasize}
-{ duplicate$ empty$
-    { pop$ "" }
-    { "\emph{" swap$ * "}" * }
-  if$
-}
-
-INTEGERS { nameptr namesleft numnames }
-
-FUNCTION {format.names}
-{ 's :=
-  #1 'nameptr :=
-  s num.names$ 'numnames :=
-  numnames 'namesleft :=
-    { namesleft #0 > }
-    { s nameptr "{ff~}{vv~}{ll}{, jj}" format.name$ 't :=
-      nameptr #1 >
-        { namesleft #1 >
-            { ", " * t * }
-            { numnames #2 >
-                { "," * }
-                'skip$
-              if$
-              t "others" =
-                { " et~al." * }
-                { " and " * t * }
-              if$
-            }
-          if$
-        }
-        't
-      if$
-      nameptr #1 + 'nameptr :=
-      namesleft #1 - 'namesleft :=
-    }
-  while$
-}
-
-FUNCTION {format.key}
-{ empty$
-    { key field.or.null }
-    { "" }
-  if$
-}
-
-FUNCTION {format.authors}
-{ author empty$
-    { "" }
-    { author format.names }
-  if$
-}
-
-FUNCTION {format.editors}
-{ editor empty$
-    { "" }
-    { editor format.names
-      editor num.names$ #1 >
-        { " (eds.)" * }
-        { " (ed.)" * }
-      if$
-    }
-  if$
-}
-
-FUNCTION {format.isbn}
-{ isbn empty$
-    { "" }
-    { new.block "ISBN " isbn * }
-  if$
-}
-
-FUNCTION {format.issn}
-{ issn empty$
-    { "" }
-    { new.block "ISSN " issn * }
-  if$
-}
-
-FUNCTION {format.url}
-{ url empty$
-    { "" }
-    { new.block "URL \url{" url * "}" * }
-  if$
-}
-
-FUNCTION {format.doi}
-{ doi empty$
-    { "" }
-    { new.block "\doi{" doi * "}" * }
-  if$
-}
-
-FUNCTION {format.title}
-{ title empty$
-    { "" }
-    { title "t" change.case$ }
-  if$
-}
-
-FUNCTION {format.full.names}
-{'s :=
-  #1 'nameptr :=
-  s num.names$ 'numnames :=
-  numnames 'namesleft :=
-    { namesleft #0 > }
-    { s nameptr
-      "{vv~}{ll}" format.name$ 't :=
-      nameptr #1 >
-        {
-          namesleft #1 >
-            { ", " * t * }
-            {
-              numnames #2 >
-                { "," * }
-                'skip$
-              if$
-              t "others" =
-                { " et~al." * }
-                { " and " * t * }
-              if$
-            }
-          if$
-        }
-        't
-      if$
-      nameptr #1 + 'nameptr :=
-      namesleft #1 - 'namesleft :=
-    }
-  while$
-}
-
-FUNCTION {author.editor.full}
-{ author empty$
-    { editor empty$
-        { "" }
-        { editor format.full.names }
-      if$
-    }
-    { author format.full.names }
-  if$
-}
-
-FUNCTION {author.full}
-{ author empty$
-    { "" }
-    { author format.full.names }
-  if$
-}
-
-FUNCTION {editor.full}
-{ editor empty$
-    { "" }
-    { editor format.full.names }
-  if$
-}
-
-FUNCTION {make.full.names}
-{ type$ "book" =
-  type$ "inbook" =
-  or
-    'author.editor.full
-    { type$ "proceedings" =
-        'editor.full
-        'author.full
-      if$
-    }
-  if$
-}
-
-FUNCTION {output.bibitem}
-{ newline$
-  "\bibitem[" write$
-  label write$
-  ")" make.full.names duplicate$ short.list =
-     { pop$ }
-     { * }
-   if$
-  "]{" * write$
-  cite$ write$
-  "}" write$
-  newline$
-  ""
-  before.all 'output.state :=
-}
-
-FUNCTION {n.dashify}
-{ 't :=
-  ""
-    { t empty$ not }
-    { t #1 #1 substring$ "-" =
-        { t #1 #2 substring$ "--" = not
-            { "--" *
-              t #2 global.max$ substring$ 't :=
-            }
-            {   { t #1 #1 substring$ "-" = }
-                { "-" *
-                  t #2 global.max$ substring$ 't :=
-                }
-              while$
-            }
-          if$
-        }
-        { t #1 #1 substring$ *
-          t #2 global.max$ substring$ 't :=
-        }
-      if$
-    }
-  while$
-}
-
-FUNCTION {format.date}
-{ year duplicate$ empty$
-    { "empty year in " cite$ * warning$
-       pop$ "" }
-    'skip$
-  if$
-  month empty$
-    'skip$
-    { month
-      " " * swap$ *
-    }
-  if$
-  extra.label *
-}
-
-FUNCTION {format.btitle}
-{ title emphasize
-}
-
-FUNCTION {tie.or.space.connect}
-{ duplicate$ text.length$ #3 <
-    { "~" }
-    { " " }
-  if$
-  swap$ * *
-}
-
-FUNCTION {either.or.check}
-{ empty$
-    'pop$
-    { "can't use both " swap$ * " fields in " * cite$ * warning$ }
-  if$
-}
-
-FUNCTION {format.bvolume}
-{ volume empty$
-    { "" }
-    { "volume" volume tie.or.space.connect
-      series empty$
-        'skip$
-        { " of " * series emphasize * }
-      if$
-      "volume and number" number either.or.check
-    }
-  if$
-}
-
-FUNCTION {format.number.series}
-{ volume empty$
-    { number empty$
-        { series field.or.null }
-        { output.state mid.sentence =
-            { "number" }
-            { "Number" }
-          if$
-          number tie.or.space.connect
-          series empty$
-            { "there's a number but no series in " cite$ * warning$ }
-            { " in " * series * }
-          if$
-        }
-      if$
-    }
-    { "" }
-  if$
-}
-
-FUNCTION {format.edition}
-{ edition empty$
-    { "" }
-    { output.state mid.sentence =
-        { edition "l" change.case$ " edition" * }
-        { edition "t" change.case$ " edition" * }
-      if$
-    }
-  if$
-}
-
-INTEGERS { multiresult }
-
-FUNCTION {multi.page.check}
-{ 't :=
-  #0 'multiresult :=
-    { multiresult not
-      t empty$ not
-      and
-    }
-    { t #1 #1 substring$
-      duplicate$ "-" =
-      swap$ duplicate$ "," =
-      swap$ "+" =
-      or or
-        { #1 'multiresult := }
-        { t #2 global.max$ substring$ 't := }
-      if$
-    }
-  while$
-  multiresult
-}
-
-FUNCTION {format.pages}
-{ pages empty$
-    { "" }
-    { pages multi.page.check
-        { "pp.\ " pages n.dashify tie.or.space.connect }
-        { "pp.\ " pages tie.or.space.connect }
-      if$
-    }
-  if$
-}
-
-FUNCTION {format.eid}
-{ eid empty$
-    { "" }
-    { "art." eid tie.or.space.connect }
-  if$
-}
-
-FUNCTION {format.vol.num.pages}
-{ volume field.or.null
-  number empty$
-    'skip$
-    { "\penalty0 (" number * ")" * *
-      volume empty$
-        { "there's a number but no volume in " cite$ * warning$ }
-        'skip$
-      if$
-    }
-  if$
-  pages empty$
-    'skip$
-    { duplicate$ empty$
-        { pop$ format.pages }
-        { ":\penalty0 " * pages n.dashify * }
-      if$
-    }
-  if$
-}
-
-FUNCTION {format.vol.num.eid}
-{ volume field.or.null
-  number empty$
-    'skip$
-    { "\penalty0 (" number * ")" * *
-      volume empty$
-        { "there's a number but no volume in " cite$ * warning$ }
-        'skip$
-      if$
-    }
-  if$
-  eid empty$
-    'skip$
-    { duplicate$ empty$
-        { pop$ format.eid }
-        { ":\penalty0 " * eid * }
-      if$
-    }
-  if$
-}
-
-FUNCTION {format.chapter.pages}
-{ chapter empty$
-    'format.pages
-    { type empty$
-        { "chapter" }
-        { type "l" change.case$ }
-      if$
-      chapter tie.or.space.connect
-      pages empty$
-        'skip$
-        { ", " * format.pages * }
-      if$
-    }
-  if$
-}
-
-FUNCTION {format.in.ed.booktitle}
-{ booktitle empty$
-    { "" }
-    { editor empty$
-        { "In " booktitle emphasize * }
-        { "In " format.editors * ", " * booktitle emphasize * }
-      if$
-    }
-  if$
-}
-
-FUNCTION {empty.misc.check}
-{ author empty$ title empty$ howpublished empty$
-  month empty$ year empty$ note empty$
-  and and and and and
-  key empty$ not and
-    { "all relevant fields are empty in " cite$ * warning$ }
-    'skip$
-  if$
-}
-
-FUNCTION {format.thesis.type}
-{ type empty$
-    'skip$
-    { pop$
-      type "t" change.case$
-    }
-  if$
-}
-
-FUNCTION {format.tr.number}
-{ type empty$
-    { "Technical Report" }
-    'type
-  if$
-  number empty$
-    { "t" change.case$ }
-    { number tie.or.space.connect }
-  if$
-}
-
-FUNCTION {format.article.crossref}
-{ key empty$
-    { journal empty$
-        { "need key or journal for " cite$ * " to crossref " * crossref *
-          warning$
-          ""
-        }
-        { "In \emph{" journal * "}" * }
-      if$
-    }
-    { "In " }
-  if$
-  " \citet{" * crossref * "}" *
-}
-
-FUNCTION {format.book.crossref}
-{ volume empty$
-    { "empty volume in " cite$ * "'s crossref of " * crossref * warning$
-      "In "
-    }
-    { "Volume" volume tie.or.space.connect
-      " of " *
-    }
-  if$
-  editor empty$
-  editor field.or.null author field.or.null =
-  or
-    { key empty$
-        { series empty$
-            { "need editor, key, or series for " cite$ * " to crossref " *
-              crossref * warning$
-              "" *
-            }
-            { "\emph{" * series * "}" * }
-          if$
-        }
-        'skip$
-      if$
-    }
-    'skip$
-  if$
-  " \citet{" * crossref * "}" *
-}
-
-FUNCTION {format.incoll.inproc.crossref}
-{ editor empty$
-  editor field.or.null author field.or.null =
-  or
-    { key empty$
-        { booktitle empty$
-            { "need editor, key, or booktitle for " cite$ * " to crossref " *
-              crossref * warning$
-              ""
-            }
-            { "In \emph{" booktitle * "}" * }
-          if$
-        }
-        { "In " }
-      if$
-    }
-    { "In " }
-  if$
-  " \citet{" * crossref * "}" *
-}
-
-FUNCTION {article}
-{ output.bibitem
-  format.authors "author" output.check
-  author format.key output
-  new.block
-  format.title "title" output.check
-  new.block
-  crossref missing$
-    { journal emphasize "journal" output.check
-      eid empty$
-        { format.vol.num.pages output }
-        { format.vol.num.eid output }
-      if$
-      format.date "year" output.check
-    }
-    { format.article.crossref output.nonnull
-      eid empty$
-        { format.pages output }
-        { format.eid output }
-      if$
-    }
-  if$
-  format.issn output
-  format.doi output
-  format.url output
-  new.block
-  note output
-  fin.entry
-}
-
-FUNCTION {book}
-{ output.bibitem
-  author empty$
-    { format.editors "author and editor" output.check
-      editor format.key output
-    }
-    { format.authors output.nonnull
-      crossref missing$
-        { "author and editor" editor either.or.check }
-        'skip$
-      if$
-    }
-  if$
-  new.block
-  format.btitle "title" output.check
-  crossref missing$
-    { format.bvolume output
-      new.block
-      format.number.series output
-      new.sentence
-      publisher "publisher" output.check
-      address output
-    }
-    { new.block
-      format.book.crossref output.nonnull
-    }
-  if$
-  format.edition output
-  format.date "year" output.check
-  format.isbn output
-  format.doi output
-  format.url output
-  new.block
-  note output
-  fin.entry
-}
-
-FUNCTION {booklet}
-{ output.bibitem
-  format.authors output
-  author format.key output
-  new.block
-  format.title "title" output.check
-  howpublished address new.block.checkb
-  howpublished output
-  address output
-  format.date output
-  format.isbn output
-  format.doi output
-  format.url output
-  new.block
-  note output
-  fin.entry
-}
-
-FUNCTION {inbook}
-{ output.bibitem
-  author empty$
-    { format.editors "author and editor" output.check
-      editor format.key output
-    }
-    { format.authors output.nonnull
-      crossref missing$
-        { "author and editor" editor either.or.check }
-        'skip$
-      if$
-    }
-  if$
-  new.block
-  format.btitle "title" output.check
-  crossref missing$
-    { format.bvolume output
-      format.chapter.pages "chapter and pages" output.check
-      new.block
-      format.number.series output
-      new.sentence
-      publisher "publisher" output.check
-      address output
-    }
-    { format.chapter.pages "chapter and pages" output.check
-      new.block
-      format.book.crossref output.nonnull
-    }
-  if$
-  format.edition output
-  format.date "year" output.check
-  format.isbn output
-  format.doi output
-  format.url output
-  new.block
-  note output
-  fin.entry
-}
-
-FUNCTION {incollection}
-{ output.bibitem
-  format.authors "author" output.check
-  author format.key output
-  new.block
-  format.title "title" output.check
-  new.block
-  crossref missing$
-    { format.in.ed.booktitle "booktitle" output.check
-      format.bvolume output
-      format.number.series output
-      format.chapter.pages output
-      new.sentence
-      publisher "publisher" output.check
-      address output
-      format.edition output
-      format.date "year" output.check
-    }
-    { format.incoll.inproc.crossref output.nonnull
-      format.chapter.pages output
-    }
-  if$
-  format.isbn output
-  format.doi output
-  format.url output
-  new.block
-  note output
-  fin.entry
-}
-
-FUNCTION {inproceedings}
-{ output.bibitem
-  format.authors "author" output.check
-  author format.key output
-  new.block
-  format.title "title" output.check
-  new.block
-  crossref missing$
-    { format.in.ed.booktitle "booktitle" output.check
-      format.bvolume output
-      format.number.series output
-      format.pages output
-      address empty$
-        { organization publisher new.sentence.checkb
-          organization output
-          publisher output
-          format.date "year" output.check
-        }
-        { address output.nonnull
-          format.date "year" output.check
-          new.sentence
-          organization output
-          publisher output
-        }
-      if$
-    }
-    { format.incoll.inproc.crossref output.nonnull
-      format.pages output
-    }
-  if$
-  format.isbn output
-  format.doi output
-  format.url output
-  new.block
-  note output
-  fin.entry
-}
-
-FUNCTION {conference} { inproceedings }
-
-FUNCTION {manual}
-{ output.bibitem
-  format.authors output
-  author format.key output
-  new.block
-  format.btitle "title" output.check
-  organization address new.block.checkb
-  organization output
-  address output
-  format.edition output
-  format.date output
-  format.url output
-  new.block
-  note output
-  fin.entry
-}
-
-FUNCTION {mastersthesis}
-{ output.bibitem
-  format.authors "author" output.check
-  author format.key output
-  new.block
-  format.title "title" output.check
-  new.block
-  "Master's thesis" format.thesis.type output.nonnull
-  school "school" output.check
-  address output
-  format.date "year" output.check
-  format.url output
-  new.block
-  note output
-  fin.entry
-}
-
-FUNCTION {misc}
-{ output.bibitem
-  format.authors output
-  author format.key output
-  title howpublished new.block.checkb
-  format.title output
-  howpublished new.block.checka
-  howpublished output
-  format.date output
-  format.issn output
-  format.url output
-  new.block
-  note output
-  fin.entry
-  empty.misc.check
-}
-
-FUNCTION {phdthesis}
-{ output.bibitem
-  format.authors "author" output.check
-  author format.key output
-  new.block
-  format.btitle "title" output.check
-  new.block
-  "PhD thesis" format.thesis.type output.nonnull
-  school "school" output.check
-  address output
-  format.date "year" output.check
-  format.url output
-  new.block
-  note output
-  fin.entry
-}
-
-FUNCTION {proceedings}
-{ output.bibitem
-  format.editors output
-  editor format.key output
-  new.block
-  format.btitle "title" output.check
-  format.bvolume output
-  format.number.series output
-  address output
-  format.date "year" output.check
-  new.sentence
-  organization output
-  publisher output
-  format.isbn output
-  format.doi output
-  format.url output
-  new.block
-  note output
-  fin.entry
-}
-
-FUNCTION {techreport}
-{ output.bibitem
-  format.authors "author" output.check
-  author format.key output
-  new.block
-  format.title "title" output.check
-  new.block
-  format.tr.number output.nonnull
-  institution "institution" output.check
-  address output
-  format.date "year" output.check
-  format.url output
-  new.block
-  note output
-  fin.entry
-}
-
-FUNCTION {unpublished}
-{ output.bibitem
-  format.authors "author" output.check
-  author format.key output
-  new.block
-  format.title "title" output.check
-  new.block
-  note "note" output.check
-  format.date output
-  format.url output
-  fin.entry
-}
-
-FUNCTION {default.type} { misc }
-
-
-MACRO {jan} {"January"}
-
-MACRO {feb} {"February"}
-
-MACRO {mar} {"March"}
-
-MACRO {apr} {"April"}
-
-MACRO {may} {"May"}
-
-MACRO {jun} {"June"}
-
-MACRO {jul} {"July"}
-
-MACRO {aug} {"August"}
-
-MACRO {sep} {"September"}
-
-MACRO {oct} {"October"}
-
-MACRO {nov} {"November"}
-
-MACRO {dec} {"December"}
-
-
-
-MACRO {acmcs} {"ACM Computing Surveys"}
-
-MACRO {acta} {"Acta Informatica"}
-
-MACRO {cacm} {"Communications of the ACM"}
-
-MACRO {ibmjrd} {"IBM Journal of Research and Development"}
-
-MACRO {ibmsj} {"IBM Systems Journal"}
-
-MACRO {ieeese} {"IEEE Transactions on Software Engineering"}
-
-MACRO {ieeetc} {"IEEE Transactions on Computers"}
-
-MACRO {ieeetcad}
- {"IEEE Transactions on Computer-Aided Design of Integrated Circuits"}
-
-MACRO {ipl} {"Information Processing Letters"}
-
-MACRO {jacm} {"Journal of the ACM"}
-
-MACRO {jcss} {"Journal of Computer and System Sciences"}
-
-MACRO {scp} {"Science of Computer Programming"}
-
-MACRO {sicomp} {"SIAM Journal on Computing"}
-
-MACRO {tocs} {"ACM Transactions on Computer Systems"}
-
-MACRO {tods} {"ACM Transactions on Database Systems"}
-
-MACRO {tog} {"ACM Transactions on Graphics"}
-
-MACRO {toms} {"ACM Transactions on Mathematical Software"}
-
-MACRO {toois} {"ACM Transactions on Office Information Systems"}
-
-MACRO {toplas} {"ACM Transactions on Programming Languages and Systems"}
-
-MACRO {tcs} {"Theoretical Computer Science"}
-
-
-READ
-
-FUNCTION {sortify}
-{ purify$
-  "l" change.case$
-}
-
-INTEGERS { len }
-
-FUNCTION {chop.word}
-{ 's :=
-  'len :=
-  s #1 len substring$ =
-    { s len #1 + global.max$ substring$ }
-    's
-  if$
-}
-
-FUNCTION {format.lab.names}
-{ 's :=
-  s #1 "{vv~}{ll}" format.name$
-  s num.names$ duplicate$
-  #2 >
-    { pop$ " et~al." * }
-    { #2 <
-        'skip$
-        { s #2 "{ff }{vv }{ll}{ jj}" format.name$ "others" =
-            { " et~al." * }
-            { " \& " * s #2 "{vv~}{ll}" format.name$ * }
-          if$
-        }
-      if$
-    }
-  if$
-}
-
-FUNCTION {author.key.label}
-{ author empty$
-    { key empty$
-        { cite$ #1 #3 substring$ }
-        'key
-      if$
-    }
-    { author format.lab.names }
-  if$
-}
-
-FUNCTION {author.editor.key.label}
-{ author empty$
-    { editor empty$
-        { key empty$
-            { cite$ #1 #3 substring$ }
-            'key
-          if$
-        }
-        { editor format.lab.names }
-      if$
-    }
-    { author format.lab.names }
-  if$
-}
-
-FUNCTION {author.key.organization.label}
-{ author empty$
-    { key empty$
-        { organization empty$
-            { cite$ #1 #3 substring$ }
-            { "The " #4 organization chop.word #3 text.prefix$ }
-          if$
-        }
-        'key
-      if$
-    }
-    { author format.lab.names }
-  if$
-}
-
-FUNCTION {editor.key.organization.label}
-{ editor empty$
-    { key empty$
-        { organization empty$
-            { cite$ #1 #3 substring$ }
-            { "The " #4 organization chop.word #3 text.prefix$ }
-          if$
-        }
-        'key
-      if$
-    }
-    { editor format.lab.names }
-  if$
-}
-
-FUNCTION {calc.short.authors}
-{ type$ "book" =
-  type$ "inbook" =
-  or
-    'author.editor.key.label
-    { type$ "proceedings" =
-        'editor.key.organization.label
-        { type$ "manual" =
-            'author.key.organization.label
-            'author.key.label
-          if$
-        }
-      if$
-    }
-  if$
-  'short.list :=
-}
-
-FUNCTION {calc.label}
-{ calc.short.authors
-  short.list
-  "("
-  *
-  year duplicate$ empty$
-  short.list key field.or.null = or
-     { pop$ "" }
-     'skip$
-  if$
-  *
-  'label :=
-}
-
-FUNCTION {sort.format.names}
-{ 's :=
-  #1 'nameptr :=
-  ""
-  s num.names$ 'numnames :=
-  numnames 'namesleft :=
-    { namesleft #0 > }
-    {
-      s nameptr "{vv{ } }{ll{ }}{  ff{ }}{  jj{ }}" format.name$ 't :=
-      nameptr #1 >
-        {
-          "   "  *
-          namesleft #1 = t "others" = and
-            { "zzzzz" * }
-            { numnames #2 > nameptr #2 = and
-                { "zz" * year field.or.null * "   " * }
-                'skip$
-              if$
-              t sortify *
-            }
-          if$
-        }
-        { t sortify * }
-      if$
-      nameptr #1 + 'nameptr :=
-      namesleft #1 - 'namesleft :=
-    }
-  while$
-}
-
-FUNCTION {sort.format.title}
-{ 't :=
-  "A " #2
-    "An " #3
-      "The " #4 t chop.word
-    chop.word
-  chop.word
-  sortify
-  #1 global.max$ substring$
-}
-
-FUNCTION {author.sort}
-{ author empty$
-    { key empty$
-        { "to sort, need author or key in " cite$ * warning$
-          ""
-        }
-        { key sortify }
-      if$
-    }
-    { author sort.format.names }
-  if$
-}
-
-FUNCTION {author.editor.sort}
-{ author empty$
-    { editor empty$
-        { key empty$
-            { "to sort, need author, editor, or key in " cite$ * warning$
-              ""
-            }
-            { key sortify }
-          if$
-        }
-        { editor sort.format.names }
-      if$
-    }
-    { author sort.format.names }
-  if$
-}
-
-FUNCTION {author.organization.sort}
-{ author empty$
-    { organization empty$
-        { key empty$
-            { "to sort, need author, organization, or key in " cite$ * warning$
-              ""
-            }
-            { key sortify }
-          if$
-        }
-        { "The " #4 organization chop.word sortify }
-      if$
-    }
-    { author sort.format.names }
-  if$
-}
-
-FUNCTION {editor.organization.sort}
-{ editor empty$
-    { organization empty$
-        { key empty$
-            { "to sort, need editor, organization, or key in " cite$ * warning$
-              ""
-            }
-            { key sortify }
-          if$
-        }
-        { "The " #4 organization chop.word sortify }
-      if$
-    }
-    { editor sort.format.names }
-  if$
-}
-
-
-FUNCTION {presort}
-{ calc.label
-  label sortify
-  "    "
-  *
-  type$ "book" =
-  type$ "inbook" =
-  or
-    'author.editor.sort
-    { type$ "proceedings" =
-        'editor.organization.sort
-        { type$ "manual" =
-            'author.organization.sort
-            'author.sort
-          if$
-        }
-      if$
-    }
-  if$
-  "    "
-  *
-  year field.or.null sortify
-  *
-  "    "
-  *
-  cite$
-  *
-  #1 entry.max$ substring$
-  'sort.label :=
-  sort.label *
-  #1 entry.max$ substring$
-  'sort.key$ :=
-}
-
-ITERATE {presort}
-
-SORT
-
-STRINGS { longest.label last.label next.extra }
-
-INTEGERS { longest.label.width last.extra.num number.label }
-
-FUNCTION {initialize.longest.label}
-{ "" 'longest.label :=
-  #0 int.to.chr$ 'last.label :=
-  "" 'next.extra :=
-  #0 'longest.label.width :=
-  #0 'last.extra.num :=
-  #0 'number.label :=
-}
-
-FUNCTION {forward.pass}
-{ last.label label =
-    { last.extra.num #1 + 'last.extra.num :=
-      last.extra.num int.to.chr$ 'extra.label :=
-    }
-    { "a" chr.to.int$ 'last.extra.num :=
-      "" 'extra.label :=
-      label 'last.label :=
-    }
-  if$
-  number.label #1 + 'number.label :=
-}
-
-FUNCTION {reverse.pass}
-{ next.extra "b" =
-    { "a" 'extra.label := }
-    'skip$
-  if$
-  extra.label 'next.extra :=
-  extra.label
-  duplicate$ empty$
-    'skip$
-    { "{\natexlab{" swap$ * "}}" * }
-  if$
-  'extra.label :=
-  label extra.label * 'label :=
-}
-
-EXECUTE {initialize.longest.label}
-
-ITERATE {forward.pass}
-
-REVERSE {reverse.pass}
-
-FUNCTION {bib.sort.order}
-{ sort.label  'sort.key$ :=
-}
-
-ITERATE {bib.sort.order}
-
-SORT
-
-FUNCTION {begin.bib}
-{   preamble$ empty$
-    'skip$
-    { preamble$ write$ newline$ }
-  if$
-  "\begin{thebibliography}{" number.label int.to.str$ * "}" *
-  write$ newline$
-  "\providecommand{\natexlab}[1]{#1}"
-  write$ newline$
-  "\providecommand{\url}[1]{\texttt{#1}}"
-  write$ newline$
-  "\expandafter\ifx\csname urlstyle\endcsname\relax"
-  write$ newline$
-  "  \providecommand{\doi}[1]{doi: #1}\else"
-  write$ newline$
-  "  \providecommand{\doi}{doi: \begingroup \urlstyle{rm}\Url}\fi"
-  write$ newline$
-}
-
-EXECUTE {begin.bib}
-
-EXECUTE {init.state.consts}
-
-ITERATE {call.type$}
-
-FUNCTION {end.bib}
-{ newline$
-  "\end{thebibliography}" write$ newline$
-}
-
-EXECUTE {end.bib}
diff --git a/Paper2Video/src/latex_proj/iclr2023_conference.sty b/Paper2Video/src/latex_proj/iclr2023_conference.sty
deleted file mode 100644
index 83581d41be32f8b949ea275522589a34e4099c2b..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/iclr2023_conference.sty
+++ /dev/null
@@ -1,245 +0,0 @@
-%%%% ICLR Macros (LaTex)
-%%%% Adapted by Hugo Larochelle from the NIPS stylefile Macros
-%%%% Style File
-%%%% Dec 12, 1990   Rev Aug 14, 1991; Sept, 1995; April, 1997; April, 1999; October 2014
-
-% This file can be used with Latex2e whether running in main mode, or
-% 2.09 compatibility mode.
-%
-% If using main mode, you need to include the commands
-%             \documentclass{article}
-%             \usepackage{iclr14submit_e,times}
-%
-
-% Change the overall width of the page.  If these parameters are
-%       changed, they will require corresponding changes in the
-%       maketitle section.
-%
-\usepackage{eso-pic} % used by \AddToShipoutPicture
-\RequirePackage{fancyhdr}
-\RequirePackage{natbib}
-
-% modification to natbib citations
-\setcitestyle{authoryear,round,citesep={;},aysep={,},yysep={;}}
-
-\renewcommand{\topfraction}{0.95}   % let figure take up nearly whole page
-\renewcommand{\textfraction}{0.05}  % let figure take up nearly whole page
-
-% Define iclrfinal, set to true if iclrfinalcopy is defined
-\newif\ificlrfinal
-\iclrfinalfalse
-\def\iclrfinalcopy{\iclrfinaltrue}
-\font\iclrtenhv  = phvb at 8pt
-
-% Specify the dimensions of each page
-
-\setlength{\paperheight}{11in}
-\setlength{\paperwidth}{8.5in}
-
-
-\oddsidemargin .5in    %   Note \oddsidemargin = \evensidemargin
-\evensidemargin .5in
-\marginparwidth 0.07 true in
-%\marginparwidth 0.75 true in
-%\topmargin 0 true pt           % Nominal distance from top of page to top of
-%\topmargin 0.125in
-\topmargin -0.625in
-\addtolength{\headsep}{0.25in}
-\textheight 9.0 true in       % Height of text (including footnotes & figures)
-\textwidth 5.5 true in        % Width of text line.
-\widowpenalty=10000
-\clubpenalty=10000
-
-% \thispagestyle{empty}        \pagestyle{empty}
-\flushbottom \sloppy
-
-% We're never going to need a table of contents, so just flush it to
-% save space --- suggested by drstrip@sandia-2
-\def\addcontentsline#1#2#3{}
-
-% Title stuff, taken from deproc.
-\def\maketitle{\par
-\begingroup
-   \def\thefootnote{\fnsymbol{footnote}}
-   \def\@makefnmark{\hbox to 0pt{$^{\@thefnmark}$\hss}} % for perfect author
-                                                        % name centering
-%   The footnote-mark was overlapping the footnote-text,
-%   added the following to fix this problem               (MK)
-   \long\def\@makefntext##1{\parindent 1em\noindent
-                            \hbox to1.8em{\hss $\m@th ^{\@thefnmark}$}##1}
-   \@maketitle \@thanks
-\endgroup
-\setcounter{footnote}{0}
-\let\maketitle\relax \let\@maketitle\relax
-\gdef\@thanks{}\gdef\@author{}\gdef\@title{}\let\thanks\relax}
-
-% The toptitlebar has been raised to top-justify the first page
-
-\usepackage{fancyhdr}
-\pagestyle{fancy}
-\fancyhead{}
-
-% Title (includes both anonimized and non-anonimized versions)
-\def\@maketitle{\vbox{\hsize\textwidth
-%\linewidth\hsize \vskip 0.1in \toptitlebar \centering
-{\LARGE\sc \@title\par}
-%\bottomtitlebar % \vskip 0.1in %  minus
-\ificlrfinal
-    \lhead{Published as a conference paper at ICLR 2023}
-    \def\And{\end{tabular}\hfil\linebreak[0]\hfil
-            \begin{tabular}[t]{l}\bf\rule{\z@}{24pt}\ignorespaces}%
-  \def\AND{\end{tabular}\hfil\linebreak[4]\hfil
-            \begin{tabular}[t]{l}\bf\rule{\z@}{24pt}\ignorespaces}%
-    \begin{tabular}[t]{l}\bf\rule{\z@}{24pt}\@author\end{tabular}%
-\else
-       \lhead{Under review as a conference paper at ICLR 2023}
-   \def\And{\end{tabular}\hfil\linebreak[0]\hfil
-            \begin{tabular}[t]{l}\bf\rule{\z@}{24pt}\ignorespaces}%
-  \def\AND{\end{tabular}\hfil\linebreak[4]\hfil
-            \begin{tabular}[t]{l}\bf\rule{\z@}{24pt}\ignorespaces}%
-    \begin{tabular}[t]{l}\bf\rule{\z@}{24pt}Anonymous authors\\Paper under double-blind review\end{tabular}%
-\fi
-\vskip 0.3in minus 0.1in}}
-
-\renewenvironment{abstract}{\vskip.075in\centerline{\large\sc
-Abstract}\vspace{0.5ex}\begin{quote}}{\par\end{quote}\vskip 1ex}
-
-% sections with less space
-\def\section{\@startsection {section}{1}{\z@}{-2.0ex plus
-    -0.5ex minus -.2ex}{1.5ex plus 0.3ex
-minus0.2ex}{\large\sc\raggedright}}
-
-\def\subsection{\@startsection{subsection}{2}{\z@}{-1.8ex plus
--0.5ex minus -.2ex}{0.8ex plus .2ex}{\normalsize\sc\raggedright}}
-\def\subsubsection{\@startsection{subsubsection}{3}{\z@}{-1.5ex
-plus      -0.5ex minus -.2ex}{0.5ex plus
-.2ex}{\normalsize\sc\raggedright}}
-\def\paragraph{\@startsection{paragraph}{4}{\z@}{1.5ex plus
-0.5ex minus .2ex}{-1em}{\normalsize\bf}}
-\def\subparagraph{\@startsection{subparagraph}{5}{\z@}{1.5ex plus
-  0.5ex minus .2ex}{-1em}{\normalsize\sc}}
-\def\subsubsubsection{\vskip
-5pt{\noindent\normalsize\rm\raggedright}}
-
-
-% Footnotes
-\footnotesep 6.65pt %
-\skip\footins 9pt plus 4pt minus 2pt
-\def\footnoterule{\kern-3pt \hrule width 12pc \kern 2.6pt }
-\setcounter{footnote}{0}
-
-% Lists and paragraphs
-\parindent 0pt
-\topsep 4pt plus 1pt minus 2pt
-\partopsep 1pt plus 0.5pt minus 0.5pt
-\itemsep 2pt plus 1pt minus 0.5pt
-\parsep 2pt plus 1pt minus 0.5pt
-\parskip .5pc
-
-
-%\leftmargin2em
-\leftmargin3pc
-\leftmargini\leftmargin \leftmarginii 2em
-\leftmarginiii 1.5em \leftmarginiv 1.0em \leftmarginv .5em
-
-%\labelsep \labelsep 5pt
-
-\def\@listi{\leftmargin\leftmargini}
-\def\@listii{\leftmargin\leftmarginii
-   \labelwidth\leftmarginii\advance\labelwidth-\labelsep
-   \topsep 2pt plus 1pt minus 0.5pt
-   \parsep 1pt plus 0.5pt minus 0.5pt
-   \itemsep \parsep}
-\def\@listiii{\leftmargin\leftmarginiii
-    \labelwidth\leftmarginiii\advance\labelwidth-\labelsep
-    \topsep 1pt plus 0.5pt minus 0.5pt
-    \parsep \z@ \partopsep 0.5pt plus 0pt minus 0.5pt
-    \itemsep \topsep}
-\def\@listiv{\leftmargin\leftmarginiv
-     \labelwidth\leftmarginiv\advance\labelwidth-\labelsep}
-\def\@listv{\leftmargin\leftmarginv
-     \labelwidth\leftmarginv\advance\labelwidth-\labelsep}
-\def\@listvi{\leftmargin\leftmarginvi
-     \labelwidth\leftmarginvi\advance\labelwidth-\labelsep}
-
-\abovedisplayskip 7pt plus2pt minus5pt%
-\belowdisplayskip \abovedisplayskip
-\abovedisplayshortskip  0pt plus3pt%
-\belowdisplayshortskip  4pt plus3pt minus3pt%
-
-% Less leading in most fonts (due to the narrow columns)
-% The choices were between 1-pt and 1.5-pt leading
-%\def\@normalsize{\@setsize\normalsize{11pt}\xpt\@xpt} % got rid of @ (MK)
-\def\normalsize{\@setsize\normalsize{11pt}\xpt\@xpt}
-\def\small{\@setsize\small{10pt}\ixpt\@ixpt}
-\def\footnotesize{\@setsize\footnotesize{10pt}\ixpt\@ixpt}
-\def\scriptsize{\@setsize\scriptsize{8pt}\viipt\@viipt}
-\def\tiny{\@setsize\tiny{7pt}\vipt\@vipt}
-\def\large{\@setsize\large{14pt}\xiipt\@xiipt}
-\def\Large{\@setsize\Large{16pt}\xivpt\@xivpt}
-\def\LARGE{\@setsize\LARGE{20pt}\xviipt\@xviipt}
-\def\huge{\@setsize\huge{23pt}\xxpt\@xxpt}
-\def\Huge{\@setsize\Huge{28pt}\xxvpt\@xxvpt}
-
-\def\toptitlebar{\hrule height4pt\vskip .25in\vskip-\parskip}
-
-\def\bottomtitlebar{\vskip .29in\vskip-\parskip\hrule height1pt\vskip
-.09in} %
-%Reduced second vskip to compensate for adding the strut in \@author
-
-
-%% % Vertical Ruler
-%% % This code is, largely, from the CVPR 2010 conference style file
-%% % ----- define vruler
-%% \makeatletter
-%% \newbox\iclrrulerbox
-%% \newcount\iclrrulercount
-%% \newdimen\iclrruleroffset
-%% \newdimen\cv@lineheight
-%% \newdimen\cv@boxheight
-%% \newbox\cv@tmpbox
-%% \newcount\cv@refno
-%% \newcount\cv@tot
-%% % NUMBER with left flushed zeros  \fillzeros[<WIDTH>]<NUMBER>
-%% \newcount\cv@tmpc@ \newcount\cv@tmpc
-%% \def\fillzeros[#1]#2{\cv@tmpc@=#2\relax\ifnum\cv@tmpc@<0\cv@tmpc@=-\cv@tmpc@\fi
-%% \cv@tmpc=1 %
-%% \loop\ifnum\cv@tmpc@<10 \else \divide\cv@tmpc@ by 10 \advance\cv@tmpc by 1 \fi
-%%    \ifnum\cv@tmpc@=10\relax\cv@tmpc@=11\relax\fi \ifnum\cv@tmpc@>10 \repeat
-%% \ifnum#2<0\advance\cv@tmpc1\relax-\fi
-%% \loop\ifnum\cv@tmpc<#1\relax0\advance\cv@tmpc1\relax\fi \ifnum\cv@tmpc<#1 \repeat
-%% \cv@tmpc@=#2\relax\ifnum\cv@tmpc@<0\cv@tmpc@=-\cv@tmpc@\fi \relax\the\cv@tmpc@}%
-%% % \makevruler[<SCALE>][<INITIAL_COUNT>][<STEP>][<DIGITS>][<HEIGHT>]
-%% \def\makevruler[#1][#2][#3][#4][#5]{\begingroup\offinterlineskip
-%% \textheight=#5\vbadness=10000\vfuzz=120ex\overfullrule=0pt%
-%% \global\setbox\iclrrulerbox=\vbox to \textheight{%
-%% {\parskip=0pt\hfuzz=150em\cv@boxheight=\textheight
-%% \cv@lineheight=#1\global\iclrrulercount=#2%
-%% \cv@tot\cv@boxheight\divide\cv@tot\cv@lineheight\advance\cv@tot2%
-%% \cv@refno1\vskip-\cv@lineheight\vskip1ex%
-%% \loop\setbox\cv@tmpbox=\hbox to0cm{{\iclrtenhv\hfil\fillzeros[#4]\iclrrulercount}}%
-%% \ht\cv@tmpbox\cv@lineheight\dp\cv@tmpbox0pt\box\cv@tmpbox\break
-%% \advance\cv@refno1\global\advance\iclrrulercount#3\relax
-%% \ifnum\cv@refno<\cv@tot\repeat}}\endgroup}%
-%% \makeatother
-%% % ----- end of vruler
-
-%% % \makevruler[<SCALE>][<INITIAL_COUNT>][<STEP>][<DIGITS>][<HEIGHT>]
-%% \def\iclrruler#1{\makevruler[12pt][#1][1][3][0.993\textheight]\usebox{\iclrrulerbox}}
-%% \AddToShipoutPicture{%
-%% \ificlrfinal\else
-%% \iclrruleroffset=\textheight
-%% \advance\iclrruleroffset by -3.7pt
-%%   \color[rgb]{.7,.7,.7}
-%%   \AtTextUpperLeft{%
-%%     \put(\LenToUnit{-35pt},\LenToUnit{-\iclrruleroffset}){%left ruler
-%%       \iclrruler{\iclrrulercount}}
-%%   }
-%% \fi
-%% }
-%%% To add a vertical bar on the side
-%\AddToShipoutPicture{
-%\AtTextLowerLeft{
-%\hspace*{-1.8cm}
-%\colorbox[rgb]{0.7,0.7,0.7}{\small \parbox[b][\textheight]{0.1cm}{}}}
-%}
diff --git a/Paper2Video/src/latex_proj/iclr2023_conference.tex b/Paper2Video/src/latex_proj/iclr2023_conference.tex
deleted file mode 100644
index 9728b27bda5e377aa90cf117fa2e5cbd954863ba..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/iclr2023_conference.tex
+++ /dev/null
@@ -1,2334 +0,0 @@
-
-\documentclass{article} % For LaTeX2e
-\usepackage{iclr2023_conference,times}
-
-% Optional math commands from https://github.com/goodfeli/dlbook_notation.
-\input{math_commands.tex}
-
-\usepackage{url}
-
-\usepackage[utf8]{inputenc} % allow utf-8 input
-\usepackage[T1]{fontenc}    % use 8-bit T1 fonts
-\usepackage{hyperref}       % hyperlinks
-\usepackage{url}            % simple URL typesetting
-\usepackage{booktabs}       % professional-quality tables
-\usepackage{amsfonts}       % blackboard math symbols
-\usepackage{nicefrac}       % compact symbols for 1/2, etc.
-\usepackage{microtype}      % microtypography
-\usepackage{xcolor}         % colors
-\usepackage{mathtools}
-
-\usepackage{amssymb}
-\usepackage{amsthm}
-\usepackage{amsmath}
-\newtheorem{theorem}{Theorem}[section]
-\newtheorem{proposition}{Proposition}
-\newtheorem{lemma}{Lemma}
-\newtheorem{corollary}{Corollary}
-% \theoremstyle{definition}
-\newtheorem{definition}{Definition}
-\newtheorem{assumption}{Assumption}
-% \theoremstyle{remark}
-\newtheorem{remark}{Remark}
-\usepackage{bbm}
-\usepackage[linesnumbered,ruled,vlined]{algorithm2e}
-\usepackage{algorithmic}
-% \usepackage{algpseudocode}
-\SetKwInput{KwInput}{Input}                % Set the Input
-\SetKwInput{KwOutput}{Output} 
-% \usepackage{accents}
-% \newcommand{\ubar}[1]{\underaccent{\bar}{#1}}
-% \usepackage{amsfonts}
-\usepackage{mathtools}  
-\usepackage{listings}% http://ctan.org/pkg/listings
-\usepackage{caption}
-\usepackage{subcaption}
-\usepackage{float}
-\usepackage{enumitem}
-\newcommand{\jin}[1]{\textbf{\textcolor{black}{[MJ: #1]}}}
-\newcommand{\VK}[1]{\textcolor{black}{#1}}
-\newcommand{\hl}[1]{\textcolor{black}{#1}}
-
-\newcommand{\yuhao}[1]{\textbf{\textcolor{red}{[Yuhao: #1]}}}
-
-% \DeclarePairedDelimiter{\ceil}{\lceil}{\rceil}
-% \usepackage{relsize}
-% \usepackage{mathtools}
-% \usepackage{blindtext}
-% % \usepackage{newtxtext,newtxmath}
-% \usepackage{tabu}
-% \usepackage[colorlinks=true,linkcolor=blue]{hyperref}
-% \DeclareMathOperator*{\argmax}{arg\,max}
-% \DeclareMathOperator*{\argmin}{arg\,min}
-
-\newcommand{\edit}[1]{{\color{blue}{#1}}}
-
-\title{A CMDP-within-online framework for Meta-Safe Reinforcement Learning}
-
-% Authors must not appear in the submitted version. They should be hidden
-% as long as the \iclrfinalcopy macro remains commented out below.
-% Non-anonymous submissions will be rejected without review.
-
-\author{Vanshaj Khattar \\
-Virginia Tech\\
-Blacksburg, VA 24061\\
-\texttt{vanshajk@vt.edu} \\
-\And
-Yuhao Ding \\
-UC Berkeley \\
-Berkeley, CA 94709 \\
-\texttt{yuhao\textunderscore ding@berkeley.edu} \\
-\And
-Bilgehan Sel \\
-Virginia Tech\\
-Blacksburg, VA 24061\\
-\texttt{bsel@vt.edu}\\
-\AND
-Javad Lavaei\\
-UC Berkeley\\
-Berkeley, CA 94709\\
-\texttt{lavaei@berkeley.edu}\\
-\And
-Ming Jin \thanks{Corresponding author}\\
-Virginia Tech\\
-Blacksburg, VA 24061\\
-\texttt{jinming@vt.edu}
-}
-
-% The \author macro works with any number of authors. There are two commands
-% used to separate the names and addresses of multiple authors: \And and \AND.
-%
-% Using \And between authors leaves it to \LaTeX{} to determine where to break
-% the lines. Using \AND forces a linebreak at that point. So, if \LaTeX{}
-% puts 3 of 4 authors names on the first line, and the last on the second
-% line, try using \AND instead of \And before the third author name.
-
-\newcommand{\fix}{\marginpar{FIX}}
-\newcommand{\new}{\marginpar{NEW}}
-
-\iclrfinalcopy % Uncomment for camera-ready version, but NOT for submission.
-\begin{document}
-
-
-\maketitle
-
-\begin{abstract}
-Meta-reinforcement learning has widely been used as a learning-to-learn framework to solve unseen tasks with limited experience. However, the aspect of constraint violations has not been adequately addressed in the existing works, making their application restricted in real-world settings. In this paper, we study the problem of meta-safe reinforcement learning (Meta-SRL) through the CMDP-within-online framework to establish the \emph{first provable guarantees} in this important setting. We obtain task-averaged regret bounds for the reward maximization (optimality gap) and constraint violations using gradient-based meta-learning and show that the task-averaged optimality gap and constraint satisfaction improve with task-similarity in a static environment or task-relatedness in a dynamic environment. Several technical challenges arise when making this framework practical. To this end, we propose a meta-algorithm that performs inexact online learning on the upper bounds of within-task optimality gap and constraint violations estimated by off-policy stationary distribution corrections. Furthermore, we enable the learning rates to be adapted for every task and extend our approach to settings with a competing dynamically changing oracle. Finally, experiments are conducted to demonstrate the effectiveness of our approach. 
-\end{abstract}
-
-\section{Introduction}
-\label{sec:Introduction}
-
-The field of meta-reinforcement learning (meta-RL) has recently evolved  as one of the promising directions that enables reinforcement learning (RL) agents to learn quickly in dynamically changing environments  \citep{finn2017model,mitchell2021offline,zintgraf2021exploration}.
-%\citep{finn2017model,mitchell2021offline,zintgraf2021exploration,sodhani2021multi,hospedales2020meta} The basic principle in many of the Meta-RL settings is to learn a \emph{meta-initialization} for an online-learning algorithm (e.g. online gradient descent \citep{shalev2012online,hazan2016introduction}) using the data from multiple related tasks. This framework allows the RL agent to leverage the policies learned from multiple tasks, to learn an optimal policy quickly for an unseen task \citep{nagabandi2018learning}. This ability of generalization and quick-learning over different tasks has made Meta-RL successful in many domains such as robotics \citep{schoettler2020meta,arndt2020meta}, federated learning \citep{jiang2019improving,li2019differentially}, image recognition \citep{ren2018meta} etc. 
-Many real-world applications, nevertheless, have safety constraints that should rarely be violated, which existing works do not fully address.  
-% \jin{Yuhao: a brief summary of recent developments on RL theory for CMDP may be useful.}
-%Indeed, many real-world applications have safety constraints that should be always met or rarely violated. 
-Safe RL problems are often modeled as constrained Markov decision processes (CMDPs), where the agent aims to maximize the value function while satisfying given constraints on the trajectory \citep{altman1999constrained}. However, unlike meta-learning, CMDP algorithms are not designed to generalize efficiently over unseen tasks \citep{paternain2022safe,ding2021provably,ding2022provably} %\citep{chow2018lyapunov,paternain2022safe,efroni2020exploration,ding2021provably,ding2022provably, ying2021dual, yu2019convergent,xu2021crpo,chen2021primal,bura2021safe}. 
-%Take the rehabilitation robotics example, where a certain torque motor control needs to be designed to aid a person in walking. The required torque profiles would vary for each person; hence using a safe RL algorithm for every task/person can take a substantial amount of interaction time. 
-In this paper, we study how meta-learning can be principally designed to help safe RL algorithms adapt quickly while satisfying safety constraints. 
-% Despite the importance of Meta-SRL problems, the literature still lacks practical algorithms and theoretical results.
-
-There are several unique challenges involved in meta-learning for the CMDP settings. First, multiple losses are incurred at each time step, i.e., reward and constraints, which are typically nonconvex and coupled through dynamics. Hence, adapting existing theories developed for stylized settings such as online convex optimization \citep{hazan2016introduction}
-% \citep{hazan2016introduction,shalev2012online}
-is not straightforward. Second, it is unrealistic to assume the computation of a globally optimal policy for CMDPs (unlike online learning \citep{hazan2016introduction}).
-%even the characterization of the distance of a suboptimal policy to the set of global policies has been limited to some restricted settings, i.e., algorithm-dependent \cite[Lemmas 3 and 15, non-uniform Łojasiewicz for policy gradient]{mei2020global}. 
-Thus, classical online learning algorithms that assume exact or unbiased estimator of the loss function do not apply \citep{khodak2019adaptive}.
-% \citep{shalev2012online, khodak2019adaptive}
-Overall, there is an interplay among nonconvexity, the stochastic nature of the optimization problem, as well as algorithm and generalization considerations, posing significant complexity to leverage inter-task dependency \citep{denevi2019learning}.
-% \citep{denevi2019learning,balcan2019provable,balcan2021learning}
-
-To this end, we propose a provably low-regret online learning framework that extends the current meta-learning algorithms to safe RL settings. 
-%In view of the aforementioned challenges, 
-% Our main contributions to the meta-learning and safe RL literature are as follows:
-Our main contributions are as follows:
-\begin{enumerate}[wide, labelindent=0pt]
-    \item \textbf{Inexact CMDP-within-online framework:} We propose a novel CMDP-within-online framework where the within-task is CMDP, and the meta-learner aims to learn the meta-initialization and learning rate. In our framework, the meta-learner only requires the inexact optimal policies for each within-task CMDP and the approximate state visitation distributions estimated using collected offline trajectories to construct the upper bounds on the suboptimality gap and constraint violations. An upper bound on these estimation errors is established in Theorem \ref{thm:dualDICE}. 
-    
-    \item \textbf{Task-averaged regret in terms of empirical task-similarity:} We show that the task-averaged regrets for optimality gap (TAOG) and constraint violations (TACV) (Def. \ref{def:taog}) diminish with respect to both the number of steps in the within-task algorithm $M$ and the number of tasks $T$. Specifically, task-averaged regret of $\mathcal{O} \left(\frac{1}{\sqrt{M}}\sqrt{\frac{\mathcal{E}_T}{\sqrt{T}}+  \hat{D}^{*2} } \right)$ holds, where  $\mathcal{E}_T$ is the total inexactness in online learning  and $\hat{D}^*$ is the empirical task-similarity (Theorem \ref{thm:InexactTAOG}). 
-    
-    \item \textbf{Adapting to a dynamic environment:} We adapt the learning rates for each task to \hl{environments that entail dynamically changing meta-initialization policies.} An improved rate of $\mathcal{O}\left(\frac{1}{M^{3/4}\sqrt{T}}\left(\mathcal{E}_T+ \sqrt{\frac{\mathcal{E}_T}{T} +\hat{V}_\psi^2 } \right) \right)$ for TAOG and TACV are shown, where $\hat{V}_\psi$ is the empirical task-relatedness with respect to a sequence of changing comparator policies $\{\psi_t^*\}_{t=1}^T$ (Corollary \ref{cor:CorollaryAdpativeRate}).
-\end{enumerate}
-
-Incorporating all these components makes our Meta-safe RL (Meta-SRL) approach highly practical and theoretically appealing for potential adaption to different RL settings. \VK{Furthermore, we remark on some {key technical contributions} that support the above developments, which may be of independent interest:} \emph{1)} \VK{We study the \emph{optimization landscape} of CMDP (Theorem
-\ref{thm:dualDICE}) that is algorithmic-agnostic, which differs from the existing work of \citep{mei2020global}[Lemmas 3 and 15] that is restricted to the setting of policy gradient. This is achieved by developing {new techniques} based on tame geometry and subgradient flow systems}; \emph{2)} \VK{we provide static and dynamic regret bounds for \emph{inexact online gradient descent} (see Appendix \ref{sec:inexact-app}), which we leverage to obtain our final theoretical results in Theorems \ref{thm:InexactTAOG}, \ref{thm:UtSim1}, and Corollary \ref{cor:CorollaryAdpativeRate}.} Due to the space restrictions, the related work can be found in Appendix \ref{sec:RelatedWork}.
-
-% we use the CRPO algorithm \citep{xu2021crpo} for within-task CMDP to illustrate the technical details, our framework can be potentially adapted to different RL techniques \citep{efroni2020exploration,geist2019theory}. %The paper is organized as follows. Section \ref{sec:RelatedWork} presents the related works in Meta-RL. Section \ref{sec:Preliminaries} introduces the preliminaries of the Meta-RL problem and the CRPO algorithm \citep{xu2021crpo} used for within-the-task online learning on a CMDP. 
-% \vspace{-0.3cm}
-\section{CMDP-within-online framework}
-\label{sec:Preliminaries}
-% \vspace{-0.1cm}
-In this section, we introduce the CMDP-within-online framework for the Meta-SRL problems. In this framework, a within-task algorithm (such as CRPO \citep{xu2021crpo}) for some CMDP task $t\in[T]$ is encapsulated in an online learning algorithm (meta-learning algorithm), which decides upon a sequence of initialization policy $\phi_t$ and learning rate $\alpha_t >0$ 
-% (potentially separately for reward and each constraint) 
-for each within-task algorithm. The goal of the meta-learning algorithm is to minimize some notion of task-averaged performance regret to facilitate provably efficient adaptation to a new task.
-\subsection{CMDP and the primal approach}
-\textbf{Model.} For each task $t\in[T]$, a CMDP $\mathcal{M}_t$ is defined by the state space $\mathcal{S}$, the action space $\mathcal{A}$, discount factor $\gamma$, initial state distribution over the state-space $\rho_t$, the transition kernel  $P_t(s'|s,a): \mathcal{S} \times \mathcal{A} \rightarrow \mathcal{S}$, reward functions $c_{t,0}: \mathcal{S} \times \mathcal{A} \rightarrow [0,1]$ and cost functions $c_{t,i}:\mathcal{S} \times \mathcal{A} \rightarrow [0,1]$ for $i = 1, ..., p$.  The actions are chosen according to a stochastic policy $\pi_{t}:\mathcal{S} \rightarrow \Delta(\mathcal{A})$ where $\Delta(\mathcal{A})$ is the simplex over the action space. We use $\Delta(\mathcal{A})^{|\mathcal{S}|}$ to denote the simplex over all states $\mathcal{S}$. The initial policy for task $t$ is denoted as $\pi_{t,0}$. The discounted state visitation distribution of a policy $\pi$ is defined as
-$\nu_{t,s_{0}}^{\pi}(s):=(1-\gamma) \sum_{m=0}^{\infty} \gamma^{m} {P_t}\left(s_{m}=s \mid \pi, s_{0}\right)$ and we write $\nu_t^*(s) := \mathbb{E}_{s_0\sim \rho_t} \left[\nu^{\pi^*}_{t,s_0}(s)\right]$ as the visitation distribution when the initial state follows $\rho_t$ at task $t$. \VK{We denote $\pi_t^*$ as an optimal policy for task $t$ and $\nu_t^*(s) := \mathbb{E}_{s_0\sim \rho_t} \left[\nu^{\pi_t^*}_{t,s_0}(s)\right]$ is the corresponding state visitation distribution induced by policy $\pi_t^*$ when the initial state $s_0$ is sampled from initial state distribution $\rho_t$ at task $t$.}
-
-\textbf{Policy parametrization.}
-We consider the softmax parametrization. For any $\theta \in \mathbb{R}^{|\mathcal{S}| \times|\mathcal{A}|}$, the corresponding softmax policy $\pi_{\theta}$ is defined as
-$\pi_{\theta}(a \mid s):=\frac{\exp (\theta(s, a))}{\sum_{a^{\prime} \in \mathcal{A}} \exp \left(\theta\left(s, a^{\prime}\right)\right)},  \forall(s, a) \in \mathcal{S} \times \mathcal{A}.$ We neglect the dependence on $\theta$ to alleviate the notational burden.
-% In the function approximation setting, we parameterize the
-% policy by a two-layer neural network together with the softmax policy  $\theta(s, a)\coloneqq f((s,a);\omega,b)=\frac{1}{\sqrt{W}}\sum_{\iota=1}^W b_\iota\cdot\mathrm{ReLU}(\omega_\iota^\top\xi(s,a))$ for any state-action pair $(s,a)$, where $\xi(s,a)\in\mathbb{R}^d$ is the feature vector with $d\geq 2$ and $\|\xi(s,a)\|\leq 1$, $\mathrm{ReLU}(x)=\mathbbm{1}(x>0)\cdot x$, $b=[b_1,\cdots,b_W]^\top\in\mathbb{R}^W$, and $\omega=[\omega_1^\top,\cdots,\omega_W^\top]^\top\in\mathbb{R}^{W d}$ form the set of parameters $\theta$. 
-
-\textbf{Value function.}  For task $t$ and a policy $\pi$, we define the state-value function as $V_{t,\pi}^{i}(s)=\mathbb{E}_t\left[\sum_{m=0}^{\infty} \gamma^{m} c_{t,i}\left(s_{m}, a_{m}, s_{m+1}\right) \mid s_{0}=s, \pi\right]$ and the action-value function as $Q_{t,\pi}^{i}(s, a)=$ $\mathbb{E}_t\left[\sum_{m=0}^{\infty} \gamma^{m} c_{t,i}\left(s_{m}, a_{m}, s_{m+1}\right) \mid s_{0}=s, a_{0}=a, \pi \right]$, where $m$ denotes the time steps. %, and the advantage function as $A_{t,\pi}^{i}(s, a)=Q_{t,\pi}^{i}(s, a)-V_{t,\pi}^{i}(s)$.  
- Furthermore, the expected total reward/cost functions are $J_{t,i}(\pi)=\mathbb{E}_{\rho_t}\left[V_{t,\pi}^{i}(s)\right]=\mathbb{E}_{\rho_t \cdot \pi}\left[Q_{t,\pi}^{i}(s, a)\right] .$
-
-\textbf{CMDP.} In each task $t$, the goal of the agent is to solve the following CMDP problem
-\begin{equation}\label{eq:CRPOsafeRL}
-    \underset{\pi}{\max} \hspace{0.1cm} J_{t,0}(\pi) \hspace{0.3cm} \text{s.t.} \hspace{0.2cm} J_{t,i}(\pi) \leq d_{t,i}, \hspace{0.3cm} \forall i = 1,...,p,
-\end{equation}
-where $d_{t,i}$ is a fixed limit on the expected total cost $J_{t,i}$ for task $t$ and constraint $i$ (among a total of $p$ constraints). We denote the optimal solution of \eqref{eq:CRPOsafeRL} for the task $t$ as $\pi_t^*$ which can be non-unique.
-% which belongs to an optimal solution set $\Pi_t^*$ . 
-
-\textbf{Primal approach.} In this work, we focus on the primal approach, CRPO \citep{xu2021crpo}, as an exemplary algorithm with guarantees for a single-task CMDP. \VK{CRPO is a primal-based online CMDP algorithm, which performs policy optimization (natural gradient ascent on the reward) when constraints are not violated, or constraint minimization (natural gradient descent on the constraint function) for one of the violated constraints. } Specifically, with softmax parametrization and carefully chosen parameters, the {suboptimality gap} and {constraint violation} for task $t$ are bounded as follows (if the exact action-value function $\{Q_{t,\pi}^i\}_{i=0}^p$ are available for all $\pi$)\footnote{This regret is slightly different from \citep{xu2021crpo} as we assume an exact critic estimation for simplicity. 
-% \hl{Note that  we use a simplified version of CRPO in this section to illustrate the main idea of our framework. A full analysis with the original CRPO algorithm is provided in Section \ref{sec:Methodology}. In particular, the results of Theorems \ref{thm:dualDICE}, \ref{thm:UtSim1} , and Corollary \ref{cor:CorollaryAdpativeRate}  are based on the analysis with the non-simplified CRPO algorithm. 
-For more details about the CRPO algorithm, choice of parameters, and the convergence analysis for a single task, we refer the reader to Appendices \ref{sec:crpo} and \ref{sec:prelim-app}.}:
-\begin{equation}\label{eq:RegretRandC}
-\begin{aligned}
- & R_0 = J_{t,0}(\pi_t^*) - \mathbb{E}[J_{t,0}(\hat{\pi}_t)]\leq \frac{2}{\alpha_t M}\mathbb{E}_{s \sim \nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})]+\frac{4 \alpha_t c_{max}^2|\mathcal{S}| |\mathcal{A}|}{(1-\gamma)^3},\\
- & R_{i} = \mathbb{E}[J_{t,i}(\hat{\pi}_t)]- d_{t,i} \leq  \frac{2}{\alpha_t M}\mathbb{E}_{s \sim \nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})]+\frac{4 \alpha_t c_{max}^2 |\mathcal{S}| |\mathcal{A}|}{(1-\gamma)^3},   \forall  i = 1,...,p.
-\end{aligned}
-\end{equation}
-where $\hat{\pi}_t$ is the policy returned by running CRPO for $M$ steps with learning rate $\alpha_t$ in task $t$, $\pi^*_t$ is the optimal policy, $c_{max}$ is the upper bound on reward/cost function, and
-$D_{KL}(\cdot|\cdot)$ is the KL divergence.
-% \footnote{In particular, we note that $\mathbb{E}_{\nu}[D_{KL}(\pi_1|\pi_2)]=\sum_{s\in\mathcal{S}}\nu(s)\sum_{a\in\mathcal{A}}\pi_1(a|s)\log(\pi_1(a|s)/\pi_2(a|s))$.} 
-
-\subsection{Meta-SRL problem setup}\label{subsec:Adaptation}
-We now consider the lifelong extension of CMDPs in which safe RL tasks arrive one at a time, and $t=1,2,\ldots,T$ denotes the index for a sequence of online learning problems. In each single task $t$, the agent must sequentially optimize the policy $\{\pi_{t,i}\}_{i=0}^{M}$ so that the corresponding sub-optimality and the constraint violation, 
- given in \eqref{eq:RegretRandC}, decays sub-linearly in $M$. Beyond the single task, the meta-learner should aim to optimize the upper bounds in \eqref{eq:RegretRandC} over the initial policy $\pi_{t,0}$ and the learning rate $\alpha_t$ so that the task-averaged sub-optimality and constraint violation are expected to improve as the meta-learner solves more tasks. Therefore, we will aim to minimize the task-averaged sub-optimality gap and the task-averaged constraint violation defined as follows:
-\begin{definition}\label{def:taog}
-The \textbf{task-averaged optimality gap (TAOG)} $\bar{R}_0$ and the \textbf{task-averaged constraint-violation (TACV)} $\bar{R}_i$ of a safe RL algorithm after $T$ tasks are
-\begin{equation}\label{eq:TARandTACV}
-\begin{aligned}
- \bar{R}_0  = \frac{1}{T}\sum_{t=1}^T\bigg[ J_{t,0}(\pi_t^*) - \mathbb{E}[J_{t,0}(\hat{\pi}_t)] \bigg],
-      \  \bar{R}_{i} =  \frac{1}{T} \sum_{t=1}^T\bigg[ \mathbb{E}[J_{t,i}(\hat{\pi}_t)] - d_{t,i} \bigg], \ \forall i = 1,...,p,
-       \end{aligned}
-\end{equation}
-where $\hat{\pi}_t$ is the policy returned by running some safe RL algorithm for $M$ time-steps at task $t$ where the expectation is taken with respect to the randomness of the algorithm and environment.
-\end{definition}
-\hl{We can observe from (\eqref{eq:RegretRandC}) that the task-averaged regrets can be upper bounded by terms based on the policy initializations $\{\pi_{t,0}\}_{t=1}^T$ and the learning rates $\{\alpha_t\}_{t=1}^T$. The crux of our idea is to design a meta-algorithm that can sequentially update the initial policy $\pi_{t,0}$ and the learning rate of the CRPO algorithm $\alpha_t$ by performing online learning on the upper bounds, i.e., we consider the right-hand sides of (\eqref{eq:RegretRandC}) as the individual loss function. This enables us to bound the dynamic regrets (TAOG and TACV), which are measured against a dynamic sequence of optimal policies $\{\pi_t^*\}_{t=1}^T$, \emph{via the static regret, which is measured against a fixed initial policy} $\phi$.} 
-% Essentially, the problem of bounding the dynamic regret is relaxed into the corresponding but ``easier” problem of bounding the static regret measured by the \emph{upper bounds of TAOG/TACV}.
-% \hl{Note that,  unlike in the standard regret, one cannot achieve TAOG and TACV decreasing in $T$ without further assumptions on the environment (e.g., \citep{kwon2021rl}\footnote{\VK{See Appendix \ref{sec:KwonRelation} for further discussion on the relation between our results to hardness results presented in \citep{kwon2021rl}.} }) because the comparator $\pi_t^\ast$ is dynamic which can lead to suboptimality or constraint violation at each task $t$. Hence, we seek a new notion of task similarity.}
-
-% We will consider CRPO as the within-task algorithm. 
-% Furthermore, the average is taken over $T$ and a low TAOG and TACV ensure that the optimality gap or constraint violation of an algorithm is low on average over the tasks compared to that of the optimal within-task parameter.
-% We can observe that the upper bounds on the task-averaged regrets depend on the distance of a suboptimal policy $\hat{\pi}_t$ to an optimal policy $\pi_t^*$, and inversely depends on the number of tasks $T$ and inversely on the square root of the time horizon $M$. 
-
-% for CMDPs. %The quantities that the meta learner can control are the initialization of the policy $\pi_{t,0}$, the learning rate of the safe RL algorithm, and the critic initialization in the function approximation setting.
-\subsection{Task-similarity}\label{subsec:taskSimilarity}
-In Meta-SRL, we expect TAOG and TACV to improve with the similarity among the online CMDP tasks. We now discuss the notions of similarity in a static environment; an extension to a dynamic environment is introduced in Sec. \ref{sec:dynamic-regret}.
-% Furthermore, the notion of similarity not only affects the evaluation of the meta-learning algorithm but also impacts the quality of the meta-initialization being learned and, eventually, the performance on an unseen task.
-Given optimal polices $\{\pi_t^\ast\}_{t=1}^T$, where $\pi_t^\ast \in \Pi_t^\ast$ for every $t$, the \textbf{task-similarity} can be measured as $D^{*2}=\underset{\phi \in \Delta(\mathcal{A})^{|\mathcal{S}|} } {\min} \frac{1}{T} \sum_{t=1}^T \mathbb{E}_{s \sim \nu_t^*}[D_{KL}(\pi_t^*|\phi)]$. If the optimal policy is not unique, we take the worst case for $D^*$, i.e., a set of policies for which $D^{*2}$ is maximum.
-% \begin{definition}\label{def:taskSimilarity} The task-similarity of CMDP tasks $[T]$ can be measured by the quantity $D^*$ defined as
-% \begin{equation*}
-%     D^{*2} =\underset{\phi \in \Delta(\mathcal{A})^{|\mathcal{S}|} } {\min}  \sum_{t=1}^T 
-%      \mathbb{E}_{s \sim \nu_t^*}[D(\pi_t^*|\phi)],
-% \end{equation*}
-% where $\Pi_t^\ast$ is the set of the optimal policy for the task $t$ and
-%   $\nu_t^*$ is the state distribution induced by $\pi_t^*$. We further denote $\phi^* = \underset{\phi\in \Delta(\mathcal{A})^{|\mathcal{S}|}}{\argmin} \mathbb{E}_{s \sim \nu_t^*}[D(\pi_t^*|\phi)]$.
-% \end{definition}
-This notion of task-similarity in the static environment is natural for studying gradient-based meta-learning, as it implies that there exists a meta initialization $\phi$ with respect to which optimal policies for individual tasks are all close together. In particular, when the tasks are all identical, i.e., $\{\pi_t^\ast\}_{t=1}^T$  are all equal, we have $D^{*2}=0$. In the practical scenario where only suboptimal policies are accessible, we denote the \textbf{empirical task-similarity} as $\hat{D}^{*2} \coloneqq \underset{\phi \in \Delta(\mathcal{A})^{|\mathcal{S}|} } {\min} \frac{1}{T} \sum_{t=1}^T \mathbb{E}_{s \sim \hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\phi)]$, which depends on the suboptimal policies $\{\hat{\pi}_t\}_{t=1}^T$ returned by a within-task algorithm. 
-% Note that this is a natural notion of similarity that resembles \emph{Bregman information} introduced in the setting of clustering \citep{banerjee2005clustering}.
-As it is desirable that the meta-initialization policy $\pi_{t,0}$ has good exploration properties, we need the initial policy to have full support over $\mathcal{S} \times \mathcal{A}$. We introduce the following assumption:
-\begin{assumption} \label{asmptn:newAsmptn1} The meta-initialization policy $\pi_{t,0}$ for any task $t$ lies inside a \textbf{shrinkage simplex set}, i.e., $\pi_{t,0}(\cdot | s) \in \Delta \mathcal{A}_{\varrho} \coloneqq  \left\{a_1e_1 + ... + a_{n_a}e_{n_a}| \sum_{i=0}^{n_a}a_i = 1, a_i \geq \varrho \hspace{0.3cm} \forall i = 1, ..., n_a \right\}$ for all $s \in \mathcal{S}$, where $e_1,...,e_{n_a} \in \mathbb{R}^{n_a}$ are one-hot vectors for each action (e.g., $e_1$ is the vector of all $0s$ except $1$ at the first location) and $\varrho > 0$. In particular, $\Delta \mathcal{A}_\varrho$ lies inside the regular simplex set $\Delta \mathcal{A}$ ($\varrho=0$).
-\end{assumption}
-
-% Assumption \ref{asmptn:newAsmptn1} entails some explorations for the meta-initialization policy. 
-%The first condition can also be satisfied for compact $\Theta$ \citep[Lemma 27]{mei2020global}. 
-\hl{Technically, Assumption \ref{asmptn:newAsmptn1} is a minimal requirement for the CRPO to provide any guarantees in a single task. This can be seen in (\eqref{eq:RegretRandC}): if $\pi_{t,0}$ does not have full support over the state/action space, then there may be a state $s$ and an action $a$ where $\pi_t^*(a|s) > 0$ but $\pi_{t,0}(a|s) = 0$, which would make the KL divergence term in (\eqref{eq:RegretRandC}) infinite.} Furthermore, Assumption \ref{asmptn:newAsmptn1} ensures the following holds for the meta-initialization policy $\pi_{t,0}$ for any state $s \in \mathcal{S}$ with positive constants $C_\pi$, $L_g$, $L_\pi$ and $\mu_\pi$: {(1)} $|D_{KL}(\pi_t^*(\cdot|s)|\pi_{t,0}(\cdot|s))|$, $|D_{KL}(\hat{\pi}_t(\cdot|s)|\pi_{t,0}(\cdot|s))|\leq C_{\pi}$; {(2)} $D_{KL}(\pi_t^*(\cdot|s)|\pi_{t,0}(\cdot|s))$ is $L_g$-Lipschitz and $L_\pi$-smooth in $\pi_{t,0}(\cdot|s)$; {(3)} $D_{KL}(\pi_t^*(\cdot|s)|\pi_{t,0}(\cdot|s))$ is $\mu_\pi$-strongly convex in $\pi_{t,0}(\cdot|s)$. We use these conditions in the proof of Lemma \ref{lemma: ideal setting}, Lemma \ref{cor:dynamicRegretOGD}, and Theorem \ref{thm:InexactTAOG}. 
-
-In this work, we develop algorithms whose TAOG and TACV scale with the task-similarity, which implies that the method will do well if tasks are similar. To understand the CMDP-within-online framework and the impact of task-similarity on the upper bounds of TAOG and TACV for Meta-SRL, we first present a simplified result under the ideal setting where  $\{\nu_t^*\}_{t=1}^T$ and $\{\pi_t^*\}_{t=1}^T$ are available for each task $t$ and the task-similarity $D^{*2}$ is known.
-
-
-%More relaxed conditions than Assumption \ref{asmptn:newAsmptn1} are possible (see, e.g., \citep{zhang2017improved,baby2022optimal}).
-% \VK{Essentially, we make use of these conditions in the proof of Lemma \ref{lemma: ideal setting}, Lemma \ref{cor:dynamicRegretOGD}, and Theorem \ref{thm:InexactTAOG}, where we exploit the strong convexity, Lipschitzness, and boundedness of the KL divergence with respect to the meta-initialization policy. We expect that Assumption \ref{asmptn:newAsmptn1} is also needed in unconstrained meta-learning by adapting our method, i.e., the MDP-within-online framework. Also, see Appendix \ref{sec:kl-bound} for further discussions on Assumption \ref{asmptn:newAsmptn1}.} 
-
-\begin{lemma}\label{lemma: ideal setting}
-Assume $\{\nu_t^\ast\}_{t=1}^T$ and $\{\pi_t^\ast\}_{t=1}^T$ are given  after each task and the task-similarity $D^{*2}$ is known.
-For each task $t$, we run CRPO for $M$ iterations with $\alpha = \frac{(1-\gamma)^{\frac{3}{2}}}{\sqrt{2M |\mathcal{S}||\mathcal{A}| }} \sqrt{\frac{L_g^2(\log T + 1)}{\mu_\pi T}+ D^{*2} }$. In addition, the initialization $\{\pi_{t,0}\}_{t=1}^T$  are determined by playing \textit{Follow-the-Regularized-Leader} (FTRL) or \textit{online gradient descent} (OGD) on the functions $\mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \cdot \right)\right], \text{ for } t=1,\ldots, T$.\footnote{When  online learning is played on $\mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \pi \right)\right]$  to determine $\pi_{t+1,0}$, we treat $\theta$ in $\pi_{\theta}$ for all $s\in\mathcal{S}$ and $a\in\mathcal{A}$ as the decision variable. For simplicity, we will refer to $\pi$ as the decision variable.} 
-Then, it holds that 
-\begin{align*}
- & \Bar{R}_0 \leq  \mathcal{O}\left(\frac{1}{\sqrt{M}} \sqrt{\frac{\log T}{T} + D^{*2} } \right), \Bar{R}_i \leq  \mathcal{O}\left(\frac{1}{\sqrt{M}} \sqrt{\frac{\log T}{T} + D^{*2} } \right) \ \forall i=1,\ldots,p.
-\end{align*}
-\end{lemma}
-
-
-% \begin{remark}
-% It should be noted that in the case of $D^* = 0$ (i.e., if all the tasks are similar), the learning rate $\alpha$ in the above Lemma will also be $0$, which implies that $FTL$ or $OGD$ will give the optimal meta-initialization as $\pi_t^*$, and no learning is required to learn the task, as the policy will be the optimal policy. Next corollary presents the special case for TAOG and TACV when the task similarity $D^* = 0$.
-% \end{remark}
-
-% \begin{corollary}[Case of $D^* = 0$]
-% \label{cor:Dstar=0}
-% Assume $\{\nu_t^\ast\}_{t=1}^T$ and $\{\pi_t^\ast\}_{t=1}^T$ are given such that $\nu_t^* = \nu_1^*$ and $\pi_t^* = \pi_1^* \ \forall t \in [T]$ (i.e., $D^* = 0$). For each task $t$, we run CRPO for $M$ iterations with $\alpha = \frac{(1-\gamma)^{\frac{3}{2}} \mathbb{E}_{s \sim \nu_1^*}[D_{KL}(\pi_1^*|\pi_{1,0})] }{\sqrt{2M |\mathcal{S}||\mathcal{A}| }}$. 
-% In addition, the initialization $\{\pi_{t,0}\}_{t=1}^T$  are determined by playing \textit{Follow-the-Regularized-Leader} (FTRL) or \textit{online mirror descent} (OMD) \citep{hazan2016introduction} on the functions $\mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \cdot \right)\right], \text{ for } t=1,\ldots, T$. Then, it holds that 
-% \begin{align*}
-%  & \Bar{R}_i \leq  \frac{2 \sqrrt
-%  |\mathcal{S}| |\mathacal{A}|}{\sqrt{M}(1-\gamma)^{3/2}} \hspace{0.3cm} \forall i=1,\ldots,p.
-% \end{align*}
-% \end{corollary}
-The above result reveals an interesting benefit brought by including more tasks (the regret decays at a rate of $\log(T)/T$)  with more similarity (i.e., lower $D^*$), which improves upon single-task guarantee and serves as the initial point of our study. 
-However, there are several limitations. First, if the optimal policies $\pi_t^*$ and the induced state distributions $\nu_t^*$ are not revealed after each task, it is not likely that the plug-in estimator $\{\mathbb{E}_{s \sim \hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\cdot)]\}_{t=1}^T$ with the learned policy $\hat{\pi}_t$ and estimated visitation distribution $\hat{\nu}_t$ is an unbiased estimator, ruling out existing analysis for FTRL or OGD in the bandit setting. Besides, 
-while the knowledge of $D^*$ used to determine the learning rate can be relaxed \citep{khodak2019adaptive,balcan2019provable}, the resulting scheme is complex to implement---we expect that the learning rates can be chosen \emph{adaptively} for different tasks based on losses observed in the past. We aim to address these challenges with a series of developments in the next section.
-
-
-\begin{algorithm}[t]
-% \KwInput{}
-% \KwOutput{\zeta^{\star}}
-  \caption{Inexact CMDP-within-online framework (exemplified with CRPO \citep{xu2021crpo} as the within-task safe RL algorithm)}
-  \begin{algorithmic}[1]
-    \STATE Initialize actor policy $\pi_{1,0}$ and learning rate $\alpha_1$
-    \FOR{task $t \in [T]$}
-        \STATE Run CRPO with initializations for actor policy $\pi_{t,0}$ and learning rates $\alpha_{t}$ to obtain a policy $\hat{\pi}_t$
-        \STATE Estimate the discounted state visitation distribution $\hat{\nu}_t$ of $\hat{\pi}_t$ based on trajectory data collected within-task $t$ with DualDICE \citep{nachum2019dualdice}
-    \STATE Run one or multiple steps of OGD on
-    \begin{enumerate}
-        \item[(a)] $INIT$: $\hat{f}_{t}^{init}(\phi) = \mathbb{E}_{\hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\phi)]$.
-        \item[(b)] SIM: $\hat{f}_t^{sim}(\kappa) = \frac{c_1^t \mathbb{E}_{\hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\pi_{t,0})]}{\kappa} + \kappa (c_2^tM + c_4^t \sqrt{M})+c_3^t\sqrt{M}$
-    \end{enumerate}
-    to obtain $\pi_{t+1,0}$ and $\alpha_{t+1}$. Here $c_1^t=2$, $c_2^t=\frac{4c_{max}^2|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3}$, $c_3^t=\frac{3+(1-\gamma)^2}{(1-\gamma)^2}$, and $c_4^t=\frac{3 c_{max}}{(1-\gamma)^2}$.
-     \ENDFOR
-  \end{algorithmic}
-  \label{alg:MetaSRL}
-\end{algorithm} 
-
- \section{Provable guarantees for practical CMDP-within-online framework}
-\label{sec:Methodology}
-
-% This section introduces a provably low-regret online learning framework: Meta-SRL, which is more practical as it addresses the limitations and assumptions of the current Meta-RL formulations, that might not be true for CMDP settings. We address each challenge and then provide the inexact upper bounds for the TAOG and TACV regrets under each setting. 
-% The goal of the meta-learner will be to do principled meta-initializations for some safe RL algorithm, such that the upper bounds on the TAO and TACV regrets are minimized \citep{khodak2019adaptive}.
-\subsection{Inexact CMDP-within-online framework}
-\label{subsec:Inexact}
-
-One of the key steps to generalize the online-within-online methodology \citep{balcan2019provable}
-%\citep{balcan2019provable,alquier2017regret}
-to Meta-SRL is to relax the assumption of accessing the exact upper bounds of within-task performance by designing algorithms to estimate and update their inexact versions.
-
-\textbf{Estimation of upper bounds.} Once a CMDP task $t$ is complete, the meta-learner only has access to a suboptimal policy $\hat{\pi}_t$ and the trajectory dataset $\mathcal{D}_t$ produced by some safe RL algorithm. Let $\tilde{\nu}_t$ denote the discounted state visitation distribution induced by policy $\hat{\pi}_t$. To obtain an estimate $\hat{\nu}_t$ from $\mathcal{D}_t$, recent methods often rely on estimating  discounted state visitation distribution corrections \citep{liu2018breaking,gelada2019off}.
-% \citep{hallak2017consistent,liu2018breaking,gelada2019off}
-However, the main issues are that $\mathcal{D}_t$ is collected by multiple behavior policies during the learning period, and depending on how far these behavior policies are from the target policy, the per-step importance ratios involved in these methods may have large variance, which may result in a detrimental effect on stochastic algorithms. In this work, we use a methods from the distribution correction estimation (DICE) family, namely DualDICE \citep{nachum2019dualdice}, which is agnostic to the number of behavior policies used and does not involve any per-step importance ratios, thus is less likely to be affected by their high variance. In particular, for each state-action pair $(s,a)$, the method aims to estimate the quantity $\omega_{\pi/\mathcal{D}_t}(s,a) = \frac{d^\pi(s,a)}{d^{\mathcal{D}_t}(s,a)}$, i.e., the likelihood that the target policy $\pi$ will experience the state-action pair normalized by the probability with which the state-action pair appears in the off-policy data $\mathcal{D}_t$. Thereby, we estimate $\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi)]$ with $\mathbb{E}_{\hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\pi)]$ by plugging in $\hat{\pi}_t$ from the within-task CMDP and $\hat{\nu}_t$ from DualDICE in lieu of the optimal policy $\pi_t^*$ and the corresponding discounted state visitation distribution $\nu_t^*$.
-
-\textbf{Bounding the estimation error.} We breakdown the error by sources of origin:
-\begin{align}
-    \mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi)] &- \mathbb{E}_{ \hat{\nu}_t}[D_{KL}(\hat{\pi}_{t}|\pi)]=\underbrace{\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{ \Tilde{\nu}_t}[D_{KL}(\pi_t^*|\pi)]}_{(A)}\label{eq:error_decompose}\\
-    &+\underbrace{\mathbb{E}_{ \Tilde{\nu}_t}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{ \hat{\nu}_t}[D_{KL}(\pi_t^*|\pi)]}_{(B)}+\underbrace{\mathbb{E}_{\hat{\nu}_t}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{ \hat{\nu}_t}[D_{KL}(\hat{\pi}_{t}|\pi)]}_{(C)},\nonumber
-\end{align}
-where $(A)$ accounts for the mismatch between the discounted state visitation distributions of an optimal policy $\pi_t^*$ and a suboptimal one $\hat{\pi}_t$, $(B)$ originates from the estimation error of DualDICE, and $(C)$ is due to the difference between $\pi_t^*$ and $\hat{\pi}_t$ measured according to $\hat{\pi}_t$. 
-% By triangle inequality, we can bound the total error by controlling each term separately.
-
-% \footnote{\hl{This decomposition is general in the sense that it provides a guideline to bound each term with potentially different strategies. In particular, the term $(B)$ can be bounded differently if we replace DualDICE with another stationary distribution estimation algorithm. To bound the terms $(A)$ and $(C)$, we have developed new techniques based on tame geometry and subgradient flow systems.}}
-% }}
-
-To bound $(A)$, we need to control the distance between $\nu_t^*$ and $\Tilde{\nu}_t$, which can be bounded by the distance between the inducing policy parameters as long as they are Lipschitz continuous \cite[Lemma 3]{xu2020improving}. In addition, the bound on $(C)$ also depends on the distance between policies. Controlling the distance between a policy to an optimal policy based on the suboptimality gap requires the optimization to have some curvatures around the optima (e.g., H{\"o}lderian growth \citep{johnstone2020faster}). However, to the best of our knowledge, available results are algorithm-dependent PL inequalities for policy gradient \citep{mei2020global} or quadratic growth with entropy regularization \citep{ding2021beyond}. Given some mild assumptions on the objective/constraint functions and policy parametrization, we can show that a growth condition holds broadly for any CMDP problem. 
-\begin{assumption}\label{asmptn:Definable}
-The functions $J_{t,i}(\cdot)$ for $i =0,1,...,p$ and $t \in [T]$ and parametric policy $\pi_{\theta}$ are definable in some o-minimal structure \citep{van1996geometric}.
-\end{assumption}
-\hl{The definition of ``o-minimal structure'' is given in Appendix \ref{subsec:app_F1}. Assumption \ref{asmptn:Definable} is mild as practically all functions from real-world applications, including deep neural networks, are definable in some o-minimal structures.
-% ; also, the composition of mappings, along with the sum, inf-convolution, and several other classical operations of analysis involving a finite number of definable objects in some o-minimal structure remains in the same structure. 
-For Assumption \ref{asmptn:Definable} to hold, a sufficient condition requires that the reward and utility functions belong to the same o-minimal structure.} \hl{ We use Assumption \ref{asmptn:Definable} to bound the terms $(A)$ and $(C)$ as definable sets admit the property of Whitney stratification, and any stratifiable function enjoys a nonsmooth Kurdyka-Lojasiewicz inequality, which implies some curvature around the local/global minima. Details on tame geometry and proof of Theorem \ref{thm:dualDICE} is given in Appendix \ref{sec:kl-bound}.}
-\begin{theorem}[KL divergence estimation error bound] \label{thm:dualDICE} The following bound holds:
-\begin{equation*}
-\begin{aligned}
-    \quad & |\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{ \hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\pi)]| \\ \quad &  \leq \mathcal{O}\bigg(h\left(\frac{1}{\sqrt{M}}\right)+\frac{1}{\sqrt{M}}+ \sqrt{\epsilon_{opt}}+\sqrt{\epsilon_{approx}(\mathcal{F},\mathcal{H})}\bigg) = \epsilon_t,
-    \end{aligned}
-\end{equation*}
-where $h$ is a strictly increasing continuous function with the property that $h(0)=0$ as specified in Proposition \ref{prop:Bolte}, $\epsilon_{approx}(\mathcal{F},\mathcal{H})$ and $\epsilon_{opt}$ are the approximation error and optimization error of DualDICE, defined in \Eqref{eq:eps_FH} and \Eqref{eq:opt}, respectively.
-\end{theorem}
-\begin{remark}
-\label{rem:cum_inexactness}
-We define cumulative inexactness $\mathcal{E}_T \coloneqq \sum_{t=1}^T \epsilon_t$. This quantity decays with $M$ at a rate of $\mathcal{O} \left(h\left(\frac{1}{\sqrt{M}}\right)+ \frac{1}{\sqrt{M}}  \right)$ up to some approximation and optimization errors $\epsilon_{approx}$ and $\epsilon_{opt}$. Moreover, there is a trade-off between $\epsilon_{approx}$ and $\epsilon_{opt}$: if the parametrization functions $\mathcal{F}$ and $\mathcal{H}$ used to solve DualDICE optimization are chosen as neural networks, then $\epsilon_{approx}$ can be reduced at the cost of increasing $\epsilon_{opt}$. If we use stochastic gradient descent as an optimization algorithm in DualDICE with $K$ steps, then $\epsilon_{opt}$ decays at a rate of $\mathcal{O}(1/K)$. Note that $h$ is a definable function used in the Kurdyka–\L{}ojasiewicz (KL) inequality (see, e.g., \cite[Thm. 14]{bolte2007clarke}).
-\end{remark}
-With the above uniform bound on estimation error, our next step is to develop static regret bounds for the inexact online gradient descent, which are used to furnish the upper bounds on TAOG and TACV of the proposed inexact CMDP-within-online algorithm.
-
-% and the excess risk bound given in Theorems \ref{thm:UtSim1} and \ref{thm:UtSimFunctionApproximation1}.
-
-\begin{lemma}[Static regret bound for inexact OGD]
-\label{cor:staticRegretOGD} 
-Denote $f_t(\pi_{t,0}) \coloneqq \mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})]$ for all $t \in [T]$. For any fixed comparator $\pi^*_{0} = \underset{\pi_{0} \in \Delta \mathcal{A}_{\varrho}^{|\mathcal{S}|}}{\argmin} \sum_{t=1}^T f_t(\pi_{0})$, if OGD is run on a sequence of loss functions $\{\hat{f}_t\}_{t \in [T]}$, where $\hat{f}_t(\pi_{t,0}) \coloneqq \mathbb{E}_{\hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\pi_{t,0})]$ for all $t \in [T]$  with the step-size of $\mathcal{O}(1/\sqrt{T})$, then the following bound holds for static regret:
-\begin{equation*}
-    \sum_{t=1}^T f_t(\pi_{t,0}) - \sum_{t=1}^Tf_t(\pi_{0}^*)  \leq \mathcal{O} \left(\sqrt{T}+ \mathcal{E}_T \right),
-\end{equation*}
-where $\mathcal{E}_T \coloneqq \sum_{t=1}^T \epsilon_t$ is the cumulative inexactness, and $\epsilon_t$ is the upper bound from Theorem \ref{thm:dualDICE}.
-\end{lemma}
-\VK{The static regret analyzed above is defined with respect to the optimal \emph{initial policy} $\pi_{0}^*$ in hindsight, not the \emph{final learned policy}. 
-%A static regret with respect to a final learned policy reduces the meta-learning problem to a single-task problem. However, 
-A static regret with respect to an initial policy provides freedom for the safe RL algorithm to adapt the initial policy based on observations within the task.} Once the static regret for the inexact OGD is established, we can obtain the upper bounds on TAOG and TACV for the proposed inexact CMDP-within-online algorithm in terms of the empirical task-similarity $\hat{D}^*$.
-% \textbf{Proof sketch:} Let $\nabla_t$ and $\hat{\nabla}_t$ denote the gradients of $\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi_0)]$ and $\mathbb{E}_{\hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\pi_0)]$ at $\pi_0$, respectively. We have shown that $|\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})] - \mathbb{E}_{\hat{\nu}_t}[D(\hat{\pi}_t|\pi_0)]| \leq \epsilon_t$, where $\epsilon_t$ is specified by Theorem \ref{thm:dualDICE}. Also,  $\hat{\nabla}_t$ is a $2\epsilon_t$-subgradient of $\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi_0)]$ (see Lemma \ref{lem:app2epsilon} in the appendix), and $\|\nabla_t - \hat{\nabla}_t\| \leq C_{g} \epsilon_t$ for some constant $C_{g}$ that depends on the smoothness parameter $L_\pi$. Thus, using the smoothness and convexity of the KL divergence, we can obtain the final result. 
-
-% \textbf{Static regret for inexact OGD:} Recall that at each task $t$, the meta-algorithm approximates the upper-bound of the KL divergence between the optimal policy and an initial policy, $\mathbb{E}_{\nu_t^*}[D(\pi_t^*|\pi_0)]$ by plugging in a near-optimal policy $\hat{\pi}_{t}$ and its estimated discounted state visitation distribution $\hat{\nu}_t$, i.e., $\mathbb{E}_{\hat{\nu}_t}[D(\hat{\pi}_t|\pi_0)]$. Let $\nabla_t$ and $\hat{\nabla}_t$ denote the gradients of $\mathbb{E}_{\nu_t^*}[D(\pi_t^*|\pi_0)]$ and $\mathbb{E}_{\hat{\nu}_t}[D(\hat{\pi}_t|\pi_0)]$ at $\pi_0$, respectively. We have shown that $|\mathbb{E}_{\nu_t^*}[D(\pi_t^*|\pi_{t,0})] - \mathbb{E}_{\hat{\nu}_t}[D(\hat{\pi}_t|\pi_0)]| \leq \epsilon_t$, where $\epsilon_t$ is specified by Theorem \ref{thm:dualDICE}. Also,  $\hat{\nabla}_t$ is a $2\epsilon_t$-subgradient of $\mathbb{E}_{\nu_t^*}[D(\pi_t^*|\pi_0)]$ (see Lemma \ref{lem:app2epsilon} in the appendix), and $\|\nabla_t - \hat{\nabla}_t\| \leq C_{g} \epsilon_t$ for some constant $C_{g}$ that depends on the smoothness parameter $L_\pi$. Refer the appendix for the full proof.
-% Thus, as is shown in the appendix, running online projected subgradient descent (OGD) on a sequence of smooth and convex loss functions with $\tilde{\epsilon}_t$-subgradient for $T$ rounds will incur a static regret $\mathcal{O}({\sqrt{T}}+\mathcal{E}_T)$, where $\mathcal{E}_T\coloneqq\sum_{t=1}^T\tilde{\epsilon}_t$ is the cumulative inexactness.
-
-\begin{theorem}
-\label{thm:InexactTAOG}
-For each task $t$, we run CRPO for $M$ iterations with $\alpha = \mathcal{O} \left(\frac{1}{\sqrt{M}} \sqrt{\frac{1}{\sqrt{T}} + \frac{\mathcal{E}_T}{T} + \hat{D}^{*2}} \right) $ and we obtain $\{\hat{\nu}_t\}_{t=1}^T$ and $\{\hat{\pi}_t\}_{t=1}^T$. 
-In addition, the initialization $\{\pi_{t,0}\}_{t=1}^T$ for each task $t$ are determined by playing \textit{inexact OGD} (Algorithm \ref{alg:InexactOGD}) on $\mathbb{E}_{ \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \cdot \right)\right], \text{ for } t=1,\ldots, T$. Then, the following holds for TAOG ($i=0$) and TACV ($i=1,...,p$):
-\begin{align*}
- & \Bar{R}_i \leq  \mathcal{O}\left(\frac{1}{\sqrt{M}} \left(\sqrt{\frac{1}{\sqrt{T}}+ \frac{\mathcal{E}_T}{T} + \hat{D}^{*2}} \right) \right) \ \forall i=0,1,\ldots,p.
-\end{align*}
-\end{theorem}
-The benefit of task-similarity is preserved when we perform directly on the plug-in estimator  $\mathbb{E}_{ \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \cdot \right)\right]$, though we incur an additional cost on the inexactness $\mathcal{E}_T$ and the dependence on $T$ is worse compared to Lemma \ref{lemma: ideal setting}. As $\mathcal{E}_T/T$ diminishes when the learned policy becomes optimal across tasks, e.g., by increasing within-task steps $M$ or if meta-initialization is chosen such that a few steps suffice to reach optimal, we expect the inexactness to have  a limited effect on the performance.  %This motivates the next section, where we adaptively choose the learning rate to establish an improved bound. %The authors believe that the dependence on $T$ can be improved with an improved analysis for the inexact OGD. 
-
-
-% \jin{delete this paragraph}The above result shows that TAOG and TACV decay at a rate of $\mathcal{O}(\sqrt{1/\sqrt{T} + \mathcal{E}_T/T})$, as compared to the rate of $\mathcal{O}(\log T/T)$ in Lemma \ref{lemma: ideal setting}, which assumed access to the exact optimal policies. The above result presents more realistic settings as the suboptimal policies are revealed to us, and we can measure the quantities $\hat{D}^*$ and $\mathcal{E}_T$.\jin{how do you measure $\mathcal{E}_T$?} Running OGD on this regret upper bound will lead to better meta-policy initialization.\jin{this sentence is too generic and do not add any value in the discussion.}
-% \jin{discuss this result. compare the rate to Lemma 1, what's the difference, why is that. }\jin{you spend several sentences and many words, but the  new idea/insight is meager.  }
-% \vspace{-0.2cm}
-\subsection{Dynamic regret and task-relatedness}
-\label{sec:dynamic-regret}
-\hl{In many settings, we have a changing environment, so it is natural to study dynamic regret and compare with a sequence of potentially time-varying \emph{initial policies} $\{\psi_t^*\}_{t=1}^T$.} To measure task-similarity in this case, we define \textbf{task-relatedness} which can be measured by $V^2_{\psi}=\frac{1}{T}\sum_{t=1}^T \mathbb{E}_{s \sim \nu_t^*}[D_{KL}(\pi_t^*|\psi_t^*)]$. This notion of task-relatedness gives the measure of how far optimal policies are in each task from some time-varying comparator. We denote \textbf{empirical task-relatedness} as $\hat{V}_\psi^2 \coloneqq \frac{1}{T}\sum_{t=1}^T \mathbb{E}_{s \sim \hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\psi_t^*)]$, which depends on the suboptimal policy returned by the within-task algorithm. 
-% Note that $\hat{V}_\psi$ is algorithm-dependent, yet $V_\psi$ is algorithm-agnostic. Furthermore, 
-To measure the performance of Meta-SRL in dynamic settings, we analyze the dynamic regret bound, i.e., $U_T\coloneqq\sum_{t=1}^Tf_t(\phi_t)-\sum_{t=1}^T f_t(\psi^*_t)$, where $\psi^*_t\in\arg\min_{x\in\mathcal{X}}f_t(x)$ is a sequence of minimizers for each loss, and $f_t(\cdot) = \mathbb{E}_{s \sim \nu_t^*}[D_{KL}(\pi_t^*|\cdot)]$. By exploiting the strong convexity of the loss function (KL divergence in our case), previous studies have shown that the dynamic regret can be upper bounded by the path-length of the comparator sequence, defined as $\mathcal{P}_T\coloneqq\sum_{t=2}^T\|\psi_t^*-\psi_{t-1}^*\|$, which captures the cumulative difference between successive comparators \citep{zhao2020dynamic}. The bound can be further improved  for strongly convex functions as the minimum of the path-length and the squared path-length, $\mathcal{S}_T\coloneqq\sum_{t=2}^T\|\psi_t^*-\psi_{t-1}^*\|^2$, which can be much smaller than the path-length \citep{zhang2017improved}. We extend these results to the settings of inexact online gradient descent by allowing the learner to query the inexact gradient of the function. 
-% \hl{The result below is another technical contribution of the study and can be of independent interest.}
-
-% \begin{lemma}[Dynamic regret bound for inexact OGD]
-% \label{cor:dynamicRegretOGD}
-% Denote $f_t(\phi_{t}) \coloneqq \mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\phi_{t})]$ for all $t \in [T]$. For any dynamically varying comparator $\{\phi_{t}\}_{t=1}^T$ with $\psi^*_{t} \in \underset{\phi_{t} \in \Delta \mathcal{A}_{\varrho}^{|\mathcal{S}|}}{\argmin}  f_t(\phi_{t})$, if single-step inexact OGD is run with the step-size $\beta \leq \frac{1}{2\mu_\pi}$ on a sequence of loss functions $\{\hat{f}_t\}_{t \in [T]}$, where $\hat{f}_t(\phi_t) = \mathbb{E}_{\hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\phi_{t})]$ , then the following bound holds for dynamic regret:
-% \begin{equation*}
-%     \sum_{t=1}^T f_t(\phi_{t}) - \sum_{t=1}^Tf_t(\psi_{t}^*) \leq \mathcal{O}\left(\min(\mathcal{S}_T+\mathcal{E}_T,\mathcal{P}_T+\tilde{\mathcal{E}}_T) \right),
-% \end{equation*}
-% where $\mathcal{P}_T \coloneqq \sum_{t=2}^T\|\psi_{t}^* - \psi_{t-1}^*\|$ is the path-length of the comparator sequence, $\mathcal{S}_T \coloneqq \sum_{t=2}^T\|\psi_{t}^* - \psi^*_{t-1}\|^2$ is the squared path-length, $\mathcal{E}_T \coloneqq \sum_{t=1}^T \epsilon_t$ is the cumulative inexactness, $\tilde{\mathcal{E}}_T \coloneqq \sum_{t=1}^T \sqrt{\epsilon_t}$ is the cumulative square root of inexactness, and $\epsilon_t$ is the upper bound from Theorem \ref{thm:dualDICE}.
-% \end{lemma}
-
-\begin{lemma}[Dynamic regret bound for inexact OGD]\label{cor:dynamicRegretOGD}
-Denote $f_t(\phi_{t}) \coloneqq \mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\phi_{t})]$ for all $t \in [T]$. For any dynamically varying comparator $\psi^*_{t}$ , if single-step inexact OGD is run with the step-size $\beta \leq \frac{1}{2\mu_\pi}$ on a sequence of loss functions $\{\hat{f}_t\}_{t \in [T]}$, where $\hat{f}_t(\phi_t) = \mathbb{E}_{\hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\phi_{t})]$ , then the following bound holds for dynamic regret:
-\begin{equation*}
-    \sum_{t=1}^T f_t(\phi_{t}) - \sum_{t=1}^Tf_t(\psi_{t}^*) \leq \mathcal{O}\left(\min(\mathcal{S}_T+\mathcal{E}_T,\mathcal{P}_T+\tilde{\mathcal{E}}_T) \right),
-\end{equation*}
-where $\mathcal{P}_T \coloneqq \sum_{t=2}^T\|\psi_{t}^* - \psi_{t-1}^*\|$ is the path-length of the comparator sequence, $\mathcal{S}_T \coloneqq \sum_{t=2}^T\|\psi_{t}^* - \psi^*_{t-1}\|^2$ is the squared path-length, $\mathcal{E}_T \coloneqq \sum_{t=1}^T \epsilon_t$ is the cumulative inexactness, $\tilde{\mathcal{E}}_T \coloneqq \sum_{t=1}^T \sqrt{\epsilon_t}$ is the cumulative square root of inexactness, and $\epsilon_t$ is the upper bound from Theorem \ref{thm:dualDICE}.
-\end{lemma}
-% % The above result shows how the dynamic regret jointly depends on $4$ quantities, i.e., $\mathcal{S}_T$, $\mathcal{E}_T$, $\mathcal{P}_T$ and $\tilde{\mathcal{E}}_T$, which reflect the amount of non-stationarity in the environment and the cumulative inexactness in the gradient estimation.
-% This result can be extended to an algorithm that performs multiple OGD update per round. Same regret can be achieved if the number of steps are of the order $\mathcal{O}(L_\pi/\beta)$.
-% \begin{remark}
-% Note that all the quantities $V_\psi, \mathcal{P}_T$ and $\mathcal{S}_T$ measure task-relatedness. We expect that these notions can be unified when we replace OGD with OMD with properly chosen regularization as future work. \yuhao{I previously thought $P_T$ and $S_T$ are defined on $\pi_t^*$ so that they also measure the task relatedness. But if $P_T$ and $S_T$ are  just defined on comparators, it measures something different...VK: I agree, they are not exactly dependent on $\pi_t^*$.}
-% \end{remark} 
-
-% \textbf{Proof idea} To measure the performance of our meta-algorithm in dynamic settings, we analyze the dynamic regret bound, i.e., $U_T\coloneqq\sum_{t=1}^Tf_t(x_t)-\sum_{t=1}^Tf_t(x^*_t)$, where $x^*_t\in\arg\min_{x\in\mathcal{X}}f_t(x)$ is a sequence of local minimizers \citep{zhang2017improved}. By exploiting the strong convexity, previous studies have shown that the dynamic regret can be upper bounded by the path-length of the comparator sequence, defined as $\mathcal{P}_T\coloneqq\sum_{t=2}^T\|x_t^*-x_{t-1}^*\|$ that captures the cumulative difference between successive comparators \citep{zhao2020dynamic}. The bound can be further improved  for strongly convex functions as the minimum of the path-length and the squared path-length, $\mathcal{S}_T\coloneqq\sum_{t=2}^T\|x_t^*-x_t\|^2$, which can be much smaller than the path-length \citep{zhang2017improved}. We extend these results to the settings of inexact online gradient descent by also allowing the learner to query the inexact gradient of the function multiple times. As we show in the appendix, online projected gradient descent on a sequence of strongly convex functions with access to multiple $\epsilon_t$-subgradients in each round can be bounded by  $\mathcal{O}(\min(\mathcal{S}_T+\mathcal{E}_T,\mathcal{P}_T+\tilde{\mathcal{E}}_T))$, where $\tilde{\mathcal{E}}_T\coloneqq\sum_{t=1}^T\sqrt{\tilde{\epsilon}}_t$ is the cumulative square root of inexactness. 
-
-% Next corollary provides the TAOG and the TACV bound for the case of dynamic regret.
-% \begin{corollary}
-% \label{cor:InexactTAOG}
-% Let $\hat{V}_{\psi}^{2}= \frac{1}{T} \sum_{t=1}^T \mathbb{E}_{s \sim \hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\psi_t^*)]$ be the estimated task similarity, where $\{\psi_t^*\}_{t \in [T]}$ is a sequennce of dynamically varying comparator. For each task $t$, we run CRPO for $M$ iterations with $\alpha = \mathcal{O}\left(\frac{1}{\sqrt{M}}\left(\sqrt{\frac{\min(\mathcal{S}_T+\mathcal{E}_T,\mathcal{P}_T+\tilde{\mathcal{E}}_T)  }{T} + \frac{\mathcal{E}_T}{ T}+ \hat{V}_\psi^2} \right) \right) $, to obtain $\{\hat{\nu}_t\}_{t=1}^T$ and $\{\hat{\pi}_t\}_{t=1}^T$. 
-% In addition, the initialization $\{\pi_{t,0}\}_{t=1}^T$  are determined by playing \textit{Follow-the-Regularized-Leader} (FTRL) or \textit{online mirror descent} (OMD) \citep{hazan2016introduction} on the functions $\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \cdot \right)\right], \text{ for } t=1,\ldots, T$. Then, it holds that 
-% \begin{align*}
-%  \Bar{R}_i \leq  \mathcal{O}\left(\frac{1}{\sqrt{M}}\left(\sqrt{\frac{\min(\mathcal{S}_T+\mathcal{E}_T,\mathcal{P}_T+\tilde{\mathcal{E}}_T)  }{T} + \frac{\mathcal{E}_T}{ T}+ \hat{V}_\psi^2} \right) \right), \ \forall i=0,\ldots,p,
-% \end{align*}
-% where $\mathcal{P}_T \coloneqq \sum_{t=2}^T\|\psi_{t}^* - \psi_{t-1}^*\|$ is the path-length of the comparator sequence, $\mathcal{S}_T \coloneqq \sum_{t=2}^T\|\psi_{t}^* - \psi_{t-1}^*\|^2$ is the squared path-length, $\mathcal{E}_T \coloneqq \sum_{t=1}^T \epsilon_t$ is the cumulative inexactness, $\tilde{\mathcal{E}}_T \coloneqq \sum_{t=1}^T \sqrt{\epsilon_t}$ is the cumulative square root of inexactness, and $\epsilon_t$ is the upper bound from Theorem \ref{thm:dualDICE}.
-% \end{corollary}
-
-% The analysis from the above corollaries require us to know the estimated task-similarity for all the $T$ tasks beforehand. This is impractical for the practical situations, where the meta-learner should make the meta-initialization as the tasks keep coming. Moreover, the estimated task similarity appears in the denominator, making the TAOG and TACV decreasing only after a certain threshold of estimated task similarity. Next section will address this limitation for the above, by adapting the learning rates after each task is performed.
-
-\subsection{Dynamic regret with adaptive learning rates} \label{subsec:adapt_learning_rates}
-
-\label{subsec:DifferentLearningRates}
-It can be observed from the last section that to set the learning rate $\alpha_t$ for the within-task algorithm CRPO, knowledge of optimal/suboptimal policies from all $T$ tasks is used. This makes the algorithm less applicable in online settings where tasks are encountered sequentially. Moreover, when the task-environment changes dynamically, a fixed policy initialization $\psi$ may not be the best candidate comparator, where it is natural to study dynamic regret by competing with a potentially time-varying sequence $\{\psi_t^*\}_{t=1}^T$. Also, the tasks may share some common aspects of the optimization landscape, %e.g., the constraints are harder to satisfy than optimizing the rewards, 
-so adapting learning rates based on prior experience may further improve performance. This is the direction we pursue. Recall the regret for suboptimality and constraint violation of the CRPO:
-\begin{equation}\label{eq:fullRegretCRPO}
-\begin{aligned}
-    U_{t}(\pi_{t,0},\alpha_t)&\coloneqq  \frac{c_1^t}{\alpha_{t}}\mathbb{E}_{s\sim \nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})] + \alpha_t(c_2^tM + c_4^t \sqrt{M}) + c_3^t\sqrt{M} ,
-    \end{aligned}
-\end{equation}
-where the constants $\{c_i^t\}_{i=1,\ldots,4}$ are given in Algorithm \ref{alg:MetaSRL}. We assume that  $\alpha_t \in \Lambda\coloneqq\{\alpha_t\mid\alpha_t\geq\zeta\}$ for some $\zeta>0$, where $\Lambda$ is a convex set. 
-Overall, the goal of the meta-learner is to make a sequence of decisions, collected by $x_t = \{\pi_{t,0}\in \Delta \mathcal{A}_{\varrho}^{|\mathcal{S}|},\alpha_{t}\in\Lambda\}$, such that TAOG and TACV are minimized.
-
-To design the adaptive algorithm, we consider the following two parallel sequences of loss functions over initial policy $\phi$, $f_{t}^{init}(\phi) = \mathbb{E}_{{\nu_t^*}}[D_{KL}({\pi}_t^*|\phi)]$, and learning rate $\kappa$,
-\begin{equation*}
-\begin{aligned}
- f_t^{sim}(\kappa) &= \frac{c_1^t\mathbb{E}_{{\nu_t^*}}[D_{KL}({\pi}^*_t|\pi_{t,0})]}{\kappa}+\underbrace{\kappa(c_2^tM + c_4^t \sqrt{M}) + c_3^t\sqrt{M}}_{ f_{t}^{rate}(\kappa)}.
-\end{aligned}
-\end{equation*}
-Note that $f_t^{sim}(\alpha_{t})=U_t(\pi_{t,0}, \alpha_{t})$ matches the upper bound in \eqref{eq:fullRegretCRPO}. 
-We also denote the inexact versions $\hat{f}_{t}^{init}(\phi)$ and $\hat{f}_t^{sim}(\kappa)$ by replacing $\mathbb{E}_{{\nu_t^*}}[D_{KL}({\pi}_t^*|\phi)]$ with $\mathbb{E}_{{\hat{\nu}_t}}[D_{KL}({\hat{\pi}}_t|\phi)]$ in the above. Inspired by \citep{khodak2019adaptive},
-%\citep{khodak2019adaptive,niazadeh2021online}
-instead of running one online algorithm on $U_{t}(\pi_{t},\alpha_t)$, we will run two online algorithms separately for the function sequences $\hat{f}_t^{init}$ and $\hat{f}_t^{sim}$ by taking actions on the initial policy and learning rates, respectively, such that the overall regret can be bounded by an expression that depends on the regrets for each sequence. %While the idea is inspired by \citep{khodak2019adaptive}, the proof is more involved due to the complicated form of $f_t^{sim}$. 
-Let INIT and SIM be two algorithms, such that the actions $\pi_{t+1,0} \coloneqq \mathrm{INIT}(t)$  are taken over $\hat{f}_{t}^{init}$ and the actions $\alpha_{t+1} \coloneqq \mathrm{SIM}(t)$  are taken over $\hat{f}_t^{sim}$; these actions will then be used as policy initialization and learning rates for the next CMDP. We assume the following regret upper bounds for each algorithm:\footnote{While we run the online learning algorithm on the inexact versions of the loss $\{\hat{f}_t\}_{t=1}^T$, the dynamic/static regret is the standard one measured using the exact losses: $U_T=\sum_{t=1}^Tf_t(\phi_t)-\sum_{t=1}^T f_t(\psi^*_t)$.}
-\begin{enumerate}
-    \item $U_{T}^{init}(\{\psi_t^*\}_{t=1}^T)$: upper bound on the dynamic regret for $\mathrm{INIT}$ over functions $\{\hat{f}_t^{init}\}_{t=1}^T$ with respect to a time-varying sequence $\{\psi_t^*\}_{t=1}^T$;
-    % \item INIT: inexact dynamic regret with respect to a sequence $\{\psi_t\}_{t=1}^T$ is upper bounded by $U_{T}^{init}(\psi)$,
-    \item $U_{T}^{sim}(\kappa)$: upper bound on the  static regret for $\mathrm{SIM}$ over functions $\{\hat{f}_t^{sim}\}_{t=1}^T$ with respect to a comparator $\kappa > 0$.
-    % \item SIM: inexact static regret with respect to $\kappa$ is upper bounded by $U_{T}^{sim}(\kappa)$.
-\end{enumerate}
-
-\begin{theorem}\label{thm:UtSim1}
-Let each within-task CMDP $t$ run $M$ steps of CRPO, initialized by policy $\pi_{t,0}\coloneqq \mathrm{INIT}(t)$ and learning rates $\alpha_{t} \coloneqq \mathrm{SIM}(t)$. Let $\kappa^* \coloneqq \argmin L(\kappa)$, where
-\begin{equation} \label{eq:appLkappa}
-    L(\kappa) =U_T^{sim}(\kappa)+ \frac{U_T^{init}(\{\psi_t^*\}_{t=1}^T)}{\kappa} + \frac{\mathcal{E}_T}{\kappa} + \sum_{t=1}^T\bigg[ \frac{\hat{f}_t^{init}(\psi_t^*)}{\kappa} + f_t^{rate}(\kappa)\bigg],
-\end{equation}
-and $\{\psi_t^*\}_{t=1}^T$ is any comparator sequence. Then, 
-the following bounds on TAOG and TACV hold:
-\begin{equation}
-\bar{R}_{i} \leq \frac{L(\kappa^*)}{T},\qquad\qquad \forall\; i=0,...,p.
-\end{equation}
-\end{theorem}
-
-% Both $\hat{f}_t^{init}(\{\psi_t\}_{t \in [T]})$ and $\hat{f}_t^{sim}(\kappa)$ are convex functions with respect to the inputs. Therefore, if we run FTL or OGD over these loss function, the following upper bounds on the regret will hold: 
-% \begin{equation}
-%     \label{eq:UTinit}
-%     U_T^{init}(\{\psi_t\}_{t \in T}) = \mathcal{O}\left(\min(\mathcal{S}_T + \mathcal{E}_T, \mathcal{P}_T + \tilde{\mathcal{E}}_T) \right),
-% \end{equation}
-
-% \begin{equation}
-%     \label{eq:UTsim}
-%     U_T^{sim}(\kappa) = \mathcal{O} \left(\sqrt{T} + \mathcal{E}_T \right).
-% \end{equation}
-\VK{Note that the terms $U_T^{init}$ and $U_T^{sim}$ are simply placeholders for upper bounds on the respective regrets for some inexact online algorithms. In particular, INIT and SIM can be any inexact online algorithms in Algorithm \ref{alg:MetaSRL}, and the results of Theorem \ref{thm:UtSim1} can be instantiated by plugging in the respective $U_T^{init}$ and $U_T^{sim}$.
-}
-The following corollary presents the TAOG, and TACV regret bounds when $\mathrm{INIT}$ and $\mathrm{SIM}$ are inexact OGD over the loss functions $\{\hat{f}_t^{init}\}_{t=1}^T$ and $\{\hat{f}_t^{sim}\}_{t=1}^T$ respectively. 
-% \jin{Do we have bounds for inexact FRL?VK: Currently we only have bounds for the inexact OGD.}\jin{My question is for you to reexamine and make necessary edits. VK: Addressed.}
-\begin{corollary} \label{cor:CorollaryAdpativeRate}
-If $\mathrm{INIT}(t)$ and $\mathrm{SIM}(t)$ are inexact OGD, and are run over the sequences $\{\hat{f}_t^{init}\}_{t=1}^T$ and $\{\hat{f}_t^{sim}\}_{t=1}^T$, then, the following bounds on TAOG and TACV hold for all $i = 0,\ldots,p$:
-\begin{equation}\label{eq:upper-bound-adapt}
-\begin{aligned}
-\bar{R}_{i} \leq & \mathcal{O}\left(\frac{1}{\sqrt{M}} \left( \frac{1}{\sqrt{MT}}  +  \frac{\mathcal{E}_T}{T\sqrt{M}} + \frac{1}{M^{1/4}\sqrt{T}} \sqrt{\frac{\min(\mathcal{S}_T + \mathcal{E}_T, \mathcal{P}_T + \tilde{\mathcal{E}}_T)+\mathcal{E}_T}{T} + \hat{V}_{\psi}^2 } \right) \right).
-\end{aligned}
-\end{equation} 
-\end{corollary}
-
-\begin{remark}
-The bounds are improved in terms of $M$ and $T$ due to the adaptive learning rate.  Specifically, the bounds diminish at a rate $\mathcal{O}\left(\frac{1}{M^{3/4}\sqrt{T}}\left(\mathcal{E}_T+ \sqrt{\frac{\mathcal{E}_T}{T} +\hat{V}_\psi^2 } \right) \right)$ as compared to the previous rate $\mathcal{O}\left(\frac{1}{\sqrt{M}}\left(\sqrt{\frac{\mathcal{E}_T}{\sqrt{T}} + \hat{D}^{*2}  } \right) \right)$. Note that $\hat{V}_\psi$ is same as $\hat{D}^*$ in the case of a fixed comparator $\psi^*$. Moreover, a practical aspect of our algorithm is that it does not require the knowledge of quantities like $\mathcal{S}_T, \mathcal{P}_T$ and $\mathcal{E}_T$ to decide the value of learning rate $\alpha_t$. 
-\end{remark}
-% Since $f_{t}^{init}(\phi)$  is smooth and strongly convex by Assumption \ref{asmptn:newAsmptn1} and $f_t^{sim}(\kappa)$ is convex and smooth for $\kappa_i\in \Lambda$, we can directly apply regret bounds developed in Sec. \ref{subsec:Inexact} for running OGD as both INIT and SIM on the inexact losses $\hat{f}_{t}^{init}(\phi)$ and $\hat{f}_t^{sim}(\kappa)$, respectively. % The formal statements can be found in the appendix for both the cases of static and dynamic regrets.
-%  {Due to the space restriction, we refer the reader to the Appendix for more discussions on how the upper bounds in \eqref{eq: upper bound in Thm 3.2} are related to the task-similarity or the task-relatedness in Section \ref{subsec:taskSimilarity}.} 
-
-% Note that it is straightforward to extend our method to analyze task transfer bound using standard online-to-batch conversion techniques (see, e.g., \citep{khodak2019adaptive,balcan2019provable,denevi2019learning}). We leave such analysis to the interested readers.
-
-\begin{figure}[t]
-\centering
-  \includegraphics[width=\columnwidth]{FrozenLake/FrozenLakeLowSimilarity.pdf}
-\caption{Frozen lake results for reward maximization and constraint violations when the task-relatedness is low. The Blue dashed line represents the averaged thresholds for the constraint violations. We do $10$ runs on each baseline to get the performance plots with variance.}
-\label{fig:FrozenLake}
-\end{figure}
-
-\begin{figure}[t]
-\centering
-  \includegraphics[width=\columnwidth]{Acrobot/Acrobot_low_similarity2.pdf}
-\caption{Acrobot results for reward maximization and constraint violations when the task-relatedness is low. Blue dashed line represents the averaged thresholds for the constraint violations.}
-\label{fig:Acrobot}
-\end{figure}
-
-\vspace{-0.1cm}
-\section{Experiments}\label{sec:experiments}
-
-In this section, we show the effectiveness of the proposed Meta-SRL framework and compare it with the following baselines: simple averaging (i.e., initialize with the average of learned policies from past CMDPs), pre-trained (i.e., initialize test task with the suboptimal policy from another CMDP), Follow the Average Leader (FAL), and random initialization strategies. Note that simple averaging takes the average of previous suboptimal policies obtained from random initializations on all CMDP tasks, while FAL does this in an online manner while tasks arrive sequentially. Different CMDPs are generated using a probability distribution over the parameters of CMDPs (e.g., rewards, transition dynamics), similar to the latent CMDP model \citep{chen2021understanding}. \hl{We consider the Frozen lake, acrobot, half-Cheetah, and humanoid environments from the OpenAI gym \citep{brockman2016openai} and MuJoco \cite{todorov2012mujoco} under constrained settings. } For more details on experimental setups, distribution shift, and extra experiments on Mujoco, please refer to Appendix \ref{sec:ExpDetails}.
-
-% \textbf{Frozen lake:} For the Frozen lake, we randomly generate $T=10$ different orientations as tasks over the probability of a state being frozen or a hole, and evaluate the performance for the scenarios with high task-similarity (low variance for the latent CMDP distribution) or low task-similarity (high variance for the latent CMDP distribution). The agent gets rewarded $+2$ when it reaches the goal state, and incurs a cost $-1$ when it falls into a hole. We choose the constraint threshold $d_{t,i} = 0.3$.
-
-% \textbf{Acrobot:} Acrobot is a $2$ link robot OpenAI gym  environment which has a continuous state space. The agent is rewarded when it achieves certain height of the end link. Two constraints are introduced for two links, where $-1$ cost is incurred if any link swings in the prohibited direction. We randomly generate $T=50$ different tasks with different mass links and center of gravity.
-
-We can observe from Figure \ref{fig:FrozenLake} that Meta-SRL achieves higher rewards and lower constraint violations more quickly than baseline initializations. The baseline FAL which simply takes the average of previous suboptimal policies, performs poorly. This illustrates the benefit of incorporating stationary distribution correction estimation and adaptive learning rates. Indeed, for Frozen lake, different locations of the hole can result in different stationary distributions---it is more sensible to put higher weights on policies that frequently visit a particular state since it implies that the corresponding strategies can have a substantial impact on the case of low task-similarity conditions. We also observe similar trends for the Acrobot in Figure \ref{fig:Acrobot}, where Meta-SRL achieves higher rewards quickly and zero constraint violations as compared to other baseline initializations under low task-relatedness settings. The pre-trained baseline was able to achieve higher rewards but did not achieve constraint satisfaction for both constraints. Under high task-similarity settings, we expected all the methods (except vanilla CRPO) to perform well; however, we noticed that simple averaging does poorly even in this setting, possibly due to adverse interference among different tasks. 
-
-\section{Conclusion and future directions}
-\vspace{-0.2cm}
-We introduced a novel framework, Meta-SRL, for meta-learning over CMDPs. The proposed framework does not assume access to globally optimal policies from the training tasks, and instead performs online learning over inexact within-task bounds estimated by stationary distribution correction. Moreover, strategies for learning rate adaptation are designed to further exploit task-relatedness. One limitation of the proposed method is that it only considers CRPO as the within-task algorithm; nevertheless, our framework can be potentially adapted to more single-task algorithms by making the dependence of guarantees on initial policy/step sizes explicit.
-% e.g., safe exploration \citep{efroni2020exploration}, regularization \citep{geist2019theory}.
-% off-policy evaluation \citep{duan2020minimax}.  
-Some potential future directions could be to design Meta-SRL with zero constraint violation \citep{liu2021fast}, 
-% improve exploration  using regularization 
-non-stationary environments \citep{ding2022provably}, and multi-agent settings \citep{de2021constrained}. 
-% \VK{Broader impact statement and discussions on the incorporation of fairness constraints for socially responsible systems are presented in Appendix \ref{sec:BroaderImpact}.}
-
-\section{Broader Impact Statements}\label{sec:BroaderImpact}
- \VK{There is an increasing need to address fairness as a constraint in learning settings. Existing works that aim to achieve zero-shot generalization without any task-specific adaptation have limited capability to adapt to shifting environments. While online meta-learning is a principled technique to learn good priors over model parameters for fast adaptation in a sequential setting, existing methods often do not address constraints and thus have limited applications in fairness-aware learning.}
-
-\VK{
-The proposed CMDP-within-online framework can potentially be adapted to reinforcement learning tasks with fairness constraints in a \textbf{non-stationary environment}. In practice, this can provide a strategy that learns priors over policy parameters not only to master the current fairness-aware task but also to become proficient with quick adaptation at learning newly arrived tasks. Our theoretical analysis can be leveraged to provide a sublinear bound on the “task-averaged fairness violation” regret. Similar ideas have been explored by \citep{zhao2021fairness} in the supervised learning setting, while we are not aware of any work on the reinforcement learning counterpart. Thus, it can be an extension for future work to explore the extent to which our method can address this important problem.}\VK{ Nevertheless, fairness constraints present a unique challenge for meta-safe RL settings, as fairness constraints should rarely be violated in a real-world setting due to the implicated discrimination or bias. Additional efforts, such as incorporating pessimism \cite{bai2021achieving} or developing offline methods, may be entailed to reduce fairness violations during initial deployment.}
-
-\section{Acknowledgments}
-
-The authors acknowledge the generous support by NSF, the Commonwealth Cyber Initiative (CCI), 4-VA collaborative research grant, C3.ai Digital Transformation Institute, and the U.S. Department of Energy. We would also like to thank the anonymous reviewers, which helped us to improve the quality of the manuscript and Tengyu Xu for providing the code for CRPO.
-
-
-
-
-
-% consider generalization under adversarial scenarios \citep{pan2019risk,lykouris2021corruption}jin2021power
-% While the present study represents a first step in this important direction, more works are needed to further understand the limits of the approach and verify its practicality in various domains.
-
-
-
-
-
-\bibliography{iclr2023_conference}
-\bibliographystyle{iclr2023_conference}
-
-\newpage 
-\appendix
-\section*{Appendix}
-In this section, we start with a summary of related work in Sec. \ref{sec:RelatedWork} followed by a brief recapitulation of the CRPO algorithm in Sec. \ref{sec:crpo}. Note that CRPO will be our focus as the exemplary within-task safe RL algorithm. We also introduce notations therein that will be used in later analysis. In Sec. \ref{sec:inexact_cmdp-app}, we give the pseudo-code of our inexact CMDP-within-online algorithm, with further discussions on key aspects. Sec. \ref{sec:prelim-app} provides the proof for Sec. \ref{sec:Preliminaries} of the main paper, which focuses on an elementary yet illustrative example of the CMDP-within-online approach. We start by providing a simplified proof to help the reader understand the main approach of CRPO, and then demonstrate the potential improvement by exploiting inter-task-relatedness (Lemma~\ref{lemma: appIdeal setting}). Sec. \ref{sec:inexact-app} contains the key developments in extending online learning approaches, specifically online gradient descent, to the case of inexact loss functions. We start with some preliminaries on $\epsilon$-subgradient (Sec. \ref{sec:basic-subgradient}). Then, we conduct the analysis for static regret (Thm. \ref{prop:InexactUpperBound-app}) and dynamic regret (Thm. \ref{prop:DynamicRegret}). In Sec. \ref{sec:kl-bound}, we provide a detailed analysis of the KL divergence estimation error bound, which contributes to one of our main contributions in understanding the key aspects of the proposed inexact CMDP-within-online framework. Our development leverages the seminal results developed for tame geometry, which we briefly review in Sec. \ref{sec:prelim-tame}. We also briefly set up the notations and recall basic properties of subgradient flow systems \ref{sec:subflow}. Through a series of bounds, the final result is obtained in Thm. \ref{thm:dualDICEAppendix}. We then provide proofs for Sec. \ref{subsec:adapt_learning_rates}. In Sec. \ref{subsec:appAdaptiveRate}, we first extend the analysis of CRPO to the case of adaptive learning rates. Then, we provide the proof for Thm. \ref{thm:UtSim1} and Corollary \ref{cor:CorollaryAdpativeRate} in Sec. \ref{subsec:appProofsAdaptive}. Experimental details are provided  in Sec. \ref{sec:ExpDetails}. Frequently used notations and constants are listed in Sec.\ref{sec:notations}.
-
-
-
-\section{Related work}
-\label{sec:RelatedWork}
-
-\textbf{Meta-reinforcement learning:} Current state-of-the-art meta-RL includes learning the initial conditions \citep{finn2017model}, hyperparameters \citep{jaderberg2019human}, step directions \citep{li2017meta} and stepsizes \citep{young2018metatrace}, and training recurrent neural networks to embed previous task experience \citep{duan2016rl} (see also \citep{chen2021understanding} for sim-to-real transfer, and \cite{suilen2022robust} for the extension to robust MDPs), with recent developments on improving meta-optimization \citep{rothfuss2018promp,liu2019taming,song2019maml} (see \citep{hospedales2020meta} for a review). Recently, \citep{fallah2021convergence, ji2022theoretical} provided theoretical studies on the convergence of model-agnostic meta-RL. However, these works all focus on the unconstrained meta-RL and their local optimality convergence, while our work is the first to obtain provable guarantees for optimality and constraint satisfaction for CMDPs.
-
-\textbf{Online meta-learning/learning-to-learn (LTL).}
-% \jin{incorporate second paragraph in \url{https://proceedings.neurips.cc/paper/2019/file/e0e2b58d64fb37a2527329a5ce093d80-Paper.pdf} on online-within-batch, online-within-online, and lifelong learning}
-Most initialization-based meta-learning studies focus on the setting with decomposable within-task loss functions that are often convex \citep{finn2019online,denevi2019learning,balcan2019provable}; nonconvex within-task settings are studied usually for multi-task representation learning  \citep{balcan2015efficient,maurer2016benefit,du2020few,tripuraneni2020theory}. Theoretically, our work is inspired by the Average Regret-Upper-Bound Analysis (ARUBA) strategy \citep{khodak2019adaptive} for obtaining a meta-procedure, which has been recently extended to learning nonconvex piecewise-Lipschitz functions \citep{balcan2021learning}; the main technical advance in our work is in providing the guarantees for CMDPs, which is challenging due to the interplay between the nonconvexity and stochasticity of the optimization and the complexity of the within-task safe RL algorithms that involve policy update, critic learning, and the proper choice of stepsizes for reward/constraints.
-
-\textbf{Inexact online learning.} Online learning with access to inexact loss/gradient information has been studied for stochastic zero-biased noise \citep{cesa2011online,yang2016tracking,bedi2018tracking,dixit2019online}, deterministic error/nonzero-biased stochastic noise \citep{bedi2018tracking,dixit2019online}, and adversarial perturbation \citep{resler2019adversarial}. Our analysis for static regret uses the formalism of $\epsilon-$subgradient \cite[Chap. XI]{jean2010convex}; for dynamic regret, we extend the work \citep{zhang2017improved} to 
-the inexact setting allowing multiple updates per round and provide improved rates than prior results \citep{bedi2018tracking,dixit2019online}.
-
-\textbf{safe RL and CMDP.} Direct policy search methods have had substantial empirical successes in solving CMDPs \citep{borkar2005actor,uchibe2007constrained,bhatnagar2012online,achiam2017constrained,chow2017risk} (see, e.g., \citep{garcia2015comprehensive} for a survey of safe RL). Recently, major progress in understanding the theoretical nonasymptotic global convergence behavior of policy-based methods for CMDPs has also been achieved 
-\citep{chow2018lyapunov,paternain2022safe,efroni2020exploration,ding2021provably,ding2022provably, ying2021dual, yu2019convergent,xu2021crpo,chen2021primal,liu2021fast, liu2021learning}. 
-However, most of these works only study a single CMDP task and don't seek to make the algorithm perform well on new, potentially related CMDP tasks. In addition, while our work uses the constraint-rectified policy optimization (CRPO) algorithm proposed in \citep{xu2021crpo} as a building block, our framework can be potentially adapted to most of the existing RL literature by making the dependence of guarantees on initial policy/step sizes explicit, e.g., safe exploration \citep{efroni2020exploration}, regularization \citep{geist2019theory}, off-policy evaluation \citep{duan2020minimax,tennenholtz2020off}, and offline RL under constraints \citep{le2019batch,wu2021offline,lee2021optidice,thomas2021multi}.
-
-\section{CRPO Algorithm and notations}
-\label{sec:crpo}
-We provide some preliminaries and notations for the CRPO algorithm for the sake of completeness. CRPO \citep{xu2021crpo} is a primal-based CMDP algorithm, which performs policy optimization (natural gradient ascent on the reward) when constraints are not violated, or constraint minimization (natural gradient descent on the constraint function) for the corresponding violated constraint. There are three crucial components in the overall strategy to solve the CMDP problem \eqref{eq:CRPOsafeRL}:
-\begin{enumerate}
-    \item \textbf{Policy evaluation:} In each step $m$ of task $t$, for a certain policy $\pi_{t,m}$, the action-value functions $Q_{t,\pi_{t,m}}^i$ are estimated for the reward ($i=0$) and constraints ($i = 1,...,p$). TD-learning is employed for critic evaluation \citep{bhandari2018finite}.
-    \item \textbf{Estimation of constraint violation:} Once the Q-estimates $\Bar{Q}_{t,\pi_{t,m}}^i(s,a)$ are obtained, then a weighted average is taken to estimate expected constraint violation $\Bar{J}_{t,i}(\pi_{t,m})$ under a given policy $\pi_{t,m}$.
-    
-    \item \textbf{Policy optimization:} After the constraint estimation, it is checked if the expected constraint violation $\Bar{J}_{t,\pi_{t,m}}^i$ exceeds the given safety threshold, i.e., if $\Bar{J}_{t,i}(\pi_{t,m}) \leq d_{t,i} + \eta_t$ for all $i=1,\ldots,p$. If none of the constraints are violated, then one step of natural policy gradient ascent is performed to maximize the objective. If one or more constraints are violated, then one step of natural policy gradient descent is conducted to minimize one of the unsatisfied constraints.
-\end{enumerate}
-
-The set of time steps the policy optimization for reward maximization takes place is denoted by $\mathcal{N}_{t,0}$, and the set of time steps constraint minimization takes place is denoted by $\mathcal{N}_{t,i}$. Thus $|\mathcal{N}_{t,0}|+\sum_{i=1}^p|\mathcal{N}_{t,i}| = M$ for any task $t$.
-
-
-
-% \textbf{Discrete state-action space:} In the discrete state-action space, CRPO employs softmax parametrized policies .
-% In the tabular setting, we consider the softmax $\theta \in \mathbb{R}^{|\mathcal{S}| \times|\mathcal{A}|}$, the corresponding softmax policy $\pi_{\theta}$ is defined as
-% $\pi_{\theta}(a|s):=\frac{\exp (\theta(s, a))}{\sum_{a^{\prime} \in \mathcal{A}} \exp \left(\theta\left(s, a^{\prime}\right)\right)},  \forall(s, a) \in \mathcal{S} \times \mathcal{A}$. For the critic value estimation for all objectives, TD-learning is employed \citep{bhandari2018finite}. The Q-function for objective $i$ is denoted by $Q_{t,\pi}^i$ for some policy $\pi$ and the Q-function parameters are denoted by $\omega$. Learning rate for the TD-learning is denoted by $\beta$. There is a total of $K$ iterations are done for the TD learning, and $k \in \{1,...,K\}$ denotes the index of iteration.
-
-\section{ Inexact CMDP-within-online Algorithm}
-\label{sec:inexact_cmdp-app}
-
-% \begin{algorithm}[t]
-% % \KwInput{}
-% % \KwOutput{\zeta^{\star}}
-%   \caption{Inexact CMDP-within-online framework (exemplified with CRPO \citep{xu2021crpo} as the within-task safe RL algorithm)}
-%   \begin{algorithmic}[1]
-%     \STATE Initialize actor policy $\pi_{1,0}$ and learning rate $\alpha_1$
-%     \FOR{task $t \in [T]$}
-%         \STATE Run CRPO with initializations for actor policy $\pi_{t,0}$ and learning rates $\alpha_{t}$ to obtain a policy $\hat{\pi}_t$
-%         \STATE Estimate the discounted state visitation distribution $\hat{\nu}_t$ of $\hat{\pi}_t$ based on trajectory data collected within-task $t$ with DualDICE \citep{nachum2019dualdice}
-%     \STATE Run one or multiple steps of OGD on
-%     \begin{enumerate}
-%         \item[(a)] $INIT$: $\hat{f}_{t}^{init}(\phi) = \mathbb{E}_{\hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\phi)]$.
-%         \item[(b)] SIM: $\hat{f}_t^{sim}(\kappa) = \frac{c_1^t \mathbb{E}_{\hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\pi_{t,0})]}{\kappa} + \kappa (c_2^tM + c_4^t \sqrt{M})+c_3^t\sqrt{M}$
-%     \end{enumerate}
-%     to obtain $\pi_{t+1,0}$ and $\alpha_{t+1}$. Here $c_1^t=2$, $c_2^t=\frac{4c_{max}^2|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3}$, $c_3^t=\frac{3+(1-\gamma)^2}{(1-\gamma)^2}$, and $c_4^t=\frac{3 c_{max}}{(1-\gamma)^2}$.
-%      \ENDFOR
-%   \end{algorithmic}
-%   \label{alg:MetaSRL}
-% \end{algorithm}
-
-Algorithm \ref{alg:MetaSRL} presents the inexact-CMDP-within-online algorithm for Meta-SRL. The first step in the algorithm is to initialize with some random actor policy $\phi_1$, and the learning rate $\alpha_1$ for the first task. Then, for each task $t$, a within-task algorithm (i.e., CRPO) is run  for $M$ steps to obtain a policy $\hat{\pi}_t$. The discounted state visitation distribution $\hat{\nu}_t$ induced by $\hat{\pi}_t$ is then estimated using the trajectory data collected within task $t$. Afterward, an inexact OGD method is run on the new loss functions to update the meta-initialization policy $\phi_{t+1}$, and the learning rate $\alpha_{t+1}$. The online learning loop is iterated for all tasks $t\in[T]$. %These three algorithms do online gradient descent on the respective losses that are bounded by to the TAOG and TACV regrets of the meta-algorithm. In particular, $INIT^a$ will run OGD on the loss function $c_1^t\mathbb{E}_{\hat{\nu}_t}[D(\hat{\pi}_t|\phi)]$ (which appears in TAOG and TACV) to learn a policy initialization $\phi$ such that the TAOG and TACV regret is minimized. Similarly, $SIM$ runs OGD on the loss function $\frac{c_1^t \mathbb{E}_{\hat{\nu}_t}[D(\hat{\pi}_t|\pi_{t,0})]}{\alpha_{t}} + c_2^t \sum_{i=0}^p \frac{\alpha_{t}^2}{\alpha_{t}}$ to learn the learning rates such that the TAOG and TACV with respect to the learning rates is minimized for the next task; and $INIT^{c,i}$ runs OGD on $c_ec_1({K,W})^{m+1}\|\omega_{t,0}^{i,*} - \omega_{t,0}^{i,0}\|_2 $ to learn the critic initialization parameter, such that TAOG and TACV regret is minimized with respect to the critic initialization $\phi_t^c$.
-
-
-
-
-
-\section{Proof in Section \ref{sec:Preliminaries}}
-\label{sec:prelim-app}
-We first present a simplified proof for the results in Equation \eqref{eq:RegretRandC}. \VK{This result also shows in (\eqref{eq: condition for two events hold}) how the safety threshold $\eta_t$ can be chosen to achieve sublinear convergence rate in $M$.}
-\begin{lemma} \label{lemma: three events hold}
-For CRPO \citep{xu2021crpo} with the softmax parametrization and the exact critic estimation (i.e., no critic evaluation error), if we have 
-\begin{align} \label{eq: condition for two events hold}
-\eta_t  \geq &  \frac{2}{\alpha M} \left(\mathbb{E}_{s \sim \nu^{*}_t}\left[D\left(\pi^{*}_t | \pi_{{t,0}}\right)\right] +\frac{2 M \alpha^{2} c_{max}^2 |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right),
-\end{align}
-then the following holds
-\begin{enumerate} 
-    \item  $\mathcal{N}_{t,0} \neq \emptyset$, i.e., $\hat{\pi}_t$ is well-defined,
-    % \item $\sum_{m \in \mathcal{N}_{t,0}}\left(J_{t,0}\left(\pi_t^{*}\right)-J_{t,0}\left(\pi_{t,m}\right)\right) \leq \eta_t  |\mathcal{N}_{t,0}|$. 
-    \item $J_{t,0}\left(\pi_t^{*}\right)-J_{t,0}\left(\hat{\pi}_t\right) \leq \eta_t.$
-    \item $J_{t,i}\left(\pi_t^{*}\right)-J_{t,i}\left(\hat{\pi}_t\right) \leq \eta_t$, for $i=1,\ldots, p$.
-\end{enumerate}
-\end{lemma}
-\begin{proof}
-The following inequality holds due to Lemma 7 in \citep{xu2021crpo}:
-\begin{align} \label{eq: summation bound of gap in Lemma two events hold}
-&\alpha \sum_{m \in \mathcal{N}_{t,0}} \left(J_{t,0}\left(\pi_t^{*}\right)-J_{t,0}\left(\pi_{{t,m}}\right)\right)+\alpha \eta_t \sum_{i=1}^p \left|\mathcal{N}_{t,i}\right| 
-\leq  \mathbb{E}_{s \sim \nu^{*}_t}\left[D\left(\pi^{*}_t | \pi_{{t,0}}\right)\right] +\frac{2 M \alpha^{2} |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}}.
-\end{align}
-We first verify item 1. If $\mathcal{N}_{t,0}=\emptyset$, then $\sum_{i=1}^p \left|\mathcal{N}_{t,i}\right|=M$, and \eqref{eq: summation bound of gap in Lemma two events hold} implies that
-\begin{align*}
- \alpha \eta_t M   \leq &  \mathbb{E}_{s \sim \nu^{*}_t}\left[D\left(\pi^{*}_t | \pi_{{t,0}}\right)\right] +\frac{2 M \alpha^{2} |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}}
-\end{align*}
-which contradicts \eqref{eq: condition for two events hold}. Thus, we must have $\mathcal{N}_{t,0} \neq \emptyset$.
-
-We then proceed to verify item 2. 
-If $\sum_{m \in \mathcal{N}_{t,0}}\left(J_{t,0}\left(\pi_t^{*}\right)-J_{t,0}\left(\pi_{t,m}\right)\right) > \eta_t |\mathcal{N}_{t,0}|$ , then \eqref{eq: summation bound of gap in Lemma two events hold} implies that
-$$
-\alpha \eta_t M \leq \mathbb{E}_{s \sim \nu^{*}_t}\left[D\left(\pi^{*}_t | \pi_{{t,0}}\right)\right] +\frac{2 M \alpha^{2} |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}},
-$$
-which contradicts \eqref{eq: condition for two events hold}. Hence, item 2 holds. 
-
-Finally, the item 3 holds obviously since $\hat{\pi}_t$ is sampled from $\mathcal{N}_{t,0}$. This completes the proof.
-\end{proof}
-
-We now prove Lemma \ref{lemma: ideal setting} in Section \ref{sec:Preliminaries}.
-
-\begin{lemma}\label{lemma: appIdeal setting}
-Assume $\{\nu_t^\ast\}_{t=1}^T$ and $\{\pi_t^\ast\}_{t=1}^T$ are given.
-For each task $t$, we run CRPO for $M$ iterations with $\alpha = \frac{(1-\gamma)^{\frac{3}{2}}}{\sqrt{2M |\mathcal{S}||\mathcal{A}| }} \sqrt{\left(\frac{L_g^2(\log T + 1)}{\mu_\pi T}+ D^{*2}  \right)}$. 
-In addition, the initialization $\{\pi_{t,0}\}_{t=1}^T$  are determined by playing FTRL or OGD on the functions $\mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \cdot \right)\right], \text{ for } t=1,\ldots, T$. Then, it holds that 
-\begin{align*}
- \Bar{R}_i \leq \frac{\sqrt{8|\mathcal{S}||\mathcal{A}|}}{\sqrt{M(1-\gamma)^{3}}}\sqrt{\left(\frac{L_g^2(\log T + 1)}{\mu_\pi T}+ D^{*2}  \right)}, \ \forall i=1,\ldots,p.
-\end{align*}
-\end{lemma}
-
-\begin{proof}
-By the within-task guarantee for CMDP, we know that $\Bar{R}_0$ and $\{\Bar{R}_i\}_{i=1}^p$ are well-defined. In addition, it holds that 
-\begin{align*}
-  \Bar{R}_0 \leq &\frac{1}{T}\sum_{t=1}^T \left(\frac{2 \mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \pi_{{t,0}}\right)\right]}{\alpha M} +\frac{ 4\alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right)\\
-  =&  \frac{2}{T}\sum_{t=1}^T \left(  \frac{\mathbb{E}_{s \sim \nu^{*}_t}\left[D_{\mathrm{KL}}\left(\pi^{*}_t | \phi_t \right)\right]- \mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \phi^\ast \right)\right]}{\alpha M } \right)\\
-+&  \frac{2}{T}\sum_{t=1}^T \left( \frac{\mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \phi^\ast \right)\right]}{\alpha M}+\frac{2  \alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right).
-\end{align*}
-where $\phi_t=\pi_{t,0}$.
-The first inequality follows from the choice of $\eta_t$ from Lemma \ref{lemma: three events hold}. The key step is the last step, which splits the total loss into the loss of the meta-update algorithm and the the loss if we had always initialized at $\phi^\ast$.
-
-Since each $\mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \cdot \right)\right]$ is $\mu_\pi$-strongly convex due to Assumption \ref{asmptn:newAsmptn1}, and each $\phi_{t}$ is determined by  playing OGD, we have that:
-\begin{align*}
-    & \frac{2}{T}\sum_{t=1}^T \left(  \frac{\mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \phi_t \right)\right]- \mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \phi^\ast \right)\right]}{\alpha M} \right)\leq \frac{2L_g^2 (\log T + 1)}{\mu_\pi \alpha MT},
-\end{align*}
-where $L_g$ is the upper bound on $\nabla_\phi \mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \phi \right)\right]$, $\mu_\pi$ is the strong convexity parameter for the KL divergence of the softmax policy. 
-
-Since $\phi^\ast=\argmin_{\phi} \sum_{t=1}^T \mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \phi \right)\right]$, by the definition of $D^\ast$, we have $\mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \phi^* \right)\right] \leq D^{\ast 2}$. Thus, by substituting the definition of $\phi^\ast$, it holds that
-\begin{align*}
-\frac{2}{T}\sum_{t=1}^T \left( \frac{\mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \phi^\ast \right)\right]}{\alpha M}+\frac{2 \alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right)& = \frac{2D^{\ast 2}}{\alpha M} +  \frac{4  \alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}}.
-\end{align*}
-
-Setting the value of $\alpha = \frac{\sqrt{\left(\frac{L_g^2(\log T + 1)}{\mu_\pi T}+ D^{*2}  \right)(1-\gamma)^{3}}  }{\sqrt{2M|\mathcal{S}||\mathcal{A}|}}$, we can obtain the TAOG $\bar{R}_0$ as:
-\begin{align*}
-    \bar{R}_0 \leq \frac{\sqrt{8\left(\frac{L_g^2(\log T + 1)}{\mu_\pi T}+ D^{*2}  \right)|\mathcal{S}||\mathcal{A}|}  }{\sqrt{M (1-\gamma)^{3}}}
-\end{align*}
-The bound for $\Bar{R}_i$ can be derived similarly.
-\end{proof}
-
-\VK{The lemma above shows how the parameters like learning rate $\alpha$ and safety threshold $\eta_t$ can be chosen to achieve decreasing TAOG and TACV in the number of updates per task $M$ and the number of tasks $T$.}
-
-
-
-% Next corollary presents the special case for TAOG and TACV when the task similarity $D^* = 0$.
-
-% \begin{corollary}[Case of $D^* = 0$]
-% \label{cor:app_Dstar=0}
-% Assume $\{\nu_t^\ast\}_{t=1}^T$ and $\{\pi_t^\ast\}_{t=1}^T$ are given such that $\nu_t^* = \nu_1^*$ and $\pi_t^* = \pi_1^* \ \forall t \in [T]$ (i.e., $D^* = 0$). For each task $t$, we run CRPO for $M$ iterations with $\alpha = \frac{(1-\gamma)^{\frac{3}{2}} \mathbb{E}_{s \sim \nu_1^*}[D_{KL}(\pi_1^*|\pi_{1,0})] }{\sqrt{2M |\mathcal{S}||\mathcal{A}| }}$. 
-% In addition, the initialization $\{\pi_{t,0}\}_{t=1}^T$  are determined by playing \textit{Follow-the-Regularized-Leader} (FTRL) or \textit{online mirror descent} (OMD) \citep{hazan2016introduction} on the functions $\mathbb{E}_{s \sim \nu^{*}_t}\left[D_{KL}\left(\pi^{*}_t | \cdot \right)\right], \text{ for } t=1,\ldots, T$. Then, it holds that 
-% \begin{align*}
-%  & \Bar{R}_i \leq  \frac{2 \sqrrt
-%  |\mathcal{S}| |\mathacal{A}|}{\sqrt{M}(1-\gamma)^{3/2}} \hspace{0.3cm} \forall i=1,\ldots,p.
-% \end{align*}
-% \end{corollary}
-
-% \begin{proof}
-% If all the tasks given are similar, we can write $\bar{R}_0$ as
-% \begin{align*}
-%   \Bar{R}_0 \leq &\frac{1}{T}\sum_{t=1}^T \left(\frac{2 \mathbb{E}_{s \sim \nu^{*}_1}\left[D_{KL}\left(\pi^{*}_1 | \pi_{{t,0}}\right)\right]}{\alpha M} +\frac{ \alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right)\\
-%   =& \frac{\mathbb{E}_{\nu_1^*} [D_{KL} (\pi_1^*|\pi_{1,0})] }{\alpha M} + \frac{\alpha |\mathcal{S}| |\mathcal{A}|}{(1 - \gamma)^3} + \frac{1}{T-1}\sum_{t=2}^T \left( \frac{\mathbb{E}_{\nu_1^*} [D_{KL} (\pi_1^*|\pi_{t,0})] }{\alpha M}+ \frac{\alpha |\mathcal{S}| |\mathcal{A}|}{(1 - \gamma)^3}  \right) \\ =& \frac{\mathbb{E}_{\nu_1^*} [D_{KL} (\pi_1^*|\pi_{t,0})] }{\alpha M} + \frac{2\alpha |\mathcal{S}| |\mathcal{A}|}{(1 - \gamma)^3},
-% \end{align*}
-
-% where the last inequality follows from the fact that the FTL and OGD will set the policy meta-initialization as $\phi^* = \pi_1^* \ \forall t = 2, \ldots T$. Thus, by substituting $\alpha = \frac{(1-\gamma)^{\frac{3}{2}} \mathbb{E}_{s \sim \nu_1^*}[D_{KL}(\pi_1^*|\pi_{1,0})] }{\sqrt{2M |\mathcal{S}||\mathcal{A}| }}$ will yield the desired result.
-
-% \end{proof}
-
-\section{Inexact online gradient descent}
-\label{sec:inexact-app}
-\subsection{Basics for \texorpdfstring{$\epsilon$}{[1]}-subgradient}
-\label{sec:basic-subgradient}
-We start with some basics for $\epsilon$-subdifferential used in the subsequent analysis. This material is based on \cite[Chap. XI]{jean2010convex}. Throughout this section, we consider a convex, closed, and proper function $f:\mathbb{R}^d \rightarrow \mathbb{R} \cup \{+\infty\}$ with domain $\mathrm{Dom}(f)$. We always consider a positive $\epsilon>0$.
-
-\begin{definition}[$\epsilon$-subgradient \citep{jean2010convex}] Given $\hat{x} \in \mathrm{Dom}(f)$, the vector $u \in \mathbb{R}^d$ is called $\epsilon$-subgradient of $f$ at $\hat{x}$ when the following property holds for any $x \in \mathbb{R}^d$:
-\begin{align*}
-    f(x) \geq f(\hat{x}) + \langle u,x - \hat{x} \rangle - \epsilon.  
-\end{align*}
-The set of all $\epsilon$-subgradients of $f$ at $\hat{x}$ is the $\epsilon$-subdifferential of f at $\hat{x}$, denoted by $\partial_{\epsilon}f(\hat{x})$.
-\end{definition}
-
-In view of the exact subdifferential $\partial f(x)$, $\partial_{\epsilon}f(\hat{x})$ can be called an approximate subdifferential, which is a set-valued function with a convex graph. For practical use, $\partial_{\epsilon}f(\hat{x})$ can be used to characterize the $\epsilon$-solution to a convex minimization problem.
-
-\begin{lemma}(\cite[Thm. 1.1.5]{jean2010convex}) The following two properties are equivalent.
-\begin{align*}
-    0 \in \partial_{\epsilon}f(\hat{x})  \iff f(\hat{x}) \leq f(x) + \epsilon,\qquad \text{for all }x \in \mathbb{R}^d.
-\end{align*}
-\end{lemma}
-
-One useful result that stems directly from the definition is to link the $\epsilon$-subdifferential of two uniformly close functions (e.g., an expectation of a function and its empirical version).
-
-\begin{lemma}\label{prop:Appsubdifferential}\label{lem:app2epsilon}
-Consider two convex functions $f$ and $g$, with the property that $\|f-g\|_{\infty} \leq \epsilon$, where $\|f-g\|_{\infty} = \max_x|f(x) - g(x)|$. Then, for any $x\in\mathbb{R}^d$ and $u \in \partial f(x)$ in the subdifferential of $f$ at $x$, it is also in the $2\epsilon$-subdifferential of $g$ at $x$, i.e., $u \in \partial_{2\epsilon} g(x)$.
-\end{lemma}
-\begin{proof}
-The proof follows directly by convexity and the uniform condition:
-\begin{equation*}
-\begin{aligned}
- g(y)  &  \geq f(y) - \epsilon \\  & \geq f(x) + \langle s,y-x \rangle - \epsilon \\  & \geq g(x) + \langle  s, y-x\rangle - 2\epsilon,
-\end{aligned}
-\end{equation*}
-where the second inequality is by the convexity of $f$, and the first and last inequalities are due to the supremum norm condition.
-\end{proof}
- 
-Our next result is concerned with bounding the distance (measured in $\ell_2$ norm) between the true gradient and the $\epsilon$-subgradient of the function, assuming the function is differentiable and smooth.
-\begin{lemma}\label{prop:Appsubdifferential2}
- Suppose a function $f$ is convex, differentiable, and $L$-smooth over $\mathrm{Dom}(f)$, and $u \in \partial_{\epsilon}f(x)$ is an $\epsilon$-subgradient of $f$ at $x\in \mathrm{Dom}(f)$. Then,
- \begin{equation*}
- \|u - \nabla f(x)\|_2^2 \leq \frac{2 \epsilon}{2 C_1 - C_1^2 L},
- \end{equation*}
- for any $C_1 \in \{ c \in(0, \frac{2}{L}): x + c(u - \nabla f(x)) \in \mathrm{Dom}(f)\}$. In particular, if $x + \frac{1}{L}(u - \nabla f(x)) \in \mathrm{Dom}(f)$, then $\|u - \nabla f(x)\|_2^2 \leq 2 \epsilon L$.
- \end{lemma}
-\begin{proof}
-Since $u$ is an $\epsilon$-gradient, $f(y) \geq f(x) + \langle u, y-x \rangle - \epsilon$ for all $ y \in \mathrm{Dom}(f)$. Thus,
-\begin{align*}
-    0 &\leq f(x) - f(y)+ \langle \nabla f(x), y-x \rangle + \frac{L}{2} \|y - x\|^2\\
-    &\leq \langle \nabla f(x)-u, y-x \rangle + \frac{1}{2} \|y - x\|^2+\epsilon
-\end{align*}
-Choose $y = x+ c(u - \nabla f(x))$ for $c\in(0,\frac{2}{L})$ such that $x + c(u - \nabla f(x)) \in \mathrm{Dom}(f)$, we have that
-\begin{equation*}
-\|u - \nabla f(x)\|_2^2 \leq \frac{2 \epsilon}{2 c - c^2 L}.
-\end{equation*}
-\end{proof}
-The smoothness condition in the above seems necessary, as we can construct counterexamples that drive the distance of an $\epsilon$-subgradient and its exact counterpart arbitrarily large without the smoothness condition. In fact, it is known that the set-valued mapping $(x,\epsilon)\rightarrow\partial_\epsilon f(x)$ is inner semi-continuous for a Lipschitz-continuous $f$, which is implied by the fact that the distance (using the Hausdorff distance for sets) between any two subdifferential $\partial_\epsilon f(x)$ and $\partial_{\epsilon'}f(x')$ for all $x,x'\in\mathbb{R}^d$ and $\epsilon,\epsilon'$ is positive, and shown to be bounded by $\mathcal{O}\left(\frac{1}{\min\{\epsilon,\epsilon'\}}(\|x-x'\|+|\epsilon-\epsilon'|\right)$ \cite[Thm. 4.1.3]{jean2010convex}. While the exact gradient can be interpreted as $\epsilon$-subgradient driving $\epsilon\rightarrow 0^+$, the existing bound provided by \cite[Thm. 4.1.3]{jean2010convex} is vacuous in this case; on the other hand, the bound provided in Lemma \ref{prop:Appsubdifferential2} remains meaningful.
-
-\subsection{Static regret for the inexact OGD algorithm}
-\label{sec:static-regret-app}
-
-% \jin{Add the inexact algorithm pseudo code, so it specifies: $\{x_t\}_{t=1}^T$ is generated by $x_{t+1} = P_X(x_t - \alpha \hat{\nabla}_t)$, where $x_1 = 0$; specify $\hat{\nabla}_t$ be some $\epsilon$-gradient played by OGD for the loss $\ell_t$ at point $x_t \in X$, $P_X$ is a projection operator.}
-
-\begin{algorithm}[t]
-\label{alg}
-% \KwInput{}
-% \KwOutput{\zeta^{\star}}
-  \caption{Inexact OGD Algorithm}
-  \KwInput{Learning rate $\alpha$, $x_1=0$}
-  \begin{algorithmic}[1]
-    
-    \FOR{$t = 1,..,T$}
-        \STATE Incur loss $\ell_t(x_t)$ and compute $\epsilon$-gradient $\hat{\nabla}_t\ell_t(x_t)$
-        \STATE $x_{t+1} = P_X(x_t - \alpha \hat{\nabla}_t \ell_t(x_t))$
-    \ENDFOR
-  \end{algorithmic}
-  \label{alg:InexactOGD}
-\end{algorithm}
-
-In the following, we consider the online learning setup, where a sequence of loss functions $\{\ell_t\}_{t \in [T]}$ are revealed sequentially, and the performance of the OGD algorithm (see Algorithm \ref{alg:InexactOGD}) is measured against a static decision in hindsight:
-\begin{equation}
-    \text{(static regret)}\quad \sum_{t=1}^T\ell_t(x_t)-\min_{x\in X}\sum_{t=1}^T\ell_t(x) \quad 
-\end{equation}
-where $\{x_t\in X\}_{t\in[T]}$ is a sequence of actions played by the online algorithm. For simplicity, we assume that $X$ belongs to the domains of $\ell_t$ for all $t\in [T]$. Furthermore, we define the following cumulative inexact error bounds:
-\begin{equation}
-    \mathcal{E}_T\coloneqq\sum_{t=1}^T {\epsilon}_t,
-\end{equation}
-where $\epsilon_t$ corresponds to the inexactness of the $\epsilon_t$-subgradient in each round of OGD.
-
-\begin{theorem}[Static regret bound for the inexact OGD]\label{prop:InexactUpperBound-app}
-Assume that $\{\ell_t\}_{t \in [T]}$ are convex and $L_2$-smooth, with bounded gradient, i.e., $\|\nabla \ell_t(x)\|_2\leq L_1$ for all $t\in[T]$ and all $x\in X$. Then, for any comparator $x\in X$, with the stepsize $\alpha\coloneqq \frac{\|x\|}{L_1\sqrt{2T}}$, we have that 
-\begin{equation*}
-\sum_{t=1}^T\ell_t(x_t)-\sum_{t=1}^T\ell_t(x) \leq L_1\|x\|\sqrt{2T} + \left( 1+\frac{\sqrt{2}cL_1L_2\|x\|}{\sqrt{T}}\right)\sum_{t=1}^T \epsilon_t,
-\end{equation*}
-where $\epsilon_t$ is the amount of inexactness at each step $t$. 
-\end{theorem}
-
-\begin{proof}
-
-By convexity and the fact that $\hat{\nabla}_t$ is an $\epsilon_t$-subgradient of $\ell_t$ at $x_t$,  we have that
-\begin{equation*}
-\ell_t(x_t) - \ell_t(x) \leq  \langle\hat{\nabla}_t, x_t-x \rangle + \epsilon_t, \quad\forall x\in X
-\end{equation*}
-% Now, taking the expectation with respect to the stochasticity of $\mu^{(t)}$, we have
-
-% \begin{equation*}
-% \mathbb{E}_{\mu^{(t)}}[\ell_t(x_t) - \ell_t(x)] \leq \mathbb{E}_{\mu^{(t)}}[\langle \hat{\nabla}_t,x_t-x \rangle] + \mathbb{E}_{\mu^{(t)}} \epsilon_t.
-% \end{equation*}
-Hence, summing over $t = 1, ..., T$, we get 
-\begin{equation*}
-\frac{1}{T}\sum_{t=1}^T \ell_t(x_t)  - \ell_t(x)  \leq \frac{1}{T} \sum_{t=1}^T \langle \hat{\nabla}_t, x_t-x\rangle + \epsilon_t.
-\end{equation*}
-To bound the RHS, observe that
-\begin{align*}
-\|x_{t+1} - x\|^2 & \leq \|x_t - \alpha \hat{\nabla}_t - x\|^2 \\ \quad & = \|x_t - x\|^2 - 2\alpha \langle x_t-x, \hat{\nabla}_t \rangle + \alpha^2 \|\hat{\nabla}_t\|^2,
-\end{align*}
-where the first inequality is due to the OGD update rule and the nonexpansiveness of the projection operator. Thus, rearranging the terms and exploiting the telescopic sum over $t \in [T]$, we have that 
-\begin{equation*}
-\begin{aligned}
-\sum_{t=1}^T \langle x_t - x, \hat{\nabla}_t \rangle  & \leq  \frac{1}{2 \alpha}(\|x_1 - x\|^2  - \|x_{T+1} - x\|^2 ) + \frac{\alpha}{2}\sum_{t=1}^T \|\hat{\nabla}_t\|^2 \\ \quad & \leq \frac{1}{2\alpha} \|x_1 - x\|^2 + \frac{\alpha}{2} \sum_{t=1}^T \|\hat{\nabla}_t\|^2.
-\end{aligned}
-\end{equation*}
-Furthermore, since $\ell_t$ is $L_2$-smooth with bounded gradient, and $\hat{\nabla}_t$ is an $\epsilon_t$-gradient for any $t \in [T]$, by Lemma \ref{prop:Appsubdifferential2}, the following holds:
-\begin{equation*}
-\begin{aligned}
-\|\hat{\nabla}_t\|^2 & \leq 2 \|\nabla_t\|^2 + 2 \|\nabla_t - \hat{\nabla}_t\|^2 \\ \quad & \leq 2 L_1^2 + 2cL_2\epsilon_t,
-\end{aligned}
-\end{equation*}
-where the constant $c$ is specified by Lemma \ref{prop:Appsubdifferential2}.
-Hence, by combining the above relations, we get 
-
-\begin{equation*}
-\begin{aligned}
-\frac{1}{T}\sum_{t=1}^T \ell_t(x_t) - \ell_t(x) & \leq \frac{1}{2 \alpha T}\|x_1 - x\|^2 + \frac{1}{T}\sum_{t=1}^T \Bigg(\frac{\alpha}{2} \|\hat{\nabla}_t\|^2 +\epsilon_t \Bigg) \\ 
-& \leq \frac{1}{2 \alpha T}\|x_1 - x\|^2 + \alpha L_1^2+ \Bigg(\frac{\alpha c L_2+1}{T} \Bigg) \sum_{t=1}^T \epsilon_t.
-\end{aligned}
-\end{equation*}
-
-Let $\alpha = \frac{\|x\|}{L_1\sqrt{2T}}$, then we get the RHS as 
-\begin{equation*}
-L_1\|x\|\sqrt{\frac{2}{T}} + \frac{1+\frac{\sqrt{2}cL_1L_2\|x\|}{\sqrt{T}}}{T}\sum_{t=1}^T \epsilon_t.
-\end{equation*}
-\end{proof}
-
-\begin{remark}
-We can relax the dependence of setting the stepsize on $T$ by using a standard doubling trick (first proposed in \citep{auer2002finite}, see also, e.g., \citep{balcan2019provable,khodak2019adaptive}). 
-\end{remark}
-
-After establishing the static regret for the inexact OGD, we can use this result to obtain the proof of Lemma \ref{cor:staticRegretOGD}, which gives the static regret if we run inexact OGD over the loss sequences $\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})]$ for all $t \in [T]$. Overall, the following regret bound will eventually help us establish the proof of Theorem \ref{thm:InexactTAOG}, where we will utilize the static regret upper bound for inexact OGD over the loss sequences $\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})]$ for all $t \in [T]$. Also, we denote the norm of the policy with respect to the state distribution $\nu$ as $\|\pi\|_\nu = \sum_{s \in \mathcal{S}} \nu(s) \pi(s)$. Now we proceed to present the proof for Lemma \ref{cor:staticRegretOGD}. 
-\begin{lemma}
-\label{cor:appstaticRegretOGD}
-Denote $\ell_t(\pi_{t,0}) \coloneqq \mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})]$ for all $t \in [T]$. For any fixed comparator $\pi^*_{0} = \underset{\pi_{0} \in \Delta \mathcal{A}_{\varrho}^{|\mathcal{S}|}}{\argmin} \sum_{t=1}^T \ell_t(\pi_{0})$, if OGD is run on a sequence of loss functions $\{\hat{\ell}_t\}_{t \in [T]}$, where $\hat{\ell}_t \coloneqq \mathbb{E}_{\hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\pi_{t,0})]$ with the step-size $\alpha \coloneqq \frac{\|\pi_{0}^*\|}{L_g \sqrt{2T}}$, then the following bound holds for static regret:
-\begin{equation*}
-    \sum_{t=1}^T \ell_t(\pi_{t,0}) - \sum_{t=1}^T\ell_t(\pi_{0}^*)  \leq \sqrt{2T}L_g\|\pi_{0}^* \| + \left(1 + \frac{4\sqrt{2} L_gL_\pi \|\pi_{0}^*\|}{(2C_1 - C_1^2 L_\pi) \sqrt{T}} \right) \mathcal{E}_T,
-\end{equation*}
-for any $C_1 \in \{c \in \left(0, \frac{2}{L_\pi}\right): \pi_{0}^* + c(\hat{\nabla}_t - \nabla_t )  \in \Delta \mathcal{A}_{\varrho}^{|\mathcal{S}|}\}$ where $\hat{\nabla}_t$ and $\nabla_t$ are an $\epsilon_t$-subgradient and exact subgradient of $\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})]$ at $\pi_{t,0}$, respectively,   $\mathcal{E}_T \coloneqq \sum_{t=1}^T \epsilon_t$ is the cumulative inexactness.
-\end{lemma}
-
-\begin{proof}
-The proof follows directly after substituting $c = \frac{4}{2C_1 - C_1^2 L_\pi}$ and other appropriate constants in Theorem \ref{prop:InexactUpperBound-app}.
-\end{proof}
-
-Note that the inexactness bound $\epsilon_t$ can be obtained from Theorem \ref{thm:dualDICE}. Establishing the above Corollary, we can finally provide the proof for Theorem \ref{thm:InexactTAOG}.
-
-\textbf{Proof of Theorem \ref{thm:InexactTAOG}}
-
-\begin{theorem}
-\label{thm:appTAOGinexactStatic}
-Let $\hat{D}^{*2}=\underset{\phi \in \Delta \mathcal{A}_{\varrho}^{|\mathcal{S}|}} {\min} \frac{1}{T} \sum_{t=1}^T \mathbb{E}_{s \sim \hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\phi)]$ be the empirical task-similarity, and let $c_1 = \sqrt{2}L_g\|\phi^*\|$, and $c_2 = \left(2 + \frac{4\sqrt{2} L_gL_\pi \|\phi^*\|}{(2C_1 - C_1^2 L_\pi) \sqrt{T}} \right)$, where $\phi^*$ is the fixed optimal meta-initialization for all the tasks given by $\phi^* = \underset{\phi \in \Delta \mathcal{A}_{\varrho}^{|\mathcal{S}|}} {\argmin} \frac{1}{T} \sum_{t=1}^T \mathbb{E}_{s \sim \hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\phi)]$. For each task $t$, we run CRPO for $M$ iterations with $\alpha = \sqrt{\frac{|\mathcal{S}|\mathcal{A}|}{2M(1-\gamma)^3}} \sqrt{\left(\frac{c_1}{\sqrt{T}} + \frac{c_2 \mathcal{E}_T}{T}+ \hat{D}^{*2} \right)}$, and we obtain $\{\hat{\nu}_t\}_{t=1}^T$ and $\{\hat{\pi}_t\}_{t=1}^T$. 
-In addition, the initialization $\{\pi_{t,0}\}_{t=1}^T$  are determined by playing OGD on the functions $\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \cdot \right)\right], \text{ for } t=1,\ldots, T$. Then, it holds that 
-\begin{align*}
- & \Bar{R}_i \leq \frac{\sqrt{8 |\mathcal{S}||\mathcal{A}|} }{ \sqrt{M}(1-\gamma)^{3/2}} \left(\sqrt{ \frac{\sqrt{2}L_g\|\phi^* \|}{\sqrt{T}}  + \left(2 + \frac{4\sqrt{2} L_gL_\pi \|\phi^*\|}{(2C_1 - C_1^2 L_\pi) \sqrt{T}} \right) \frac{\mathcal{E}_T}{T} + \hat{D}^{*2} }\right)  \hspace{0.2cm} \forall i=0,\ldots,p.
-\end{align*}
-\end{theorem}
-
-\begin{proof}
-We know that $\Bar{R}_0$ and $\{\Bar{R}_i\}_{i=1}^p$ are well-defined. In addition, it holds that 
-\begin{align*}
-  \Bar{R}_0 \leq &\frac{2}{T}\sum_{t=1}^T \left(\frac{ \mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t| \phi_t\right)\right]}{\alpha M} +\frac{ 2\alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right)\\
-  =&  \frac{2}{T}\sum_{t=1}^T \left(  \frac{\mathbb{E}_{s \sim \nu^*_t}\left[D_{\mathrm{KL}}\left(\pi^*_t | \phi_t \right)\right]- \mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \phi^\ast \right)\right]}{\alpha M } \right)\\
-&+  \frac{2}{T}\sum_{t=1}^T \left( \frac{\mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \phi^\ast \right)\right]}{\alpha M}+\frac{ 2\alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right) \\ 
-\leq & \frac{2}{T}\sum_{t=1}^T \left(  \frac{\mathbb{E}_{s \sim \nu^*_t}\left[D_{\mathrm{KL}}\left(\pi^*_t | \phi_t \right)\right]- \mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \phi^\ast \right)\right]}{\alpha M } \right)\\
-&+  \frac{2}{T}\sum_{t=1}^T \left( \frac{\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \phi^\ast \right)\right]\pm \epsilon_t}{\alpha M}+\frac{ 2\alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right).
-\end{align*}
-where $\phi_t=\pi_{t,0}$.
-Second equality follows from the fact that the total loss can be split into the loss of the meta-update algorithm and the the loss if we had always initialized at $\phi^\ast$. Last inequality follows from the KL-divergence estimation error bound in Theorem \ref{thm:dualDICE}.
-
-Since each $\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \cdot \right)\right]$ is $\mu_\pi$-strongly convex due to Assumption 1, and since each $\phi_{t}$ is determined by  playing FTL or inexact OGD, the following term can be upper bounded using Lemma \ref{cor:appstaticRegretOGD} as follows:
-\begin{equation*}
-\begin{aligned}
-    \frac{2}{T}\sum_{t=1}^T \left(  \frac{\mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \phi_t \right)\right]- \mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \phi^\ast \right)\right]}{\alpha M} \right)  \leq \\ \frac{2}{\alpha M} \left(\frac{\sqrt{2}L_g\|\phi^* \|}{\sqrt{T}}  + \left(1 + \frac{4\sqrt{2} L_gL_\pi \|\phi^*\|}{(2C_1 - C_1^2 L_\pi) \sqrt{T}} \right) \frac{\mathcal{E}_T}{T} \right),
-\end{aligned}
-\end{equation*}
-where the constants are from the Corollary \ref{cor:appstaticRegretOGD}. Now, we will upper bound the second term. Since $\phi^\ast=\argmin_{\phi} \frac{1}{T}\sum_{t=1}^T \mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \phi \right)\right]$, by the definition of $\hat{D}^*$, we have $\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \phi \right)\right] \leq \hat{D}^{\ast 2}$. Thus, by substituting the definition of $\phi^\ast$, it holds that
-\begin{equation*}
-\begin{aligned}
-\frac{2}{T}\sum_{t=1}^T \left( \frac{\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \phi^\ast \right)\right] \pm \epsilon_t}{\alpha M}+\frac{2\alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right)& \leq \frac{2\hat{D}^{\ast 2}}{\alpha M} + \frac{2\mathcal{E}_T}{T\alpha M}+  \frac{ 4\alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}}.
-\end{aligned}
-\end{equation*}
-Setting the value of $\alpha = \frac{(1-\gamma)^{3/2}\sqrt{\frac{c_1}{\sqrt{T}}+ \frac{c_2\mathcal{E}_T}{T} + \hat{D}^{*2}  }   }{\sqrt{2M|\mathcal{S}||\mathcal{A}|}}$, where $c_1 = \sqrt{2}L_g\|\phi^*\|$, $c_2 = \left(2 + \frac{4\sqrt{2} L_gL_\pi \|\phi^*\|}{(2C_1 - C_1^2 L_\pi) \sqrt{T}} \right)$, we can obtain the TAOG $\bar{R}_0$ as follows:
-\begin{align*}
-    \bar{R}_0 \leq \frac{\sqrt{8\left(\frac{c_1}{\sqrt{T}}+ \frac{c_2\mathcal{E}_T}{T} + \hat{D}^{*2} \right)|\mathcal{S}||\mathcal{A}|}  }{\sqrt{M(1-\gamma)^{3}}}
-\end{align*}
-The bound for $\Bar{R}_i$ can be derived similarly.
-
-\end{proof}
-
-\subsection{Dynamic regret for the inexact OGD algorithm}
-\label{sec:dynamic-regret-app}
-In the following, we consider a stronger notion of regret that measures the performance of the OGD algorithm (see Algorithm \ref{alg:InexactOGD}) against a dynamically changing sequence in hindsight (see, e.g.,~\citep{zinkevich2003online,jadbabaie2015online,zhang2017improved}):
-\begin{equation}
-    \text{(dynamic regret)}\quad \sum_{t=1}^T\ell_t(x_t)-\sum_{t=1}^T\ell_t(x^*_t) \quad 
-\end{equation}
-where $x^*_t\in\arg\min_{x\in X}\ell_t(x)$ is the optimal decision for the loss $\ell_t$. It is well known that in the worst case, it is impossible to achieve a sub-linear dynamic regret bound, due to the arbitrary fluctuation in the functions \citep{zinkevich2003online,besbes2015non,yang2016tracking}. Thus, it is common to upper bound the dynamic regret in terms of a certain regularity of the comparator sequence. One possible regularity condition is the path length of the comparator sequence \citep{zinkevich2003online,jadbabaie2015online}:
-\begin{equation}
-    \mathcal{P}_T\coloneqq \sum_{t=2}^T\|x^*_t-x^*_{t-1}\|,
-\end{equation}
-which captures the cumulative Euclidean norm of the difference between successive comparators (note that we will use $\|\cdot\|$ for the Euclidean norm, unless otherwise specified). The path-length measure is also the regularity condition used in existing inexact OGD literature \citep{bedi2018tracking,dixit2019online}. However, as remarked in \citep{zhang2017improved}, a potentially tighter bound can be achieved by examining the squared path-length measure:
-\begin{equation}
-    \mathcal{S}_T\coloneqq \sum_{t=2}^T\|x^*_t-x^*_{t-1}\|^2,
-\end{equation}
-which can be much smaller than $\mathcal{P}_T$ when the local variations are small. For example, when $\|x^*_t-x^*_{t-1}\|=\Theta(1/\sqrt{T})$ for all $t \in[T]$, we have $\mathcal{P}_T=\Theta(\sqrt{T})$ but $\mathcal{S}_T=\Theta(1)$. In this section, we provide analysis with respect to both measures for strongly convex and smooth functions. Furthermore, we propose to apply inexact OGD multiple times in each round, and demonstrate that the dynamic regret is reduced from $\mathcal{O}(\mathcal{P}_T+\mathcal{E}_T)$ to $\mathcal{O}(\min\{\mathcal{P}_T+\mathcal{E}_T,\mathcal{S}_T+\tilde{\mathcal{E}}_T\})$, where 
-\begin{equation*}
-    \tilde{\mathcal{E}}_T\coloneqq\sum_{t=1}^T\sqrt{\epsilon_t}
-\end{equation*}
-is the cumulative square root inexactness bounds. Note that our results improve over existing bounds for inexact online learning \citep{bedi2018tracking,dixit2019online} and can be regarded as a generalization of \citep{zhang2017improved} to the inexact settings. We start with a result that will be used in later analysis.
-
-\begin{lemma}\label{lem:DynamicLemma1}
-Assume that $f:X \rightarrow \mathbb{R}$ is $\lambda$-strongly convex and $L$-smooth, and let $x^* = \underset{x \in X}{\argmin} f(x)$ be the unique optimal solution. Let $v = P_X(x - \alpha \hat{\nabla} f(x))$, where $\hat{\nabla}f(u) \in \partial_{\epsilon}f(u)$ and $\alpha \leq \frac{1}{2L}$, we have that 
-\begin{align*}
-    \|v - x^*\|^2 \leq \frac{1}{\lambda \alpha +1}\|x^* - x\|^2+ \frac{c\alpha+2L\alpha}{\lambda L \alpha + L}\epsilon,
-\end{align*}
-where the constant $c$ is specified in Lemma \ref{prop:Appsubdifferential2}.
-\end{lemma}
-
-\begin{proof}
-By the update rule, we have that
-\begin{equation}
-    \label{eq:updaterule}
-    v = \underset{x' \in X}{\argmin} f(x) + \langle \hat{\nabla}f(x),x'-x \rangle + \frac{1}{2\alpha}\|x'-x\|^2.
-\end{equation}
-
-By strong convexity of the objective above,
-\begin{equation}\label{eq:strongConvexity}
-     \langle \hat{\nabla}f(x), v-x \rangle +  \frac{1}{2\alpha}\|v-x\|^2 \leq \langle \hat{\nabla} f(x),x^* - x \rangle  + \frac{1}{2\alpha}\|x^* - x\|^2  - \frac{1}{2\alpha}\|v - x^*\|^2.
-\end{equation}
-
-Since, $f(x)$ is $\lambda$-strongly convex and $L$-smooth, we have that
-\begin{equation}\label{eq:3}
-\begin{aligned}
-    f(x^*) - \frac{\lambda}{2}\|x^* - x\|^2 \geq f(x) + \langle \nabla f(x), x^* - x \rangle,
-    \end{aligned}
-\end{equation}
-and
-\begin{equation}\label{eq:4}
-    f(x^*) \leq f(x) + \langle \nabla f(x), x^*-x \rangle + \frac{L}{2}\|x^*-x\|^2. 
-\end{equation}
-Also, since $\hat{\nabla}f(x)$ is an $\epsilon$-subgradient, we can write
-\begin{equation}\label{eq:SubgradientProperty}
-    f(x^*) \geq f(x) + \langle \hat{\nabla}f(x), x^* - x \rangle - \epsilon.
-\end{equation}
-Combining \eqref{eq:3}, \eqref{eq:4} and \eqref{eq:SubgradientProperty}, we have that
-\begin{equation*}
-    f(x^*) + \frac{L-\lambda}{2}\|x^* - x\|^2 \geq f(x) + \langle \hat{\nabla} f(x), x^* - x\rangle - \epsilon.
-\end{equation*}
-Combining the above relations, we have that
-\begin{equation*}
-\begin{aligned}
-    f(v)  &\leq f(x) + \langle \nabla f(x), v-x \rangle + \frac{L}{2} \|v-x\|^2 \\ & = f(x) + \langle \hat{\nabla}f(x),v-x\rangle + \frac{L}{2} \|v-x\|^2 + \langle \nabla f(x) - \hat{\nabla}f(x),v-x \rangle \\ 
-    & \overset{(i)}{\leq} f(x) + \langle \hat{\nabla}f(x), x^* - x \rangle + \left(\frac{L}{2} - \frac{1}{2\alpha} \right)\|v-x\|^2  \\
-    &\qquad\qquad\qquad\qquad+ \frac{1}{2\alpha}\|x^* -x\|^2 - 
-    \frac{1}{2\alpha}\|v - x^*\|^2 + \langle \nabla f(x) - \hat{\nabla}f(x),v-x \rangle \\
-    & \overset{(ii)}{\leq} f(x^*) + \left(\frac{L}{2} - \frac{1}{2\alpha} \right)\|v-x\|^2  + \frac{1}{2\alpha}\|x^* - x\|^2  \\ 
-    &\qquad\qquad\qquad\qquad- 
-    \frac{1}{2\alpha}\|v - x^*\|^2 + \langle \nabla f(x) - \hat{\nabla}f(x),v-x \rangle + \epsilon\\
-    & \overset{(iii)}{\leq} f(v) - \left( \frac{\lambda}{2} + \frac{1}{2\alpha}\right)\|v - x^*\|^2  + \left(\frac{L}{2} - \frac{1}{2\alpha} \right)\|v-x\|^2 \\ 
-    &\qquad\qquad\qquad\qquad+ \frac{1}{2\alpha}\|x^* - x\|^2+\langle \nabla f(x) - \hat{\nabla}f(x),v-x \rangle + \epsilon\\
-    & \overset{(iv)}{\leq} f(v) - \left( \frac{\lambda}{2} + \frac{1}{2\alpha}\right)\|v - x^*\|^2  + \left(\frac{L}{2} - \frac{1}{2\alpha} \right)\|v-x\|^2\\ 
-    &\qquad\qquad\qquad\qquad + \frac{1}{2\alpha}\|x^* - x\|^2+ \|\nabla f(x) - \hat{\nabla}f(x)\|\|v-x\| + \epsilon \\
-    & \overset{(v)}{\leq} f(v) - \left( \frac{\lambda}{2} + \frac{1}{2\alpha}\right)\|v - x^*\|^2  \\
-    &\qquad\qquad\qquad\qquad + \left(\frac{L}{2} - \frac{1}{2\alpha} + \frac{\kappa}{2} \right)\|v-x\|^2 + \frac{1}{2\alpha}\|x^* -x\|^2 + \left( \frac{c}{2\kappa}+1\right)\epsilon,
-    \end{aligned}
-\end{equation*}
-where the first inequality is due to $L$-smoothness, $(i)$ follows from \eqref{eq:strongConvexity}, $(ii)$ is due to convexity, $(iii)$ is due to  strong convexity, $(iv)$ follows from Cauchy-Schwarz inequality, and $(v)$ is due to the inequality $ab\leq \frac{1}{2\kappa}a^2+\frac{\kappa}{2}b^2$ for $a,b\geq 0$ and $\kappa>0$ and the constant $c$ comes from Lemma \ref{prop:Appsubdifferential2}.
-Choosing $\kappa= L$, $\alpha \leq \frac{1}{2L}$, and rearranging the above, we have then proved the claim.
-\end{proof}
-
-% \jin{Add the OGD with multiple steps. See \citep{zhang2017improved} Algorithm 1. Use the notations  Let $z_{t-1}^{k+1} = P_X(x_{t-1} - \alpha \hat{\nabla}f(x_{t-1}))$, where $P_X$ is a projection operator,  $\hat{\nabla}f(x_{t-1}) \in \nabla_\epsilon f(x_{t-1})$, and $x_t =  z_{t-1}^{k+1}$, and $\alpha>0$ is some learning rate.}
-
-\begin{algorithm}[t]
-% \KwInput{}
-% \KwOutput{\zeta^{\star}}
-  \caption{Inexact Online Multiple Gradient Descent Algorithm}
-  \KwInput{Learning rate $\alpha$, $x_1=0$}
-  \begin{algorithmic}[1]
-    
-    \FOR{$t = 1,..,T$}
-        \STATE Incur loss $\ell_t(x_t)$ 
-        \STATE $z_{t}^1 = x_t$
-        \FOR{$k = 1,...,K$}
-        
-        \STATE $z_{t}^{k+1} = P_X(x_{t} - \alpha \hat{\nabla}\ell_t(z_t^k))$ 
-        
-        % \yuhao{$x_t$ should be $z_t^k$??}
-        
-        \ENDFOR
-        \STATE $x_{t+1} = z_t^{K+1}$
-    \ENDFOR
-    
-  \end{algorithmic}
-  \label{alg:InexactMOGD}
-\end{algorithm}
-
-\begin{theorem}[Dynamic regret for inexact OGD with multiple updates]\label{prop:DynamicRegret}
-Assume that $\ell_t: X \rightarrow \mathbb{R}$ is $\lambda$-strongly convex, $L_1$-Lipschitz, and $L_2$-smooth  for all $t \in [T]$. By setting $\alpha \leq \frac{1}{2L_2}$, $K\coloneqq\ceil{\frac{\ln 2}{\ln (1+\lambda \alpha)}}$, then, for any $\beta>0$, we have that 
-
-\begin{equation*}
-\begin{aligned}
-\sum_{t=1}^T\ell_t(x_t) - \ell_t(x_t^*) \leq  \min\bigg(C_1 \|x_1 - x_1^*\|^2 + C_2&\mathcal{E}_T + C_3 S_T + \frac{1}{2\beta}\sum_{t=1}^T\|\nabla \ell_t(x_t^*)\|^2, \\ \quad & C_4 \|x_1 - x_{1}^*\|+C_5\sum_{t=1}^T \sqrt{\epsilon_t} + C_4 P_T\bigg),
-\end{aligned}
-\end{equation*}
-where $C_1 = 2(L_2+\beta)$, $C_2 =(L_2+\beta)\frac{3c\alpha+6\alpha L_2}{2\lambda \alpha L_2}$, $C_3 = 3(L_2+\beta)$, $C_4 = \frac{2L_1}{2-\sqrt{2}}$ and $C_5 = \frac{2L_1}{2-\sqrt{2}}\sqrt{\frac{c\alpha+2L_2\alpha}{2\alpha \lambda L_2}}$.
-\end{theorem}
-
-\begin{proof}
-The proof has two parts, where we use different techniques to bound the dynamic regret by $\mathcal{S}_T$ and $\mathcal{E}_T$, as well as $\mathcal{P}_T$ and $\tilde{\mathcal{E}}_T$. Then the final bound is obtained by taking the minimum between the two bounds.
-
-\textbf{Bounding the dynamic regret by $\mathcal{S}_T$ and ${\mathcal{E}}_T$.}  Since $\ell_t$ is $L_2$-smooth, we have that
-\begin{align}
-\ell_t(x_t) - \ell_t(x_t^*) & \leq \langle \nabla \ell_t(x_t^*), x_t - x_t^* \rangle + \frac{L_2}{2} \|x_t - x_t^*\|^2 \\ 
-& \leq \|\nabla \ell_t(x_t^*)\|\|x_t - x_t^*\| + \frac{L_2}{2} \|x_t - x_t^*\|^2 \\ 
-& \leq \frac{1}{2\beta} \|\nabla \ell_t(x_t^*)\|^2 + \frac{L_2+\beta}{2}\|x_t - x_t^*\|^2,\label{eq:dynamic_bd0}
-\end{align}
-where the second inequality is due to Cauchy–Schwartz and the third inequality is due to  $ab\leq \frac{1}{2\beta}a^2+\frac{\beta}{2}b^2$ for $a,b\geq 0$ and $\beta>0$. 
-
-Now, using $\|x-y\|^2\leq (1 + \iota)\|x-z\|^2+\left(1 + \frac{1}{\iota}\right)\|z-y\|^2$, we can bound
-\begin{equation}\label{eq:dynamic_bd1}
-\sum_{t=1}^T \|x_t - x_t^*\|^2 \leq \|x_1 - x_1^*\|^2 + \sum_{t=2}^T(1 + \iota)\|x_t - x_{t-1}^*\|^2 + \left(1 + \frac{1}{\iota}\right)\|x_t^* - x_{t-1}^*\|^2.
-\end{equation}
-Recall the updating rule $z_{t-1}^{j+1} = P_X(z_{t-1}^j - \alpha \hat{\nabla}f_{t-1}(z_{t-1}^j))$, $j = 1, ..., K$; then, we can write that
-\begin{align}
-    \|x_t - x_{t-1}^*\|^2&=\|z_{t-1}^{K+1}- x_{t-1}^*\|^2\label{eq:dynamic_bd2}\\
-    &\leq \left(\frac{1}{\lambda \alpha +1}\right)^{K}\|x_{t-1}-x_{t-1}^*\|^2+ \frac{1-\left(\frac{1}{\lambda \alpha +1}\right)^{K}}{1-\frac{1}{\lambda \alpha +1}}\frac{c\alpha+2L_2\alpha}{\lambda L_2 \alpha + L_2}\epsilon_{t-1},\nonumber
-\end{align}
-where we recursively apply the result from Lemma \ref{lem:DynamicLemma1}. Thus, by plugging in \eqref{eq:dynamic_bd2} into \eqref{eq:dynamic_bd1}, and using the definitions of $\mathcal{S}_T$ and $\mathcal{S}_T$, we have that
-\begin{align}
-    \sum_{t=1}^T\|x_t - x_t^*\|^2 &\leq \|x_1 - x_1^*\|^2 + (1+\iota) \left(\frac{1}{\lambda \alpha +1} \right)^K \sum_{t=1}^T\|x_t - x_t^*\|^2 \label{eq:dynamic_bd3}\\
-    &\qquad\qquad\qquad\qquad+ (1+\iota)\frac{1-\left(\frac{1}{\lambda \alpha +1}\right)^{K}}{1-\frac{1}{\lambda \alpha +1}}\frac{c\alpha+2L_2\alpha}{\lambda L_2 \alpha + L_2}\mathcal{E}_T + \left( 1 +\frac{1}{\iota}\right)\mathcal{S}_T.\nonumber
-\end{align}
-Rearranging the terms, the above relation implies that
-\begin{align*}
-\sum_{t=1}^T\|x_t - x_t^*\|^2 &\leq \frac{(1+\lambda \alpha)^K}{(1+\lambda \alpha)^K - (1+\iota)} \|x_1 -x_1^* \|^2+\left( 1 +\frac{1}{\iota}\right)\frac{(1+\lambda \alpha)^K}{(1+\lambda \alpha)^K - (1+\iota)} \mathcal{S}_T\\
-&\qquad\qquad\qquad\qquad+(1+\iota)\frac{(1+\lambda \alpha)^K-1}{(1+\lambda \alpha)^K - (1+\iota)}\frac{c\alpha+2L_2\alpha}{\lambda L_2 \alpha }\mathcal{E}_T
-\end{align*}
-Let $\iota = \frac{1}{2}$ and choose $K = \ceil{\frac{\log 2}{\log (1+\lambda \alpha)}}$, we have
-\begin{equation*}
-\sum_{t=1}^T\|x_t - x_t^*\|^2 \leq 4 \|x_1 - x_1^*\|^2 + \frac{3c\alpha+ 6 L_2 \alpha}{\lambda \alpha L_2} \mathcal{E}_T + 6 \mathcal{S}_T.
-\end{equation*}
-Combine the above with \eqref{eq:dynamic_bd0}, and summing over $t\in[T]$, we have that
-\begin{align*}
-    &\sum_{t=1}^T \ell_t(x_t) - \ell_t(x_t^*) \\
-&\leq \frac{1}{2\beta}\sum_{t=1}^T\|\nabla \ell_t(x_t^*)\|^2 + 3(L_2+\beta)\mathcal{S}_T + (L_2+\beta)\frac{3c\alpha+6L\alpha}{2\lambda \alpha L}\mathcal{E}_T + 2 (L_2+\beta)\|x_1 - x_1^*\|^2,
-\end{align*}
-which holds true for any positive $\beta>0$.
-
-
-\textbf{Bounding the dynamic regret by $\mathcal{P}_T$ and $\tilde{\mathcal{E}}_T$.} By \eqref{eq:dynamic_bd3} and the choice of $K= \ceil{\frac{\log 2}{\log (1+\lambda \alpha)}}$, we have that:
-\begin{equation*}
-    \|x_t - x_{t-1}^*\|^2 \leq \frac{1}{2}\|x_{t-1} - x_{t-1}^*\|^2 + \frac{c\alpha+2L_2\alpha}{2\alpha \lambda L_2}\epsilon_{t-1}.
-\end{equation*}
-Thus, 
-\begin{align}
-    \|x_t - x_{t-1}^*\| & \leq \sqrt{\frac{1}{2}\|x_{t-1} - x_{t-1}^*\|^2+ \frac{c\alpha+2L_2\alpha}{2\alpha \lambda L_2}\epsilon_{t-1}}\nonumber \\ 
-    & \leq \frac{1}{\sqrt{2}}\|x_{t-1} - x_{t-1}^*\| + \sqrt{\frac{c\alpha+2L_2\alpha}{2\alpha \lambda L_2}} \sqrt{\epsilon_{t-1}},\label{eq:dynamic_bd4}
-\end{align}
-where the last inequlity follows from $\sqrt{a+b}\leq\sqrt{a}+\sqrt{b}$.
-Due to the bounded gradient assumption, we have that 
-\begin{equation}
-\label{eq:dynamic_bd5}
-    \sum_{t=1}^T \ell_t(x_t) - \ell_t(x_t^*) \leq L_1 \sum_{t=1}^T\|x_t - x_t^*\|
-\end{equation}
-
-To bound $\sum_{t=1}^T\|x_t - x_t^*\|$, notice that
-\begin{align*}
-\sum_{t=1}^T\|x_t - x_t^*\|  & \leq \|x_1 - x_1^*\| + \sum_{t=2}^T\|x_t - x_{t-1}^*\|+ \|x_{t-1}^* - x_t^*\|\\  & \leq \|x_1 - x_1^*\| + \frac{1}{\sqrt{2}}\sum_{t=1}^T\|x_t - x_t^*\| + \sqrt{\frac{c\alpha+2L_2\alpha}{2\alpha \lambda L_2}}  \tilde{\mathcal{E}}_T+\mathcal{P}_T,
-\end{align*}
-which implies that 
-\begin{equation*}
-\sum_{t=1}^T \|x_t - x_t^*\| \leq \frac{2}{2-\sqrt{2}}\|x_1 - x_1^*\| + \frac{2}{2-\sqrt{2}}\sqrt{\frac{c\alpha+2L_2\alpha}{2\alpha \lambda L_2}} \tilde{\mathcal{E}}_T + \frac{2}{2-\sqrt{2}} \mathcal{P}_T.
-\end{equation*}
-Plugging the above in \eqref{eq:dynamic_bd5} proves the claim.
-\end{proof}
-
-In the above result, the number of OGD updates per round is on the order of $\mathcal{O}(L_2/\alpha)$, where $L_2/\alpha$ is the condition number of each loss function. Below, we also provide a dynamic regret bound for standard OGD (single update per round); as a result, we only provide the bound in terms of $\mathcal{P}_T$ (similar to \citep{jadbabaie2015online,mokhtari2016online}.
-
-
-After establishing the dynamic regret for the inexact OGD, we can use this result to obtain the proof of Lemma \ref{cor:dynamicRegretOGD} in the main paper, which provides the dynamic regret of inexact OGD over the loss sequences $\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\phi_{t})]$ for all $t \in [T]$. 
-Here, we present the full statement with constants for the Lemma \ref{cor:dynamicRegretOGD}.
-
-\begin{lemma}[Dynamic regret bound for inexact OGD]
-\label{cor:appdynamicRegretOGD}
-Denote $\ell_t(\phi_{t}) \coloneqq \mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\phi_{t})]$ for all $t \in [T]$. For any dynamically varying comparator $\psi^*_{t} = \underset{\psi_{t} \in \Delta \mathcal{A}_{\varrho}^{|\mathcal{S}|}}{\argmin} \sum_{t=1}^T \mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\phi_{t})]$  if OGD is run on a sequence of loss functions $\hat{\ell}_t(\phi_{t})$, where $\hat{\ell}_t(\phi_t) = \mathbb{E}_{\nu_t^*}[D_{KL}(\hat{\pi}_t|\phi_{t})]$ for all $t \in [T]$ with the step-size $\alpha \leq \frac{1}{2\mu_\pi}$, number of iterations $K \coloneqq \ceil{\frac{\ln 2}{\ln (1 + \mu_\pi \alpha)}} $ then the following bound holds for dynamic regret for any $\beta > 0$:
-\begin{equation*}
-\begin{aligned}
-    \sum_{t=1}^T \ell_t(\phi_{t}) - \sum_{t=1}^T\ell_t(\psi_{t}^*) \leq \min (C_1\|\phi_1 - \psi_1^*\|^2 + C_2 & \mathcal{E}_T + C_3 \mathcal{S}_T + \frac{1}{2 \beta}\sum_{t=1}^T \|\nabla \ell_t(\psi_t^*)\|^2, \\
-    & C_4\|\phi_1 - \psi_1^*\| + C_5 \tilde{\mathcal{E}}_T + C_4 \mathcal{P}_T ),
-\end{aligned}
-\end{equation*}
-where $C_1 = 2(L_\pi + \beta)$, $C_2 = (L_\pi +\beta)\frac{3C_6\alpha+6\alpha L_\pi}{2 \mu_\pi \alpha L_\pi}$, $C_3 = 3(L_\pi + \beta)$, $C_4 = \frac{2 L_g}{2-\sqrt{2}}$, and $C_5 = \frac{2L_g}{2-\sqrt{2}}\sqrt{\frac{C_6\alpha + 2L_\pi \alpha}{2\alpha \mu_\pi L_\pi}}$, for any $C_6 \in \{c \in \left(0, \frac{2}{L_\pi}\right): \psi_{t}^* + c(\hat{\nabla}_t - \nabla_t )  \in \Delta \mathcal{A}_{\varrho}^{|\mathcal{S}|}\}$ where $\hat{\nabla}_t$ and $\nabla_t$ are an $\epsilon_t$-subgradient and exact subgradient of $\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\psi_{t})]$ at $\psi_{t}$, respectively,   $\mathcal{E}_T \coloneqq \sum_{t=1}^T \epsilon_t$ is the cumulative inexactness.
-\end{lemma}
-
-\begin{proof}
-The proof directly follows after plugging in the constants from Theorem \ref{prop:DynamicRegret}.
-\end{proof}
-
-\section{KL divergence estimation error bound}
-\label{sec:kl-bound}
-We recall the following notations. For each task $t$, the initial state distribution is denoted by $\rho_t$, the state distribution for the optimal policy $\pi_{t}^*$ is given by $\nu_t^*$, the state distribution for the policy $\hat{\pi}_t$ is denoted by $\Tilde{\nu}_t$, and the state distribution estimated using the trajectory sample dataset $\mathcal{D}_t$ is denoted as $\hat{\nu}_t$. 
-
-In the main paper, we breakdown the KL divergence estimation error by the sources of origin:
-\begin{align}
-    \mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi)] &- \mathbb{E}_{ \hat{\nu}_t}[D_{KL}(\hat{\pi}_{t}|\pi)]=\underbrace{\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{ \Tilde{\nu}_t}[D_{KL}(\pi_t^*|\pi)]}_{(A)}\\
-    &+\underbrace{\mathbb{E}_{ \Tilde{\nu}_t}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{ \hat{\nu}_t}[D_{KL}(\pi_t^*|\pi)]}_{(B)}+\underbrace{\mathbb{E}_{\hat{\nu}_t}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{ \hat{\nu}_t}[D_{KL}(\hat{\pi}_{t}|\pi)]}_{(C)},\nonumber
-    \label{eq:KLerrorDecompose}
-\end{align}
-where $(A)$ accounts for the mismatch between the discounted state visitation distributions of an optimal policy $\pi_t^*$ and a suboptimal one $\hat{\pi}_t$, $(B)$ originates from the estimation error of DualDICE, and $(C)$ is due to the difference between $\pi_t^*$ and $\hat{\pi}_t$ measured according to $\hat{\pi}_t$. By the triangle inequality, we can bound the total error by controlling each term separately. This decomposition is general in the sense that it provides a guideline to bound each term with potentially different strategies. In particular, the term $(B)$ can be bounded differently if we replace DualDICE with another stationary distribution estimation algorithm. To bound the terms $(A)$ and $(C)$, we have developed new techniques based on tame geometry and subgradient flow systems. To streamline the presentation, we consider the tabular setting with softmax parametrization. 
-
-
-
-To bound $(A)$, we need to control the distance between $\nu_t^*$ and $\Tilde{\nu}_t$, which can be bounded by the distance between the inducing policy parameters as long as they are Lipschitz continuous \cite[Lemma 3]{xu2020improving}. In addition, the bound on $(C)$ also depends on the distance between policies. In general, controlling the distance between a policy to an optimal policy based on the suboptimality gap requires the optimization to have some curvatures around the optima (e.g., quadratic growth \citep{drusvyatskiy2018error} or H{\"o}lderian growth \citep{johnstone2020faster}). However, to the best of the knowledge of the authors, the only available results are algorithm-dependent PL inequalities for policy gradient \citep{mei2020global} or quadratic growth with entropy regularization \citep{ding2021beyond}. 
-% \begin{assumption}\label{asmptn:Definable}
-% The functions $J_{t,i}(\cdot)$ for $i =0,1,...,p$ and $t \in [T]$ and parametric policy $\pi_{\theta}$ are definable in some o-minimal structure \citep{van1996geometric}.
-% \end{assumption}
-
-
-\VK{
-\textbf{Discussion on Assumption \ref{asmptn:newAsmptn1}:} As discussed in the main text, Assumption \ref{asmptn:newAsmptn1} implies boundedness and Lipschitzness of the KL divergence. We make use of this in bounding the terms $(A)$ and $(C)$ in (\eqref{eq:error_decompose}) and eventually obtain Theorems \ref{thm:dualDICE} and \ref{thm:UtSim1}.}
-\VK{
-We expect that Assumption \ref{asmptn:newAsmptn1} is also needed in unconstrained meta-learning by adapting our method, i.e., the MDP-within-online framework. Technically, Assumption \ref{asmptn:newAsmptn1} is a minimum requirement even for single-task CRPO to provide provable guarantees. This can be seen in the convergence guarantee of the original CRPO method (Lemma \ref{lemma: three events hold} and Lemma \ref{lem:2_threeEventsHold} in our paper, or Theorem 3 in \citep{xu2021crpo} last line of their proof before the term $D_{KL}(\pi_t^*|\pi_{t,0})$ is submerged in the big-O notation). For example, as shown in our (\eqref{eq:RegretRandC}),}
-
-$$R_0 = J_{t,0}(\pi_t^*) - \mathbb{E}[J_{t,0}(\hat{\pi}_t)]\leq \frac{2}{\alpha_t M}\mathbb{E}_{s \sim \nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})]+\frac{4 \alpha_t c_{max}^2|\mathcal{S}| |\mathcal{A}|}{(1-\gamma)^3}.$$
-\VK{
-To ensure that the bound is nontrivial, we need to bound the term $D_{KL}(\pi_t^*|\pi_{t,0}).$ However, if $\pi_{t,0}$ does not have full support over the state/action space, then there may be a state $s$ where $\pi_t^*(s) > 0$ but $\pi_{t,0}(s) = 0$, which would make the KL divergence infinite.}
-
-
-
-\subsection{Preliminaries on tame geometry}\label{subsec:app_F1}
-
-\label{sec:prelim-tame}
-For the sake of completeness, let us recall some fundamental concepts/results in tame geometry, which allows us to study the global geometry of the solution maps of a wide range of optimization problems, which will be used in bounding the estimation error for the KL divergence. More information can be found in \citep{davis2020stochastic,van1996geometric}. Recall that a class of functions on a bounded set is called $C^p$ smooth when it possesses the uniformly bounded partial derivatives up to order $p$. 
-
-
-\begin{definition}[Whitney Stratification]
-\label{def:WhitneyStratification}
- A Whitney $C^k$ stratification of a set $I$ is a partition of $I$ into finitely many nonempty $C^k$ manifolds, called strata, satisfying the following compatibility conditions:
-
-\begin{enumerate}
-    \item For any two strata $I_a$ and $I_b$, the implication $I_a \cap I_b \neq \emptyset$ implies that $I_a \subset \mathrm{cl} I_b$ holds, where $\mathrm{cl} I_b$ denotes the closure of the set $I_b$. 
-    
-    \item For any sequence of points $x_k$ in a stratum $I_a$, converging to a point $x^{\star}$ in a stratum $I_b$, if the corresponding normal vectors $v_k \in N_{I_a}(x_k)$ converge to a vector $v$, then the inclusion $v \in N_{I_b}(x^{\star})$ holds. Here $N_{I_a}(x_k)$ denotes the normal cone to $I_a$ at $x_k$.
-\end{enumerate}
-
-\end{definition}
-
-Roughly speaking, stratification is a locally finite partition of a given set into differentiable manifolds, which fit together in a regular manner (property $1$ in Def. \ref{def:WhitneyStratification}). Whitney stratification as defined above is a special type of stratification for which the strata are such that their tangent spaces (as viewed from normal cones) also fit regularly (property $2$).
-
-There are several paths to verifying Whitney stratifiability. For instance, one can show that the function under study belongs to one of the well-known function classes, such as semialgebraic functions \citep{davis2020stochastic}, whose members are known to be Whitney stratifiable. However, to study the solution function of a general convex optimization problem, we need a far-reaching axiomatic extension of semialgebraic sets to classes of functions definable on ``o-minimal structures,'' which are very general classes and share several attractive analytic features as semialgebraic sets, including Whitney stratifiability \citep{davis2020stochastic,van1996geometric}.
-
-\begin{definition}[o-minimal structure]
-\label{def:0-minimal structure}
- \citep{van1996geometric} An o-minimal structure is defined as a sequence of Boolean algebras $O_v$ of subsets of $\mathbb{R}^{v}$, such that for each $n_v \in \mathbb{N}$, the following properties hold:
-
-\begin{enumerate}
-    \item If some set $X$ belongs to $O_v$, then $X \times \mathbb{R}$ belong to $O_{v+1}$.
-    
-    \item Let $P_{proj}: \mathbb{R}^{v} \times \mathbb{R} \rightarrow \mathbb{R}^{v}$ denote the coordinate projection operator onto $\mathbb{R}^{v}$, then for any $X$ in $O_{v+1}$, the set $P_{proj}(X)$ belongs to $O_v$.
-    
-    \item $O_v$ contains all sets of the form $\{x \in \mathbb{R}^{v}: \hspace{0.1cm} y(x) = 0 \}$, where $y(x)$ is a polynomial in $\mathbb{R}^{v}$.
-    
-    \item The elements of $O_1$ are exactly the finite unions of intervals (possibly infinite) and points.
-\end{enumerate}
-Then all the sets that belong to $O_v$ are called definable in the o-minimal structure.
-
-\end{definition}
-
-Definable sets have broader applicability than semialgebraic sets (in the sense that the latter is a special kind of definable sets) but enjoys the same, remarkable stability property: the composition of definable mappings (including sum, inf-convolution, and several other classical operations of analysis involving  a finite number of definable objects) in some o-minimal structure remains in the same structure. We will crucially exploit these properties in the following sections.
-
-
-% \begin{defintion}[Definable function \citep{bolte2007clarke}] For a given o-minimal structure $\mathcal{O}$ defined over $R^+$, a function $f: \mathbb{R}^n \rightarrow \mathbb{R} \cup \{+\infty\}$ is said to be definable in $\mathcal{O}$ is its graph belongs to $\mathcal{O}_{\nu+1}$.
-
-% \end{defintion}
-
-
-\subsection{Basic properties of subgradient flow systems}
-\label{sec:subflow}
-We also recall some basic definitions and properties of the subgradient flow system (see, e.g., \cite[Thm. 13]{bolte2010characterizations}). Let $f: \mathbb{R}^d \rightarrow \mathbb{R} \cup \{+\infty\}$ be a proper lower semicontinuous function. 
-
-\begin{definition}[Subgradient flow system]\label{def:subgflow}
-For every $x \in \mathrm{dom}(f)$, there exists a unique absolutely continuous curve (called trajectory or subgradient curve) $\theta(\tau):[0,+\infty) \rightarrow \mathbb{R}^d$ that satisfies
-\begin{equation}
-    \begin{cases}
-     \Dot{\theta}(\tau) \in - \partial f(\theta(\tau)) & \text{a.e. on}\hspace{0.3cm} (0,+\infty)\\
-    \theta(0) = \theta_0 \in \mathrm{dom}(f).
-\end{cases}
-\end{equation}
-\end{definition}
-
-Moreover, the trajectory also satisfies the following properties \cite[Thm. 13]{bolte2010characterizations}:
-\begin{enumerate}
-    \item $\theta(\tau) \in \mathrm{dom}(\partial f)$ for all $\tau \in (0,+\infty)$.
-    \item For all $\tau>0$, the right derivative $\Dot{\theta}(\tau^+)$ is well defined and equal to 
-    \begin{align*}
-        \Dot{\theta}(\tau^+) = - \partial^0 f(\theta(\tau)),
-    \end{align*}
-    where $\partial^0 f(\theta)$ is the minimum norm subgradient in $\partial f(\theta)$. In particular, we have that $\Dot{\theta}(\tau) = - \partial^0 f(\theta(\tau))$, for almost all $\tau$. 
-\end{enumerate}
-
-
-\subsection{Bounding the distance \texorpdfstring{$\|\hat{\theta}_t-\theta^*_t\|$}{[1]}} \label{subsec:app_F3}
-
-Recall Assumption \ref{asmptn:Definable}, which requires that the objective/constraint functions and policy parametrization are definable in some o-minimal structure \citep{van1996geometric}. This is  a mild assumption as practically all functions from real-world applications, including deep neural networks, are definable in some o-minimal structure \citep{davis2020stochastic}; also, the composition of mappings, along with the sum, inf-convolution, and several other classical operations of analysis involving a finite number of definable objects in some o-minimal structure remains in the same structure \citep{van1996geometric}. The far-reaching consequence of definability, exploited in this study, is that definable sets and functions admit, for each $k \geq 1$, a $C^k$–Whitney stratification with finitely many strata (see, for instance, \cite[Result 4.8]{van1996geometric}). This remarkable property, combined with the result that any stratifiable functions enjoys a nonsmooth Kurdyka–\L{}ojasiewicz inequality \citep{bolte2007clarke}, provides the foundation to bound the distance $\|\pi^*_t-\hat{\pi}_t\|$ by the suboptimality gap. Note that without further specifications, $\pi^*_t$ is understood as one of the optimal policies that are closest to the policy $\hat{\pi}_t$ (i.e., the projection of $\hat{\pi}_t$ onto the optimal policy set). 
-
-We start with the following elementary result. Here and throughout the section, we use $\mathcal{F}_{t,\Tilde{d}} = \{\pi_{t,\theta}: J_{t,i}(\pi_{t,\theta}) \leq \Tilde{d}_{t,i}\}$ to denote the feasible set with upper bounds $\Tilde{d}$. Note that $\mathcal{F}_{t,{d}}$ is the original feasible set. We also let $\mathbb{I}_{\mathcal{F}_{t,\Tilde{d}}}(\cdot)$ be the indicator function for the set $\mathcal{F}_{t,\Tilde{d}}$.
-\begin{lemma}\label{prop:Definable}
-The function (with variable $\theta$) $J_{t,0}(\pi_{t,\theta})+ \mathbb{I}_{\mathcal{F}_{t,\Tilde{d}}}(\pi_{t,\theta})$, where $\Tilde{d}_{t}$ is any vector such that $\mathcal{F}_{t,\Tilde{d}}$ is non-empty, is definable.
-\end{lemma}
-\begin{proof}
-Since $J_{t,i}(\cdot)$ is definable for $i = 1,...,p$, by the rule of composition, which is due to the definable counterpart of the Tarski-Seidenberg theorem, $J_{t,i}(\pi_{t,\theta}) - d_{t,i}$ is definable for $i = 1,...,p$. Thus, $\mathcal{F}_{t,\Tilde{d}}$ is definable on the same o-minimal structure by definition. Furthermore, $\mathbb{I}_{\mathcal{F}_{t,\Tilde{d}}}(\cdot)$ is definable as the indicator of $\mathcal{F}_{t,\Tilde{d}}$. The definability of $J_{t,0}(\pi_{t,\theta})$ follows similarly. Since definability is preserved under addition, the function $J_{t,0}(\pi_{t,\theta})+ \mathbb{I}_{\mathcal{F}_{t,\Tilde{d}}}(\pi_{t,\theta})$ is definable.
-\end{proof}
-
-For the convenience of the reader, we restate the result for non-smooth Kurdyka–\L{}ojasiewicz (KL) inequality from \cite[Thm. 14]{bolte2007clarke}.
-
-\begin{proposition}[Non-smooth Kurdyka–\L{}ojasiewicz inequality] \label{prop:Bolte}
-Let $f$ be a lower semicontinuous definable function. There exists $\rho>0$, a strictly increasing continuous definable function $h:[0,\rho]\rightarrow (0,\infty)$ which is $C^1$ smooth on $(0,\rho)$, with $h(0) = 0$, and a continuous definable function $\mathcal{X}:\mathbb{R}_+ \rightarrow (0,\rho)$ such that
-\begin{equation*}
-    \|\partial^0 f(x)\| \geq \frac{1}{h'(|f(x)|)},
-\end{equation*}
-whenever $0 < |f(x)| \leq \mathcal{X}(\|x\|)$.
-\end{proposition}
-
-
-Let ${\theta}$ and $\theta_t^*$ denote the parameters of a policy ${\pi_\theta}$ and $\pi_t^*$, respectively.  Directly bounding the distance between ${\theta}$ and $\theta_t^*$ is difficult because $\pi$ may be infeasible (this is even true for $\hat\pi_t$, since it is only guaranteed to approximately satisfy the constraints), i.e., $\theta \notin \mathcal{F}_{t,d}$. Thus, the typical approach of following the subgradient flow of $J_{t,0}(\pi_\theta)+\mathbb{I}_{\mathcal{F}_{t,{d}}}(\pi_\theta)$ to reach $\theta_t^*$ is not applicable. The idea is to enlarge the feasible set $\mathcal{F}_{t,d}$ by increasing the violation threshold $\Tilde{d}_{t,i} \geq d_{t,i} + \delta$, for any $\delta >0$, such that with high probability, $\theta \in \mathcal{F}_{t,\Tilde{d}}$. Then by following the subgradient flow for $J_{t,0}(\pi_\theta)+ \mathbb{I}_{\Tilde{\mathcal{F}}_t}(\pi_\theta)$, we can arrive at a critical point $\Tilde{\theta}_t^*$ (corresponding to the policy  $\Tilde{\pi}_t^*$), which is most likely different from $\theta_t^*$. It then remains to bound the distance between $\theta_t^*$ and $\Tilde{\theta}_t^*$, which is possible due to the preservation of definability through $\inf$ projection. This is the roadmap we will follow. A graphical illustration of the approach is shown in Fig. \ref{fig:EnlargedFeasibleSet}.
-
-
-\begin{figure}[t] %{\textwidth}
-  \centering
-  \includegraphics[width=.4\columnwidth]{illustrate.pdf}
-\caption{To bound the distance between $\pi_t^*$ and $\hat{\pi}_t$, we first bound the distance between  $\Tilde{\pi}_t^*$ and the optimal policy with respect to a larger feasible set $\mathcal{F}_{t,\tilde{d}}$ by an argument based on subgradient flow curve. Note that $\hat{\pi}_t\in\mathcal{F}_{t,\tilde{d}}$ may be infeasible with respect to the original set of constraints but feasible with respect to the relaxed constraints. We then bound the distance between the optimal policies $\pi_t^*$ and $\Tilde{\pi}_t^*$, which correspond to the original feasible set $\mathcal{F}_{t,{d}}$ and the enlarged set $\mathcal{F}_{t,\tilde{d}}$. By the triangle inequality, we can then derive the desired bound on the distance between $\pi_t^*$ and $\hat{\pi}_t$. Note that for better visualization, we vertically separate the sets $\mathcal{F}_{t,{d}}$ and $\mathcal{F}_{t,\tilde{d}}$, which also aims to indicate that in general the optimal solution $\Tilde{\pi}_t^*$ has a higher objective than $\pi_t^*$  due to the relaxed constraints.  }
-\label{fig:EnlargedFeasibleSet}
-\end{figure}
-
-
-\textbf{Bounding the term $\|\theta_t^* - \Tilde{\theta}_t^*\|$.} In this part, we will bound the term $\|\theta_t^* - \Tilde{\theta}_t^*\|$, which will be used to bound $\|\pi_t^* - \Tilde{\pi}_t^*\|$. Firstly, we will prove that the parameter $\theta_t$, which represents the solution map of an optimization with definable objective and constraints, is definable. 
-
-\begin{proposition} \label{prop:thetaDdefinable}
-Let $\theta_t(d) \in {\arg\min} \left\{ J_{t,0}(\pi_{t,\theta}), \text{ s.t. }  J_{t,i}(\pi_{t,\theta}) \leq d_{t,i},  \forall i = 1,...,p\right\}$ be the solution map of the constraint parameters $d$. Then, the function $\theta_t(d)$ is continuous and definable.  Furthermore, there exists a finite partition of the space such that the restriction of $\theta_t(d)$ to each partition is $C^p$ smooth.
-\end{proposition}
-
-\begin{proof}
-First, it can be seen that the solution map $\theta_t(d) \in \argmin J_{t,0}(\pi_{t,{\theta}})+\mathbb{I}_{\mathcal{F}_{t,{d}}}(\pi_{t,\theta})$. Let $\phi_t(d) = \min J_{t,0}(\pi_{t,{\theta}}) + \mathbb{I}_{\mathcal{F}_{t,{d}}}(\pi_{t,\theta})$ be the optimal value function. Since $J_{t,0}(\pi_{t,\theta}) + \mathbb{I}_{\mathcal{F}_{t,{d}}}(\pi_{t,\theta})$ is definable by Proposition \ref{prop:Definable}, and definability is preserved under $\inf$ projection, $\phi_t(d)$ is definable. Since $\theta_t(d) = \{\theta:J_{t,0}(\pi_{t,\theta})+\mathbb{I}_{\mathcal{F}_{t,{d}}}(\pi_{t,\theta}) = \phi_t(d)\}$, by the Tarski-Seidenberg Theorem, $\theta_t(d)$ is definable. The continuity property follows directly from Berge's Maximum theorem.
-
-
-Following the discussion of Whitney stratifications in \citep{bolte2007clarke},
-since the graph of a definable function is Whitney stratifiable, we can construct a partition by projecting the stratification into the function domain, which will be a Whitney $C^p$-stratification by the constant rank theorem. Furthermore, the restriction of the definable function to each stratum is $C^p$-smooth. Alternatively, we can directly use the fact that for any definable function, there exists a $C^p$-decomposition which has a finite number of cells, and the restriction to each cell is $C^p$-smooth \citep{van1996geometric}. This completes the proof.
-\end{proof}
-
-Now that we have proved that the function $\theta_t(d)$ is definable, we can obtain the bound $\|\theta_t^* - \Tilde{\theta}_t^*\|$. Intuitively, our proof exploits the fact that continuous and definable functions exhibit controlled behaviors along any path, even if it crosses over a finite number of Whitney strata.
-\begin{lemma}\label{lem:DefinableTheta}
-For any $\tilde{d}$ such that $\mathcal{F}_{t,\tilde d}$ is non-empty, the following holds:
-\begin{align*}
-    \|\theta_t^* - \Tilde{\theta}_t^*\| = \|\theta(d) - \theta(\Tilde{d})\| = \mathcal{O}(\|d-\tilde{d}\|).
-\end{align*}
-\end{lemma}
-\begin{proof}
-Since every smooth function over a bounded set is Lipschitz, let us denote $L_d$ as the maximum of the Lipschitz constants for all the cells of the Whitney stratification of $\theta_t(d)$. Let $d(\lambda) = \lambda d + (1-\lambda)\Tilde{d}$, where $0 \leq \lambda \leq 1$, be the curve that connects between $d$ and $\Tilde{d}$. Also, let $0 = \lambda_1 \leq ... \leq \lambda_n = 1$ be the partition such that $\theta_t(d(\lambda))$ belongs to one cell for all $\lambda_i < \lambda < \lambda_{i+1}$ for $i = 1, ..., n-1$. We know that $n < \infty$ since $d(\theta_t)$ is Whitney stratifiable. Thus,
-\begin{equation*}
-\begin{aligned}
-    \|\theta_t(d) - \theta_t(\Tilde{d})\| & \leq \sum_{i=1}^{n-1}\|\theta_t(d(\lambda_i)) - \theta_t(d(\lambda_{i+1}))\| \\ 
-    & \leq L_d \sum_{i=1}^{n-1}\|d(\lambda_i) - d(\lambda_{i+1})\| \\ 
-    & \leq L_d \|d - \Tilde{d}\|\sum_{i=1}^{n-1}|\lambda_{i+1} - \lambda_i| \\ 
-    & = L_d\|d - \Tilde{d}\|
-    \end{aligned}
-\end{equation*}
-where the first inequality is due to triangle inequality, the second inequality is due to Lipschitz continuity, the third inequality is due to the definition of $d(\lambda)$, the first equality is due to the non-decreasing sequence of $\lambda_i$.
-\end{proof}
-
-\textbf{Bounding the term $\|\Tilde{\theta}_t^* - \hat{\theta}_t \|$.} Recall that $\hat{\theta}_t$ is the parameter for $\hat{\pi}_t$ (output of within-task CRPO), and $\Tilde{\theta}_t^*$ is the parameter for an optimal solution with an enlarged feasible set $\mathcal{F}_{t,\tilde d}$. In this subsection, we will obtain the upper bound for the term $\|\Tilde{\theta}_t^* - \hat{\theta}_t \|$.  Let $f(\theta,\tilde{d}) = J_{t,0}(\pi_{\theta}) + \mathbb{I}_{{\mathcal{F}}_{t,\tilde{d}}}(\pi_\theta)$, and choose $\tilde{d}_{i}=\mathcal{O}(1/\sqrt{M})$ for $i=1,...,p$, which coincides with the upper bound on constraint violation for within-task CRPO such that $\hat\theta_t\in {\mathcal{F}}_{t,\tilde{d}}$ with high probability. In the next result, we will condition on this high-probability event.  
-
-\begin{lemma}\label{lem:theta-tilde-bd}
-With the choice of $\tilde{d}=d+\delta$, where $\delta=\mathcal{O}(1/\sqrt{M})$ coincides with the upper bound on constraint violation for within-task CRPO such that $\hat\theta_t\in {\mathcal{F}}_{t,\tilde{d}}$, the following holds:
-\begin{equation*}
-    \|\Tilde{\theta}_t^* - \hat{\theta}_t \| \leq \mathcal{O}\left(h\left({1}/{\sqrt{M}}\right)\right),
-\end{equation*}
-where $h$ is a strictly increasing continuous function with the property that $h(0)=0$ as specified in Lemma \ref{prop:Bolte}.
-% where $\tilde{\pi}_t^*$ is the optimal policy for the feasible region enlarged by some factor $\Delta$. \yuhao{what is $\Delta$-enlarged feasible region? optimal policy of a region?? }
-\end{lemma}
-\begin{proof}
-Without loss of generality, consider $f(\theta) = J_{t,0}(\pi_{\theta}) + \mathbb{I}_{{\mathcal{F}}_{t,\tilde{d}}}(\pi_\theta)+c$, where  $c \coloneqq - \inf J_{t,0}(\pi_{\theta}) + \mathbb{I}_{\Tilde{\mathcal{F}}_t}(\pi_\theta)$, so that the minimal value of $f$ is translated to $0$. For simplicity, also assume that $f(\hat{\theta}_t) \leq \mathcal{X}(\|\hat{\theta}_t\|)$. Note that this assumption can be relaxed by using the concept of ``curves of maximal slope'' at the cost of slightly more complicated analysis and bounds \citep{ioffe2009invitation}.
-
-Now, consider a subgradient flow $\Dot{\theta}(\tau) \in -\partial f (\theta(\tau))$ (see Definition \ref{def:subgflow}), initialized at $\theta(0)=\hat{\theta}_t$ then, for any $0 \leq s'<s$, we have that
-\begin{equation*}
-\begin{aligned}
-    h \big(f(\theta(s'))\big) - h \big(f(\theta(s)) \big)  & = \int_s^{s'} \frac{d}{d \tau}h\big(f(\theta(\tau)) \big)d\tau \\ 
-    &  = \int_{s'}^s h'\big(f(\theta(\tau))\big)\|\Dot{\theta}(\tau)\|^2 d\tau  \\ 
-    & \geq \int_{s'}^s \|\Dot{\theta}(\tau)\|d\tau  \\ 
-    & \geq \Bigg\|\int_{s'}^s \Dot{\theta}(\tau)d\tau\Bigg\|\\
-    &=\|\theta(s)-\theta(s')\|
-\end{aligned}
-\end{equation*}
-where the second equality is due to the property of the subgradient flow (see Sec. \ref{sec:subflow}), the first inequality is due to $\|\partial^0(h f)\big(\theta(\tau)\big)\|\geq 1$ from Proposition \ref{prop:Bolte}, and the second inequality is due to the triangle inequality. Thus, by taking $s' = 0$ and $s \rightarrow \infty$,
-we have shown that
-\begin{align*}
-    h(f(\hat{\theta}_t)) \geq \|\hat{\theta}_t - \Tilde\theta_t^*\|.
-\end{align*}
-Therefore,
-\begin{equation*}
-\begin{aligned}
-    \|\hat{\theta}_t - \Tilde{\theta}_t^*\| & \leq  h(f(\hat{\theta}_t) - f(\Tilde\theta_t^*)) \\ \quad & \overset{(i)}{\leq} h(J_{t,0}(\hat{\pi}_t) - J_{t,0}(\Tilde{\pi}_t^*)),
-    \end{aligned}
-\end{equation*}
-where the first inequality is due to the optimality of $\Tilde{\pi}_t^*$, and $(i)$ follows since both $\hat{\pi}_t$ and $\Tilde{\pi}_t^*$ are feasible for $\mathcal{F}_{t, \Tilde{d}}$. Note that the suboptimality bound can be split as 
-\begin{equation*}
-    h(J_{t,0}(\hat{\pi}_t) - J_{t,0}(\Tilde{\pi}_t^*)) = h\bigg(J_{t,0}(\hat{\pi}_t) - J_{t,0}(\pi_t^*) + J_{t,0}(\pi_t^*) -   J_{t,0}(\Tilde{\pi}_t^*) \bigg).
-\end{equation*}
-
-By CRPO, we can bound the value difference $J_{t,0}(\hat{\pi}_t) - J_{t,0}(\pi_t^*) \leq \mathcal{O}({1}/{\sqrt{M}} )$. Moreover, Since the value function is Lipschitz \cite[Lemma 4]{xu2020improving}, the value difference $J_{t,0}(\pi_t^*) -   J_{t,0}(\Tilde{\pi}_t^*)$ can be bounded by the distance $\|\theta_t^* - \Tilde{\theta}_t^*\|$, which is bounded again by $ \mathcal{O}({1}/{\sqrt{M}} )$ according to Lemma \ref{lem:DefinableTheta}. Hence, recognizing that $h$ is strictly increasing, we have proved the claim.
-\end{proof}
-
-
-\textbf{Bounding the term  $\|{\theta}_t^* - \hat{\theta}_t \|$.} Finally, we are able to bound the term of our original interests.
-\begin{lemma}\label{lem:bound-definable-final}
-Under assumption \ref{asmptn:Definable}, the following holds:
-\begin{equation*}
-    \|{\theta}_t^* - \hat{\theta}_t \| \leq \mathcal{O}\left(h\left(\frac{1}{\sqrt{M}}\right)+\frac{1}{\sqrt{M}}\right),
-\end{equation*}
-where $h$ is a strictly increasing continuous function with the property that $h(0)=0$ as specified in Lemma \ref{prop:Bolte}.
-\end{lemma}
-\begin{proof}
-The claim follows directly from Lemmas \ref{lem:DefinableTheta} and \ref{lem:theta-tilde-bd} and the triangle inequality.
-\end{proof}
-\begin{remark}
-Note that our strategy to bound $\|{\theta}_t^* - \hat{\theta}_t \|$ is algorithmic-agnostic as it only relies on the optimization landscape. The only place we rely on the algorithm is to bound the suboptimality gap, which is then converted to a bound on $\|\Tilde{\theta}_t^* - \hat{\theta}_t \|$ in Lemma \ref{lem:theta-tilde-bd}. Also, the enlargement of the feasible set should be viewed as  a proof technique and has no implications for the algorithm design. Indeed, the motivation for the enlargement is to properly design a subgradient flow system. Thus, the result of Lemma \ref{lem:bound-definable-final} is not conditioned on how the enlargement is performed. Also, note that definability is used differently in Lemmas \ref{lem:DefinableTheta} and \ref{lem:theta-tilde-bd}. In the former case, we exploit the Whitney stratification property to provide an upper bound, while in the latter case, we exploit the KL property to obtain a lower bound, hence they serve different purposes.
-\end{remark}
-
-
-
-\subsection{Bounding term \textit{(A)}: \texorpdfstring{$|\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{ \Tilde{\nu}_t}[D_{KL}(\pi_t^*|\pi)]|$}{[1]}}
-
-The result from Lemma \ref{lem:bound-definable-final} can be used directly to provide bounds for \emph{(A)} and \emph{(C)}. We start with the term \emph{(A)}.
-\begin{lemma}\label{lem:BoundonA}
-The following bound holds:
-\begin{equation}
-    {|\mathbb{E}_{s\sim \nu_t^*}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{s\sim \Tilde{\nu}_t}[D_{KL}(\pi_t^*|\pi)]|}=\mathcal{O}\left(h\left(\frac{1}{\sqrt{M}}\right)+\frac{1}{\sqrt{M}}\right)
-\end{equation}
-where $h$ is a strictly increasing continuous function with the property that $h(0)=0$ as specified in Lemma \ref{prop:Bolte}.
-\end{lemma}
-\begin{proof}
-\begin{align*}
-    & |\mathbb{E}_{s\sim \nu_t^*}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{s\sim \Tilde{\nu}_t}[D_{KL}(\pi_t^*|\pi)]| \\&=
-     \bigg|\sum_{s \in \mathcal{S}_t}\big(\nu_t^*(s) - \Tilde{\nu}_t(s)\big)D_{KL}(\pi_t^*(s)|\pi(s))  \bigg| \\ &\leq   C_{\pi}\|\nu_t^* - \Tilde{\nu}_t\|_1\\
-     &\leq 2C_{\pi}C_\nu\|\theta_t^*-\hat{\theta}_t\|_2
-\end{align*}
-where the first equality is by definition, the first inequality is due to Assumption \ref{asmptn:newAsmptn1}, and the second inequality is due to \cite[Lem. 3]{xu2020improving}, which also specifies the constant $C_\nu$, and \cite[Prop. 4.2]{levin2017markov}. The result then follows by recalling the result from Lemma \ref{lem:bound-definable-final}.
-\end{proof}
-
-
-
-\subsection{Bounding term (C): \texorpdfstring{$|\mathbb{E}_{\hat{\nu}_t}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{ \hat{\nu}_t}[D_{KL}(\hat{\pi}_{t}|\pi)]|$}{[1]}}
-Similarly, we can prove the upper bound for the error term $(C)$.
-\begin{lemma}\label{lem:BoundonC}
-The following bound holds:
-\begin{equation}
-    |\mathbb{E}_{s\sim \hat{\nu}_t}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{s\sim \hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\pi)]|=\mathcal{O}\left(h\left(\frac{1}{\sqrt{M}}\right)+\frac{1}{\sqrt{M}}\right)
-\end{equation}
-\end{lemma}
-\begin{proof}
-
-\begin{align*}
-    & |\mathbb{E}_{s\sim \hat{\nu}_t}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{s\sim \hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\pi)]| \\ 
-    &= \bigg| \sum_{s \in \mathcal{S}}\hat{\nu}_t(s) \bigg( D_{KL}(\pi_t^*|\pi) - D_{KL}(\hat{\pi}_t|{\pi}) \bigg)\bigg|\\ 
-    &\leq  \sum_{s \in \mathcal{S}}\hat{\nu}_t(s) \bigg|D_{KL}(\pi_t^*|\pi) - D_{KL}(\hat{\pi}_t|{\pi}) \bigg| \\ 
-    &\leq L_g \sum_{s \in \mathcal{S}}\hat{\nu}_t(s) \|\pi_t^*(s) - \hat{\pi}_t(s)\|_2 \\ 
-    &\leq   L_g\sum_{s \in \mathcal{S}}\hat{\nu}_t(s)  \| \theta_t^* - \hat{\theta}_{t} - c'1\|_{\infty} \\\ 
-    &\leq  L_g  \| \theta_t^* - \hat{\theta}_{t}\|_2
-\end{align*}
-where the first inequality is due to the non-negativity of $\hat{\nu}_t(s)$ and triangle inequality, the second inequality is due to Assumption \ref{asmptn:newAsmptn1}, the third inequality holds for any constant $c'$ and is due to \cite[Lem. 24]{mei2020global}, and the last inequality is due to $\sum_{s \in \mathcal{S}}\hat{\nu}_t(s)=1$ and by choosing $c'=0$. The result then follows by recalling the result from Lemma \ref{lem:bound-definable-final}.
-\end{proof}
-
-\subsection{Bounding term (B): \texorpdfstring{$\left|\mathbb{E}_{ \Tilde{\nu}_t}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{ \hat{\nu}_t}[D_{KL}(\pi_t^*|\pi)] \right|$}{[1]}}
-Now, we will upper bound the error term $(B)$. The proof follows DualDICE \citep{nachum2019dualdice}. We introduce the following notations.  Let $\hat{\mathbb{E}}_{d^{\mathcal{D}_t}}$ denote an average of empirical samples where $\{s_i, a_i, r_i, s_i'\}_{i=1}^{N} \sim d^{\mathcal{D}_t}$, and $\rho_t$ be the initial state distribution for the CMDP task $t$. The number of data points $N=\mathcal{O}(M^{1+1/\sigma})$, where $\sigma$ is any positive number $\sigma\in(0,1)$. Note that the additional factor of $\mathcal{O}(M^{1/\sigma})$ results from the critic evaluation per policy update (see \cite[Thm. 1]{xu2021crpo}). We will roughly bound $N=\mathcal{O}(M^{2})$ in the following to simplify the presentation. The stationary distribution correction factor is denoted as $w_{\hat{\pi}_t/\mathcal{D}_t}(s,a) = \frac{\Tilde{\nu}_t(s,a)}{d^{\mathcal{D}_t}(s,a)}$. 
-
-We make the following regularity assumption on the distribution $d^{\mathcal{D}_t}$ with respect to the target policy $\hat{\pi}_t$ \cite[Asm. 1]{nachum2019dualdice}.
-
-\begin{assumption}[Reference distribution property]
-For any $(s,a)$, $\tilde{\nu}_t(s,a)>0$ implies that $d^{\mathcal{D}_t}(s,a)>0$. Furthermore, the correction terms are bounded by some finite constant $C_{\omega}$: $\|\omega_{\Tilde{\nu}_t/\mathcal{D}_t}\|_\infty\leq C_{\omega}$.
-\end{assumption}
-
-
-For convenience, we  recapitulate the key points from DualDICE, where we also omit the task dependence $t$ (i.e., we use $d^\mathcal{D}$, $\pi$, and $\rho$ in lieu of $d^{\mathcal{D}_t}$, $\hat{\pi}_t$, and $\rho_t$, respectively). The objective function is given by 
-\begin{align}
-    J(z, \zeta) = & \E_{(s,a,s'), a'\sim\pi(s')}\left[(z(s,a) - \gamma z(s',a'))\zeta(s,a) - \zeta(s,a)^2 / 2\right] \\
-     & - (1 - \gamma)~\E_{s_0\sim\beta,a_0\sim\pi(s_0)} \left[ z(s_0,a_0) \right].
-\end{align}
-
-The objective in the form prior to introduction of $\zeta$ is denoted as $J(z)$:
-\begin{align}
-J(z) = \frac{1}{2}\E_{(s,a)}\left[(z - \mathcal{T}^{\pi}z)(s,a)^2\right]
-  - (1 - \gamma)~\E_{s_0\sim\beta,a_0\sim\pi(s_0)} \left[ z(s_0,a_0) \right].
-  \label{eq:Jz-dualdice}
-\end{align}
-
-Let $\hat{J}(z, \zeta)$ denotes the empirical surrogate of $J(z, \zeta)$ with optimal solution as $(\hat z^*, \hat\zeta^*)$. We denote $z^*_\mathcal{F} = \argmin_{z\in\mathcal{F}} J(z)$ and $z^* = \argmin_{z: S\times A\rightarrow \mathbb{R}} J(z)$. We denote $L(z) = \max_{\zeta\in \mathcal{H}} J(z, \zeta)$ and $\hat L(z)=\max_{\zeta\in \mathcal{H}} \hat J(z, \zeta)$ as the primal objectives, and $\ell(\zeta) = \min_{z\in\mathcal{F}} J(z, \zeta)$, $\hat\ell(\zeta) = \min_{z\in\mathcal{F}} \hat J(z, \zeta)$ as the dual objectives. We apply some optimization algorithm $OPT$ for optimizing $\hat J(z, \zeta)$ with samples $\{s_i, a_i, r_i, s'_i\}_{i=1}^N$, $\{s^i_0\}_{i=1}^N\sim \beta$, and target actions $a'_i\sim\pi(s'_i),a^i_0\sim\pi(s^i_0)$ for $i=1,\dots,N$.
-The output of $OPT$ is denoted by $(\hat{z}, \hat{\zeta})$. We also make the following definitions to capture the error of approximation with $\mathcal{F}$ for $z$ and $\mathcal{H}$ for $\zeta$ in optimizing $\hat J(z, \zeta)$:
-\begin{align}
-    \epsilon_{approx}(\mathcal{F})&\coloneqq\sup_{z\in S\times A\rightarrow\ \mathbb{R}}\inf_{z_{\mathcal{F}}\in\mathcal{F}}\left(\|z_{\mathcal{F}}-z\|_{\mathcal{D},1}+\| z_{\mathcal{F}}-z\|_{\rho\pi,1} \right)\\
-    \epsilon_{approx}(\mathcal{H})&\coloneqq\sup_{\zeta\in S\times A\rightarrow\ \mathbb{R}}\inf_{\zeta_{\mathcal{H}}\in\mathcal{H}}\left(\|\zeta_{\mathcal{H}}-\zeta\|_{\mathcal{D},1}+\| \zeta_{\mathcal{H}}-\zeta\|_{\rho\pi,1} \right)\\
-    \epsilon_{approx}(\mathcal{F},\mathcal{H})&\coloneqq\epsilon_{approx}(\mathcal{F})+\epsilon_{approx}(\mathcal{H})\label{eq:eps_FH}
-\end{align}
-We also define
-\begin{equation}
-    \epsilon_{opt}\coloneqq{\|\hat\zeta - \hat{\zeta}^*\|^2_{\mathcal{D}_t} + \big \|\big( \hat{z}^* - \hat{\mathcal{B}}^{\pi}\hat{z}^*\ \big) - \big(\hat{z} - \hat{\mathcal{B}}^\pi \hat{z} \big)\big\|^2_{\mathcal{D}_t} }\label{eq:opt}
-\end{equation}
-as the optimization error of OPT from DualDICE.
-
-\begin{lemma}\label{lem:BoundonB}
-By estimating $\hat{\nu}_t$ with DualDICE, the following bound holds:
-\begin{equation*}
-\begin{aligned}
-\left|\mathbb{E}_{ \Tilde{\nu}_t}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{ \hat{\nu}_t}[D_{KL}(\pi_t^*|\pi)] \right|=\mathcal{O}\left(\sqrt{\frac{1}{M}+\epsilon_{opt}+\epsilon_{approx}(\mathcal{F},\mathcal{H})}\right),
-\end{aligned}
-\end{equation*}
-\end{lemma}
-
-
-\begin{proof}
-% First, note that 
-% \begin{equation}\label{eq:expected}
-%     \underset{x:\mathcal{S}\rightarrow \mathcal{C}}{\min} J_1(x) := \frac{1}{2}\mathbb{E}_{s \sim d^{\mathcal{D}_t}}[x(s)]^2 - \mathbb{E}_{s \sim d^\pi}x(s).
-% \end{equation}
-
-% The above equation implies that $x^*(s) = w_{\pi/\mathcal{D}_t}(s) = \frac{d^{\pi}(s)}{d^{\mathcal{D}_t}(s)}$. To get the expected value of the initial state distribution $\rho_t$, the following change of variables is done.
-
-% \begin{equation}
-%     z(s) = x(s) + \gamma \mathbb{E}_{s' \sim P_t(s, a)} z(s') \hspace{0.5cm} \forall s \in \mathcal{S}. 
-% \end{equation}
-
-
-% Define $\rho_{t,m}(s) := P_t(s=s_m|s_0 \sim \rho_t,a_j \sim \pi(s_j, s_{j+1}\sim P_t(s_j,a_j)\hspace{0.5cm} \text{for} \hspace{0.5cm} 0 \leq j<m)$, as the state visitation probability at step $m$ when following $\pi$. Clearly $\rho_{t,0} = \rho_t$. 
-We begin with the following decomposition:
-    \begin{align*}
-        &\left(\mathbb{E}_{ \Tilde{\nu}_t}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{ \hat{\nu}_t}[D_{KL}(\pi_t^*|\pi)] \right)^2 \leq \\
-        & \qquad \underbrace{2\bigg(\hat{\mathbb{E}}_{d^{\mathcal{D}_t}} \sum_a\big((\hat{z} - \hat{\mathcal{B}}^{\hat{\pi}_t}\hat{z})(s,a) - (\hat{z}^* - \hat{\mathcal{B}}^{\hat{\pi}_t}\hat{z}^*)(s,a)\big)D_{KL}(\pi_{t}^*|{\pi}) \bigg)^2}_{\epsilon_1}  \\  
-        & \qquad+\underbrace{2\bigg(\hat{\mathbb{E}}_{d^{\mathcal{D}_t}} \sum_a (\hat{z}^* - \hat{\mathcal{B}}^{\hat{\pi}_t}\hat{z}^*)(s,a)D_{KL}(\pi_{t}^*|{\pi}) - \mathbb{E}_{d^{\mathcal{D}_t}}\sum_a\omega_{\hat{\pi}_t/\mathcal{D}_t}(s,a)D_{KL}(\pi_{t}^*|{\pi})) \bigg)^2}_{\epsilon_2} .
-    \end{align*}
-We will bound each term above separately.
-\begin{equation*}
-\begin{aligned}
-    \epsilon_1 & \leq 2C_{\pi}^2  \bigg(\hat{\mathbb{E}}_{d^{\mathcal{D}_t}}\sum_a\big(\hat{z} - \hat{\mathcal{B}}^{\hat{\pi}_t}\hat{z} \big)(s,a) - \big(\hat{z}^* - \hat{\mathcal{B}}^{\hat{\pi}_t}{\hat{z}^*} \big)(s,a) \bigg)^2 \\ 
-    & \leq 2C_\pi^2 \Bigg(\underbrace{\|\hat\zeta - \hat{\zeta}^*\|^2_{\mathcal{D}_t} + \big \|\big( \hat{z}^* - \hat{\mathcal{B}}^{\pi}\hat{z}^*\ \big) - \big(\hat{z} - \hat{\mathcal{B}}^\pi \hat{z} \big)\big\|^2_{\mathcal{D}_t} }_{\epsilon_{opt}}\Bigg),
-    \end{aligned}
-\end{equation*} 
-where the error $\epsilon_{opt}$ is induced by the optimization OPT. The error term $\epsilon_2$ can be decomposed as
-\begin{equation}
-\begin{aligned}
-    \epsilon_2 \leq & 2\underbrace{C_\pi^2 \bigg(\hat{\mathbb{E}}_{d^{\mathcal{D}_t}}\sum_a\big(\hat{z}^* - \hat{\mathcal{B}}^{\hat{\pi}_t}{\hat{z}^*} \big)(s,a) - \mathbb{E}_{d^{\mathcal{D}_t}}\sum_a \big(\hat{z}^* - {\mathcal{B}}^{\hat{\pi}_t}\hat{z}^* \big)(s,a) \bigg)^2}_{\epsilon_{stat}}  \\ 
-    & \qquad+ 2C_\pi^2\bigg({\mathbb{E}}_{d^{\mathcal{D}_t}}\sum_a\big(\big(\hat{z}^* - {\mathcal{B}}^{\hat{\pi}_t}{\hat{z}^*} \big)(s,a) -  \omega_{\hat{\pi}_t/\mathcal{D}_t}(s,a)\big) \bigg)^2\\
-    = & 2{\epsilon_{stat}}  + 2C_\pi^2\bigg({\mathbb{E}}_{d^{\mathcal{D}_t}}\sum_a\big(\big(\hat{z}^* - {\mathcal{B}}^{\hat{\pi}_t}{\hat{z}^*} \big)(s,a) -  \big({z}^* - {\mathcal{B}}^{\hat{\pi}_t}{{z}^*} \big)(s,a)\big) \bigg)^2,
-    \end{aligned}
-\end{equation}
-where the equality is due to the result that ${z}^* - {\mathcal{B}}^{\hat{\pi}_t}{{z}^*}(s,a)=\omega_{\hat{\pi}_t/\mathcal{D}_t}(s,a)$ (see \cite[Eq. 17]{nachum2019dualdice}) and $\epsilon_{stat}$ is the error due to the finite number error. By \cite[Lem. 7]{nachum2019dualdice}, $\epsilon_{stat}=\mathcal{O}\left(\frac{\log M+\log \frac{1}{\delta}}{M^2}\right)$ with probability at least $1-\delta$, where we use the bound on the number of data as $\mathcal{O}(M^2)$. To bound the second term, use the fact that $J(z)$ as defined in \eqref{eq:Jz-dualdice} is $1$-strongly convex. Hence,
-\begin{equation*}
-    \begin{aligned}
-    &\bigg({\mathbb{E}}_{d^{\mathcal{D}_t}}\sum_a\big(\big(\hat{z}^* - {\mathcal{B}}^{\hat{\pi}_t}{\hat{z}^*} \big)(s,a) -  \big(z^* - {\mathcal{B}}^{\hat{\pi}_t}z^* \big)(s,a)\big) \bigg)^2 \\
-    &\leq \|\big(\hat{z}^* - {\mathcal{B}}^{\hat{\pi}_t}{\hat{z}^*} \big) -  \big(z^* - {\mathcal{B}}^{\hat{\pi}_t}z^* \big)\|_{\mathcal{D}_t}^2\\
-    & \leq 2\big(J(\hat{z}^*) - J(z^*)\big)% \\ 
-    % & \overset{(i)}{\leq} \frac{2}{\sigma_f}\sqrt{|\mathcal{S}||\mathcal{A}|} \Bigg( \max (\kappa' + \kappa'\|\mathcal{B}^\pi\|_{\mathcal{D}_t,1} \epsilon_{approx}(\mathcal{F}) + 2 \epsilon_{est}(\mathcal{F}) + \bigg( L+ \frac{1+\gamma}{1-\gamma}C \bigg)\epsilon_{approx}(\mathcal{H}) \Bigg),
-    \end{aligned}
-\end{equation*}
-
-where $(i)$ follows from \citep[Section D.1]{nachum2019dualdice} $\epsilon_{approx}(\mathcal{F})$ is the error due to the approximation with $\mathcal{F}$ for $z$, $\epsilon_{approx}(\mathcal{H})$ is the error due to the approximation with $\mathcal{H}$ for $\zeta$, and $\epsilon_{est}$ is the estimation error, and $L$ is the Lipschitz constant for $f$.
-
-
-To bound the error between $J(\hat{z}^*)$ and $J(z^*)$, we use the  decomposition  suggested in \citep{nachum2019dualdice}:
-\begin{equation}
-    J(\hat{z}^*)-J(z^*)=\underbrace{J(\hat{z}^*)-L(\hat{z}^*)}_{(i)}+\underbrace{L(\hat{z}^*)-L({z}^*_{\mathcal{F}})}_{(ii)}+\underbrace{L({z}^*_{\mathcal{F}})-J({z}^*_{\mathcal{F}})}_{(iii)}+\underbrace{J({z}^*_{\mathcal{F}})-J(z^*)}_{(iv)},
-\end{equation}
-where $(i)\leq\frac{2C_\omega}{1-\gamma}\|\zeta^*_\mathcal{H}-\zeta^*\|_{\mathcal{D}_t,1}\leq\frac{2C_\omega}{1-\gamma}\epsilon_{approx}(\mathcal{H})$, $(ii)=\mathcal{O}\left(\frac{\sqrt{\log M+\log\frac{1}{\delta}}}{M}\right)$ by \cite[Lem. 6]{nachum2019dualdice} (by also plugging in $N=\mathcal{O}(M^2)$), $(iii)\leq 0$ by definition, and $(iv)=\mathcal{O}(\epsilon_{approx}(\mathcal{F}))$. Note that we refer the reader to \cite[Sec. D.1]{nachum2019dualdice} for the above bounds. Therefore, we can bound  $J(\hat{z}^*)-J(z^*)$ on the order of $\mathcal{O}\left(\epsilon_{approx}(\mathcal{H})+\epsilon_{approx}(\mathcal{F})+\frac{\sqrt{\log M+\log\frac{1}{\delta}}}{M}\right)$.
-
-Combining the above relations, while noting that $\frac{1}{\sqrt{M}}$ decreases slower than $\frac{1}{{M}}$ in terms of $M$ and is thus kept as the upper bound, we have shown the result. 
-\end{proof}
-
-
-\subsection{Putting it together: bounding the KL divergence estimation error}
-\begin{theorem}[KL divergence estimation error bound] \label{thm:dualDICEAppendix} The following bound holds:
-\begin{equation*}
-\begin{aligned}
-    \quad & |\mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi)] - \mathbb{E}_{ \hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\pi)]| \\ \quad &  = \mathcal{O}\bigg(h\left(\frac{1}{\sqrt{M}}\right)+\frac{1}{\sqrt{M}}+ \sqrt{\epsilon_{opt}}+\sqrt{\epsilon_{approx}(\mathcal{F},\mathcal{H})}\bigg),
-    \end{aligned}
-\end{equation*}
-where $h$ is a strictly increasing continuous function with the property that $h(0)=0$ as specified in Lemma \ref{prop:Bolte}, $\epsilon_{approx}(\mathcal{F},\mathcal{H})$ is defined in \eqref{eq:eps_FH}, and $\epsilon_{opt}$ is defined in \eqref{eq:opt}.
-\end{theorem}
-\begin{proof}
-The result follows by combining the upper bounds for the error terms $(A)$, $(B)$ and $(C)$, as specified by Lemmas \ref{lem:BoundonA}, \ref{lem:BoundonB}, and \ref{lem:BoundonC}. We also apply the elementary inequality $\sqrt{a+b+c}\leq\sqrt{a}+\sqrt{b}+\sqrt{c}$ to further simplify the bound. 
-\end{proof}
-
-\begin{remark}
-The bound above depends on the number of iterations $M$ per task in different ways. By increasing $M$, we can expect to reduce the suboptimality gap, which can help reduce the distance between $\hat{\pi}_t$ to the optimal set of policies. Also, increasing $M$ results in a larger dataset used to estimate its stationary distribution offline by DualDICE, which reduces the estimation error. The bound indicates that the only terms that do not vanish as we increase the number of iterations per task are those due to the inherent optimization error $\epsilon_{opt}$ and function approximation error $\epsilon_{approx}(\mathcal{F},\mathcal{H})$. In the case those terms are negligible (which are possible in view of the recent breakthrough in overparametrized deep learning \citep{zhang2021understanding,neyshabur2019towards,li2018learning,zou2018stochastic,arora2019fine}, see also \citep{fan2021selective} for a survey), then the KL divergence estimation can be driven to arbitrary accuracy.
-\end{remark}
-
-% \subsection{Proofs for TAOG and TACV bounds for CRPO with inexact online learning}
-
-% \begin{theorem}
-% \label{thm:appTAOGinexactStatic}
-% Let $\hat{D}^{*2}=\underset{\phi \in \Delta \mathcal{A}_{\varrho}^{|\mathcal{S}|}} {\min} \frac{1}{T} \sum_{t=1}^T \mathbb{E}_{s \sim \hat{\nu}_t}[D(\hat{\pi}_t|\phi)]$ be the estimated task similarity, and let $c_1 = \sqrt{2}L_g\|\phi^*\|$, and $c_2 = \left(2 + \frac{4\sqrt{2} L_gL_\pi \|\phi^*\|}{(2C_1 - C_1^2 L_\pi) \sqrt{T}} \right)$. For each task $t$, we run CRPO for $M$ iterations with $\alpha = \sqrt{\frac{|\mathcal{S}|\mathcal{A}|}{M(1-\gamma)^3}} \sqrt{\left(\frac{c_1}{\sqrt{T}} + \frac{c_2 \mathcal{E}_T}{T}+ \hat{D}^{*2} \right)}$, and we obtain $\{\hat{\nu}_t\}_{t=1}^T$ and $\{\hat{\pi}_t\}_{t=1}^T$. 
-% In addition, the initialization $\{\pi_{t,0}\}_{t=1}^T$  are determined by playing \textit{Follow-the-Regularized-Leader} (FTRL) or \textit{online mirror descent} (OMD) \citep{hazan2016introduction} on the functions $\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \cdot \right)\right], \text{ for } t=1,\ldots, T$. Then, it holds that 
-% \begin{align*}
-%  & \Bar{R}_i \leq \frac{\sqrt{ |\mathcal{S}||\mathcal{A}|} }{ \sqrt{M}(1-\gamma)^{3/2}} \left(\sqrt{ \frac{\sqrt{2}L_g\|\phi^* \|}{\sqrt{T}}  + \left(2 + \frac{4\sqrt{2} L_gL_\pi \|\phi^*\|}{(2C_1 - C_1^2 L_\pi) \sqrt{T}} \right) \frac{\mathcal{E}_T}{T} + \hat{D}^{*2} }\right)  \hspace{0.2cm} \forall i=0,\ldots,p,
-% \end{align*}
-% where $\phi^*$ is the fixed optimal meta-initialization for all the tasks given by $\phi^* = \underset{\phi \in \Delta \mathcal{A}_{\varrho}^{|\mathcal{S}|}} {\argmin} \frac{1}{T} \sum_{t=1}^T \mathbb{E}_{s \sim \hat{\nu}_t}[D(\hat{\pi}_t|\phi)]$. 
-% \end{theorem}
-
-% \begin{proof}
-% We know that $\Bar{R}_0$ and $\{\Bar{R}_i\}_{i=1}^p$ are well-defined. In addition, it holds that 
-% \begin{align*}
-%   \Bar{R}_0 \leq &\frac{1}{T}\sum_{t=1}^T \left(\frac{ \mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t| \phi_t\right)\right]}{\alpha M} +\frac{ \alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right)\\
-%   =&  \frac{1}{T}\sum_{t=1}^T \left(  \frac{\mathbb{E}_{s \sim \nu^*_t}\left[D_{\mathrm{KL}}\left(\pi^*_t | \phi_t \right)\right]- \mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \phi^\ast \right)\right]}{\alpha M } \right)\\
-% &+  \frac{1}{T}\sum_{t=1}^T \left( \frac{\mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \phi^\ast \right)\right]}{\alpha M}+\frac{ \alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right) \\ 
-% \leq & \frac{1}{T}\sum_{t=1}^T \left(  \frac{\mathbb{E}_{s \sim \nu^*_t}\left[D_{\mathrm{KL}}\left(\pi^*_t | \phi_t \right)\right]- \mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \phi^\ast \right)\right]}{\alpha M } \right)\\
-% &+  \frac{1}{T}\sum_{t=1}^T \left( \frac{\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \phi^\ast \right)\right]\pm \epsilon_t}{\alpha M}+\frac{ \alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right).
-% \end{align*}
-% where $\phi_t=\pi_{t,0}$.
-% Second equality follows from the fact that the total loss can be split into the loss of the meta-update algorithm and the the loss if we had always initialized at $\phi^\ast$. Last inequality follows from the KL-divergence estimation error bound in Theorem \ref{thm:dualDICE}.
-
-% Since each $\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \cdot \right)\right]$ is $\mu_\pi$-strongly convex due to Assumption 1, and since each $\phi_{t}$ is determined by  playing inexact FTL or inexact OGD, the following term can be upper bounded using Corollary \ref{cor:appstaticRegretOGD} as follows:
-% \begin{equation*}
-% \begin{aligned}
-%     \frac{1}{T}\sum_{t=1}^T \left(  \frac{\mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \phi_t \right)\right]- \mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \phi^\ast \right)\right]}{\alpha M} \right)  \leq \\ \frac{1}{\alpha M} \left(\frac{\sqrt{2}L_g\|\phi^* \|}{\sqrt{T}}  + \left(1 + \frac{4\sqrt{2} L_gL_\pi \|\phi^*\|}{(2C_1 - C_1^2 L_\pi) \sqrt{T}} \right) \frac{\mathcal{E}_T}{T} \right),
-% \end{aligned}
-% \end{equation*}
-% where the constants are from the Corollary \ref{cor:appstaticRegretOGD}. Now, we will upper bound the second term. Since $\phi^\ast=\argmin_{\phi} \frac{1}{T}\sum_{t=1}^T \mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \phi \right)\right]$, by the definition of $\hat{D}^*$, we have $\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \phi \right)\right] \leq \hat{D}^{\ast 2}$. Thus, by substituting the definition of $\phi^\ast$ and $\alpha = \frac{(1-\gamma)^{\frac{3}{2}} \hat{D}^*}{\sqrt{M |\mathcal{S}||\mathcal{A}| }}$, it holds that
-% \begin{equation*}
-% \begin{aligned}
-% \frac{1}{T}\sum_{t=1}^T \left( \frac{\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \phi^\ast \right)\right] \pm \epsilon_t}{\alpha M}+\frac{\alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right)& \leq \frac{\hat{D}^{\ast 2}}{\alpha M} + \frac{\mathcal{E}_T}{T\alpha M}+  \frac{ \alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}}.
-% \end{aligned}
-% \end{equation*}
-% Setting the value of $\alpha = \frac{(1-\gamma)^{3/2}\sqrt{\frac{c_1}{\sqrt{T}}+ \frac{c_2\mathcal{E}_T}{T} + \hat{D}^{*2}  }   }{\sqrt{M|\mathcal{S}||\mathcal{A}|}}$, where $c_1 = \sqrt{2}L_g\|\phi^*\|$, $c_2 = \left(2 + \frac{4\sqrt{2} L_gL_\pi \|\phi^*\|}{(2C_1 - C_1^2 L_\pi) \sqrt{T}} \right)$, we can obtain the TAOG $\bar{R}_0$ as follows:
-% \begin{align*}
-%     \bar{R}_0 \leq \frac{\sqrt{\left(\frac{c_1}{\sqrt{T}}+ \frac{c_2\mathcal{E}_T}{T} + \hat{D}^{*2} \right)|\mathcal{S}||\mathcal{A}|}  }{\sqrt{M(1-\gamma)^{3}}}
-% \end{align*}
-% The bound for $\Bar{R}_i$ can be derived similarly.
-
-% \end{proof}
-
-
-% Next, we present the proof for TAOG and TACV under inexact online learning for dynamic regret, i.e., Corollary \ref{cor:InexactTAOG}.
-
-% \begin{corollary}
-% \label{cor:appInexactTAOG}
-% Let $\hat{V}_{\psi}^{2}= \frac{1}{T} \sum_{t=1}^T \mathbb{E}_{s \sim \hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\psi_t^*)]$ be the estimated task similarity, where $\{\psi_t^*\}_{t \in [T]}$ is a sequence of dynamically varying comparator. For each task $t$, we run CRPO for $M$ iterations with $\alpha = \sqrt{\frac{(1-\gamma)^{3/2}}{M}|\mathcal{S}||\mathcal{A}|} \sqrt{\frac{U_T^{init}(\psi)+ \mathcal{E}_T}{T} + \hat{V}_\psi^2}$, where $U_T^{init}(\psi)$ is the upper bound from Corollary \ref{cor:appdynamicRegretOGD}, to obtain $\{\hat{\nu}_t\}_{t=1}^T$ and $\{\hat{\pi}_t\}_{t=1}^T$. 
-% In addition, the initialization $\{\pi_{t,0}\}_{t=1}^T$  are determined by playing \textit{Follow-the-Regularized-Leader} (FTRL) or \textit{online mirror descent} (OMD) \citep{hazan2016introduction} on the functions $\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \cdot \right)\right], \text{ for } t=1,\ldots, T$. Then, it holds that 
-% \begin{equation*}
-% \begin{aligned}
-%  \Bar{R}_i &\leq \frac{\sqrt{|\mathcal{S}||\mathcal{A}|}}{ \sqrt{M}(1-\gamma)^{3/2}} \bigg[ \frac{1}{T}\min \bigg(C_1\|\psi_1 - \psi_1^*\|^2 + C_2\mathcal{E}_T + C_3 \mathcal{S}_T + \frac{1}{2 \beta}\sum_{t=1}^T \|\nabla \ell_t(\psi_t^*)\|^2  C_4\|\psi_1 - \psi_1^*\|\\ &  + C_5 \tilde{\mathcal{E}}_T + C_4 \mathcal{P}_T \bigg) + \frac{\mathcal{E}_T}{ T} + \hat{V}_\psi^2 \bigg]^{1/2} , \ \hspace{0.2cm} \forall i=0,\ldots,p,
-%  \end{aligned}
-% \end{equation*}
-% where $\mathcal{P}_T \coloneqq \sum_{t=2}^T\|\psi_{t}^* - \psi_{t-1}^*\|$ is the path-length of the comparator sequence, $\mathcal{S}_T \coloneqq \sum_{t=2}^T\|\psi_{t}^* - \psi_{t}^*\|^2$ is the squared path-length, $\mathcal{E}_T \coloneqq \sum_{t=1}^T \epsilon_t$ is the cumulative inexactness, $\tilde{\mathcal{E}}_T \coloneqq \sum_{t=1}^T \sqrt{\epsilon_t}$ is the cumulative square root of inexactness, and $\epsilon_t$ is the upper bound from Theorem \ref{thm:dualDICE}.
-% \end{corollary}
-
-% \begin{proof}
-% \begin{align*}
-%   \Bar{R}_0 \leq &\frac{1}{T}\sum_{t=1}^T \left(\frac{ \mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t| \psi_t\right)\right]}{\alpha M} +\frac{ \alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right)\\
-%   =&  \frac{1}{T}\sum_{t=1}^T \left(  \frac{\mathbb{E}_{s \sim \nu^*_t}\left[D_{\mathrm{KL}}\left(\pi^*_t | \psi_t \right)\right]- \mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \psi_t^\ast \right)\right]}{\alpha M } \right)\\
-% &+  \frac{1}{T}\sum_{t=1}^T \left( \frac{\mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \psi_t^\ast \right)\right]}{\alpha M}+\frac{ \alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right) \\ 
-% \leq & \frac{1}{T}\sum_{t=1}^T \left(  \frac{\mathbb{E}_{s \sim \nu^*_t}\left[D_{\mathrm{KL}}\left(\pi^*_t | \psi_t \right)\right]- \mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \psi_t^\ast \right)\right]}{\alpha M } \right)\\
-% &+  \frac{1}{T}\sum_{t=1}^T \left( \frac{\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \psi_t^\ast \right)\right]\pm \epsilon_t}{\alpha M}+\frac{ \alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right).
-% \end{align*}
-% where $\psi_t=\pi_{t,0}$.
-% Second equality follows from the fact that the total loss can be split into the loss of the meta-update algorithm and the the loss if we had always initialized at $\psi_t^\ast$. Last inequality follows from the KL-divergence estimation error bound in Theorem \ref{thm:dualDICE}.
-
-% Since each $\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \cdot \right)\right]$ is $\mu_\pi$-strongly convex due to Assumption 1, and since each $\psi_{t}$ is determined by  playing FTL or OGD, the following term can be upper bounded using Corollary \ref{cor:appdynamicRegretOGD} as follows:
-% \begin{equation}
-% \begin{aligned} \label{eq:upperBoundDynReg}
-%     \frac{1}{T}\sum_{t=1}^T \left(  \frac{\mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \phi_t \right)\right]- \mathbb{E}_{s \sim \nu^*_t}\left[D_{KL}\left(\pi^*_t | \phi^\ast \right)\right]}{\alpha M} \right) & \leq \frac{1}{\alpha MT} \min \bigg(C_1\|\psi_1 - \psi_1^*\|^2 + C_2\mathcal{E}_T + C_3 \mathcal{S}_T \\ & + \frac{1}{2 \beta}\sum_{t=1}^T \|\nabla \ell_t(\psi_t^*)\|^2  C_4\|\psi_1 - \psi_1^*\| + C_5 \tilde{\mathcal{E}}_T + C_4 \mathcal{P}_T \bigg) ,
-% \end{aligned}
-% \end{equation}
-% where the constants are from the Corollary \ref{cor:appdynamicRegretOGD}. Now, we will upper bound the second term. By the definition of $\hat{V}_\psi$, we have $\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \psi_t \right)\right] \leq \hat{V}_\psi^{2}$. Thus, by substituting $\hat{V}_\psi^2$ as an upper bound, it holds that
-% \begin{equation*}
-% \begin{aligned}
-% \frac{1}{T}\sum_{t=1}^T \left( \frac{\mathbb{E}_{s \sim \hat{\nu}_t}\left[D_{KL}\left(\hat{\pi}_t | \phi^\ast \right)\right] \pm \epsilon_t}{\alpha M}+\frac{\alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}} \right)& \leq \frac{\hat{V}_\psi^{ 2}}{\alpha M} + \frac{\mathcal{E}_T}{T\alpha M}+  \frac{ \alpha |\mathcal{S}||\mathcal{A}|}{(1-\gamma)^{3}}.
-% \end{aligned}
-% \end{equation*}
-
-% Now, setting the value of $\alpha = \sqrt{\frac{(1-\gamma)^{3/2}}{M}|\mathcal{S}||\mathcal{A}|} \sqrt{\frac{U_T^{init}(\psi)+ \mathcal{E}_T}{T} + \hat{V}_\psi^2}$, where $U_T^{init}(\psi)$ is the upper bound from \eqref{eq:upperBoundDynReg}, we can obtain the TAOG $\bar{R}_0$ as
-
-% \begin{align*}
-%     \bar{R}_0 \leq \sqrt{\frac{|\mathcal{S}||\mathcal{A}|}{M(1-\gamma)^3}}\sqrt{\frac{U_T^{init}(\psi)}{T} + \frac{\mathcal{E}_T}{T} + \hat{V}_\psi^2 }
-% \end{align*}
-
-% The bound for $\Bar{R}_i$ can be derived similarly.
-% \end{proof}
-
-\section{Proofs for the section \ref{subsec:adapt_learning_rates}}
-
-\subsection{TAOG and TACV bounds for CRPO with adaptive learning rates} \label{subsec:appAdaptiveRate}
-This section presents the task-averaged regret upper bounds for the CRPO when the adaptive learning rates $\alpha_t$ are used for each task, and the Q-estimation error is accounted for. We also recall that $d_{t,i}$ is the constraint upper bound for $i=1,...,p$ and $\eta_t$ is the tolerance for constraint violation (i.e., increasing the upper bound to $d_{t,i}+\eta_t$). For a single run of CRPO in task $t$, we denote $\mathcal{N}_{t,0}$ as the set of time steps the reward is maximized and $\mathcal{N}_{t,i}$ as the set of time step constraint $i$ is minimized. The Q-function in the CRPO algorithm is learned through TD learning with the total number of iterations denoted by $K_{in}$. The Q-function of objective $i$ for policy $\pi_{t,m}$ at time step $m$ is denoted by $Q_{t,m}^i$, and the estimated Q-function is denoted by $\Bar{Q}_{t,m}^i$. The maximum value for both rewards and constraints is assumed to be $c_{max}$.
-
-With all notations for CRPO in place,
-we present the following result \citep{xu2021crpo}
-
-\begin{lemma}\label{lem:appCRPOLem8} For the CRPO algorithm in the tabular settings with learning rates $\alpha_{t}$, the following bound holds:
-\label{lem:Lem8_CRPO}
-\begin{equation*}
-    \begin{aligned}
-     & \alpha_{t}\sum_{m \in \mathcal{N}_{t,0}}\Big(J_{t,0}(\pi_t^*) - J_{t,0}(\pi_{t,m}) \Big) + \eta_t \alpha_{t}\sum_{i=1}^p |\mathcal{N}_{t,i}| \\  & \leq \mathbb{E}_{s\sim \nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})]  + \frac{2c_{max}^2|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3}\alpha_{t}^2\sum_{i=0}^p|\mathcal{N}_{t,i}| \\  & + \sum_{i=0}^p\sum_{m \in \mathcal{N}_{t,i}}\frac{\alpha_{t}(3+(1-\gamma)^2+3\alpha_{t}c_{max})}{(1-\gamma)^2}\|Q^i_{t,m} - \bar{Q}^{i}_{t,m}\|_2 
-    \end{aligned}
-\end{equation*}
-\end{lemma}
-
-\begin{proof}
-
-
-If $m \in \mathcal{N}_{t,0}$, by \citep[Lemma 7]{xu2021crpo}, we have that:
-\begin{align}
-    \alpha_{t}\big(J_{t,0}(\pi_t^*) - J_{t,0}(\pi_{t,m}) \big)  & \leq \mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,m}) - D_{KL}(\pi_t^*|\pi_{t,m+1})]+ \frac{2c^2_{max}|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3} \alpha_{t}^2  \nonumber\\  &\qquad + \frac{3\alpha_{t}(1+\alpha_{t}c_{max})}{(1-\gamma)^2}\|Q_{t,m}^0 - \bar{Q}_{t,m}^0\|_2.\label{eq:appEq12CRPO}
-    \end{align}
-Similarly, if $m \in \mathcal{N}_{t,i}$, we can write
-\begin{align}
-    \alpha_{t}\big(J_{t,i}(\pi_{t,m}) - J_{t,i}(\pi_t^*) \big)  & \leq \mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,m}) - D_{KL}(\pi_t^*|\pi_{t,m+1})]+ \frac{2c^2_{max}|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3} \alpha_{t}^2 \nonumber \\  & \qquad+ \frac{3\alpha_{t}(1+\alpha_{t}c_{max})}{(1-\gamma)^2}\|Q_{t,m}^i - \bar{Q}_{t,m}^i\|_2.\label{eq:appEq13CRPO}
-    \end{align}
-Taking the summation of \eqref{eq:appEq12CRPO} and \eqref{eq:appEq13CRPO} from $m = 0$ to $M-1$, we get
-    \begin{align*}
-        &\alpha_{t} \sum_{m \in \mathcal{N}_{t,0}}\big(J_{t,0}(\pi_t^*) - J_{t,0}(\pi_{t,m}) \big) + \sum_{i=1}^p\sum_{m \in \mathcal{N}_{t,i}}\alpha_{t}\big(J_{t,i}(\pi_{t,m}) - J_{t,i}(\pi_t^*) \big) \\
-        &\leq \mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})] + \frac{2c^2_{max}|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3} \sum_{i=0}^p\alpha_{t}^2|\mathcal{N}_{t,i}| \\
-        &\qquad\qquad\qquad+ \sum_{i=0}^p\sum_{m \in \mathcal{N}_{t,i}}\frac{3\alpha_{t}(1+\alpha_{t}c_{max})}{(1-\gamma)^2}\|Q_{t,m}^i - \bar{Q}_{t,m}^i\|_2
-    \end{align*}
-Since $J_{t,i}(\pi_{t,m}) - J_{t,i}(\pi_t^*) \geq \eta_t - \|Q_{t,m}^i - \bar{Q}_{t,m}^i\|$ \cite[Eq. 15]{xu2021crpo}, by rearranging the terms above, we obtain the result.
-\end{proof}
-
-Next, we study the condition on the maximum constraint violation threshold $\eta_t$ and how it affects $\mathcal{N}_{t,0}$ and the upper bounds for TAOG and TACV. We make the following assumption to proceed. 
-\begin{assumption}\label{asm:crpo_cj}
-    Assume that $\sum_{m\in\mathcal{N}_{t,0}}J_{t,0}(\pi_t^*)-J_{t,0}(\pi_{t,m})\geq c_J$ for some $c_J\in(-\frac{1}{2}\alpha_{t}\eta_tM,0]$.
-\end{assumption}
-The assumption above indicates that the policies in $\mathcal{N}_{t,0}$ do not have rewards higher than the optimal policy by more than $\frac{1}{2}\alpha_{t}\eta_t$ on average. Note that it is indeed possible to have rewards higher than the optimal policy if the corresponding policy does not satisfy some safety constraints (i.e., infeasible policy). However, it is not a strong assumption since we are comparing it with the optimal policy.  The above assumption is not present in \citep{xu2021crpo}, which invalidates one of its derivation steps (in particular, \cite[Thm. 3]{xu2021crpo}), and is thus introduced to rectify the proof. 
-
-\begin{lemma}\label{lem:appCRPOlem9}
-Suppose that the following condition holds:
-\begin{align}
-    \frac{1}{2}\eta_t M\alpha_{t}  & \geq \mathbb{E}_{ \nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})] + \frac{2c^2_{max}|\mathcal{S}||\mathcal{A}|M}{(1-\gamma)^3} \sum_{i=0}^p \alpha_{t}^2 \nonumber\\  
-    & + \sum_{i=0}^p \sum_{m \in \mathcal{N}_{t,i}} \frac{\alpha_{t}(3+(1-\gamma)^2+3\alpha_{t}c_{max})}{(1-\gamma)^{2}}\|Q_{t,m}^i - \bar{Q}_{t,m}^i\|_2.\label{eq:crpo_eta}
-    \end{align}
-Then, we have that $\mathcal{N}_{t,0}\neq \emptyset$, i.e., $\hat{\pi}_t$ is well-defined; also, one the following two statements must hold,
-\begin{enumerate}
-    \item $|\mathcal{N}_{t,0}| \geq M/2$, 
-    \item $\sum_{m \in \mathcal{N}_{t,0}}\Big(J_{t,0}(\pi_t^*) - J_{t,0}(\pi_{t,m}) \Big) \leq 0$. 
-\end{enumerate}
-Under assumption \ref{asm:crpo_cj}, we also have the following holds:
-$$|\mathcal{N}_{t,0}| \geq \bigg(\frac{1}{2} - \kappa \bigg)M$$
-for some $\kappa\in(0,\frac{1}{2})$.
-\end{lemma}
-\begin{proof}
-The proof for $\mathcal{N}_{t,0}\neq \emptyset$ follows directly from \cite[Lem. 9]{xu2021crpo}. For the second statement, we consider the case that $\sum_{m \in \mathcal{N}_{t,0}}\Big(J_{t,0}(\pi_t^*) - J_{t,0}(\pi_{t,m}) \Big) > 0$. From Lemma \ref{lem:appCRPOLem8}, it implies that
-\begin{equation*}
-    \begin{aligned}
-        \eta_t\sum_{i=1}^p \alpha_{t}|\mathcal{N}_{t,i}|  & \leq \mathbb{E}_{s\sim \nu^*}[D_{KL}(\pi_t^*|\pi_{t,0})]  + \frac{2c_{max}^2|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3}\sum_{i=0}^p\alpha_{t}^2|\mathcal{N}_{t,i}| \\  & + \sum_{i=0}^p\sum_{m \in \mathcal{N}_{t,i}}\frac{\alpha_{t}(3+(1-\gamma)^2+3\alpha_{t}c_{max})}{(1-\gamma)^2}\|Q^i_{t,m} - \bar{Q}^{i}_{t,m}\|_2  .
-    \end{aligned}
-\end{equation*}
-Suppose that $|\mathcal{N}_{t,0}| < M/2$, then $\sum_{i=1}^p|\mathcal{N}_{t,i}| >M/2$, we have that
-\begin{equation*}
-    \begin{aligned}
-        \frac{1}{2} \alpha_{t}\eta_tM  & < \mathbb{E}_{s\sim \nu^*}[D_{KL}(\pi_t^*|\pi_{t,0})]  + \frac{2c_{max}^2|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3}\sum_{i=0}^p\alpha_{t}^2|\mathcal{N}_{t,i}| \\ 
-        & \qquad\qquad + \sum_{i=0}^p\sum_{m \in \mathcal{N}_{t,i}}\frac{\alpha_{t}(3+(1-\gamma)^2+3\alpha_{t}c_{max})}{(1-\gamma)^2}\|Q^i_{t,m} - \bar{Q}^{i}_{t,m}\|_2,
-    \end{aligned}
-\end{equation*}
-which contradicts \eqref{eq:crpo_eta}. Hence, we must have $|\mathcal{N}_{t,0}| \geq M/2$.
-
-
-Next, we show that $|\mathcal{N}_{t,0}| \geq \big(\frac{1}{2} - \kappa \big)M$ for some $\kappa\in(0,\frac{1}{2})$. Under assumption \ref{asm:crpo_cj} and by Lemma~\ref{lem:appCRPOLem8}, we have that 
-\begin{equation*}
-    \begin{aligned}
-        \eta_t\sum_{i=1}^p \alpha_{t}|\mathcal{N}_{t,i}|  & \leq \mathbb{E}_{s\sim \nu^*}[D_{KL}(\pi_t^*|\pi_{t,0})]  + \frac{2c_{max}^2|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3}\sum_{i=0}^p\alpha_{t}^2|\mathcal{N}_{t,i}| \\  & + \sum_{i=0}^p\sum_{m \in \mathcal{N}_{t,i}}\frac{\alpha_{t}(3+(1-\gamma)^2+3\alpha_{t}c_{max})}{(1-\gamma)^2}\|Q^i_{t,m} - \bar{Q}^{i}_{t,m}\|_2-\alpha_{t}c_J  .
-    \end{aligned}
-\end{equation*}
-
-Choose $\kappa\coloneqq \frac{-c_J}{\alpha_{t}\eta_tM}$.  Since $-c_J<\frac{1}{2}\alpha_{t}\eta_tM$ by assumption, we have that $\kappa\in(0,\frac{1}{2})$. Consider the case that $|\mathcal{N}_{t,0}| < \big(\frac{1}{2} - \kappa \big)M$, which implies that $\sum_{i=1}^p|\mathcal{N}_{t,i}|>\big(\frac{1}{2} + \kappa \big)M$. This implies that
-\begin{equation*}
-    \begin{aligned}
-        \big(\frac{1}{2} + \kappa \big)\alpha_{t}\eta_tM  & \leq \mathbb{E}_{s\sim \nu^*}[D_{KL}(\pi_t^*|\pi_{t,0})]  + \frac{2c_{max}^2|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3}\sum_{i=0}^p\alpha_{t}^2|\mathcal{N}_{t,i}| \\  
-        & \qquad\qquad + \sum_{i=0}^p\sum_{m \in \mathcal{N}_{t,i}}\frac{\alpha_{t}(3+(1-\gamma)^2+3\alpha_{t}c_{max})}{(1-\gamma)^2}\|Q^i_{t,m} - \bar{Q}^{i}_{t,m}\|_2,
-    \end{aligned}
-\end{equation*}
-which again contradicts \eqref{eq:crpo_eta}. Hence, we must have $|\mathcal{N}_{t,0}| \geq\big(\frac{1}{2} - \kappa \big)M$.
-\end{proof}
-
-Now, we prove the upper bound of suboptimality and constraint violation per task.
-
-\begin{lemma}\label{lem:2_threeEventsHold} Let the violation tolerance be chosen as:
-\begin{equation*}
-    \eta_t = \frac{2\mathbb{E}_{s\sim \nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})]}{M\alpha_{t}} + \frac{\alpha_{t}4c^2_{max}|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3} + \sum_{i=0}^p \frac{2\alpha_{t}(3+(1-\gamma)^2+3\alpha_{t}c_{max})}{\sqrt{M}\alpha_{t}(1-\gamma)^{2}},
-\end{equation*}
-Then, the following holds
-\begin{align}
-    U_{t,0}(\pi_{t,0}, \alpha_{t})& = \frac{c_1^t}{\alpha_{t}M}\mathbb{E}_{s\sim \nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})] +  c_2^t \alpha_t    + \sum_{i=0}^p \frac{c_3^t\alpha_{t}+c_4^t\alpha_{t}^2}{\alpha_{t}\sqrt{M} }\label{eq:U0-adapt}\\
-U_{t,i}(\pi_{t,0}, \alpha_{t})& =
-        U_{t,0}(\pi_{t,0}, \alpha_{t})+\frac{c_5^t}{\sqrt{M}},\label{eq:Ui-adapt}
-\end{align}
-where $c_1^t=2$, $c_2^t=\frac{4c_{max}^2|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3}$, $c_3^t=\frac{3+(1-\gamma)^2}{(1-\gamma)^2}$, $c_4^t=\frac{3 c_{max}}{(1-\gamma)^2}$, and $c_5^t=\frac{2\sqrt{(1-\gamma)|\mathcal{S}||\mathcal{A}|}}{1-2\kappa}$.
-\end{lemma}
-\begin{proof}
-
-From Lemma \ref{lem:appCRPOLem8}, we have that
-\begin{equation*}
-    \begin{aligned}
-     & \alpha_{t}\sum_{m \in \mathcal{N}_{t,0}}\Big(J_{t,0}(\pi_t^*) - J_{t,0}(\pi_{t,m}) \Big) + \eta_t\sum_{i=1}^p \alpha_{t}|\mathcal{N}_{t,i}| \\  
-     & \leq \mathbb{E}_{s\sim \nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})]  + \frac{2c_{max}^2|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3}\sum_{i=0}^p\alpha_{t}^2|\mathcal{N}_{t,i}| \\  & + \sum_{i=0}^p\sum_{m \in \mathcal{N}_{t,i}}\frac{\alpha_{t}(3+(1-\gamma)^2+3\alpha_{t}c_{max})}{(1-\gamma)^2}\|Q^i_{t,\pi_m} - \bar{Q}^{i}_{t,\omega_m}\|_2 
-    \end{aligned}
-\end{equation*}
-
-If $\sum_{m \in \mathcal{N}_{t,0}}\Big(J_{t,0}(\pi_t^*) - J_{t,0}(\pi_{t,m}) \Big) \leq 0$, then $J_{t,0}(\pi_t^*) - \mathbb{E}[J_{t,0}(\hat{\pi}_t)]\leq 0$. If $\sum_{m \in \mathcal{N}_{t,0}}\Big(J_{t,0}(\pi_t^*) - J_{t,0}(\pi_{t,m}) \Big) > 0$, then, by Lemma \ref{lem:appCRPOlem9},  we have $|\mathcal{N}_{t,0}| \geq M/2$. Hence, 
-\begin{equation*}
-    \begin{aligned}
-        &J_{t,0}(\pi_t^*) - \mathbb{E}[J_{t,0}(\hat{\pi}_t)]    = \frac{1}{|\mathcal{N}_{t,0}|} \sum_{m \in \mathcal{N}_{t,0}}[J_{t,0}(\pi_t^*) - J_{t,0}({\pi}_{t,m})] \\  
-        & \leq \frac{2}{ \alpha_{t}M} \mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})] + c_2^t \alpha_t  + \sum_{i=0}^p \sum_{m \in \mathcal{N}_{t,i}} \frac{(c_3^t\alpha_{t}+c_4^t\alpha_{t}^2)}{M\alpha_{t}} \|Q^i_{t,\pi_m} - \bar{Q}^{i}_{t,\omega_m}\|_2, \\
-        &\leq\frac{2}{ \alpha_{t}M} \mathbb{E}_{\nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})] + c_2^t \alpha_t  + \sum_{i=0}^p \sum_{m \in \mathcal{N}_{t,i}} \frac{(c_3^t\alpha_{t}+c_4^t\alpha_{t}^2)}{\sqrt{M}\alpha_{t}}
-    \end{aligned}
-\end{equation*}
-where the last inequality is due to the choice of $K_{in}=\Theta(M^{1/\sigma}\log^{2/\sigma}(|\mathcal{S}|^2|\mathcal{A}|^2M^{1+2/\sigma}/\delta))$ as specified by \cite[Lem. 8]{xu2021crpo} for critic evaluations. Here, the constants are chosen as $c_1^t=2$, $c_2^t=\frac{4c_{max}^2|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3}$, $c_3^t=\frac{3+(1-\gamma)^2}{(1-\gamma)^2}$, $c_4^t=\frac{3 c_{max}}{(1-\gamma)^2}$.
-For constraint violation, consider any $i=1,...,p$, we have
-\begin{align*}
-    \mathbb{E}[J_{t,i}(\hat{\pi}_{t})] - d_{t,i}  & = \frac{1}{|\mathcal{N}_{t,0}|}\sum_{m \in \mathcal{N}_{t,0}}J_{t,i}(\pi_{t,m}) -d_{t,i} \\ 
-    &\leq \frac{1}{|\mathcal{N}_{t,0}|}\sum_{m \in \mathcal{N}_{t,0}}\big(\bar{J}_{t,i}(\pi_{t,m}) -d_{t,i}\big)+\frac{1}{|\mathcal{N}_{t,0}|}\sum_{m \in \mathcal{N}_{t,0}}|\bar{J}_{t,i}(\pi_{t,m}) -{J}_{t,i}(\pi_{t,m})|\\
-    & \leq \eta_t + \frac{1}{|\mathcal{N}_{t,0}|}\sum_{i=0}^p\sum_{m \in \mathcal{N}_{t,i}}\|Q_{t,\pi_m}^i - \Bar{Q}_{t,\pi_m}^i\|_2\\
-    &\leq  \eta_t + \frac{2}{(1-2\kappa)M}\sum_{i=0}^p\sum_{m \in \mathcal{N}_{t,i}}\|Q_{t,\pi_m}^i - \Bar{Q}_{t,\pi_m}^i\|_2
-    \end{align*}
-where the first inequality is due to triangle inequality, the second inequality is by the design of the CRPO algorithm, and the third inequality is due to Lemma \ref{lem:appCRPOlem9}, where $\kappa\in(0,\frac{1}{2})$. By the choice of $K_{in}$, we have that $\sum_{i=0}^p\sum_{m \in \mathcal{N}_{t,i}}\|Q_{t,\pi_m}^i - \Bar{Q}_{t,\pi_m}^i\|_2\leq\sqrt{(1-\gamma)|\mathcal{S}||\mathcal{A}|M}$. Plugging the value of $\eta_t$ yields the desired result.
-\end{proof}
-
-Finally, we are able to provide the following bounds on TAOG and TACV  in the case of adaptive learning rates.
-\begin{theorem}[Bounds on TAOG and TACV] 
-\label{thm:taog-tacv-adapt}
-Suppose we run the CRPO algorithm for  $M$ steps per task $t$ with learning rates $\alpha_{t}$. Then, after $T$ tasks, the TAOG $\bar{R}_0$ is given by
-\begin{align}
- \bar{R}_0  = \frac{1}{T}\sum_{t=1}^T\bigg[\frac{c_1^t}{\alpha_{t}M}\mathbb{E}_{s\sim \nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})] +  c_2^t\alpha_t    + \sum_{i=0}^p \frac{c_3^t\alpha_{t}+c_4^t\alpha_{t}^2}{\alpha_{t}\sqrt{M} } \bigg],
-       \end{align}
-and the TACV $\bar{R}_i$ is given by
-\begin{align}
-    \bar{R}_{i} =  \frac{1}{T} \sum_{t=1}^T\bigg[ \frac{c_1^t}{\alpha_{t}M}\mathbb{E}_{s\sim \nu_t^*}[D_{KL}(\pi_t^*|\pi_{t,0})] +  c_2^t\alpha_t    + \sum_{i=0}^p \frac{c_3^t\alpha_{t}+c_4^t\alpha_{t}^2}{\alpha_{t}\sqrt{M} }+\frac{c_5^t}{\sqrt{M}} \bigg],
-       \end{align}
-for $i=1,...,p$, where $c_1^t=2$, $c_2^t=\frac{4c_{max}^2|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3}$, $c_3^t=\frac{3+(1-\gamma)^2}{(1-\gamma)^2}$, $c_4^t=\frac{3 c_{max}}{(1-\gamma)^2}$, and $c_5^t=\frac{2\sqrt{(1-\gamma)|\mathcal{S}||\mathcal{A}|}}{1-2\kappa}$.
-\end{theorem}
-\begin{proof}
-The proof follows directly by summing the results \eqref{eq:U0-adapt} and \eqref{eq:Ui-adapt} over $t=1,...,T$.
-\end{proof}
-
-
-
-
-\subsection{Adapting to the dynamic regret using adaptive learning rates} \label{subsec:appProofsAdaptive}
-We restate the theorem below for convenience.
-
-\begin{theorem}\label{thm:appUtSim1}
-Let each within-task CMDP $t$ run $M$ steps of CRPO, initialized by policy $\pi_{t,0}\coloneqq \mathrm{INIT}(t)$ and learning rates $\{\alpha_{t}\}_{i=0}^p \coloneqq \mathrm{SIM}(t)$. Let $\kappa^* \coloneqq \argmin L(\kappa)$, where
-\begin{equation} 
-\label{eq:appLkappa1}
-    L(\kappa) =U_T^{sim}(\kappa)+ \frac{U_T^{init}(\{\psi_t\}_{t=1}^T)}{\kappa} + \frac{\mathcal{E}_T}{\kappa} + \sum_{t=1}^T\bigg[ \frac{\hat{f}_t^{init}(\phi_t)}{\kappa} + f_t^{rate}(\kappa)\bigg],
-\end{equation}
-and $\{\psi_t^*\}_{t=1}^T$ is any comparator sequence. Then, 
-the following bounds on TAOG and TACV hold:
-\begin{equation}\label{eq:upper-bound-adapt1}
-\bar{R}_{i} \leq \frac{L(\kappa^*)}{T},\qquad\qquad \forall\; i=0,...,p.
-\end{equation}
-\end{theorem}
-\begin{proof}
-The idea of the proof is to freeze the learning rates first to obtain a dynamic regret bound based on policy initialization and then optimize over the learning rates to obtain a tighter characterization. Also, since TAOG and TACV only differ by a bias term that does not depend on either the learning rates or the initial policy, we can treat them indistinguishably. In particular,
-    \begin{align}
-    &\sum_{t=1}^T U_t(\pi_{t,0}, \alpha_{t})  = \sum_{t=1}^T f_t^{sim}(\alpha_{t})\\ & \leq \underset{\kappa}{\min}\hspace{0.2cm} U_T^{sim}(\kappa) + \sum_{t=1}^Tf_t^{sim}(\kappa)\\  
-    & \leq \underset{\kappa}{\min} \hspace{0.2cm} U_T^{sim}(\kappa) + \frac{U_T^{init}(\Psi)}{\kappa} + \sum_{t=1}^T\bigg[ \frac{f_t^{init}(\psi_t)}{\kappa} + f_t^{rate}(\kappa)\bigg] \\  
-    & \leq \underset{\kappa}{\min} \hspace{0.2cm} U_T^{sim}(\kappa) + \frac{U_T^{init}(\Psi)}{\kappa} + \frac{\mathcal{E}_T}{\kappa} +\sum_{t=1}^T\bigg[ \frac{\hat{f}_t^{init}(\phi_t)}{\kappa} + f_t^{rate}(\kappa)\bigg] .\label{eq:up-plug}
-    \end{align}
-where $\Psi\coloneqq\{\phi_t\}_{t=1}^T$. Let 
-\begin{equation}
-    L(\kappa) = U_T^{sim}(\kappa) +\frac{U_T^{init}(\Psi)}{\kappa} + \frac{\mathcal{E}_T}{\kappa} + \sum_{t=1}^T\bigg[ \frac{\hat{f}_t^{init}(\phi_t)}{\kappa} + f_t^{rate}(\kappa)\bigg]
-\end{equation}
-and define
-\begin{equation}
-\label{eq:opt-rate-learning}
-    \kappa^* = \argmin L(\kappa).
-\end{equation}
-Thus, plugging $\kappa = \kappa^*$ in \eqref{eq:up-plug} results in \eqref{eq:upper-bound-adapt1}.
-\end{proof}
-
-Now, we provide the regret upper bound for $U_T^{sim}(\kappa)$, when $\mathrm{SIM}(t)$ is OGD over the sequence $\hat{f}_t^{sim}(\kappa)$.
-
-\begin{corollary}
-\label{cor:app_fsimstaticRegretOGD}
-For any fixed comparator $\alpha^* = \underset{\kappa}{\argmin} \sum_{t=1}^T \hat{f}_t^{sim}$, if $\mathrm{SIM}(t)$ is OGD which is run on a sequence of loss functions $\{\hat{f}_t^{sim}\}_{t \in [T]}$ with the step-size $ \frac{\alpha^*}{K_\alpha \sqrt{2T}}$, then the following bound holds for static regret:
-\begin{equation*}
-    \sum_{t=1}^T \hat{f}_t^{sim}(\kappa) - \sum_{t=1}^T\hat{f}_t^{sim}(\alpha^*)  \leq \sqrt{2T}K_\alpha|\alpha^* | + \left(1 + \frac{4\sqrt{2} K_\alpha L_\alpha |\alpha^*|}{(2C_1 - C_1^2 L_\alpha) \sqrt{T}} \right) \mathcal{E}_T,
-\end{equation*}
-for any $C_1 \in \{c \in \left(0, \frac{2}{L_\alpha}\right): \alpha^* + c(u - \hat{\nabla}_t )  \in \Lambda\}$ where $u$ is an $\epsilon$-subgradient of $f_t^{sim}(\kappa)$ with respect to $\kappa$,  $\mathcal{E}_T \coloneqq \sum_{t=1}^T \epsilon_t$ is the cumulative inexactness, $\epsilon_t$ is the upper bound from Theorem \ref{thm:dualDICE}.
-\end{corollary}
-
-\begin{proof}
-The proof follows directly after substituting $c = \frac{4}{2C_1 - C_1^2 L_\alpha}$ and other appropriate constants in Theorem \ref{prop:InexactUpperBound-app}.
-\end{proof}
-
-
-
-Next, we provide the proof of Corollary \ref{cor:CorollaryAdpativeRate} TAOG and TACV regret bounds when $\mathrm{INIT}$ and $\mathrm{SIM}$ are either FTL or inexact OGD.
-
-
-\begin{corollary} \label{cor:appCorollaryAdpativeRate}
-For any fixed comparator $\alpha^* = \underset{\kappa}{\argmin} \sum_{t=1}^T \hat{f}_t^{sim}$ and $c_3^t = \frac{3+(1-\gamma)^2}{(1-\gamma)^2}$. If $\mathrm{INIT}(t)$ and $\mathrm{SIM}(t)$ are FTL or inexact OGD over the sequences $\hat{f}_t^{init}$ and $\hat{f}_t^{sim}$, then, the following bounds on TAOG and TACV hold:
-\begin{equation}
-\begin{aligned}
-\bar{R}_{i} \leq & \frac{1}{\sqrt{M}} \left( \frac{\sqrt{2}K_\alpha|\alpha^* |}{\sqrt{MT}}  + \left(c_3^t + \frac{4\sqrt{2} K_\alpha L_\alpha |\alpha^*|}{(2C_1 - C_1^2 L_\alpha) \sqrt{T}} \right) \frac{\mathcal{E}_T}{\sqrt{M}T} + \frac{1}{\sqrt{c_2^t}} \frac{\sqrt{U_T^{init}(\{\psi_t^*\}_{t \in [T]}) + \mathcal{E}_T + T\hat{V}_\psi^2} }{M^{1/4}T} \right),
-\end{aligned}
-\end{equation}
-for all $i = 0,\ldots,p$, where $\{\psi_t^*\}_{t \in [T]}$ is any comparator sequence, $c_3^t = \frac{3+(1-\gamma)^2}{(1-\gamma)^2}$, and $c_2^t=\frac{4c_{max}^2|\mathcal{S}||\mathcal{A}|}{(1-\gamma)^3}$.
-\end{corollary}
-
-\begin{proof}
-Firstly, the following inexact upper bounds hold for $U_T^{init}$ and $U_T^{sim}$:
-\begin{equation}
-\begin{aligned}
-    \label{eq:UTinit}
-    U_T^{init}(\{\psi_t^*\}_{t \in T}) = \min \bigg(C_1\|\phi_1 - \psi_1^*\|^2 + C_2 & \mathcal{E}_T + C_3 \mathcal{S}_T + \frac{1}{2 \beta}\sum_{t=1}^T \|\nabla \ell_t(\psi_t^*)\|^2, \\ & C_4\|\phi_1 - \psi_1^*\| + C_5 \tilde{\mathcal{E}}_T + C_4 \mathcal{P}_T  \bigg) ,
-    \end{aligned}
-\end{equation}
-\begin{equation}
-    \label{eq:UTsim}
-    U_T^{sim}(\kappa) = \sqrt{2T}K_\alpha|\alpha^* | + \left(1 + \frac{4\sqrt{2} K_\alpha L_\alpha |\alpha^*|}{(2C_1 - C_1^2 L_\alpha) \sqrt{T}} \right) \mathcal{E}_T ,
-\end{equation}
-
-where the constants in the \ref{eq:UTinit} and \eqref{eq:UTsim} are from the Lemma \ref{cor:appdynamicRegretOGD} and Corollary \ref{cor:app_fsimstaticRegretOGD} respectively.
-
-Plugging these in the equation \eqref{eq:appLkappa1}, we can obtain the following:
-\begin{equation*}
-\begin{aligned}
-    L(\kappa) \leq & \sqrt{2T}K_\alpha|\alpha^* | + \left(1 + \frac{4\sqrt{2} K_\alpha L_\alpha |\alpha^*|}{(2C_1 - C_1^2 L_\alpha) \sqrt{T}} \right) \mathcal{E}_T  + \frac{1}{\kappa} \min \bigg(C_1\|\phi_1 - \psi_1^*\|^2 \\& + C_2 \mathcal{E}_T + C_3 \mathcal{S}_T + \frac{1}{2 \beta}\sum_{t=1}^T \|\nabla \ell_t(\psi_t^*)\|^2,  C_4\|\phi_1 - \psi_1^*\| + C_5 \tilde{\mathcal{E}}_T + C_4 \mathcal{P}_T  \bigg) \\ & + \frac{\mathcal{E}_T}{\kappa} + \frac{T\hat{V}_\psi^2}{\kappa} + f_t^{rate}(\kappa),
-    \end{aligned}
-\end{equation*}
-
-where the relation $\hat{V}_\psi^2 = \frac{1}{T}\sum_{t=1}^T \mathbb{E}_{\hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\psi_t^*)]$ is used to get the second last term $\frac{T \hat{V}_\psi^2}{\kappa}$. To get $\kappa^*$ such that $L(\kappa)$ is minimized, we proceed using the KKT optimality conditions to get $\frac{dL}{d\kappa}$ as
-\begin{equation*}
-    \frac{dL}{d\kappa} = \frac{-U_T^{init}(\{\psi\}_{t \in [T]})}{\kappa^2} - \frac{\mathcal{E}_T}{\kappa^2} - \frac{T \hat{V}_\psi}{\kappa^2} + c_2^tM+ c_4^t\sqrt{M}.
-\end{equation*}
-
-Therefore, we obtain $\kappa^*  = \sqrt{\frac{U_T^{init}(\psi) + \mathcal{E}_T + T\hat{V}_\psi^2}{c_2^tM+c_4^t\sqrt{M}}}$. 
-
-Substituting this value of $\kappa^*$ in \eqref{eq:appLkappa}, and using the approximation $\sqrt{\frac{1}{M+\sqrt{M}}} \approxeq \frac{1}{M^{1/4}}$, we can obtain the final result.
-\end{proof}
-
-\section{Additional experiments and details} \label{sec:ExpDetails}
-
-In this section, we describe our experimental setup and other additional details for the OpenAI gym experiments under constrained settings. We present the performance of Meta-SRL on the Acrobot and the Frozen lake under high task-similarity conditions. We first present some details on the baselines used to avoid any confusion.
-
-\textbf{Details of baselines used:} Denote the policy initialization for task $t$ as $\phi_t$. First baseline is the FAL, which initializes the policy $ \phi_{t+1}$ for the test task $t+1$ as $\phi_{t+1} = \frac{1}{t} \sum_{i=1}^t \phi_i$. This is done online as the tasks are encountered sequentially.
-
-For the simple averaging baseline, we run CRPO on $10$ tasks with the random initialization and evaluate the performance on a test task by initializing with the average of all the suboptimal policies from the batch of suboptimal policies, i.e., $\phi_T  = \frac{1}{T-1} \sum_{i=1}^{T-1} \phi_i$, where $T$ is the test task. This can be seen as an offline method, where suboptimal policies from $T-1$ tasks are stored and averaged to get the initialization for the test task. 
-
-The pre-trained baseline uses suboptimal policy from an already trained task to initialize the policy on the test task, i.e., $\phi_t = \hat{\pi}_{t'}$, where $\phi_t$ is the policy initialization for the task $t$, and $\hat{\pi}_{t'}$ is the suboptimal policy returned some pre-trained task $t'$. This baseline could be thought of as the Strawman initialization strategy, where the policy for the next task is initialized using the suboptimal policy from the previous task, i.e., $\phi_{t+1} = \hat{\pi}_t$.
-
-For the Meta-SRL in the discrete state action space (i.e., Frozen lake), we take the weighted average of the previous suboptimal policies weighted by the stationary distributions induced by each suboptimal policy over all the states. We also adapt the learning rates $\alpha_t$ for the CRPO algorithm for each task $t$. The difference between FAL and Meta-SRL is that Meta-SRL weights the suboptimal policies higher in the states which were encountered more frequently.
-
-Random initialization baseline is using random iniitalization for the within-task algorithm CRPO.
-
-% \jin{try to find a common template to introduce each baseline, so it's easier to compare. you haven't defined other baselines. remakr on the differences of some key pairs, eg FAL and Meta-SRL, what is the difference, why it is important, FAL vs simple averaging, what's the difference, etc. Your job is to help readers understand the intricacies of different methods and eventually appreciate your method.}
-
-\textbf{Experimental setup:} We run all the algorithms online where tasks are encountered sequentially and present the results for the test task after the policy initialization suggested by respective baselines. We do $10$ runs for each baseline on the test task and present the variance plots. On the test task, we train for $8$ steps on the Frozen lake and for $5$ steps on the Acrobot. In Frozen lake, each step corresponds to $5$ episodes, and the average rewards/costs are reported for each step in the performance and constraint violation plots.
-
-\subsection{Frozen lake}
-
-\textbf{Frozen lake:} For the Frozen lake, we randomly generate $T=10$ different orientations as tasks over the probability of a state being frozen or a hole and evaluate the performance for the scenarios with high task-similarity (low variance for the latent CMDP distribution) or low task-similarity (high variance for the latent CMDP distribution). The agent gets rewarded $+2$ when it reaches the goal state and incurs a cost $-1$ when it falls into a hole. We choose the constraint threshold $d_{t,i} = 0.3$.
-
-
-
-
-\textbf{High task-similarity:} To generate tasks with high task-similarity, we start with a random frozen lake grid of $4 \times 4$, where the probability of each grid being frozen is $0.7$. Then, we generate $9$ different grids which differ from the first one by only one of the grids. This means the agent always encounters the new grid, which is very similar to the previous task. From Figure \ref{fig:FrozenLakeHigh}, we can observe that baselines pre-trained and FAL are competitive with the Meta-SRL in terms of reward maximization and almost zero constraint violations. The good performance of the pre-trained baseline can be explained using the fact that the the test task and the training task are very similar; the policy initialization will be close to the optimal policy of the test task.
-
-\begin{figure}[htbp]
-\centering
-  \includegraphics[width=\columnwidth]{FrozenLake/FrozenLakeHighSimilarity.pdf}
-\caption{Frozen lake results for reward maximization and constraint violations when the task-relatedness is high. The blue dashed line represents the averaged thresholds for the constraint violations.}
-\label{fig:FrozenLakeHigh}
-\end{figure}
-
-
-\textbf{Low task-similarity:} In this case, random tasks are generated where the probability of a tile being frozen is kept between $0.3$ and $0.7$. The tasks are less similar  due to the high uncertainty associated with the changing orientations.
-
-% \textbf{Performance comparison in long sequence of learning tasks:} Our frozen lake experiments constitute the case of sparse rewards, where the agent only gets rewarded when it reaches the goal position. It is evident from the experiments that the Meta-SRL learns a meta-initialization that not only achieves high reward performance quickly but also drives constraint violation below the fixed upper limit faster as compared to other baselines. Here, we also test the performance of the Meta-SRL, when the length of episode in each of the update steps is increased, i.e., to test the performance in sparse reward settings with long sequences of learning. The results are presented in Figure \ref{fig:LongSequences}, We can observe that, as the time horizon for each episode increases, our agent is able to leverage the similarity. Although, there is there is deterioration in performance after horizon $H=20$ for every baseline, Meta-SRL is still able to learn better than other baselines.
-
-
-% \begin{figure}[htbp]
-% \centering
-% \begin{subfigure}{.47\textwidth}
-%   \centering
-%   \includegraphics[width=\columnwidth]{TimeHorizon/FrozenLake_reward_k1.pdf}
-% \caption{}
-% \end{subfigure}%
-% \begin{subfigure}{.47\textwidth}\label{fig:FrozenLakeCV1}
-%   \centering
-%   \includegraphics[width=\columnwidth]{TimeHorizon/FrozenLake_reward_k2.pdf}
-% \caption{}
-% \end{subfigure}
-% \caption{Frozen lake results for varying time horizons in every episode.  }
-% \label{fig:TimeHorizon}
-% \end{figure}
-
-
-
-
-\subsection{Acrobot}
-
-\textbf{Acrobot:} Acrobot is a $2$ link robot OpenAI gym  environment with continuous state space. The agent is rewarded when it achieves a certain height of the end link. Two constraints are introduced for two links, where a $-1$ cost is incurred if any link swings in the prohibited direction. We randomly generate $T=50$ different tasks with different mass links and centers of gravity.
-
-\textbf{High task-similarity:} To generate tasks with high similarity for the acrobot, we considered changing the mass of the links, the center of gravity (COG), and constraint threshold for each link. The changes in these quantities were done by adding noise to the default quantities. We considered a Gaussian noise with a low variance of $0.1$ to change the tasks only slightly. From Figure \ref{fig:AcrobotHigh}, we can observe that only pre-trained and FAL baselines are competitive with the Meta-SRL in terms of reward maximization and almost zero constraint violations.
-
-\begin{figure}[htbp]
-\centering
-  \includegraphics[width=\columnwidth]{Acrobot/Acrobot_high_similarity.pdf}
-\caption{Acrobot results for reward maximization and constraint violations when the task-relatedness is high. The blue dashed line represents the averaged thresholds for the constraint violations.}
-\label{fig:AcrobotHigh}
-\end{figure}
-
-\textbf{Low task-similarity:} To generate low similar tasks, we increased the variance of the Gaussian noise to $0.3$. We can observe from Figure \ref{fig:Acrobot} that the performance of the baselines was poor for constraint satisfaction, while Meta-SRL could converge quickly for reward maximization and constraint satisfaction.
-
-Note that, in real-world settings, tasks are likely to have low similarity in terms of how close their optimal policies are. The good performance of Meta-SRL under these settings highlights its potential to be extended to real-world settings where safety constraints are present.
-
-
-\subsection{Half cheetah}
-
-\hl{Half-cheetah is a simulation environment for a 2-dimensional robot. It consists of 9 links and eight joints, where the goal for the cheetah is to run at a certain velocity. It has a 17-dimensional state space and a 6-dimensional action space. The reward is calculated as the negative of the absolute difference between the current cheetah velocity and the desired velocity. The original HalfCheetah environment does not have any constraints. We introduce a constraint that penalizes the deviation of the cheetah’s head from some desired height:}
-
-$$h_{cheetah} - h_{target} \leq \epsilon,$$
-
-\VK{where we specify the cumulative absolute difference between the cheetah head height and the desired height to be less than a tolerance $\epsilon$. The cheetah is trained on $T = 100$ tasks for both high and low task-similarity settings.}
-
-\VK{\textbf{High task-similarity:} To generate tasks with high similarity, the goal velocity for each training task is uniformly sampled from a range of $[0.35,0.65]m/s$. From Figure \ref{fig:CheetahHigh}, we can observe that under high task similarity settings, Meta-SRL is able to achieve high rewards and also able to keep the constraint violation of the cheetah's head height below the threshold. Under high task-similarity settings, pre-trained and Meta-SRL perform well, as expected. However, it can be observed that both simple averaging and random initialization perform poorly in this setting.}
-
-\begin{figure}[htbp]
-\centering
-\begin{subfigure}{.47\textwidth}
-  \centering
-  \includegraphics[width=\columnwidth]{HalfCheetah/HalfCheetahReward_high_task_similarity_broken_axis.pdf}
-\caption{}
-\end{subfigure}%
-\begin{subfigure}{.47\textwidth}
-  \centering
-  \includegraphics[width=\columnwidth]{HalfCheetah/HalfCheetahCost_high_task_similarity.pdf}
-\caption{}
-\end{subfigure}
-\caption{Half-cheetah results for reward maximization and constraint violations when the task-relatedness is high. The blue dashed line represents the averaged thresholds for the constraint violations. Here, the simple averaging and random initialization perform the worst on the test task.}
-\label{fig:CheetahHigh}
-\end{figure}
-
-\VK{\textbf{Low task-similarity:} To generate tasks with low similarity for the half-cheetah, the goal velocity for each training task is uniformly sampled from a range of $[0.0,1.0]m/s$. Tasks are less similar due to the high variance of goal velocities that the cheetah is trained on. It can be observed from Figure \ref{fig:Halfcheetah} that Meta-SRL is able to achieve higher rewards and zero constraint violations quickly compared to other baseline initializations under low task-relatedness settings. The pre-trained baseline can achieve higher rewards, but similar to other baselines, it cannot achieve constraint satisfaction within 10 steps. It can also be observed that both simple averaging and random initialization perform poorly in reward maximization in this setting. Close inspection indicates that there is a high variance among the policy parameters learned from each task, which may result in interference among different tasks in the relatively high dimensional state space.}
-
-\begin{figure}[htbp]
-\centering
-\begin{subfigure}{.47\textwidth}\label{fig:FrozenLakeReward1}
-  \centering
-  \includegraphics[width=\columnwidth]{HalfCheetah/HalfCheetahReward_low_task_similarity_broken_axis.pdf}
-\caption{}
-\end{subfigure}%
-\begin{subfigure}{.47\textwidth}\label{fig:FrozenLakeReward2}
-  \centering
-  \includegraphics[width=\columnwidth]{HalfCheetah/HalfCheetahCost_low_task_similarity.pdf}
-\caption{}
-\end{subfigure}
-\caption{Half-cheetah results for reward maximization and constraint violations when the task-relatedness is low. The blue dashed line represents the averaged thresholds for the constraint violations.}
-\label{fig:Halfcheetah}
-\end{figure}
-
-
-\subsection{Humanoid}
-
-\hl{Humanoid is a simulation environment of a 3D bipedal robot, which consists of a torso (abdomen) with two arms and legs. Each leg and arm has two links (representing the knees and elbows, respectively). It has a 376-dimensional observation space and a 17-dimensional action space. The goal of the humanoid is to walk forward as fast as possible without falling over.}
-
-\VK{The original humanoid environment does not have any constraints. We introduce a constraint that penalizes the deviation of the angles between the torso and the upper arm and the angle between the upper arm and the lower arm, such that humanoid motions are smooth and graceful. The cumulative constraint is given as:}
-$$\left|\theta_{tr} - \frac{\pi}{4}\right|+\left|\theta_{tl} - \frac{\pi}{4}\right|+\left|\theta_{r} - \frac{\pi}{4}\right|+\left|\theta_{l} - \frac{\pi}{4}\right| \leq \epsilon,$$
-\VK{where $\theta_{tr}$, $\theta_{tl}$, $\theta_r$, and $\theta_l$ are the angles between the torso and right arm, the angle between the torso and the left arm, the angle between both the upper and lower right arms, and  the angle between both the upper and lower left arms, respectively. We specify the cumulative absolute difference between all these angles and the desired angle to be less than a tolerance $\epsilon$. The reward is calculated on the basis of the dot product between the direction and the velocity vector of the Center of Gravity of the humanoid, multiplied by the default scaling value in the humanoid environment $W_f$. the humanoid as follows:}
-
-$$r = W_f(v_y \sin \theta + v_x \cos \theta),$$
-
-\VK{where $r$ is the instantaneous reward that accounts for the amount of forward movement by the humanoid, $v_x$, and $v_y$ are the horizontal and lateral components of the velocity, and $\theta$ is the walking direction of the humanoid. The default value of $W_f$ is 1.25. The humanoid is trained on $T = 250$ tasks for both high and low task-similarity settings.}
-
-\VK{\textbf{High task-similarity:} We generate different tasks by changing the direction of motion of the humanoid. Possible direction angles in the humanoid environment range from $-\pi/2$ to $\pi/2$ (which varies from left to right). To generate tasks with high similarity, the goal direction of the humanoid for each training task is uniformly sampled from a range of $[-\pi/4,\pi/4]$. From Figure \ref{fig:HumanoidHigh}, we can observe that under high task similarity settings, Meta-SRL is able to achieve high rewards and maintain the constraint violation of the humanoid's hand and torso angles  below the threshold of $\epsilon=4$. Under high task-similarity settings, both pre-trained and Meta-SRL perform well, as expected. However, it can be observed that both simple averaging and random initialization perform poorly in this setting. Moreover, the pre-trained baseline also fails to learn reward-maximizing behaviors.}
-
-
-\begin{figure}[htbp]
-\centering
-\begin{subfigure}{.47\textwidth}
-  \centering
-  \includegraphics[width=\columnwidth]{Humanoid/HumanoidReward_high_task_similarity.pdf}
-\caption{}
-\end{subfigure}%
-\begin{subfigure}{.47\textwidth}
-  \centering
-  \includegraphics[width=\columnwidth]{Humanoid/HumanoidCost_high_task_similarity.pdf}
-\caption{}
-\end{subfigure}
-\caption{Humanoid results for reward maximization and constraint violations when the task-relatedness is high. The blue dashed line represents the averaged thresholds for the constraint violations.}
-\label{fig:HumanoidHigh}
-\end{figure}
-
-\VK{\textbf{Low task-similarity:} To generate tasks with low similarity for the humanoid, the goal direction for each training task is uniformly sampled from a range of $[-\pi/4,\pi/4]$. Tasks are less similar due to the high variance of goal direction that the humanoid is trained on. Figure \ref{fig:HumanoidLow} shows that Meta-SRL is able to quickly achieve higher rewards and zero constraint violations compared to other baseline initializations under low task-relatedness settings. The pre-trained baseline also achieves constraint satisfaction in this case but fails to learn behaviors to maximize the rewards within 10 steps. It can also be observed that both simple averaging and random initialization perform poorly in reward maximization in this setting. This can be attributed to a high variance among the policy parameters learned from each task, which may result in interference among different tasks in the relatively high-dimensional state space.}
-
-\begin{figure}[htbp]
-\centering
-\begin{subfigure}{.47\textwidth}
-  \centering
-  \includegraphics[width=\columnwidth]{Humanoid/HumanoidReward_low_task_similarity.pdf}
-\caption{}
-\end{subfigure}%
-\begin{subfigure}{.47\textwidth}
-  \centering
-  \includegraphics[width=\columnwidth]{Humanoid/HumanoidCost_low_task_similarity.pdf}
-\caption{}
-\end{subfigure}
-\caption{Humanoid results for reward maximization and constraint violations when the task-relatedness is low. The blue dashed line represents the averaged thresholds for the constraint violations.}
-\label{fig:HumanoidLow}
-\end{figure}
-
-
-
-% \subsubsection{Performance comparison in long sequence of learning tasks} 
-% \VK{Our frozen lake experiments constitute the case of sparse rewards, where the agent only gets rewarded when it reaches the goal position. It is evident from the experiments that the Meta-SRL learns a meta-initialization that not only achieves high reward performance quickly but also drives constraint violation below the fixed upper limit faster as compared to other baselines. Here, we also test the performance of the Meta-SRL on the frozen lake environment with respect to the number of training tasks $T$, i.e., to test the performance in sparse reward settings with long sequences of learning tasks. For this set of experiments, tasks are generated using the same principle used to generate low similarity tasks, i.e., random tasks are generated independently, where the probability of a tile being frozen is kept between $0.3$ and $0.7$. We report the performance of all baselines for varying number of training tasks $T = 5, 10, 20, 30, 40, 50, 80, 90,$ and $100$ on the respective meta-test tasks. For better visualization of performance across tasks, the plots are shown for each CRPO update step $m = 1$ to $8$ in Figure \ref{fig:ChangingTasks} and \ref{fig:ChangingTasksCost}. }
-
-% \VK{It can be observed from both Figures \ref{fig:ChangingTasks} and \ref{fig:ChangingTasksCost} that Meta-SRL can leverage the similarity across tasks as the number of training tasks $T$ increase, and the CRPO update steps $m$ exceed 3. However, in the sparse reward settings, it is also observed from Figures \ref{fig:ChangingTasks} and \ref{fig:ChangingTasksCost} ((a)-(c)) that the benefit of increasing the number of training tasks $T$ is not very evident for the first 3 CRPO update steps $m$. The potential reason could be that the first 20 training tasks were more similar to the meta-test task than the first 30 or 40 tasks. Meta-SRL is still able to achieve better reward performance and constraint satisfaction for all the CRPO update steps. Note that, our frozen lake experiments only cover short sequences of learning tasks, and it will be important in future work to design methods that improve performance in the long sequence of learning tasks in sparse reward settings.}
-
-% \begin{figure}[htbp]
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_reward_m1.pdf}%
-%             \label{subfig:a}%
-%         }\hfill
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_reward_m2.pdf}%
-%             \label{subfig:b}%
-%         }\\
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_reward_m3.pdf}%
-%             \label{subfig:c}%
-%         }\hfill
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_reward_m4.pdf}%
-%             \label{subfig:d}%
-%         }\\
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_reward_m5.pdf}%
-%             \label{subfig:c}%
-%         }\hfill
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_reward_m6.pdf}%
-%             \label{subfig:d}%
-%         }\\
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_reward_m7.pdf}%
-%             \label{subfig:c}%
-%         }\hfill
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_reward_m8.pdf}%
-%             \label{subfig:d}%
-%         }
-%         \caption{Frozen lake reward performance for all baselines with respect to the number of training tasks $T$. Here $m$ denotes the update step for the CRPO algorithm. The results are shown for each update step from $m=1$ to $8$. Variance is reported across 10 runs on the meta-test task.}
-%         \label{fig:ChangingTasks}
-%   \end{figure}
-  
-  
-%   \begin{figure}[htbp]
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_cost_m1.pdf}%
-%             \label{subfig:a}%
-%         }\hfill
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_cost_m2.pdf}%
-%             \label{subfig:b}%
-%         }\\
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_cost_m3.pdf}%
-%             \label{subfig:c}%
-%         }\hfill
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_cost_m4.pdf}%
-%             \label{subfig:d}%
-%         }\\
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_cost_m5.pdf}%
-%             \label{subfig:c}%
-%         }\hfill
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_cost_m6.pdf}%
-%             \label{subfig:d}%
-%         }\\
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_cost_m7.pdf}%
-%             \label{subfig:c}%
-%         }\hfill
-%         \subfloat[]{%
-%             \includegraphics[width=.48\columnwidth]{ChangingTasks/FrozenLake_cost_m8.pdf}%
-%             \label{subfig:d}%
-%         }
-%         \caption{Frozen lake constraint violation for all baselines with respect to the number of training tasks $T$. Here $m$ denotes the update step for the CRPO algorithm. The results are shown for each update step from $m=1$ to $8$. Variance is reported across 10 runs on the meta-test task.}
-%         \label{fig:ChangingTasksCost}
-%   \end{figure}
-    
-
-
-\section{Relation of Meta-SRL to hardness results in \texorpdfstring{\citep{kwon2021rl}}{[1]}}\label{sec:KwonRelation}
-\VK{
-There is a key difference in the problem setting of meta-learning in our study and the latent MDP setting in \citep{kwon2021rl}. The latent MDP setting is more challenging in the sense that there is no clear boundary between tasks. In the latent MDP setting, each episode may come from an unknown MDP drawn from a distribution (as a special case of POMDP); in the meta-learning setting, the agent knows when a new task has arrived and is allowed to interact with the MDP over a set of episodes (the number is linear with respect to $M$ in our paper). Due to the above difference, the worst-case lower bound of requiring an exponential number of episodes to learn an $\epsilon$-optimal policy in \citep{kwon2021rl} does not hold in our case.
-Indeed, if the identity (referred to as ``context” in latent MDP) is revealed or can be inferred, \citep{kwon2021rl} is able to achieve a regret that is polynomial in the number of episodes (Thms. 3.3 and 3.4 from \citep{kwon2021rl}).}
-
-
-\VK{
-Furthermore, a close examination of the bounds provided by \citep{kwon2021rl} also reveals some differences from our result. In particular, let $K$ be the number of contexts in a latent MDP and $N$ be the total number of episodes ($N$ is on the order of $TM$ in our case as we encounter $T$ tasks, each with $M$ episodes). Then \citep{kwon2021rl} is able to bound the regret (without dividing by the number of episodes $N$) as $\mathcal{O}(\sqrt{KN})$. To compare their bound with ours, we consider each task in the meta-learning setting as a context, so $T = \mathcal{O}(K)$. Therefore, their upper bound (after dividing by the number of episodes $N=TM$) becomes $\mathcal{O}(1/\sqrt{M})$, which does not diminish with the number of tasks $T$. Note that our bound (see the comment after Corollary \ref{cor:CorollaryAdpativeRate}) is $\mathcal{O}\left( \frac{\hat{V}_\psi}{M^{3/4}\sqrt{T}} \right)$ (after dividing by the number of episodes $N = TM$), where for simplicity we have assumed $\mathcal{E}_T = 0$, i.e., exact access to the loss function. Note that $\hat{V}_\psi$ is a measure of task-relatedness (a smaller value indicates more relatedness among tasks). It can be seen that while we have a worse order dependence on $M$, our bound scales with task relatedness $\hat{V}_\psi$ and diminishes with respect to the increasing number of tasks $T$. This is expected as we leverage the relatedness among contexts (in fact, the result  of \citep{kwon2021rl} would hold when tasks are sufficiently different from each other to infer the contexts with spectral methods).}
-\VK{
-In summary, we refer to \citep{kwon2021rl} as an example that achieving regret diminishing in the number of tasks $T$ is hard, even with the assumption of observing the task identities (contexts).}
-
-
-
-\section{Notations and constants}
-\label{sec:notations}
-
-
-\begin{table}[h]
-\centering
-\begin{tabular}{ll}
-\hline \textbf{Notation}                           & \textbf{Definition}        \\ \hline
-$t$ & index of task\\
-$k$ & index of OGD steps\\
-$\phi_t$,$\pi_{t,0}$  & policy initialization for task $t$\\
-$\alpha_t$ & learning rate of within-task algorithm\\                                                            $t \in [T]$ & set of all tasks, where $[T] = \{1,\ldots,T\}$
-\\
-$\mathcal{M}_t$ &CMDP for task $t$\\
-$\mathcal{S}$ & state space of CMDP $\mathcal{M}_t$
-\\
-$\mathcal{A} \in \mathbb{R}^{n_a}$ & action space of CMDP $\mathcal{M}_t$\\             $\rho_t$ & initial state distribution of task $t$
-\\
-$P_t(\cdot|s,a)$ & transition kernel for task $t$                       \\
-$c_{t,0}: \mathcal{S} \times \mathcal{A} \rightarrow [0,1]$  & reward function 
-\\
-$c_{t,i}: \mathcal{S} \times \mathcal{A} \rightarrow [0,1]$  & cost function $i$ for task $t$ \\   
-$p$ & total number of constraints  
-\\
-$m \in [M]$ &  set of all timesteps for within-task algorithm \\
-$\Delta (\mathcal{A})^{|\mathcal{S}|}$  & simplex over all state-action pairs
-\\
-$\pi_t: \mathcal{S}\rightarrow \Delta(\mathcal{A})$ & stochastic policy for task $t$            \\
-$\nu_t^\pi$ &state visitation distribution of policy $\pi$ at task $t$\\
-$\theta$ & softmax policy parameters
-\\
-$V_{t,\pi}^i(s)$ & state-value function for reward ($i=0$) or cost $i$ in task $t$ with policy $\pi$\\ $Q_{t,\pi}^i(s,a)$ & action-value function for reward ($i=0$) or cost $i$ in task $t$ with policy $\pi$
-\\
-$J_{t,i}(\pi)$ & expected total reward ($i=0$) or cost $i$ for task $t$ and policy $\pi$\\            $d_{t,i}$ & bound on the expected total cost $i$ for task $t$
-\\
-$\Pi_t^*$ & set of optimal solutions for task $t$\\   
-$\pi_t^*$ & \VK{optimal policy for task $t$}\\   
-$c_{max}$ & maximum value of reward/cost functions
-\\
-$D_{KL}(\cdot|\cdot)$ & KL divergence\\
-$\bar{R}_0, \bar{R}_i$ & TAOG and TACV\\
-$\hat{\pi}_t$ & suboptimal policy returned by within-task algorithm in task $t$
-\\
-$D^*$, $\hat{D}^*$ & true and empirical task-similarity\\           
-$V_\psi$ ,$\hat{V}_\psi$ & true and empirical task-relatedness
-\\
-$\Delta \mathcal{A}_\varrho$ & shrinkage simplex set inside $\Delta \mathcal{A}$\\          $L_g$, $L_\pi$  &Lipschitzness and smoothness parameter for KL divergence of policy $\pi$ w.r.t. initial policy
-\\
-$C_\pi$ & maximum value of KL divergence of policy $\pi$ w.r.t. initial policy \\     
-$\omega_{\pi/ \mathcal{D}_t}(s,a)$ &stationary distribution correction for task $t$ at state $s$ action $a$
-\\
-$\mathcal{D}_t$ & off policy dataset for task $t$\\           
-$ \mu_\pi$ & strong convexity parameter for KL divergence of policy $\pi$ w.r.t. initial policy
-\\
-$\{\psi_t^*\}_{t=1}^T$ & dynamically varying comparator sequence\\    
-$\epsilon_t$ &inexactness in the KL divergence estimation using DualDICE 
-\\
-$\hat{\nu}_t$ & state distribution induced by $\hat{\pi}_t$\\            $\epsilon_{opt}$ &optimization error in DualDICE
-\\
-$\epsilon_{approx}$ & approximation error in DualDICE\\            
-$\mathcal{F}, \mathcal{H}$ &hypothesis class used in DualDICE
-\\
-$\mathcal{E}_T$ & cumulative inexactness for KL divergence estimation given by $\sum_{t=1}^T \epsilon_t$ \\           $\tilde{\mathcal{E}}_T$ &cumulative square root of inexactness for KL divergence estimation given by $\sum_{t=1}^T \sqrt{\epsilon_t}$
-\\
-$h:[0,\rho] \rightarrow (0,\infty)$& strictly increasing continuous definable function used in Theorem \ref{thm:dualDICE}
-\\
-$\nabla_t$, $\hat{\nabla}_t$& Exact and inexact gradient
-\\
-$\beta$ &learning rate for inexact OGD\\  
-$P_X(\cdot)$ & Projection operator
-\\
-$\mathcal{P}_T$ &path-length of the comparator $\psi_t^*$
-\\
-$K_{in}$ & number of iterations in the critic estimation of CRPO  
-\\
-$\eta_t$ & tolerance for the constraint violation $d_{t,i}$ for task $t$\\            
-$\mathcal{P}_T$, $\mathcal{S}_T$ &path-length and squared path-length of the comparator $\psi_t^*$
-\\
-$\partial_\epsilon f(\cdot)$ & $\epsilon$-subgradient of function $f$
-\\
-$Dom(f)$ & Domain of the function $f$
-\\
-$\hat{f}_t(\cdot)$ & loss functions for suboptimal policy $\mathbb{E}_{\hat{\nu}_t}[D_{KL}(\hat{\pi}_t|\pi_{t,0})]$ \\            
-\hline
-\end{tabular} \caption{Table of notations}
-\end{table}
-
-
-% \begin{table}[h]
-% \centering
-% %\vspace{-1.5em}
-% \caption{Table of notations.}
-% \resizebox{\columnwidth}{!}{
-% \begin{tabular}{llll}
-% \hline \textbf{Notation} & \multicolumn{1}{l||}{\textbf{Definition}} & \textbf{Notation} & \textbf{Definition} \\
-% % \hline
-% % \multicolumn{4}{l}{\textit{General \ \ Settings}}\\
-% \hline
-% $t$ & \multicolumn{1}{l||}{Number of tasks}                                                     & $\phi_t$ & Policy initialization for task $t$.
-% \\
-% $\alpha_t$ & \multicolumn{1}{l||}{Learning rate of within-task algorithm.}                                                               & $t \in [T]$ & Total tasks where $[T] = \{1,\ldots,T\}$.
-% \\
-% $\mathcal{M}_t$ & \multicolumn{1}{l||}{CMDP for task $t$}                   & $\mathcal{S}$ & State space of CMDP $\mathcal{M}_t$
-% \\
-% $\mathcal{A} \in \mathbb{R}^{n_a}$ & \multicolumn{1}{l||}{Action space of CMDP $\mathcal{M}_t$}                                & $\rho_t$ & Initial state distribution of task $t$
-% \\
-% $P_t(\cdot|s,a)$ & \multicolumn{1}{l||}{Transition kernel for task $t$}                             & $c_{t,0}: \mathcal{S} \times \mathcal{A} \rightarrow [0,1]$  & Reward function  
-% \\
-% $c_{t,i}: \mathcal{S} \times \mathcal{A} \rightarrow [0,1]$  & \multicolumn{1}{l||}{Cost function for task $t$ and constraint $i$}   & $p$ & Total constraints  
-% \\
-% $m \in [M]$ &  \multicolumn{1}{l||}{Total number of timesteps of within-task algorithm} & $\Delta (\mathcal{A})^{|\mathcal{S}|}$  & Simplex over all states
-% \\
-% $\pi_t: \mathcal{S}\rightarrow \Delta(\mathcal{A})$ & \multicolumn{1}{l||}{Stochatic policy for task $t$}            & $\pi_{t,0}$ & Initial policy for task $t$.
-% \\
-% $\nu_t^\pi$ & \multicolumn{1}{l||}{State visitation distribution of policy $\pi$ at task $t$}            & $\theta$ & Softmax policy parameters
-% \\
-% $V_{t,\pi}^i(s)$ & \multicolumn{1}{l||}{State-value function for task $t$ and policy $\pi$}            & $Q_{t,\pi}^i(s,a)$ & Action-value function for task $t$ and policy $\pi$
-% \\
-% $J_{t,i}(\pi)$ & \multicolumn{1}{l||}{Expected total reward/cost for task $t$ and policy $\pi$}            & $d_{t,i}$ & Fixed limit on the expected total cost for task $t$ and constraint $i$
-% \\
-% $\Pi_t^*$ & \multicolumn{1}{l||}{Set of optimal solutions for task $t$}            & $c_{max}$ & Maximum value of reward/cost
-% \\
-% $D_{KL}(\cdot|\cdot)$ & \multicolumn{1}{l||}{KL divergence}            & $c_{max}$ & Maximum value of reward/cost
-% \\
-% $\bar{R}_0, \bar{R}_i$ & \multicolumn{1}{l||}{TAO and TACV}            & $\hat{\pi}_t$ & Suboptimal policy returned by within-task algorithm in task $t$
-% \\
-% $D^*$, $\hat{D}^*$ & \multicolumn{1}{l||}{True and empirical task-similarity}            &$V_\psi$ ,$\hat{V}_\psi$ & True and empirical task-relatedness
-% \\
-% $\Delta \mathcal{A}_\varrho$ & \multicolumn{1}{l||}{Shrinkage simplex set inside $\Delta \mathcal{A}$}            &$L_g$, $L_\pi$ and $\mu_\pi$ &Lipschitzness, smoothness and the strong convexity parameter
-% \\
-% $C_\pi$ & \multicolumn{1}{l||}{Maximum value of KL divergence}            &$\omega_{\pi/ \mathca{D}_t}(s,a)$ &Stationary distribution correction for task $t$
-% \\
-% $\mathcal{D}_t$ & \multicolumn{1}{l||}{Off policy dataset for task $t$}            &$L_g$, $L_\pi$ and $\mu_\pi$ &Lipschitzness, smoothness and the strong convexity parameter
-% \\
-% $\psi_t$ & \multicolumn{1}{l||}{Dynamically varying comparator}            &\epsilon_t &Inexactness in the KL divergence estimation
-% \\
-% $\hat{\nu}_t$ & \multicolumn{1}{l||}{State distribution induced by $\hat{\pi}_t$}            &\epsilon_{opt} &Optimization error in Dual dice
-% \\
-% $\epsilon_{approx}$ & \multicolumn{1}{l||}{Approximation error in Dual Dice}            &$\mathcal{F}, \mathcal{H}$ &Hypothesis class used in Dual Dice in Dual Dice
-% \\
-% $\mathcal{E}_T$ & \multicolumn{1}{l||}{Cumulative inexactness for KL estimation in all tasks $t \in [T]$}            &$\tilde{\mathcal{E}}_T$ &Squared umulative inexactness for KL estimation in all tasks $t \in [T]$
-% \\
-% $\beta$ & \multicolumn{1}{l||}{Learning rate for inexact OGD}            &$\mathcal{P}_T$ &Path-length of the comparator $\psi_t$
-% \\
-% $K_{in}$ & \multicolumn{1}{l||}{Number of iterations in the critic estmation of CRPO}            &$U_t(\cdot,\cdot)$, $U_T(\cdot,\cdot)$ &Upper bound on the regret and task-averaged regret
-% \\
-% $\eta_t$ & \multicolumn{1}{l||}{Constraint violation threshold for task $t$}            &$\mathcal{S}_T$ &Squared path-length of the comparator $\psi_t$
-% \\
-% $\hat{f}_t(\cdot)$ & \multicolumn{1}{l||}{Loss functions for suboptimal policy}            &$\kappa$ &Learning rate notaion for CRPO loss $\hat{f}_t^{sim}$
-% % \\
-% % $\gI(\cdot): \Omega \rightarrow \{1,\ldots,n\}$& \multicolumn{1}{l||}{returns indexes $\gS$ and $\gY$, where $\gY = \gS^c$}                           & $S,Y$ & subsets of $\gD_n$, indexed by $\gS$ and $\gY$, i.e., $S=\gD_n(\gS)$, $Y=\gD_n(\gY)$
-% \\
-% \hline
-% \end{tabular}
-% }
-% \vspace{-0.5em}
-% \label{tab:notations}
-% \end{table}
-
-
-\begin{table}[h]
-
-\centering
-%\vspace{-1.5em}
-\caption{Table of constants.}
-\begin{tabular}{llll}
-\hline \textbf{Notation} & \multicolumn{1}{l||}{\textbf{Definition}} & \textbf{Notation} & \textbf{Definition} \\
-% \hline
-% \multicolumn{4}{l}{\textit{General \ \ Settings}}\\
-\hline
-$c_1^t$ & \multicolumn{1}{l||}{$2$}                                                     & $c_2^t$ & $\frac{4 c_{max}^2|\mathcal{S}| |\mathcal{A}|}{(1-\gamma)^3}$
-\\
-$c_3^t$ & \multicolumn{1}{l||}{$\frac{3+(1-\gamma)^2}{(1-\gamma)^2}$}                                                               & $c_4^t$ & $\frac{3c_{max}}{(1-\gamma)^2}$
-\\
-$c_5^t$ & \multicolumn{1}{l||}{$\frac{2\sqrt{(1-\gamma)}|\mathcal{S}||\mathcal{A}|}{1-2\kappa}$}                   & $c$ & $c \in \left(0, \frac{2}{L_\pi}\right)$
-\\
-$C_1$ & \multicolumn{1}{l||}{$2(L_\pi + \beta)$}                   & $C_2$ & $(L_\pi +\beta)\frac{3c\alpha+6\alpha L_\pi}{2 \mu_\pi \alpha L_\pi}$\\
-$C_3$ & \multicolumn{1}{l||}{$3(L_\pi + \beta)$}                   & $C_4$ & $\frac{2 L_g}{2-\sqrt{2}}$\\
-$C_5$ & \multicolumn{1}{l||}{$\frac{2L_g}{2-\sqrt{2}}\sqrt{\frac{c\alpha + 2L_\pi \alpha}{2\alpha \mu_\pi L_\pi}}$}       &  \\
-\hline
-\end{tabular}
-\vspace{-0.5em}
-\label{tab:constant}
-\end{table}
-
-
-
-\end{document}
-
-
diff --git a/Paper2Video/src/latex_proj/math_commands.tex b/Paper2Video/src/latex_proj/math_commands.tex
deleted file mode 100644
index 1fa3f41875a2c9eb97bf3e4ad5cbd1b8d8989eb6..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/math_commands.tex
+++ /dev/null
@@ -1,508 +0,0 @@
-%%%%% NEW MATH DEFINITIONS %%%%%
-
-\usepackage{amsmath,amsfonts,bm}
-
-% Mark sections of captions for referring to divisions of figures
-\newcommand{\figleft}{{\em (Left)}}
-\newcommand{\figcenter}{{\em (Center)}}
-\newcommand{\figright}{{\em (Right)}}
-\newcommand{\figtop}{{\em (Top)}}
-\newcommand{\figbottom}{{\em (Bottom)}}
-\newcommand{\captiona}{{\em (a)}}
-\newcommand{\captionb}{{\em (b)}}
-\newcommand{\captionc}{{\em (c)}}
-\newcommand{\captiond}{{\em (d)}}
-
-% Highlight a newly defined term
-\newcommand{\newterm}[1]{{\bf #1}}
-
-
-% Figure reference, lower-case.
-\def\figref#1{figure~\ref{#1}}
-% Figure reference, capital. For start of sentence
-\def\Figref#1{Figure~\ref{#1}}
-\def\twofigref#1#2{figures \ref{#1} and \ref{#2}}
-\def\quadfigref#1#2#3#4{figures \ref{#1}, \ref{#2}, \ref{#3} and \ref{#4}}
-% Section reference, lower-case.
-\def\secref#1{section~\ref{#1}}
-% Section reference, capital.
-\def\Secref#1{Section~\ref{#1}}
-% Reference to two sections.
-\def\twosecrefs#1#2{sections \ref{#1} and \ref{#2}}
-% Reference to three sections.
-\def\secrefs#1#2#3{sections \ref{#1}, \ref{#2} and \ref{#3}}
-% Reference to an equation, lower-case.
-\def\eqref#1{\ref{#1}}
-% Reference to an equation, upper case
-\def\Eqref#1{Equation~\ref{#1}}
-% A raw reference to an equation---avoid using if possible
-\def\plaineqref#1{\ref{#1}}
-% Reference to a chapter, lower-case.
-\def\chapref#1{chapter~\ref{#1}}
-% Reference to an equation, upper case.
-\def\Chapref#1{Chapter~\ref{#1}}
-% Reference to a range of chapters
-\def\rangechapref#1#2{chapters\ref{#1}--\ref{#2}}
-% Reference to an algorithm, lower-case.
-\def\algref#1{algorithm~\ref{#1}}
-% Reference to an algorithm, upper case.
-\def\Algref#1{Algorithm~\ref{#1}}
-\def\twoalgref#1#2{algorithms \ref{#1} and \ref{#2}}
-\def\Twoalgref#1#2{Algorithms \ref{#1} and \ref{#2}}
-% Reference to a part, lower case
-\def\partref#1{part~\ref{#1}}
-% Reference to a part, upper case
-\def\Partref#1{Part~\ref{#1}}
-\def\twopartref#1#2{parts \ref{#1} and \ref{#2}}
-
-\def\ceil#1{\lceil #1 \rceil}
-\def\floor#1{\lfloor #1 \rfloor}
-\def\1{\bm{1}}
-\newcommand{\train}{\mathcal{D}}
-\newcommand{\valid}{\mathcal{D_{\mathrm{valid}}}}
-\newcommand{\test}{\mathcal{D_{\mathrm{test}}}}
-
-\def\eps{{\epsilon}}
-
-
-% Random variables
-\def\reta{{\textnormal{$\eta$}}}
-\def\ra{{\textnormal{a}}}
-\def\rb{{\textnormal{b}}}
-\def\rc{{\textnormal{c}}}
-\def\rd{{\textnormal{d}}}
-\def\re{{\textnormal{e}}}
-\def\rf{{\textnormal{f}}}
-\def\rg{{\textnormal{g}}}
-\def\rh{{\textnormal{h}}}
-\def\ri{{\textnormal{i}}}
-\def\rj{{\textnormal{j}}}
-\def\rk{{\textnormal{k}}}
-\def\rl{{\textnormal{l}}}
-% rm is already a command, just don't name any random variables m
-\def\rn{{\textnormal{n}}}
-\def\ro{{\textnormal{o}}}
-\def\rp{{\textnormal{p}}}
-\def\rq{{\textnormal{q}}}
-\def\rr{{\textnormal{r}}}
-\def\rs{{\textnormal{s}}}
-\def\rt{{\textnormal{t}}}
-\def\ru{{\textnormal{u}}}
-\def\rv{{\textnormal{v}}}
-\def\rw{{\textnormal{w}}}
-\def\rx{{\textnormal{x}}}
-\def\ry{{\textnormal{y}}}
-\def\rz{{\textnormal{z}}}
-
-% Random vectors
-\def\rvepsilon{{\mathbf{\epsilon}}}
-\def\rvtheta{{\mathbf{\theta}}}
-\def\rva{{\mathbf{a}}}
-\def\rvb{{\mathbf{b}}}
-\def\rvc{{\mathbf{c}}}
-\def\rvd{{\mathbf{d}}}
-\def\rve{{\mathbf{e}}}
-\def\rvf{{\mathbf{f}}}
-\def\rvg{{\mathbf{g}}}
-\def\rvh{{\mathbf{h}}}
-\def\rvu{{\mathbf{i}}}
-\def\rvj{{\mathbf{j}}}
-\def\rvk{{\mathbf{k}}}
-\def\rvl{{\mathbf{l}}}
-\def\rvm{{\mathbf{m}}}
-\def\rvn{{\mathbf{n}}}
-\def\rvo{{\mathbf{o}}}
-\def\rvp{{\mathbf{p}}}
-\def\rvq{{\mathbf{q}}}
-\def\rvr{{\mathbf{r}}}
-\def\rvs{{\mathbf{s}}}
-\def\rvt{{\mathbf{t}}}
-\def\rvu{{\mathbf{u}}}
-\def\rvv{{\mathbf{v}}}
-\def\rvw{{\mathbf{w}}}
-\def\rvx{{\mathbf{x}}}
-\def\rvy{{\mathbf{y}}}
-\def\rvz{{\mathbf{z}}}
-
-% Elements of random vectors
-\def\erva{{\textnormal{a}}}
-\def\ervb{{\textnormal{b}}}
-\def\ervc{{\textnormal{c}}}
-\def\ervd{{\textnormal{d}}}
-\def\erve{{\textnormal{e}}}
-\def\ervf{{\textnormal{f}}}
-\def\ervg{{\textnormal{g}}}
-\def\ervh{{\textnormal{h}}}
-\def\ervi{{\textnormal{i}}}
-\def\ervj{{\textnormal{j}}}
-\def\ervk{{\textnormal{k}}}
-\def\ervl{{\textnormal{l}}}
-\def\ervm{{\textnormal{m}}}
-\def\ervn{{\textnormal{n}}}
-\def\ervo{{\textnormal{o}}}
-\def\ervp{{\textnormal{p}}}
-\def\ervq{{\textnormal{q}}}
-\def\ervr{{\textnormal{r}}}
-\def\ervs{{\textnormal{s}}}
-\def\ervt{{\textnormal{t}}}
-\def\ervu{{\textnormal{u}}}
-\def\ervv{{\textnormal{v}}}
-\def\ervw{{\textnormal{w}}}
-\def\ervx{{\textnormal{x}}}
-\def\ervy{{\textnormal{y}}}
-\def\ervz{{\textnormal{z}}}
-
-% Random matrices
-\def\rmA{{\mathbf{A}}}
-\def\rmB{{\mathbf{B}}}
-\def\rmC{{\mathbf{C}}}
-\def\rmD{{\mathbf{D}}}
-\def\rmE{{\mathbf{E}}}
-\def\rmF{{\mathbf{F}}}
-\def\rmG{{\mathbf{G}}}
-\def\rmH{{\mathbf{H}}}
-\def\rmI{{\mathbf{I}}}
-\def\rmJ{{\mathbf{J}}}
-\def\rmK{{\mathbf{K}}}
-\def\rmL{{\mathbf{L}}}
-\def\rmM{{\mathbf{M}}}
-\def\rmN{{\mathbf{N}}}
-\def\rmO{{\mathbf{O}}}
-\def\rmP{{\mathbf{P}}}
-\def\rmQ{{\mathbf{Q}}}
-\def\rmR{{\mathbf{R}}}
-\def\rmS{{\mathbf{S}}}
-\def\rmT{{\mathbf{T}}}
-\def\rmU{{\mathbf{U}}}
-\def\rmV{{\mathbf{V}}}
-\def\rmW{{\mathbf{W}}}
-\def\rmX{{\mathbf{X}}}
-\def\rmY{{\mathbf{Y}}}
-\def\rmZ{{\mathbf{Z}}}
-
-% Elements of random matrices
-\def\ermA{{\textnormal{A}}}
-\def\ermB{{\textnormal{B}}}
-\def\ermC{{\textnormal{C}}}
-\def\ermD{{\textnormal{D}}}
-\def\ermE{{\textnormal{E}}}
-\def\ermF{{\textnormal{F}}}
-\def\ermG{{\textnormal{G}}}
-\def\ermH{{\textnormal{H}}}
-\def\ermI{{\textnormal{I}}}
-\def\ermJ{{\textnormal{J}}}
-\def\ermK{{\textnormal{K}}}
-\def\ermL{{\textnormal{L}}}
-\def\ermM{{\textnormal{M}}}
-\def\ermN{{\textnormal{N}}}
-\def\ermO{{\textnormal{O}}}
-\def\ermP{{\textnormal{P}}}
-\def\ermQ{{\textnormal{Q}}}
-\def\ermR{{\textnormal{R}}}
-\def\ermS{{\textnormal{S}}}
-\def\ermT{{\textnormal{T}}}
-\def\ermU{{\textnormal{U}}}
-\def\ermV{{\textnormal{V}}}
-\def\ermW{{\textnormal{W}}}
-\def\ermX{{\textnormal{X}}}
-\def\ermY{{\textnormal{Y}}}
-\def\ermZ{{\textnormal{Z}}}
-
-% Vectors
-\def\vzero{{\bm{0}}}
-\def\vone{{\bm{1}}}
-\def\vmu{{\bm{\mu}}}
-\def\vtheta{{\bm{\theta}}}
-\def\va{{\bm{a}}}
-\def\vb{{\bm{b}}}
-\def\vc{{\bm{c}}}
-\def\vd{{\bm{d}}}
-\def\ve{{\bm{e}}}
-\def\vf{{\bm{f}}}
-\def\vg{{\bm{g}}}
-\def\vh{{\bm{h}}}
-\def\vi{{\bm{i}}}
-\def\vj{{\bm{j}}}
-\def\vk{{\bm{k}}}
-\def\vl{{\bm{l}}}
-\def\vm{{\bm{m}}}
-\def\vn{{\bm{n}}}
-\def\vo{{\bm{o}}}
-\def\vp{{\bm{p}}}
-\def\vq{{\bm{q}}}
-\def\vr{{\bm{r}}}
-\def\vs{{\bm{s}}}
-\def\vt{{\bm{t}}}
-\def\vu{{\bm{u}}}
-\def\vv{{\bm{v}}}
-\def\vw{{\bm{w}}}
-\def\vx{{\bm{x}}}
-\def\vy{{\bm{y}}}
-\def\vz{{\bm{z}}}
-
-% Elements of vectors
-\def\evalpha{{\alpha}}
-\def\evbeta{{\beta}}
-\def\evepsilon{{\epsilon}}
-\def\evlambda{{\lambda}}
-\def\evomega{{\omega}}
-\def\evmu{{\mu}}
-\def\evpsi{{\psi}}
-\def\evsigma{{\sigma}}
-\def\evtheta{{\theta}}
-\def\eva{{a}}
-\def\evb{{b}}
-\def\evc{{c}}
-\def\evd{{d}}
-\def\eve{{e}}
-\def\evf{{f}}
-\def\evg{{g}}
-\def\evh{{h}}
-\def\evi{{i}}
-\def\evj{{j}}
-\def\evk{{k}}
-\def\evl{{l}}
-\def\evm{{m}}
-\def\evn{{n}}
-\def\evo{{o}}
-\def\evp{{p}}
-\def\evq{{q}}
-\def\evr{{r}}
-\def\evs{{s}}
-\def\evt{{t}}
-\def\evu{{u}}
-\def\evv{{v}}
-\def\evw{{w}}
-\def\evx{{x}}
-\def\evy{{y}}
-\def\evz{{z}}
-
-% Matrix
-\def\mA{{\bm{A}}}
-\def\mB{{\bm{B}}}
-\def\mC{{\bm{C}}}
-\def\mD{{\bm{D}}}
-\def\mE{{\bm{E}}}
-\def\mF{{\bm{F}}}
-\def\mG{{\bm{G}}}
-\def\mH{{\bm{H}}}
-\def\mI{{\bm{I}}}
-\def\mJ{{\bm{J}}}
-\def\mK{{\bm{K}}}
-\def\mL{{\bm{L}}}
-\def\mM{{\bm{M}}}
-\def\mN{{\bm{N}}}
-\def\mO{{\bm{O}}}
-\def\mP{{\bm{P}}}
-\def\mQ{{\bm{Q}}}
-\def\mR{{\bm{R}}}
-\def\mS{{\bm{S}}}
-\def\mT{{\bm{T}}}
-\def\mU{{\bm{U}}}
-\def\mV{{\bm{V}}}
-\def\mW{{\bm{W}}}
-\def\mX{{\bm{X}}}
-\def\mY{{\bm{Y}}}
-\def\mZ{{\bm{Z}}}
-\def\mBeta{{\bm{\beta}}}
-\def\mPhi{{\bm{\Phi}}}
-\def\mLambda{{\bm{\Lambda}}}
-\def\mSigma{{\bm{\Sigma}}}
-
-% Tensor
-\DeclareMathAlphabet{\mathsfit}{\encodingdefault}{\sfdefault}{m}{sl}
-\SetMathAlphabet{\mathsfit}{bold}{\encodingdefault}{\sfdefault}{bx}{n}
-\newcommand{\tens}[1]{\bm{\mathsfit{#1}}}
-\def\tA{{\tens{A}}}
-\def\tB{{\tens{B}}}
-\def\tC{{\tens{C}}}
-\def\tD{{\tens{D}}}
-\def\tE{{\tens{E}}}
-\def\tF{{\tens{F}}}
-\def\tG{{\tens{G}}}
-\def\tH{{\tens{H}}}
-\def\tI{{\tens{I}}}
-\def\tJ{{\tens{J}}}
-\def\tK{{\tens{K}}}
-\def\tL{{\tens{L}}}
-\def\tM{{\tens{M}}}
-\def\tN{{\tens{N}}}
-\def\tO{{\tens{O}}}
-\def\tP{{\tens{P}}}
-\def\tQ{{\tens{Q}}}
-\def\tR{{\tens{R}}}
-\def\tS{{\tens{S}}}
-\def\tT{{\tens{T}}}
-\def\tU{{\tens{U}}}
-\def\tV{{\tens{V}}}
-\def\tW{{\tens{W}}}
-\def\tX{{\tens{X}}}
-\def\tY{{\tens{Y}}}
-\def\tZ{{\tens{Z}}}
-
-
-% Graph
-\def\gA{{\mathcal{A}}}
-\def\gB{{\mathcal{B}}}
-\def\gC{{\mathcal{C}}}
-\def\gD{{\mathcal{D}}}
-\def\gE{{\mathcal{E}}}
-\def\gF{{\mathcal{F}}}
-\def\gG{{\mathcal{G}}}
-\def\gH{{\mathcal{H}}}
-\def\gI{{\mathcal{I}}}
-\def\gJ{{\mathcal{J}}}
-\def\gK{{\mathcal{K}}}
-\def\gL{{\mathcal{L}}}
-\def\gM{{\mathcal{M}}}
-\def\gN{{\mathcal{N}}}
-\def\gO{{\mathcal{O}}}
-\def\gP{{\mathcal{P}}}
-\def\gQ{{\mathcal{Q}}}
-\def\gR{{\mathcal{R}}}
-\def\gS{{\mathcal{S}}}
-\def\gT{{\mathcal{T}}}
-\def\gU{{\mathcal{U}}}
-\def\gV{{\mathcal{V}}}
-\def\gW{{\mathcal{W}}}
-\def\gX{{\mathcal{X}}}
-\def\gY{{\mathcal{Y}}}
-\def\gZ{{\mathcal{Z}}}
-
-% Sets
-\def\sA{{\mathbb{A}}}
-\def\sB{{\mathbb{B}}}
-\def\sC{{\mathbb{C}}}
-\def\sD{{\mathbb{D}}}
-% Don't use a set called E, because this would be the same as our symbol
-% for expectation.
-\def\sF{{\mathbb{F}}}
-\def\sG{{\mathbb{G}}}
-\def\sH{{\mathbb{H}}}
-\def\sI{{\mathbb{I}}}
-\def\sJ{{\mathbb{J}}}
-\def\sK{{\mathbb{K}}}
-\def\sL{{\mathbb{L}}}
-\def\sM{{\mathbb{M}}}
-\def\sN{{\mathbb{N}}}
-\def\sO{{\mathbb{O}}}
-\def\sP{{\mathbb{P}}}
-\def\sQ{{\mathbb{Q}}}
-\def\sR{{\mathbb{R}}}
-\def\sS{{\mathbb{S}}}
-\def\sT{{\mathbb{T}}}
-\def\sU{{\mathbb{U}}}
-\def\sV{{\mathbb{V}}}
-\def\sW{{\mathbb{W}}}
-\def\sX{{\mathbb{X}}}
-\def\sY{{\mathbb{Y}}}
-\def\sZ{{\mathbb{Z}}}
-
-% Entries of a matrix
-\def\emLambda{{\Lambda}}
-\def\emA{{A}}
-\def\emB{{B}}
-\def\emC{{C}}
-\def\emD{{D}}
-\def\emE{{E}}
-\def\emF{{F}}
-\def\emG{{G}}
-\def\emH{{H}}
-\def\emI{{I}}
-\def\emJ{{J}}
-\def\emK{{K}}
-\def\emL{{L}}
-\def\emM{{M}}
-\def\emN{{N}}
-\def\emO{{O}}
-\def\emP{{P}}
-\def\emQ{{Q}}
-\def\emR{{R}}
-\def\emS{{S}}
-\def\emT{{T}}
-\def\emU{{U}}
-\def\emV{{V}}
-\def\emW{{W}}
-\def\emX{{X}}
-\def\emY{{Y}}
-\def\emZ{{Z}}
-\def\emSigma{{\Sigma}}
-
-% entries of a tensor
-% Same font as tensor, without \bm wrapper
-\newcommand{\etens}[1]{\mathsfit{#1}}
-\def\etLambda{{\etens{\Lambda}}}
-\def\etA{{\etens{A}}}
-\def\etB{{\etens{B}}}
-\def\etC{{\etens{C}}}
-\def\etD{{\etens{D}}}
-\def\etE{{\etens{E}}}
-\def\etF{{\etens{F}}}
-\def\etG{{\etens{G}}}
-\def\etH{{\etens{H}}}
-\def\etI{{\etens{I}}}
-\def\etJ{{\etens{J}}}
-\def\etK{{\etens{K}}}
-\def\etL{{\etens{L}}}
-\def\etM{{\etens{M}}}
-\def\etN{{\etens{N}}}
-\def\etO{{\etens{O}}}
-\def\etP{{\etens{P}}}
-\def\etQ{{\etens{Q}}}
-\def\etR{{\etens{R}}}
-\def\etS{{\etens{S}}}
-\def\etT{{\etens{T}}}
-\def\etU{{\etens{U}}}
-\def\etV{{\etens{V}}}
-\def\etW{{\etens{W}}}
-\def\etX{{\etens{X}}}
-\def\etY{{\etens{Y}}}
-\def\etZ{{\etens{Z}}}
-
-% The true underlying data generating distribution
-\newcommand{\pdata}{p_{\rm{data}}}
-% The empirical distribution defined by the training set
-\newcommand{\ptrain}{\hat{p}_{\rm{data}}}
-\newcommand{\Ptrain}{\hat{P}_{\rm{data}}}
-% The model distribution
-\newcommand{\pmodel}{p_{\rm{model}}}
-\newcommand{\Pmodel}{P_{\rm{model}}}
-\newcommand{\ptildemodel}{\tilde{p}_{\rm{model}}}
-% Stochastic autoencoder distributions
-\newcommand{\pencode}{p_{\rm{encoder}}}
-\newcommand{\pdecode}{p_{\rm{decoder}}}
-\newcommand{\precons}{p_{\rm{reconstruct}}}
-
-\newcommand{\laplace}{\mathrm{Laplace}} % Laplace distribution
-
-\newcommand{\E}{\mathbb{E}}
-\newcommand{\Ls}{\mathcal{L}}
-\newcommand{\R}{\mathbb{R}}
-\newcommand{\emp}{\tilde{p}}
-\newcommand{\lr}{\alpha}
-\newcommand{\reg}{\lambda}
-\newcommand{\rect}{\mathrm{rectifier}}
-\newcommand{\softmax}{\mathrm{softmax}}
-\newcommand{\sigmoid}{\sigma}
-\newcommand{\softplus}{\zeta}
-\newcommand{\KL}{D_{\mathrm{KL}}}
-\newcommand{\Var}{\mathrm{Var}}
-\newcommand{\standarderror}{\mathrm{SE}}
-\newcommand{\Cov}{\mathrm{Cov}}
-% Wolfram Mathworld says $L^2$ is for function spaces and $\ell^2$ is for vectors
-% But then they seem to use $L^2$ for vectors throughout the site, and so does
-% wikipedia.
-\newcommand{\normlzero}{L^0}
-\newcommand{\normlone}{L^1}
-\newcommand{\normltwo}{L^2}
-\newcommand{\normlp}{L^p}
-\newcommand{\normmax}{L^\infty}
-
-\newcommand{\parents}{Pa} % See usage in notation.tex. Chosen to match Daphne's book.
-
-\DeclareMathOperator*{\argmax}{arg\,max}
-\DeclareMathOperator*{\argmin}{arg\,min}
-
-\DeclareMathOperator{\sign}{sign}
-\DeclareMathOperator{\Tr}{Tr}
-\let\ab\allowbreak
diff --git a/Paper2Video/src/latex_proj/natbib.sty b/Paper2Video/src/latex_proj/natbib.sty
deleted file mode 100644
index ff0d0b91b6ef41468c593a0ca40a81f9a183b055..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/natbib.sty
+++ /dev/null
@@ -1,1246 +0,0 @@
-%%
-%% This is file `natbib.sty',
-%% generated with the docstrip utility.
-%%
-%% The original source files were:
-%%
-%% natbib.dtx  (with options: `package,all')
-%% =============================================
-%% IMPORTANT NOTICE:
-%% 
-%% This program can be redistributed and/or modified under the terms
-%% of the LaTeX Project Public License Distributed from CTAN
-%% archives in directory macros/latex/base/lppl.txt; either
-%% version 1 of the License, or any later version.
-%% 
-%% This is a generated file.
-%% It may not be distributed without the original source file natbib.dtx.
-%% 
-%% Full documentation can be obtained by LaTeXing that original file.
-%% Only a few abbreviated comments remain here to describe the usage.
-%% =============================================
-%% Copyright 1993-2009 Patrick W Daly
-%% Max-Planck-Institut f\"ur Sonnensystemforschung
-%% Max-Planck-Str. 2
-%% D-37191 Katlenburg-Lindau
-%% Germany
-%% E-mail: daly@mps.mpg.de
-\NeedsTeXFormat{LaTeX2e}[1995/06/01]
-\ProvidesPackage{natbib}
-        [2009/07/16 8.31 (PWD, AO)]
-
- % This package reimplements the LaTeX \cite command to be used for various
- % citation styles, both author-year and numerical. It accepts BibTeX
- % output intended for many other packages, and therefore acts as a
- % general, all-purpose citation-style interface.
- %
- % With standard numerical .bst files, only numerical citations are
- % possible. With an author-year .bst file, both numerical and
- % author-year citations are possible.
- %
- % If author-year citations are selected, \bibitem must have one of the
- %   following forms:
- %   \bibitem[Jones et al.(1990)]{key}...
- %   \bibitem[Jones et al.(1990)Jones, Baker, and Williams]{key}...
- %   \bibitem[Jones et al., 1990]{key}...
- %   \bibitem[\protect\citeauthoryear{Jones, Baker, and Williams}{Jones
- %       et al.}{1990}]{key}...
- %   \bibitem[\protect\citeauthoryear{Jones et al.}{1990}]{key}...
- %   \bibitem[\protect\astroncite{Jones et al.}{1990}]{key}...
- %   \bibitem[\protect\citename{Jones et al., }1990]{key}...
- %   \harvarditem[Jones et al.]{Jones, Baker, and Williams}{1990}{key}...
- %
- % This is either to be made up manually, or to be generated by an
- % appropriate .bst file with BibTeX.
- %                            Author-year mode     ||   Numerical mode
- % Then, \citet{key}  ==>>  Jones et al. (1990)    ||   Jones et al. [21]
- %       \citep{key}  ==>> (Jones et al., 1990)    ||   [21]
- % Multiple citations as normal:
- % \citep{key1,key2}  ==>> (Jones et al., 1990; Smith, 1989) || [21,24]
- %                           or  (Jones et al., 1990, 1991)  || [21,24]
- %                           or  (Jones et al., 1990a,b)     || [21,24]
- % \cite{key} is the equivalent of \citet{key} in author-year mode
- %                         and  of \citep{key} in numerical mode
- % Full author lists may be forced with \citet* or \citep*, e.g.
- %       \citep*{key}      ==>> (Jones, Baker, and Williams, 1990)
- % Optional notes as:
- %   \citep[chap. 2]{key}    ==>> (Jones et al., 1990, chap. 2)
- %   \citep[e.g.,][]{key}    ==>> (e.g., Jones et al., 1990)
- %   \citep[see][pg. 34]{key}==>> (see Jones et al., 1990, pg. 34)
- %  (Note: in standard LaTeX, only one note is allowed, after the ref.
- %   Here, one note is like the standard, two make pre- and post-notes.)
- %   \citealt{key}          ==>> Jones et al. 1990
- %   \citealt*{key}         ==>> Jones, Baker, and Williams 1990
- %   \citealp{key}          ==>> Jones et al., 1990
- %   \citealp*{key}         ==>> Jones, Baker, and Williams, 1990
- % Additional citation possibilities (both author-year and numerical modes)
- %   \citeauthor{key}       ==>> Jones et al.
- %   \citeauthor*{key}      ==>> Jones, Baker, and Williams
- %   \citeyear{key}         ==>> 1990
- %   \citeyearpar{key}      ==>> (1990)
- %   \citetext{priv. comm.} ==>> (priv. comm.)
- %   \citenum{key}          ==>> 11 [non-superscripted]
- % Note: full author lists depends on whether the bib style supports them;
- %       if not, the abbreviated list is printed even when full requested.
- %
- % For names like della Robbia at the start of a sentence, use
- %   \Citet{dRob98}         ==>> Della Robbia (1998)
- %   \Citep{dRob98}         ==>> (Della Robbia, 1998)
- %   \Citeauthor{dRob98}    ==>> Della Robbia
- %
- %
- % Citation aliasing is achieved with
- %   \defcitealias{key}{text}
- %   \citetalias{key}  ==>> text
- %   \citepalias{key}  ==>> (text)
- %
- % Defining the citation mode and punctual (citation style)
- %   \setcitestyle{<comma-separated list of keywords, same
- %     as the package options>}
- % Example: \setcitestyle{square,semicolon}
- % Alternatively:
- % Use \bibpunct with 6 mandatory arguments:
- %    1. opening bracket for citation
- %    2. closing bracket
- %    3. citation separator (for multiple citations in one \cite)
- %    4. the letter n for numerical styles, s for superscripts
- %        else anything for author-year
- %    5. punctuation between authors and date
- %    6. punctuation between years (or numbers) when common authors missing
- % One optional argument is the character coming before post-notes. It
- %   appears in square braces before all other arguments. May be left off.
- % Example (and default) \bibpunct[, ]{(}{)}{;}{a}{,}{,}
- %
- % To make this automatic for a given bib style, named newbib, say, make
- % a local configuration file, natbib.cfg, with the definition
- %   \newcommand{\bibstyle@newbib}{\bibpunct...}
- % Then the \bibliographystyle{newbib} will cause \bibstyle@newbib to
- % be called on THE NEXT LATEX RUN (via the aux file).
- %
- % Such preprogrammed definitions may be invoked anywhere in the text
- %  by calling \citestyle{newbib}. This is only useful if the style specified
- %  differs from that in \bibliographystyle.
- %
- % With \citeindextrue and \citeindexfalse, one can control whether the
- % \cite commands make an automatic entry of the citation in the .idx
- % indexing file. For this, \makeindex must also be given in the preamble.
- %
- % Package Options: (for selecting punctuation)
- %   round  -  round parentheses are used (default)
- %   square -  square brackets are used   [option]
- %   curly  -  curly braces are used      {option}
- %   angle  -  angle brackets are used    <option>
- %   semicolon  -  multiple citations separated by semi-colon (default)
- %   colon  - same as semicolon, an earlier confusion
- %   comma  -  separated by comma
- %   authoryear - selects author-year citations (default)
- %   numbers-  selects numerical citations
- %   super  -  numerical citations as superscripts
- %   sort   -  sorts multiple citations according to order in ref. list
- %   sort&compress   -  like sort, but also compresses numerical citations
- %   compress - compresses without sorting
- %   longnamesfirst  -  makes first citation full author list
- %   sectionbib - puts bibliography in a \section* instead of \chapter*
- %   merge - allows the citation key to have a * prefix,
- %           signifying to merge its reference with that of the previous citation.
- %   elide - if references are merged, repeated portions of later ones may be removed.
- %   mcite - recognizes and ignores the * prefix for merging.
- % Punctuation so selected dominates over any predefined ones.
- % Package options are called as, e.g.
- %        \usepackage[square,comma]{natbib}
- % LaTeX the source file natbib.dtx to obtain more details
- % or the file natnotes.tex for a brief reference sheet.
- %-----------------------------------------------------------
-\providecommand\@ifxundefined[1]{%
- \ifx#1\@undefined\expandafter\@firstoftwo\else\expandafter\@secondoftwo\fi
-}%
-\providecommand\@ifnum[1]{%
- \ifnum#1\expandafter\@firstoftwo\else\expandafter\@secondoftwo\fi
-}%
-\providecommand\@ifx[1]{%
- \ifx#1\expandafter\@firstoftwo\else\expandafter\@secondoftwo\fi
-}%
-\providecommand\appdef[2]{%
- \toks@\expandafter{#1}\@temptokena{#2}%
- \edef#1{\the\toks@\the\@temptokena}%
-}%
-\@ifclassloaded{agu2001}{\PackageError{natbib}
-  {The agu2001 class already includes natbib coding,\MessageBreak
-   so you should not add it explicitly}
-  {Type <Return> for now, but then later remove\MessageBreak
-   the command \protect\usepackage{natbib} from the document}
-  \endinput}{}
-\@ifclassloaded{agutex}{\PackageError{natbib}
-  {The AGUTeX class already includes natbib coding,\MessageBreak
-   so you should not add it explicitly}
-  {Type <Return> for now, but then later remove\MessageBreak
-   the command \protect\usepackage{natbib} from the document}
-  \endinput}{}
-\@ifclassloaded{aguplus}{\PackageError{natbib}
-  {The aguplus class already includes natbib coding,\MessageBreak
-   so you should not add it explicitly}
-  {Type <Return> for now, but then later remove\MessageBreak
-   the command \protect\usepackage{natbib} from the document}
-  \endinput}{}
-\@ifclassloaded{nlinproc}{\PackageError{natbib}
-  {The nlinproc class already includes natbib coding,\MessageBreak
-   so you should not add it explicitly}
-  {Type <Return> for now, but then later remove\MessageBreak
-   the command \protect\usepackage{natbib} from the document}
-  \endinput}{}
-\@ifclassloaded{egs}{\PackageError{natbib}
-  {The egs class already includes natbib coding,\MessageBreak
-   so you should not add it explicitly}
-  {Type <Return> for now, but then later remove\MessageBreak
-   the command \protect\usepackage{natbib} from the document}
-  \endinput}{}
-\@ifclassloaded{egu}{\PackageError{natbib}
-  {The egu class already includes natbib coding,\MessageBreak
-   so you should not add it explicitly}
-  {Type <Return> for now, but then later remove\MessageBreak
-   the command \protect\usepackage{natbib} from the document}
-  \endinput}{}
- % Define citation punctuation for some author-year styles
- % One may add and delete at this point
- % Or put additions into local configuration file natbib.cfg
-\newcommand\bibstyle@chicago{\bibpunct{(}{)}{;}{a}{,}{,}}
-\newcommand\bibstyle@named{\bibpunct{[}{]}{;}{a}{,}{,}}
-\newcommand\bibstyle@agu{\bibpunct{[}{]}{;}{a}{,}{,~}}%Amer. Geophys. Union
-\newcommand\bibstyle@copernicus{\bibpunct{(}{)}{;}{a}{,}{,}}%Copernicus Publications
-\let\bibstyle@egu=\bibstyle@copernicus
-\let\bibstyle@egs=\bibstyle@copernicus
-\newcommand\bibstyle@agsm{\bibpunct{(}{)}{,}{a}{}{,}\gdef\harvardand{\&}}
-\newcommand\bibstyle@kluwer{\bibpunct{(}{)}{,}{a}{}{,}\gdef\harvardand{\&}}
-\newcommand\bibstyle@dcu{\bibpunct{(}{)}{;}{a}{;}{,}\gdef\harvardand{and}}
-\newcommand\bibstyle@aa{\bibpunct{(}{)}{;}{a}{}{,}} %Astronomy & Astrophysics
-\newcommand\bibstyle@pass{\bibpunct{(}{)}{;}{a}{,}{,}}%Planet. & Space Sci
-\newcommand\bibstyle@anngeo{\bibpunct{(}{)}{;}{a}{,}{,}}%Annales Geophysicae
-\newcommand\bibstyle@nlinproc{\bibpunct{(}{)}{;}{a}{,}{,}}%Nonlin.Proc.Geophys.
- % Define citation punctuation for some numerical styles
-\newcommand\bibstyle@cospar{\bibpunct{/}{/}{,}{n}{}{}%
-     \gdef\bibnumfmt##1{##1.}}
-\newcommand\bibstyle@esa{\bibpunct{(Ref.~}{)}{,}{n}{}{}%
-     \gdef\bibnumfmt##1{##1.\hspace{1em}}}
-\newcommand\bibstyle@nature{\bibpunct{}{}{,}{s}{}{\textsuperscript{,}}%
-     \gdef\bibnumfmt##1{##1.}}
- % The standard LaTeX styles
-\newcommand\bibstyle@plain{\bibpunct{[}{]}{,}{n}{}{,}}
-\let\bibstyle@alpha=\bibstyle@plain
-\let\bibstyle@abbrv=\bibstyle@plain
-\let\bibstyle@unsrt=\bibstyle@plain
- % The author-year modifications of the standard styles
-\newcommand\bibstyle@plainnat{\bibpunct{[}{]}{,}{a}{,}{,}}
-\let\bibstyle@abbrvnat=\bibstyle@plainnat
-\let\bibstyle@unsrtnat=\bibstyle@plainnat
-\newif\ifNAT@numbers \NAT@numbersfalse
-\newif\ifNAT@super \NAT@superfalse
-\let\NAT@merge\z@
-\DeclareOption{numbers}{\NAT@numberstrue
-   \ExecuteOptions{square,comma,nobibstyle}}
-\DeclareOption{super}{\NAT@supertrue\NAT@numberstrue
-   \renewcommand\NAT@open{}\renewcommand\NAT@close{}
-   \ExecuteOptions{nobibstyle}}
-\DeclareOption{authoryear}{\NAT@numbersfalse
-   \ExecuteOptions{round,semicolon,bibstyle}}
-\DeclareOption{round}{%
-      \renewcommand\NAT@open{(} \renewcommand\NAT@close{)}
-   \ExecuteOptions{nobibstyle}}
-\DeclareOption{square}{%
-      \renewcommand\NAT@open{[} \renewcommand\NAT@close{]}
-   \ExecuteOptions{nobibstyle}}
-\DeclareOption{angle}{%
-      \renewcommand\NAT@open{$<$} \renewcommand\NAT@close{$>$}
-   \ExecuteOptions{nobibstyle}}
-\DeclareOption{curly}{%
-      \renewcommand\NAT@open{\{} \renewcommand\NAT@close{\}}
-   \ExecuteOptions{nobibstyle}}
-\DeclareOption{comma}{\renewcommand\NAT@sep{,}
-   \ExecuteOptions{nobibstyle}}
-\DeclareOption{semicolon}{\renewcommand\NAT@sep{;}
-   \ExecuteOptions{nobibstyle}}
-\DeclareOption{colon}{\ExecuteOptions{semicolon}}
-\DeclareOption{nobibstyle}{\let\bibstyle=\@gobble}
-\DeclareOption{bibstyle}{\let\bibstyle=\@citestyle}
-\newif\ifNAT@openbib \NAT@openbibfalse
-\DeclareOption{openbib}{\NAT@openbibtrue}
-\DeclareOption{sectionbib}{\def\NAT@sectionbib{on}}
-\def\NAT@sort{\z@}
-\def\NAT@cmprs{\z@}
-\DeclareOption{sort}{\def\NAT@sort{\@ne}}
-\DeclareOption{compress}{\def\NAT@cmprs{\@ne}}
-\DeclareOption{sort&compress}{\def\NAT@sort{\@ne}\def\NAT@cmprs{\@ne}}
-\DeclareOption{mcite}{\let\NAT@merge\@ne}
-\DeclareOption{merge}{\@ifnum{\NAT@merge<\tw@}{\let\NAT@merge\tw@}{}}
-\DeclareOption{elide}{\@ifnum{\NAT@merge<\thr@@}{\let\NAT@merge\thr@@}{}}
-\@ifpackageloaded{cite}{\PackageWarningNoLine{natbib}
-  {The `cite' package should not be used\MessageBreak
-   with natbib. Use option `sort' instead}\ExecuteOptions{sort}}{}
-\@ifpackageloaded{mcite}{\PackageWarningNoLine{natbib}
-  {The `mcite' package should not be used\MessageBreak
-   with natbib. Use option `merge' instead}\ExecuteOptions{merge}}{}
-\@ifpackageloaded{citeref}{\PackageError{natbib}
-  {The `citeref' package must be loaded after natbib}%
-  {Move \protect\usepackage{citeref} to after \string\usepackage{natbib}}}{}
-\newif\ifNAT@longnames\NAT@longnamesfalse
-\DeclareOption{longnamesfirst}{\NAT@longnamestrue}
-\DeclareOption{nonamebreak}{\def\NAT@nmfmt#1{\mbox{\NAT@up#1}}}
-\def\NAT@nmfmt#1{{\NAT@up#1}}
-\renewcommand\bibstyle[1]{\csname bibstyle@#1\endcsname}
-\AtBeginDocument{\global\let\bibstyle=\@gobble}
-\let\@citestyle\bibstyle
-\newcommand\citestyle[1]{\@citestyle{#1}\let\bibstyle\@gobble}
-\newcommand\bibpunct[7][, ]%
-  {\gdef\NAT@open{#2}\gdef\NAT@close{#3}\gdef
-   \NAT@sep{#4}\global\NAT@numbersfalse
-     \ifx #5n\global\NAT@numberstrue\global\NAT@superfalse
-   \else
-     \ifx #5s\global\NAT@numberstrue\global\NAT@supertrue
-   \fi\fi
-   \gdef\NAT@aysep{#6}\gdef\NAT@yrsep{#7}%
-   \gdef\NAT@cmt{#1}%
-   \NAT@@setcites
-  }
-\newcommand\setcitestyle[1]{
- \@for\@tempa:=#1\do
- {\def\@tempb{round}\ifx\@tempa\@tempb
-    \renewcommand\NAT@open{(}\renewcommand\NAT@close{)}\fi
-  \def\@tempb{square}\ifx\@tempa\@tempb
-    \renewcommand\NAT@open{[}\renewcommand\NAT@close{]}\fi
-  \def\@tempb{angle}\ifx\@tempa\@tempb
-    \renewcommand\NAT@open{$<$}\renewcommand\NAT@close{$>$}\fi
-  \def\@tempb{curly}\ifx\@tempa\@tempb
-    \renewcommand\NAT@open{\{}\renewcommand\NAT@close{\}}\fi
-  \def\@tempb{semicolon}\ifx\@tempa\@tempb
-    \renewcommand\NAT@sep{;}\fi
-  \def\@tempb{colon}\ifx\@tempa\@tempb
-    \renewcommand\NAT@sep{;}\fi
-  \def\@tempb{comma}\ifx\@tempa\@tempb
-    \renewcommand\NAT@sep{,}\fi
-  \def\@tempb{authoryear}\ifx\@tempa\@tempb
-    \NAT@numbersfalse\fi
-  \def\@tempb{numbers}\ifx\@tempa\@tempb
-    \NAT@numberstrue\NAT@superfalse\fi
-  \def\@tempb{super}\ifx\@tempa\@tempb
-    \NAT@numberstrue\NAT@supertrue\fi
-  \expandafter\NAT@find@eq\@tempa=\relax\@nil
-  \if\@tempc\relax\else
-    \expandafter\NAT@rem@eq\@tempc
-    \def\@tempb{open}\ifx\@tempa\@tempb
-     \xdef\NAT@open{\@tempc}\fi
-    \def\@tempb{close}\ifx\@tempa\@tempb
-     \xdef\NAT@close{\@tempc}\fi
-    \def\@tempb{aysep}\ifx\@tempa\@tempb
-     \xdef\NAT@aysep{\@tempc}\fi
-    \def\@tempb{yysep}\ifx\@tempa\@tempb
-     \xdef\NAT@yrsep{\@tempc}\fi
-    \def\@tempb{notesep}\ifx\@tempa\@tempb
-     \xdef\NAT@cmt{\@tempc}\fi
-    \def\@tempb{citesep}\ifx\@tempa\@tempb
-     \xdef\NAT@sep{\@tempc}\fi
-  \fi
- }%
- \NAT@@setcites
-}
- \def\NAT@find@eq#1=#2\@nil{\def\@tempa{#1}\def\@tempc{#2}}
- \def\NAT@rem@eq#1={\def\@tempc{#1}}
- \def\NAT@@setcites{\global\let\bibstyle\@gobble}
-\AtBeginDocument{\let\NAT@@setcites\NAT@set@cites}
-\newcommand\NAT@open{(} \newcommand\NAT@close{)}
-\newcommand\NAT@sep{;}
-\ProcessOptions
-\newcommand\NAT@aysep{,} \newcommand\NAT@yrsep{,}
-\newcommand\NAT@cmt{, }
-\newcommand\NAT@cite%
-    [3]{\ifNAT@swa\NAT@@open\if*#2*\else#2\NAT@spacechar\fi
-        #1\if*#3*\else\NAT@cmt#3\fi\NAT@@close\else#1\fi\endgroup}
-\newcommand\NAT@citenum%
-    [3]{\ifNAT@swa\NAT@@open\if*#2*\else#2\NAT@spacechar\fi
-        #1\if*#3*\else\NAT@cmt#3\fi\NAT@@close\else#1\fi\endgroup}
-\newcommand\NAT@citesuper[3]{\ifNAT@swa
-\if*#2*\else#2\NAT@spacechar\fi
-\unskip\kern\p@\textsuperscript{\NAT@@open#1\NAT@@close}%
-   \if*#3*\else\NAT@spacechar#3\fi\else #1\fi\endgroup}
-\providecommand\textsuperscript[1]{\mbox{$^{\mbox{\scriptsize#1}}$}}
-\begingroup \catcode`\_=8
-\gdef\NAT@ifcat@num#1{%
- \ifcat_\ifnum\z@<0#1_\else A\fi
-  \expandafter\@firstoftwo
- \else
-  \expandafter\@secondoftwo
- \fi
-}%
-\endgroup
-\providecommand\@firstofone[1]{#1}
-\newcommand\NAT@citexnum{}
-\def\NAT@citexnum[#1][#2]#3{%
-  \NAT@reset@parser
-  \NAT@sort@cites{#3}%
-  \NAT@reset@citea
-  \@cite{\def\NAT@num{-1}\let\NAT@last@yr\relax\let\NAT@nm\@empty
-    \@for\@citeb:=\NAT@cite@list\do
-    {\@safe@activestrue
-     \edef\@citeb{\expandafter\@firstofone\@citeb\@empty}%
-     \@safe@activesfalse
-     \@ifundefined{b@\@citeb\@extra@b@citeb}{%
-       {\reset@font\bfseries?}
-        \NAT@citeundefined\PackageWarning{natbib}%
-       {Citation `\@citeb' on page \thepage \space undefined}}%
-     {\let\NAT@last@num\NAT@num\let\NAT@last@nm\NAT@nm
-      \NAT@parse{\@citeb}%
-      \ifNAT@longnames\@ifundefined{bv@\@citeb\@extra@b@citeb}{%
-        \let\NAT@name=\NAT@all@names
-        \global\@namedef{bv@\@citeb\@extra@b@citeb}{}}{}%
-      \fi
-      \ifNAT@full\let\NAT@nm\NAT@all@names\else
-        \let\NAT@nm\NAT@name\fi
-      \ifNAT@swa
-       \@ifnum{\NAT@ctype>\@ne}{%
-        \@citea
-        \NAT@hyper@{\@ifnum{\NAT@ctype=\tw@}{\NAT@test{\NAT@ctype}}{\NAT@alias}}%
-       }{%
-        \@ifnum{\NAT@cmprs>\z@}{%
-         \NAT@ifcat@num\NAT@num
-          {\let\NAT@nm=\NAT@num}%
-          {\def\NAT@nm{-2}}%
-         \NAT@ifcat@num\NAT@last@num
-          {\@tempcnta=\NAT@last@num\relax}%
-          {\@tempcnta\m@ne}%
-         \@ifnum{\NAT@nm=\@tempcnta}{%
-          \@ifnum{\NAT@merge>\@ne}{}{\NAT@last@yr@mbox}%
-         }{%
-           \advance\@tempcnta by\@ne
-           \@ifnum{\NAT@nm=\@tempcnta}{%
-             \ifx\NAT@last@yr\relax
-               \def@NAT@last@yr{\@citea}%
-             \else
-               \def@NAT@last@yr{--\NAT@penalty}%
-             \fi
-           }{%
-             \NAT@last@yr@mbox
-           }%
-         }%
-        }{%
-         \@tempswatrue
-         \@ifnum{\NAT@merge>\@ne}{\@ifnum{\NAT@last@num=\NAT@num\relax}{\@tempswafalse}{}}{}%
-         \if@tempswa\NAT@citea@mbox\fi
-        }%
-       }%
-       \NAT@def@citea
-      \else
-        \ifcase\NAT@ctype
-          \ifx\NAT@last@nm\NAT@nm \NAT@yrsep\NAT@penalty\NAT@space\else
-            \@citea \NAT@test{\@ne}\NAT@spacechar\NAT@mbox{\NAT@super@kern\NAT@@open}%
-          \fi
-          \if*#1*\else#1\NAT@spacechar\fi
-          \NAT@mbox{\NAT@hyper@{{\citenumfont{\NAT@num}}}}%
-          \NAT@def@citea@box
-        \or
-          \NAT@hyper@citea@space{\NAT@test{\NAT@ctype}}%
-        \or
-          \NAT@hyper@citea@space{\NAT@test{\NAT@ctype}}%
-        \or
-          \NAT@hyper@citea@space\NAT@alias
-        \fi
-      \fi
-     }%
-    }%
-      \@ifnum{\NAT@cmprs>\z@}{\NAT@last@yr}{}%
-      \ifNAT@swa\else
-        \@ifnum{\NAT@ctype=\z@}{%
-          \if*#2*\else\NAT@cmt#2\fi
-        }{}%
-        \NAT@mbox{\NAT@@close}%
-      \fi
-  }{#1}{#2}%
-}%
-\def\NAT@citea@mbox{%
- \@citea\mbox{\NAT@hyper@{{\citenumfont{\NAT@num}}}}%
-}%
-\def\NAT@hyper@#1{%
- \hyper@natlinkstart{\@citeb\@extra@b@citeb}#1\hyper@natlinkend
-}%
-\def\NAT@hyper@citea#1{%
- \@citea
- \NAT@hyper@{#1}%
- \NAT@def@citea
-}%
-\def\NAT@hyper@citea@space#1{%
- \@citea
- \NAT@hyper@{#1}%
- \NAT@def@citea@space
-}%
-\def\def@NAT@last@yr#1{%
- \protected@edef\NAT@last@yr{%
-  #1%
-  \noexpand\mbox{%
-   \noexpand\hyper@natlinkstart{\@citeb\@extra@b@citeb}%
-   {\noexpand\citenumfont{\NAT@num}}%
-   \noexpand\hyper@natlinkend
-  }%
- }%
-}%
-\def\NAT@last@yr@mbox{%
- \NAT@last@yr\let\NAT@last@yr\relax
- \NAT@citea@mbox
-}%
-\newcommand\NAT@test[1]{%
- \@ifnum{#1=\@ne}{%
-  \ifx\NAT@nm\NAT@noname
-   \begingroup\reset@font\bfseries(author?)\endgroup
-   \PackageWarning{natbib}{%
-    Author undefined for citation`\@citeb' \MessageBreak on page \thepage%
-   }%
-  \else \NAT@nm
-  \fi
- }{%
-  \if\relax\NAT@date\relax
-   \begingroup\reset@font\bfseries(year?)\endgroup
-   \PackageWarning{natbib}{%
-    Year undefined for citation`\@citeb' \MessageBreak on page \thepage%
-   }%
-  \else \NAT@date
-  \fi
- }%
-}%
-\let\citenumfont=\@empty
-\newcommand\NAT@citex{}
-\def\NAT@citex%
-  [#1][#2]#3{%
-  \NAT@reset@parser
-  \NAT@sort@cites{#3}%
-  \NAT@reset@citea
-  \@cite{\let\NAT@nm\@empty\let\NAT@year\@empty
-    \@for\@citeb:=\NAT@cite@list\do
-    {\@safe@activestrue
-     \edef\@citeb{\expandafter\@firstofone\@citeb\@empty}%
-     \@safe@activesfalse
-     \@ifundefined{b@\@citeb\@extra@b@citeb}{\@citea%
-       {\reset@font\bfseries ?}\NAT@citeundefined
-                 \PackageWarning{natbib}%
-       {Citation `\@citeb' on page \thepage \space undefined}\def\NAT@date{}}%
-     {\let\NAT@last@nm=\NAT@nm\let\NAT@last@yr=\NAT@year
-      \NAT@parse{\@citeb}%
-      \ifNAT@longnames\@ifundefined{bv@\@citeb\@extra@b@citeb}{%
-        \let\NAT@name=\NAT@all@names
-        \global\@namedef{bv@\@citeb\@extra@b@citeb}{}}{}%
-      \fi
-     \ifNAT@full\let\NAT@nm\NAT@all@names\else
-       \let\NAT@nm\NAT@name\fi
-     \ifNAT@swa\ifcase\NAT@ctype
-       \if\relax\NAT@date\relax
-         \@citea\NAT@hyper@{\NAT@nmfmt{\NAT@nm}\NAT@date}%
-       \else
-         \ifx\NAT@last@nm\NAT@nm\NAT@yrsep
-            \ifx\NAT@last@yr\NAT@year
-              \def\NAT@temp{{?}}%
-              \ifx\NAT@temp\NAT@exlab\PackageWarningNoLine{natbib}%
-               {Multiple citation on page \thepage: same authors and
-               year\MessageBreak without distinguishing extra
-               letter,\MessageBreak appears as question mark}\fi
-              \NAT@hyper@{\NAT@exlab}%
-            \else\unskip\NAT@spacechar
-              \NAT@hyper@{\NAT@date}%
-            \fi
-         \else
-           \@citea\NAT@hyper@{%
-             \NAT@nmfmt{\NAT@nm}%
-             \hyper@natlinkbreak{%
-               \NAT@aysep\NAT@spacechar}{\@citeb\@extra@b@citeb
-             }%
-             \NAT@date
-           }%
-         \fi
-       \fi
-     \or\@citea\NAT@hyper@{\NAT@nmfmt{\NAT@nm}}%
-     \or\@citea\NAT@hyper@{\NAT@date}%
-     \or\@citea\NAT@hyper@{\NAT@alias}%
-     \fi \NAT@def@citea
-     \else
-       \ifcase\NAT@ctype
-        \if\relax\NAT@date\relax
-          \@citea\NAT@hyper@{\NAT@nmfmt{\NAT@nm}}%
-        \else
-         \ifx\NAT@last@nm\NAT@nm\NAT@yrsep
-            \ifx\NAT@last@yr\NAT@year
-              \def\NAT@temp{{?}}%
-              \ifx\NAT@temp\NAT@exlab\PackageWarningNoLine{natbib}%
-               {Multiple citation on page \thepage: same authors and
-               year\MessageBreak without distinguishing extra
-               letter,\MessageBreak appears as question mark}\fi
-              \NAT@hyper@{\NAT@exlab}%
-            \else
-              \unskip\NAT@spacechar
-              \NAT@hyper@{\NAT@date}%
-            \fi
-         \else
-           \@citea\NAT@hyper@{%
-             \NAT@nmfmt{\NAT@nm}%
-             \hyper@natlinkbreak{\NAT@spacechar\NAT@@open\if*#1*\else#1\NAT@spacechar\fi}%
-               {\@citeb\@extra@b@citeb}%
-             \NAT@date
-           }%
-         \fi
-        \fi
-       \or\@citea\NAT@hyper@{\NAT@nmfmt{\NAT@nm}}%
-       \or\@citea\NAT@hyper@{\NAT@date}%
-       \or\@citea\NAT@hyper@{\NAT@alias}%
-       \fi
-       \if\relax\NAT@date\relax
-         \NAT@def@citea
-       \else
-         \NAT@def@citea@close
-       \fi
-     \fi
-     }}\ifNAT@swa\else\if*#2*\else\NAT@cmt#2\fi
-     \if\relax\NAT@date\relax\else\NAT@@close\fi\fi}{#1}{#2}}
-\def\NAT@spacechar{\ }%
-\def\NAT@separator{\NAT@sep\NAT@penalty}%
-\def\NAT@reset@citea{\c@NAT@ctr\@ne\let\@citea\@empty}%
-\def\NAT@def@citea{\def\@citea{\NAT@separator\NAT@space}}%
-\def\NAT@def@citea@space{\def\@citea{\NAT@separator\NAT@spacechar}}%
-\def\NAT@def@citea@close{\def\@citea{\NAT@@close\NAT@separator\NAT@space}}%
-\def\NAT@def@citea@box{\def\@citea{\NAT@mbox{\NAT@@close}\NAT@separator\NAT@spacechar}}%
-\newif\ifNAT@par \NAT@partrue
-\newcommand\NAT@@open{\ifNAT@par\NAT@open\fi}
-\newcommand\NAT@@close{\ifNAT@par\NAT@close\fi}
-\newcommand\NAT@alias{\@ifundefined{al@\@citeb\@extra@b@citeb}{%
-  {\reset@font\bfseries(alias?)}\PackageWarning{natbib}
-  {Alias undefined for citation `\@citeb'
-  \MessageBreak on page \thepage}}{\@nameuse{al@\@citeb\@extra@b@citeb}}}
-\let\NAT@up\relax
-\newcommand\NAT@Up[1]{{\let\protect\@unexpandable@protect\let~\relax
-  \expandafter\NAT@deftemp#1}\expandafter\NAT@UP\NAT@temp}
-\newcommand\NAT@deftemp[1]{\xdef\NAT@temp{#1}}
-\newcommand\NAT@UP[1]{\let\@tempa\NAT@UP\ifcat a#1\MakeUppercase{#1}%
-  \let\@tempa\relax\else#1\fi\@tempa}
-\newcommand\shortcites[1]{%
-  \@bsphack\@for\@citeb:=#1\do
-  {\@safe@activestrue
-   \edef\@citeb{\expandafter\@firstofone\@citeb\@empty}%
-   \@safe@activesfalse
-   \global\@namedef{bv@\@citeb\@extra@b@citeb}{}}\@esphack}
-\newcommand\NAT@biblabel[1]{\hfill}
-\newcommand\NAT@biblabelnum[1]{\bibnumfmt{#1}}
-\let\bibnumfmt\@empty
-\providecommand\@biblabel[1]{[#1]}
-\AtBeginDocument{\ifx\bibnumfmt\@empty\let\bibnumfmt\@biblabel\fi}
-\newcommand\NAT@bibsetnum[1]{\settowidth\labelwidth{\@biblabel{#1}}%
-   \setlength{\leftmargin}{\labelwidth}\addtolength{\leftmargin}{\labelsep}%
-   \setlength{\itemsep}{\bibsep}\setlength{\parsep}{\z@}%
-   \ifNAT@openbib
-     \addtolength{\leftmargin}{\bibindent}%
-     \setlength{\itemindent}{-\bibindent}%
-     \setlength{\listparindent}{\itemindent}%
-     \setlength{\parsep}{0pt}%
-   \fi
-}
-\newlength{\bibhang}
-\setlength{\bibhang}{1em}
-\newlength{\bibsep}
- {\@listi \global\bibsep\itemsep \global\advance\bibsep by\parsep}
-
-\newcommand\NAT@bibsetup%
-   [1]{\setlength{\leftmargin}{\bibhang}\setlength{\itemindent}{-\leftmargin}%
-       \setlength{\itemsep}{\bibsep}\setlength{\parsep}{\z@}}
-\newcommand\NAT@set@cites{%
-  \ifNAT@numbers
-    \ifNAT@super \let\@cite\NAT@citesuper
-       \def\NAT@mbox##1{\unskip\nobreak\textsuperscript{##1}}%
-       \let\citeyearpar=\citeyear
-       \let\NAT@space\relax
-       \def\NAT@super@kern{\kern\p@}%
-    \else
-       \let\NAT@mbox=\mbox
-       \let\@cite\NAT@citenum
-       \let\NAT@space\NAT@spacechar
-       \let\NAT@super@kern\relax
-    \fi
-    \let\@citex\NAT@citexnum
-    \let\@biblabel\NAT@biblabelnum
-    \let\@bibsetup\NAT@bibsetnum
-    \renewcommand\NAT@idxtxt{\NAT@name\NAT@spacechar\NAT@open\NAT@num\NAT@close}%
-    \def\natexlab##1{}%
-    \def\NAT@penalty{\penalty\@m}%
-  \else
-    \let\@cite\NAT@cite
-    \let\@citex\NAT@citex
-    \let\@biblabel\NAT@biblabel
-    \let\@bibsetup\NAT@bibsetup
-    \let\NAT@space\NAT@spacechar
-    \let\NAT@penalty\@empty
-    \renewcommand\NAT@idxtxt{\NAT@name\NAT@spacechar\NAT@open\NAT@date\NAT@close}%
-    \def\natexlab##1{##1}%
-  \fi}
-\AtBeginDocument{\NAT@set@cites}
-\AtBeginDocument{\ifx\SK@def\@undefined\else
-\ifx\SK@cite\@empty\else
-  \SK@def\@citex[#1][#2]#3{\SK@\SK@@ref{#3}\SK@@citex[#1][#2]{#3}}\fi
-\ifx\SK@citeauthor\@undefined\def\HAR@checkdef{}\else
-  \let\citeauthor\SK@citeauthor
-  \let\citefullauthor\SK@citefullauthor
-  \let\citeyear\SK@citeyear\fi
-\fi}
-\newif\ifNAT@full\NAT@fullfalse
-\newif\ifNAT@swa
-\DeclareRobustCommand\citet
-   {\begingroup\NAT@swafalse\let\NAT@ctype\z@\NAT@partrue
-     \@ifstar{\NAT@fulltrue\NAT@citetp}{\NAT@fullfalse\NAT@citetp}}
-\newcommand\NAT@citetp{\@ifnextchar[{\NAT@@citetp}{\NAT@@citetp[]}}
-\newcommand\NAT@@citetp{}
-\def\NAT@@citetp[#1]{\@ifnextchar[{\@citex[#1]}{\@citex[][#1]}}
-\DeclareRobustCommand\citep
-   {\begingroup\NAT@swatrue\let\NAT@ctype\z@\NAT@partrue
-         \@ifstar{\NAT@fulltrue\NAT@citetp}{\NAT@fullfalse\NAT@citetp}}
-\DeclareRobustCommand\cite
-    {\begingroup\let\NAT@ctype\z@\NAT@partrue\NAT@swatrue
-      \@ifstar{\NAT@fulltrue\NAT@cites}{\NAT@fullfalse\NAT@cites}}
-\newcommand\NAT@cites{\@ifnextchar [{\NAT@@citetp}{%
-     \ifNAT@numbers\else
-     \NAT@swafalse
-     \fi
-    \NAT@@citetp[]}}
-\DeclareRobustCommand\citealt
-   {\begingroup\NAT@swafalse\let\NAT@ctype\z@\NAT@parfalse
-         \@ifstar{\NAT@fulltrue\NAT@citetp}{\NAT@fullfalse\NAT@citetp}}
-\DeclareRobustCommand\citealp
-   {\begingroup\NAT@swatrue\let\NAT@ctype\z@\NAT@parfalse
-         \@ifstar{\NAT@fulltrue\NAT@citetp}{\NAT@fullfalse\NAT@citetp}}
-\DeclareRobustCommand\citenum
-   {\begingroup
-     \NAT@swatrue\let\NAT@ctype\z@\NAT@parfalse\let\textsuperscript\NAT@spacechar
-     \NAT@citexnum[][]}
-\DeclareRobustCommand\citeauthor
-   {\begingroup\NAT@swafalse\let\NAT@ctype\@ne\NAT@parfalse
-    \@ifstar{\NAT@fulltrue\NAT@citetp}{\NAT@fullfalse\NAT@citetp}}
-\DeclareRobustCommand\Citet
-   {\begingroup\NAT@swafalse\let\NAT@ctype\z@\NAT@partrue
-     \let\NAT@up\NAT@Up
-     \@ifstar{\NAT@fulltrue\NAT@citetp}{\NAT@fullfalse\NAT@citetp}}
-\DeclareRobustCommand\Citep
-   {\begingroup\NAT@swatrue\let\NAT@ctype\z@\NAT@partrue
-     \let\NAT@up\NAT@Up
-         \@ifstar{\NAT@fulltrue\NAT@citetp}{\NAT@fullfalse\NAT@citetp}}
-\DeclareRobustCommand\Citealt
-   {\begingroup\NAT@swafalse\let\NAT@ctype\z@\NAT@parfalse
-     \let\NAT@up\NAT@Up
-         \@ifstar{\NAT@fulltrue\NAT@citetp}{\NAT@fullfalse\NAT@citetp}}
-\DeclareRobustCommand\Citealp
-   {\begingroup\NAT@swatrue\let\NAT@ctype\z@\NAT@parfalse
-     \let\NAT@up\NAT@Up
-         \@ifstar{\NAT@fulltrue\NAT@citetp}{\NAT@fullfalse\NAT@citetp}}
-\DeclareRobustCommand\Citeauthor
-   {\begingroup\NAT@swafalse\let\NAT@ctype\@ne\NAT@parfalse
-     \let\NAT@up\NAT@Up
-    \@ifstar{\NAT@fulltrue\NAT@citetp}{\NAT@fullfalse\NAT@citetp}}
-\DeclareRobustCommand\citeyear
-   {\begingroup\NAT@swafalse\let\NAT@ctype\tw@\NAT@parfalse\NAT@citetp}
-\DeclareRobustCommand\citeyearpar
-   {\begingroup\NAT@swatrue\let\NAT@ctype\tw@\NAT@partrue\NAT@citetp}
-\newcommand\citetext[1]{\NAT@open#1\NAT@close}
-\DeclareRobustCommand\citefullauthor
-   {\citeauthor*}
-\newcommand\defcitealias[2]{%
-   \@ifundefined{al@#1\@extra@b@citeb}{}
-   {\PackageWarning{natbib}{Overwriting existing alias for citation #1}}
-   \@namedef{al@#1\@extra@b@citeb}{#2}}
-\DeclareRobustCommand\citetalias{\begingroup
-   \NAT@swafalse\let\NAT@ctype\thr@@\NAT@parfalse\NAT@citetp}
-\DeclareRobustCommand\citepalias{\begingroup
-   \NAT@swatrue\let\NAT@ctype\thr@@\NAT@partrue\NAT@citetp}
-\renewcommand\nocite[1]{\@bsphack
-  \@for\@citeb:=#1\do{%
-    \@safe@activestrue
-    \edef\@citeb{\expandafter\@firstofone\@citeb\@empty}%
-    \@safe@activesfalse
-    \if@filesw\immediate\write\@auxout{\string\citation{\@citeb}}\fi
-    \if*\@citeb\else
-    \@ifundefined{b@\@citeb\@extra@b@citeb}{%
-       \NAT@citeundefined \PackageWarning{natbib}%
-       {Citation `\@citeb' undefined}}{}\fi}%
-  \@esphack}
-\newcommand\NAT@parse[1]{%
-  \begingroup
-   \let\protect=\@unexpandable@protect
-   \let~\relax
-   \let\active@prefix=\@gobble
-   \edef\NAT@temp{\csname b@#1\@extra@b@citeb\endcsname}%
-   \aftergroup\NAT@split
-   \expandafter
-  \endgroup
-  \NAT@temp{}{}{}{}{}@@%
-  \expandafter\NAT@parse@date\NAT@date??????@@%
-  \ifciteindex\NAT@index\fi
-}%
-\def\NAT@split#1#2#3#4#5@@{%
-  \gdef\NAT@num{#1}\gdef\NAT@name{#3}\gdef\NAT@date{#2}%
-  \gdef\NAT@all@names{#4}%
-  \ifx\NAT@num\@empty\gdef\NAT@num{0}\fi
-  \ifx\NAT@noname\NAT@all@names \gdef\NAT@all@names{#3}\fi
-}%
-\def\NAT@reset@parser{%
-  \global\let\NAT@num\@empty
-  \global\let\NAT@name\@empty
-  \global\let\NAT@date\@empty
-  \global\let\NAT@all@names\@empty
-}%
-\newcommand\NAT@parse@date{}
-\def\NAT@parse@date#1#2#3#4#5#6@@{%
-  \ifnum\the\catcode`#1=11\def\NAT@year{}\def\NAT@exlab{#1}\else
-  \ifnum\the\catcode`#2=11\def\NAT@year{#1}\def\NAT@exlab{#2}\else
-  \ifnum\the\catcode`#3=11\def\NAT@year{#1#2}\def\NAT@exlab{#3}\else
-  \ifnum\the\catcode`#4=11\def\NAT@year{#1#2#3}\def\NAT@exlab{#4}\else
-    \def\NAT@year{#1#2#3#4}\def\NAT@exlab{{#5}}\fi\fi\fi\fi}
-\newcommand\NAT@index{}
-\let\NAT@makeindex=\makeindex
-\renewcommand\makeindex{\NAT@makeindex
-  \renewcommand\NAT@index{\@bsphack\begingroup
-     \def~{\string~}\@wrindex{\NAT@idxtxt}}}
-\newcommand\NAT@idxtxt{\NAT@name\NAT@spacechar\NAT@open\NAT@date\NAT@close}
-\@ifxundefined\@indexfile{}{\let\NAT@makeindex\relax\makeindex}
-\newif\ifciteindex \citeindexfalse
-\newcommand\citeindextype{default}
-\newcommand\NAT@index@alt{{\let\protect=\noexpand\let~\relax
-  \xdef\NAT@temp{\NAT@idxtxt}}\expandafter\NAT@exp\NAT@temp\@nil}
-\newcommand\NAT@exp{}
-\def\NAT@exp#1\@nil{\index[\citeindextype]{#1}}
-
-\AtBeginDocument{%
-\@ifpackageloaded{index}{\let\NAT@index=\NAT@index@alt}{}}
-\newcommand\NAT@ifcmd{\futurelet\NAT@temp\NAT@ifxcmd}
-\newcommand\NAT@ifxcmd{\ifx\NAT@temp\relax\else\expandafter\NAT@bare\fi}
-\def\NAT@bare#1(#2)#3(@)#4\@nil#5{%
-  \if @#2
-    \expandafter\NAT@apalk#1, , \@nil{#5}%
-  \else
-  \NAT@wrout{\the\c@NAT@ctr}{#2}{#1}{#3}{#5}%
-\fi
-}
-\newcommand\NAT@wrout[5]{%
-\if@filesw
-      {\let\protect\noexpand\let~\relax
-       \immediate
-       \write\@auxout{\string\bibcite{#5}{{#1}{#2}{{#3}}{{#4}}}}}\fi
-\ignorespaces}
-\def\NAT@noname{{}}
-\renewcommand\bibitem{\@ifnextchar[{\@lbibitem}{\@lbibitem[]}}%
-\let\NAT@bibitem@first@sw\@secondoftwo
-\def\@lbibitem[#1]#2{%
-  \if\relax\@extra@b@citeb\relax\else
-    \@ifundefined{br@#2\@extra@b@citeb}{}{%
-     \@namedef{br@#2}{\@nameuse{br@#2\@extra@b@citeb}}%
-    }%
-  \fi
-  \@ifundefined{b@#2\@extra@b@citeb}{%
-   \def\NAT@num{}%
-  }{%
-   \NAT@parse{#2}%
-  }%
-  \def\NAT@tmp{#1}%
-  \expandafter\let\expandafter\bibitemOpen\csname NAT@b@open@#2\endcsname
-  \expandafter\let\expandafter\bibitemShut\csname NAT@b@shut@#2\endcsname
-  \@ifnum{\NAT@merge>\@ne}{%
-   \NAT@bibitem@first@sw{%
-    \@firstoftwo
-   }{%
-    \@ifundefined{NAT@b*@#2}{%
-     \@firstoftwo
-    }{%
-     \expandafter\def\expandafter\NAT@num\expandafter{\the\c@NAT@ctr}%
-     \@secondoftwo
-    }%
-   }%
-  }{%
-   \@firstoftwo
-  }%
-  {%
-   \global\advance\c@NAT@ctr\@ne
-   \@ifx{\NAT@tmp\@empty}{\@firstoftwo}{%
-    \@secondoftwo
-   }%
-   {%
-    \expandafter\def\expandafter\NAT@num\expandafter{\the\c@NAT@ctr}%
-    \global\NAT@stdbsttrue
-   }{}%
-   \bibitem@fin
-   \item[\hfil\NAT@anchor{#2}{\NAT@num}]%
-   \global\let\NAT@bibitem@first@sw\@secondoftwo
-   \NAT@bibitem@init
-  }%
-  {%
-   \NAT@anchor{#2}{}%
-   \NAT@bibitem@cont
-   \bibitem@fin
-  }%
-  \@ifx{\NAT@tmp\@empty}{%
-    \NAT@wrout{\the\c@NAT@ctr}{}{}{}{#2}%
-  }{%
-    \expandafter\NAT@ifcmd\NAT@tmp(@)(@)\@nil{#2}%
-  }%
-}%
-\def\bibitem@fin{%
- \@ifxundefined\@bibstop{}{\csname bibitem@\@bibstop\endcsname}%
-}%
-\def\NAT@bibitem@init{%
- \let\@bibstop\@undefined
-}%
-\def\NAT@bibitem@cont{%
- \let\bibitem@Stop\bibitemStop
- \let\bibitem@NoStop\bibitemContinue
-}%
-\def\BibitemOpen{%
- \bibitemOpen
-}%
-\def\BibitemShut#1{%
- \bibitemShut
- \def\@bibstop{#1}%
- \let\bibitem@Stop\bibitemStop
- \let\bibitem@NoStop\bibitemNoStop
-}%
-\def\bibitemStop{}%
-\def\bibitemNoStop{.\spacefactor\@mmm\space}%
-\def\bibitemContinue{\spacefactor\@mmm\space}%
-\mathchardef\@mmm=3000 %
-\providecommand{\bibAnnote}[3]{%
-  \BibitemShut{#1}%
-  \def\@tempa{#3}\@ifx{\@tempa\@empty}{}{%
-   \begin{quotation}\noindent
-    \textsc{Key:}\ #2\\\textsc{Annotation:}\ \@tempa
-   \end{quotation}%
-  }%
-}%
-\providecommand{\bibAnnoteFile}[2]{%
-  \IfFileExists{#2}{%
-    \bibAnnote{#1}{#2}{\input{#2}}%
-  }{%
-    \bibAnnote{#1}{#2}{}%
-  }%
-}%
-\let\bibitemOpen\relax
-\let\bibitemShut\relax
-\def\bibfield{\@ifnum{\NAT@merge>\tw@}{\@bibfield}{\@secondoftwo}}%
-\def\@bibfield#1#2{%
- \begingroup
-  \let\Doi\@gobble
-  \let\bibinfo\relax
-  \let\restore@protect\@empty
-  \protected@edef\@tempa{#2}%
-  \aftergroup\def\aftergroup\@tempa
- \expandafter\endgroup\expandafter{\@tempa}%
- \expandafter\@ifx\expandafter{\csname @bib#1\endcsname\@tempa}{%
-  \expandafter\let\expandafter\@tempa\csname @bib@X#1\endcsname
- }{%
-  \expandafter\let\csname @bib#1\endcsname\@tempa
-  \expandafter\let\expandafter\@tempa\csname @bib@Y#1\endcsname
- }%
- \@ifx{\@tempa\relax}{\let\@tempa\@firstofone}{}%
- \@tempa{#2}%
-}%
-\def\bibinfo#1{%
- \expandafter\let\expandafter\@tempa\csname bibinfo@X@#1\endcsname
- \@ifx{\@tempa\relax}{\@firstofone}{\@tempa}%
-}%
-\def\@bib@Xauthor#1{\let\@bib@Xjournal\@gobble}%
-\def\@bib@Xjournal#1{\begingroup\let\bibinfo@X@journal\@bib@Z@journal#1\endgroup}%
-\def\@bibibid@#1{\textit{ibid}.}%
-\appdef\NAT@bibitem@init{%
- \let\@bibauthor  \@empty
- \let\@bibjournal \@empty
- \let\@bib@Z@journal\@bibibid@
-}%
-\ifx\SK@lbibitem\@undefined\else
-   \let\SK@lbibitem\@lbibitem
-   \def\@lbibitem[#1]#2{%
-     \SK@lbibitem[#1]{#2}\SK@\SK@@label{#2}\ignorespaces}\fi
-\newif\ifNAT@stdbst \NAT@stdbstfalse
-
-\AtEndDocument{%
-  \ifNAT@stdbst\if@filesw
-   \immediate\write\@auxout{%
-    \string\providecommand\string\NAT@force@numbers{}%
-    \string\NAT@force@numbers
-   }%
-  \fi\fi
- }
-\newcommand\NAT@force@numbers{%
-  \ifNAT@numbers\else
-  \PackageError{natbib}{Bibliography not compatible with author-year
-  citations.\MessageBreak
-  Press <return> to continue in numerical citation style}
-  {Check the bibliography entries for non-compliant syntax,\MessageBreak
-   or select author-year BibTeX style, e.g. plainnat}%
-  \global\NAT@numberstrue\fi}
-
-\providecommand\bibcite{}
-\renewcommand\bibcite[2]{%
- \@ifundefined{b@#1\@extra@binfo}{\relax}{%
-   \NAT@citemultiple
-   \PackageWarningNoLine{natbib}{Citation `#1' multiply defined}%
- }%
- \global\@namedef{b@#1\@extra@binfo}{#2}%
-}%
-\AtEndDocument{\NAT@swatrue\let\bibcite\NAT@testdef}
-\newcommand\NAT@testdef[2]{%
-  \def\NAT@temp{#2}%
-  \expandafter \ifx \csname b@#1\@extra@binfo\endcsname\NAT@temp
-  \else
-    \ifNAT@swa \NAT@swafalse
-      \PackageWarningNoLine{natbib}{%
-        Citation(s) may have changed.\MessageBreak
-        Rerun to get citations correct%
-      }%
-    \fi
-  \fi
-}%
-\newcommand\NAT@apalk{}
-\def\NAT@apalk#1, #2, #3\@nil#4{%
-  \if\relax#2\relax
-    \global\NAT@stdbsttrue
-    \NAT@wrout{#1}{}{}{}{#4}%
-  \else
-    \NAT@wrout{\the\c@NAT@ctr}{#2}{#1}{}{#4}%
-  \fi
-}%
-\newcommand\citeauthoryear{}
-\def\citeauthoryear#1#2#3(@)(@)\@nil#4{%
-  \if\relax#3\relax
-    \NAT@wrout{\the\c@NAT@ctr}{#2}{#1}{}{#4}%
-  \else
-    \NAT@wrout{\the\c@NAT@ctr}{#3}{#2}{#1}{#4}%
-  \fi
-}%
-\newcommand\citestarts{\NAT@open}%
-\newcommand\citeends{\NAT@close}%
-\newcommand\betweenauthors{and}%
-\newcommand\astroncite{}
-\def\astroncite#1#2(@)(@)\@nil#3{%
- \NAT@wrout{\the\c@NAT@ctr}{#2}{#1}{}{#3}%
-}%
-\newcommand\citename{}
-\def\citename#1#2(@)(@)\@nil#3{\expandafter\NAT@apalk#1#2, \@nil{#3}}
-\newcommand\harvarditem[4][]{%
- \if\relax#1\relax
-   \bibitem[#2(#3)]{#4}%
- \else
-   \bibitem[#1(#3)#2]{#4}%
- \fi
-}%
-\newcommand\harvardleft{\NAT@open}
-\newcommand\harvardright{\NAT@close}
-\newcommand\harvardyearleft{\NAT@open}
-\newcommand\harvardyearright{\NAT@close}
-\AtBeginDocument{\providecommand{\harvardand}{and}}
-\newcommand\harvardurl[1]{\textbf{URL:} \textit{#1}}
-\providecommand\bibsection{}
-\@ifundefined{chapter}{%
-  \renewcommand\bibsection{%
-   \section*{\refname\@mkboth{\MakeUppercase{\refname}}{\MakeUppercase{\refname}}}%
-  }%
-}{%
-  \@ifxundefined\NAT@sectionbib{%
-    \renewcommand\bibsection{%
-      \chapter*{\bibname\@mkboth{\MakeUppercase{\bibname}}{\MakeUppercase{\bibname}}}%
-    }%
-  }{%
-    \renewcommand\bibsection{%
-      \section*{\bibname\ifx\@mkboth\@gobbletwo\else\markright{\MakeUppercase{\bibname}}\fi}%
-    }%
-  }%
-}%
-\@ifclassloaded{amsart}{\renewcommand\bibsection{\section*{\refname}}}{}%
-\@ifclassloaded{amsbook}{\renewcommand\bibsection{\chapter*{\bibname}}}{}%
-\@ifxundefined\bib@heading{}{\let\bibsection\bib@heading}%
-\newcounter{NAT@ctr}
-\renewenvironment{thebibliography}[1]{%
- \bibsection
- \parindent\z@
- \bibpreamble
- \bibfont
- \list{\@biblabel{\the\c@NAT@ctr}}{\@bibsetup{#1}\global\c@NAT@ctr\z@}%
- \ifNAT@openbib
-   \renewcommand\newblock{\par}%
- \else
-   \renewcommand\newblock{\hskip .11em \@plus.33em \@minus.07em}%
- \fi
- \sloppy\clubpenalty4000\widowpenalty4000
- \sfcode`\.\@m
- \let\NAT@bibitem@first@sw\@firstoftwo
-    \let\citeN\cite \let\shortcite\cite
-    \let\citeasnoun\cite
-}{%
- \bibitem@fin
- \bibpostamble
- \def\@noitemerr{%
-  \PackageWarning{natbib}{Empty `thebibliography' environment}%
- }%
- \endlist
- \bibcleanup
-}%
-\let\bibfont\@empty
-\let\bibpreamble\@empty
-\let\bibpostamble\@empty
-\def\bibcleanup{\vskip-\lastskip}%
-\providecommand\reset@font{\relax}
-\providecommand\bibname{Bibliography}
-\providecommand\refname{References}
-\newcommand\NAT@citeundefined{\gdef \NAT@undefined {%
-    \PackageWarningNoLine{natbib}{There were undefined citations}}}
-\let \NAT@undefined \relax
-\newcommand\NAT@citemultiple{\gdef \NAT@multiple {%
-    \PackageWarningNoLine{natbib}{There were multiply defined citations}}}
-\let \NAT@multiple \relax
-\AtEndDocument{\NAT@undefined\NAT@multiple}
-\providecommand\@mkboth[2]{}
-\providecommand\MakeUppercase{\uppercase}
-\providecommand{\@extra@b@citeb}{}
-\gdef\@extra@binfo{}
-\def\NAT@anchor#1#2{%
- \hyper@natanchorstart{#1\@extra@b@citeb}%
-  \def\@tempa{#2}\@ifx{\@tempa\@empty}{}{\@biblabel{#2}}%
- \hyper@natanchorend
-}%
-\providecommand\hyper@natanchorstart[1]{}%
-\providecommand\hyper@natanchorend{}%
-\providecommand\hyper@natlinkstart[1]{}%
-\providecommand\hyper@natlinkend{}%
-\providecommand\hyper@natlinkbreak[2]{#1}%
-\AtBeginDocument{%
-  \@ifpackageloaded{babel}{%
-     \let\org@@citex\@citex}{}}
-\providecommand\@safe@activestrue{}%
-\providecommand\@safe@activesfalse{}%
-
-\newcommand\NAT@sort@cites[1]{%
-  \let\NAT@cite@list\@empty
-  \@for\@citeb:=#1\do{\expandafter\NAT@star@cite\@citeb\@@}%
-  \if@filesw
-    \expandafter\immediate\expandafter\write\expandafter\@auxout
-      \expandafter{\expandafter\string\expandafter\citation\expandafter{\NAT@cite@list}}%
-  \fi
-  \@ifnum{\NAT@sort>\z@}{%
-    \expandafter\NAT@sort@cites@\expandafter{\NAT@cite@list}%
-  }{}%
-}%
-\def\NAT@star@cite{%
-  \let\NAT@star@sw\@secondoftwo
-  \@ifnum{\NAT@merge>\z@}{%
-   \@ifnextchar*{%
-    \let\NAT@star@sw\@firstoftwo
-    \NAT@star@cite@star
-   }{%
-    \NAT@star@cite@nostar
-   }%
-  }{%
-   \NAT@star@cite@noextension
-  }%
-}%
-\def\NAT@star@cite@star*{%
- \NAT@star@cite@nostar
-}%
-\def\NAT@star@cite@nostar{%
- \let\nat@keyopt@open\@empty
- \let\nat@keyopt@shut\@empty
- \@ifnextchar[{\NAT@star@cite@pre}{\NAT@star@cite@pre[]}%
-}%
-\def\NAT@star@cite@pre[#1]{%
- \def\nat@keyopt@open{#1}%
- \@ifnextchar[{\NAT@star@cite@post}{\NAT@star@cite@post[]}%
-}%
-\def\NAT@star@cite@post[#1]#2\@@{%
- \def\nat@keyopt@shut{#1}%
- \NAT@star@sw{\expandafter\global\expandafter\let\csname NAT@b*@#2\endcsname\@empty}{}%
- \NAT@cite@list@append{#2}%
-}%
-\def\NAT@star@cite@noextension#1\@@{%
-  \let\nat@keyopt@open\@empty
-  \let\nat@keyopt@shut\@empty
-  \NAT@cite@list@append{#1}%
-}%
-\def\NAT@cite@list@append#1{%
-  \edef\@citeb{\@firstofone#1\@empty}%
-  \if@filesw\@ifxundefined\@cprwrite{}{\expandafter\@cprwrite\@citeb=}\fi
-  \if\relax\nat@keyopt@open\relax\else
-   \global\expandafter\let\csname NAT@b@open@\@citeb\endcsname\nat@keyopt@open
-  \fi
-  \if\relax\nat@keyopt@shut\relax\else
-   \global\expandafter\let\csname NAT@b@shut@\@citeb\endcsname\nat@keyopt@shut
-  \fi
-  \toks@\expandafter{\NAT@cite@list}%
-  \ifx\NAT@cite@list\@empty
-    \@temptokena\expandafter{\@citeb}%
-  \else
-    \@temptokena\expandafter{\expandafter,\@citeb}%
-  \fi
-  \edef\NAT@cite@list{\the\toks@\the\@temptokena}%
-}%
-\newcommand\NAT@sort@cites@[1]{%
-  \count@\z@
-  \@tempcntb\m@ne
-  \let\@celt\delimiter
-  \def\NAT@num@list{}%
-  \let\NAT@cite@list\@empty
-  \let\NAT@nonsort@list\@empty
-  \@for \@citeb:=#1\do{\NAT@make@cite@list}%
-  \ifx\NAT@nonsort@list\@empty\else
-   \protected@edef\NAT@cite@list{\NAT@cite@list\NAT@nonsort@list}%
-  \fi
-  \ifx\NAT@cite@list\@empty\else
-   \protected@edef\NAT@cite@list{\expandafter\NAT@xcom\NAT@cite@list @@}%
-  \fi
-}%
-\def\NAT@make@cite@list{%
-  \advance\count@\@ne
-  \@safe@activestrue
-  \edef\@citeb{\expandafter\@firstofone\@citeb\@empty}%
-  \@safe@activesfalse
-  \@ifundefined{b@\@citeb\@extra@b@citeb}%
-   {\def\NAT@num{A}}%
-   {\NAT@parse{\@citeb}}%
-  \NAT@ifcat@num\NAT@num
-   {\@tempcnta\NAT@num \relax
-    \@ifnum{\@tempcnta<\@tempcntb}{%
-      \let\NAT@@cite@list=\NAT@cite@list
-      \let\NAT@cite@list\@empty
-      \begingroup\let\@celt=\NAT@celt\NAT@num@list\endgroup
-      \protected@edef\NAT@num@list{%
-       \expandafter\NAT@num@celt \NAT@num@list \@gobble @%
-      }%
-    }{%
-      \protected@edef\NAT@num@list{\NAT@num@list \@celt{\NAT@num}}%
-      \protected@edef\NAT@cite@list{\NAT@cite@list\@citeb,}%
-      \@tempcntb\@tempcnta
-    }%
-   }%
-   {\protected@edef\NAT@nonsort@list{\NAT@nonsort@list\@citeb,}}%
-}%
-\def\NAT@celt#1{%
-  \@ifnum{#1>\@tempcnta}{%
-    \xdef\NAT@cite@list{\NAT@cite@list\@citeb,\NAT@@cite@list}%
-    \let\@celt\@gobble
-  }{%
-    \expandafter\def@NAT@cite@lists\NAT@@cite@list\@@
-  }%
-}%
-\def\NAT@num@celt#1#2{%
- \ifx#1\@celt
-  \@ifnum{#2>\@tempcnta}{%
-    \@celt{\number\@tempcnta}%
-    \@celt{#2}%
-  }{%
-    \@celt{#2}%
-    \expandafter\NAT@num@celt
-  }%
- \fi
-}%
-\def\def@NAT@cite@lists#1,#2\@@{%
-  \xdef\NAT@cite@list{\NAT@cite@list#1,}%
-  \xdef\NAT@@cite@list{#2}%
-}%
-\def\NAT@nextc#1,#2@@{#1,}
-\def\NAT@restc#1,#2{#2}
-\def\NAT@xcom#1,@@{#1}
-\InputIfFileExists{natbib.cfg}
-       {\typeout{Local config file natbib.cfg used}}{}
-%% 
-%% <<<<< End of generated file <<<<<<
-%%
-%% End of file `natbib.sty'.
diff --git a/Paper2Video/src/latex_proj/slides.tex b/Paper2Video/src/latex_proj/slides.tex
deleted file mode 100644
index 9bd34f09ec6f93df9731e3f091beb91a3fc00841..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/slides.tex
+++ /dev/null
@@ -1,374 +0,0 @@
-\documentclass{beamer}
-
-% Theme and Color
-\usetheme{Madrid}
-\usecolortheme{default}
-
-% Packages
-\usepackage[utf8]{inputenc}
-\usepackage[T1]{fontenc}
-\usepackage{amsmath, amssymb, amsfonts}
-\usepackage{booktabs}
-\usepackage{graphicx}
-\usepackage{hyperref}
-\usepackage{bm} % For bold math symbols
-
-% Custom commands from the source text for consistency
-\newcommand{\KL}{D_{\mathrm{KL}}}
-\def\figref#1{Figure~\ref{#1}}
-
-\title[Meta-Safe RL]{A CMDP-within-online framework for Meta-Safe Reinforcement Learning}
-\author{Vanshaj Khattar\inst{1} \and Yuhao Ding\inst{2} \and Bilgehan Sel\inst{1} \and Javad Lavaei\inst{2} \and Ming Jin\inst{1}}
-\institute[VT \& UCB]{
-  \inst{1} Virginia Tech \\
-  \inst{2} UC Berkeley
-}
-\date{\today}
-
-\begin{document}
-
-% Title Page
-\begin{frame}
-  \titlepage
-\end{frame}
-
-% Table of Contents
-\begin{frame}{Outline}
-  \tableofcontents
-\end{frame}
-
-% Section 1: Motivation
-\section{Motivation}
-% Corresponds to the source text Section 1
-\begin{frame}{Motivation: Why Meta-Safe RL?}
-  \begin{block}{Background: Meta-Reinforcement Learning (Meta-RL)}
-    \begin{itemize}
-      \item Meta-RL enables agents to learn new tasks quickly with limited experience.
-      \item It's a "learning-to-learn" framework successful in robotics, federated learning, etc.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{The Problem: Safety is Critical}
-    \begin{itemize}
-      \item Many real-world applications have \alert{safety constraints} that must not be violated (e.g., robotics, autonomous driving).
-      \item Existing Meta-RL methods do not adequately address these constraints.
-      \item Safe RL problems are often modeled as \alert{Constrained Markov Decision Processes (CMDPs)}, but standard CMDP algorithms don't generalize efficiently to new tasks.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Our Goal}
-    \begin{itemize}
-      \item Develop a principled framework, \alert{Meta-Safe RL (Meta-SRL)}, that combines the fast adaptation of meta-learning with the safety guarantees of Safe RL.
-      \item Provide the \alert{first provable guarantees} for learning across multiple safe RL tasks.
-    \end{itemize}
-  \end{block}
-\end{frame}
-
-% Section 2: Related Work
-\section{Related Work}
-% Corresponds to the source text Section 1 and Appendix A
-\begin{frame}{Related Work}
-  \begin{itemize}
-    \item \textbf{Meta-Reinforcement Learning:}
-    \begin{itemize}
-        \item Focuses on learning initial conditions, hyperparameters, etc., for fast adaptation.
-        \item Most work is for \alert{unconstrained} environments.
-    \end{itemize}
-    \item \textbf{Online Meta-Learning:}
-    \begin{itemize}
-        \item Provides theoretical frameworks, often for convex and decomposable loss functions.
-        \item Our work extends this to the \alert{nonconvex and complex} setting of CMDPs.
-    \end{itemize}
-    \item \textbf{Safe RL and CMDPs:}
-    \begin{itemize}
-        \item A rich field with many algorithms (e.g., primal-dual, policy-based like \alert{CRPO}).
-        \item However, these are designed for a \alert{single task} and are not built to generalize or adapt quickly to unseen tasks.
-    \end{itemize}
-  \end{itemize}
-\end{frame}
-
-% Section 3: Method
-\section{Method}
-% Corresponds to the source text Sections 2 & 3
-\begin{frame}{Method: CMDP-within-Online Framework}
-  \begin{block}{Core Idea}
-    \begin{itemize}
-        \item A \alert{meta-learner} (online algorithm) operates over a sequence of CMDP tasks.
-        \item For each task $t$, the meta-learner provides an initial policy $\alert{\pi_{t,0}}$ and a learning rate $\alert{\alpha_t}$ to a \alert{within-task} Safe RL algorithm (e.g., CRPO).
-        \item The goal is to minimize the \textbf{Task-Averaged Optimality Gap (TAOG)} and \textbf{Task-Averaged Constraint Violation (TACV)}.
-    \end{itemize}
-  \end{block}
-  \begin{figure}
-    \centering
-    \includegraphics[width=0.6\textwidth]{illustrate.pdf}
-    \caption{Conceptual illustration of the meta-learning process.}
-    \label{fig:method_concept}
-  \end{figure}
-\end{frame}
-
-% Method Part 1: Primal Approach
-\begin{frame}{Method: The Within-Task Algorithm (CRPO)}
-  % Corresponds to the source text Section 2.1
-  \begin{block}{Constrained Markov Decision Process (CMDP)}
-    For each task $t$, the agent aims to solve:
-    \begin{equation*}
-        \underset{\pi}{\max} \hspace{0.1cm} J_{t,0}(\pi) \hspace{0.3cm} \text{s.t.} \hspace{0.2cm} \alert{J_{t,i}(\pi) \leq d_{t,i}}, \hspace{0.3cm} \forall i = 1,...,p
-    \end{equation*}
-    where $J_{t,0}$ is the expected reward and $J_{t,i}$ are expected costs.
-  \end{block}
-
-  \begin{block}{CRPO Algorithm \& Regret}
-    \begin{itemize}
-        \item We use the Constraint-Rectified Policy Optimization (\alert{CRPO}) algorithm.
-        \item The single-task optimality gap ($R_0$) and constraint violation ($R_i$) are bounded by:
-        \begin{equation*}
-            R_0, R_i \leq \mathcal{O}\left( \frac{\mathbb{E}_{s \sim \nu_t^*}[\alert{\KL(\pi_t^*|\pi_{t,0})}]}{\alpha_t M} + \alpha_t \right)
-        \end{equation*}
-        \item \textbf{Key Insight:} The performance depends heavily on the KL-divergence between the optimal policy $\pi_t^*$ and the initial policy $\pi_{t,0}$.
-        \item Our meta-learner will optimize this upper bound by choosing good `$\alert{\pi_{t,0}}$` and `$\alert{\alpha_t}$`.
-    \end{itemize}
-  \end{block}
-\end{frame}
-
-% Method Part 2: Inexact Framework
-\begin{frame}{Method: The Inexact Framework}
-  % Corresponds to the source text Section 3.1
-  \begin{block}{Challenge: Unknown Optimal Policies}
-    \begin{itemize}
-        \item In practice, the optimal policy $\alert{\pi_t^*}$ and its state distribution $\alert{\nu_t^*}$ are unknown.
-        \item We only have access to a suboptimal policy $\alert{\hat{\pi}_t}$ and collected trajectory data $\alert{\mathcal{D}_t}$.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Solution: Estimate and Bound the Error}
-    \begin{itemize}
-        \item \textbf{Estimate:} Use the suboptimal policy $\hat{\pi}_t$ and estimate its state distribution $\hat{\nu}_t$ from data $\mathcal{D}_t$ using \alert{DualDICE}.
-        \item \textbf{Inexact Loss:} The meta-learner optimizes an inexact loss function:
-        $$ \hat{f}_{t}(\phi) = \mathbb{E}_{\hat{\nu}_t}[\KL(\hat{\pi}_t|\phi)] $$
-        \item \textbf{Bound the Error:} We prove a bound on the estimation error:
-        $$ |\mathbb{E}_{\nu_t^*}[\KL(\pi_t^*|\phi)] - \mathbb{E}_{\hat{\nu}_t}[\KL(\hat{\pi}_t|\phi)]| \leq \alert{\epsilon_t} $$
-        This bound (Thm. 3.1) is derived using novel techniques from \alert{tame geometry}.
-    \end{itemize}
-  \end{block}
-\end{frame}
-
-% Method Part 3: Adaptive Learning
-\begin{frame}{Method: Dynamic Regret \& Adaptive Learning Rates}
-  % Corresponds to the source text Section 3.3
-  \begin{block}{Challenge: Adapting to Dynamic Environments}
-    \begin{itemize}
-        \item A fixed meta-initialization may not be optimal if the environment changes over time.
-        \item Setting the learning rate $\alpha_t$ optimally requires knowledge of future tasks.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Solution: Separate Online Learners}
-    \begin{itemize}
-        \item We decompose the regret upper bound into two components.
-        \item We use two parallel Online Gradient Descent (OGD) algorithms:
-        \begin{enumerate}
-            \item \textbf{INIT}: Learns the policy initialization $\alert{\pi_{t,0}}$ by minimizing $\hat{f}_{t}^{init}(\phi) = \mathbb{E}_{\hat{\nu}_t}[\KL(\hat{\pi}_t|\phi)]$.
-            \item \textbf{SIM}: Learns the learning rate $\alert{\alpha_t}$ by minimizing its own loss term $\hat{f}_t^{sim}(\kappa)$.
-        \end{enumerate}
-        \item This allows the framework to adapt both policy and learning rate online, without knowing task properties in advance.
-    \end{itemize}
-  \end{block}
-\end{frame}
-
-% Section 4: Innovation
-\section{Innovation}
-% Corresponds to the source text Section 1 (contributions)
-\begin{frame}{Our Innovations}
-  \begin{block}{Novel Framework and Guarantees}
-    \begin{itemize}
-        \item The \alert{first provable guarantees} for Meta-Safe RL, establishing bounds on task-averaged optimality gap (TAOG) and constraint violation (TACV).
-        \item The regret bounds explicitly improve with \alert{task-similarity} ($\hat{D}^*$) or \alert{task-relatedness} ($\hat{V}_\psi$).
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Practical and Adaptive Algorithm}
-    \begin{itemize}
-        \item \textbf{Inexact framework}: Works with suboptimal policies and estimates distributions using \alert{DualDICE}, making it practical.
-        \item \textbf{Adaptive learning}: The meta-learner adapts both policy initialization and learning rates for each task, handling dynamic environments.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Technical Contributions}
-    \begin{itemize}
-        \item New analysis of the \alert{optimization landscape of CMDPs} using tame geometry to bound the distance between optimal and suboptimal policies.
-        \item Extended analysis for \alert{inexact online gradient descent} to handle dynamic regret with biased gradient estimates.
-    \end{itemize}
-  \end{block}
-\end{frame}
-
-% Transition to Experiments
-\begin{frame}
-  \centering
-  \Huge
-  Experimental Evaluation
-\end{frame}
-
-% Section 5: Experimental Method
-\section{Experimental Method}
-% Corresponds to the source text Section 4
-\begin{frame}{Experimental Method}
-  \begin{block}{Objective}
-    \begin{itemize}
-      \item To empirically validate the effectiveness of our \alert{Meta-SRL} framework against standard meta-learning baselines.
-    \end{itemize}
-  \end{block}
-  
-  \begin{block}{Baselines for Comparison}
-    \begin{itemize}
-        \item \alert{Random Initialization}: Standard CRPO with a new random policy for each task.
-        \item \alert{Pre-trained}: Initialize with the final policy from the previous task.
-        \item \alert{Simple Averaging}: Offline average of all previously learned policies.
-        \item \alert{Follow the Average Leader (FAL)}: Online average of all previously learned policies.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Task Generation}
-    \begin{itemize}
-        \item We generate a sequence of related CMDP tasks by sampling from a distribution over environment parameters (e.g., transition dynamics, reward functions).
-        \item We test under two conditions: \alert{high task-similarity} and \alert{low task-similarity}.
-    \end{itemize}
-  \end{block}
-\end{frame}
-
-% Section 6: Experimental Setting
-\section{Experimental Setting}
-% Corresponds to the source text Section 4 and Appendix G
-\begin{frame}{Experimental Setting}
-  \begin{block}{Environments}
-  We use a range of classic control environments with added safety constraints:
-    \begin{itemize}
-        \item \textbf{OpenAI Gym:}
-        \begin{itemize}
-            \item \alert{FrozenLake}: Discrete state space, $T=10$ tasks.
-            \item \alert{Acrobot}: Continuous state space, $T=50$ tasks.
-        \end{itemize}
-        \item \textbf{MuJoCo:}
-        \begin{itemize}
-            \item \alert{Half-Cheetah}: High-dimensional continuous control, $T=100$ tasks. Constraint on head height.
-            \item \alert{Humanoid}: Very high-dimensional, $T=250$ tasks. Constraint on joint angles for smooth motion.
-        \end{itemize}
-    \end{itemize}
-  \end{block}
-\end{frame}
-
-% Section 7: Experimental Results
-\section{Experimental Results}
-% Corresponds to the source text Section 4
-\begin{frame}{Experimental Results: Low Task-Similarity}
-  \begin{columns}[T]
-    \begin{column}{0.5\textwidth}
-      \centering
-      \textbf{FrozenLake}
-      \includegraphics[width=\textwidth]{FrozenLake/FrozenLakeLowSimilarity.pdf}
-    \end{column}
-    \begin{column}{0.5\textwidth}
-      \centering
-      \textbf{Acrobot}
-      \includegraphics[width=\textwidth]{Acrobot/Acrobot_low_similarity2.pdf}
-    \end{column}
-  \end{columns}
-  
-  \begin{block}{Observations}
-    \begin{itemize}
-      \item In settings with low task similarity, \alert{Meta-SRL} (our method) consistently learns faster and more safely.
-      \item It achieves higher rewards while rapidly satisfying the safety constraints (driving constraint violation to zero).
-      \item Simpler baselines like \alert{FAL} and \alert{Pre-trained} struggle to satisfy constraints or learn good policies.
-    \end{itemize}
-  \end{block}
-\end{frame}
-
-\begin{frame}{Experimental Results: MuJoCo Environments}
-    \centering
-    \textbf{Half-Cheetah (Low Task-Similarity)}
-    \begin{figure}
-        \includegraphics[width=0.8\textwidth]{HalfCheetah/HalfCheetahReward_low_task_similarity_broken_axis.pdf}
-        \includegraphics[width=0.8\textwidth]{HalfCheetah/HalfCheetahCost_low_task_similarity.pdf}
-        \caption{Reward (top) and constraint violation (bottom) for Half-Cheetah. Our method (Meta-SRL) learns a high-reward policy while keeping the constraint violation below the threshold (blue line).}
-        \label{fig:halfcheetah}
-    \end{figure}
-\end{frame}
-
-
-% Section 8: Ablation Experiment
-\section{Ablation Experiment}
-\begin{frame}{Ablation Analysis}
-  While no explicit ablation study was conducted, comparing Meta-SRL to the baselines serves as a validation of its key components.
-
-  \begin{block}{Meta-SRL vs. FAL / Simple Averaging}
-    \begin{itemize}
-        \item \textbf{Ablated Component:} The intelligent meta-update (using \alert{DualDICE} estimates and \alert{OGD} on the regret bound).
-        \item \textbf{Result:} Meta-SRL significantly outperforms simple averaging, showing that a weighted, adaptive update is crucial and superior to naive averaging.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Meta-SRL vs. Pre-trained}
-    \begin{itemize}
-        \item \textbf{Ablated Component:} Learning from a history of multiple tasks. The pre-trained baseline only uses the most recent task.
-        \item \textbf{Result:} Meta-SRL is more robust, especially in low-similarity settings, demonstrating the benefit of aggregating knowledge from diverse past experiences.
-    \end{itemize}
-  \end{block}
-  
-  \begin{block}{Conclusion}
-  The full \alert{Meta-SRL} model, with its inexact estimation and adaptive learning, is critical for achieving strong performance and safety.
-  \end{block}
-\end{frame}
-
-% Section 9: Deficiencies
-\section{Deficiencies}
-% Corresponds to the source text Section 5
-\begin{frame}{Limitations of the Current Method}
-  \begin{itemize}
-    \item \textbf{Algorithm-Specific Guarantees:}
-    \begin{itemize}
-        \item Our theoretical framework is built upon the \alert{CRPO} algorithm.
-        \item Extending it to other within-task Safe RL algorithms (e.g., primal-dual methods) would require a new analysis of their specific regret bounds.
-    \end{itemize}
-    \bigskip
-    \item \textbf{No Hard Safety Guarantees During Learning:}
-    \begin{itemize}
-        \item The framework minimizes task-averaged constraint violation, achieving safety \textit{on average} and \textit{asymptotically}.
-        \item It does not guarantee \alert{zero constraint violation} at every step during the learning process, which may be a requirement for highly critical systems.
-    \end{itemize}
-  \end{itemize}
-\end{frame}
-
-% Section 10: Future Research
-\section{Future Research}
-% Corresponds to the source text Section 5
-\begin{frame}{Future Research Directions}
-  \begin{itemize}
-    \item \textbf{Meta-SRL with Zero-Violation Guarantees:}
-    \begin{itemize}
-        \item Designing frameworks that can provide hard safety constraints throughout the learning phase, possibly by integrating pessimistic or certified approaches.
-    \end{itemize}
-    \bigskip
-    \item \textbf{Extension to More Complex Scenarios:}
-    \begin{itemize}
-        \item \alert{Non-stationary environments} where the task distribution itself may shift over time.
-        \item \alert{Multi-agent settings}, where agents must learn to coordinate safely and adapt to each other's policies.
-    \end{itemize}
-    \bigskip
-    \item \textbf{Fairness and Socially Responsible AI:}
-    \begin{itemize}
-        \item Adapting the framework to handle \alert{fairness constraints}, ensuring that RL agents do not produce biased or discriminatory outcomes in non-stationary environments.
-    \end{itemize}
-  \end{itemize}
-\end{frame}
-
-% Section 11: End Slide
-\section*{End}
-\begin{frame}
-  \centering
-  \Huge
-  Thank You!
-  \vfill
-  \Large
-  Questions?
-\end{frame}
-
-\end{document}
\ No newline at end of file
diff --git a/Paper2Video/src/latex_proj/slidesproposal_0.25.tex b/Paper2Video/src/latex_proj/slidesproposal_0.25.tex
deleted file mode 100644
index e81aa1f8fc2cb894b94c2394d8f25c6d57af6877..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/slidesproposal_0.25.tex
+++ /dev/null
@@ -1,42 +0,0 @@
-\documentclass{beamer}
-
-
-% Theme and Color
-\usetheme{Madrid}
-\usecolortheme{default}
-
-% Packages
-\usepackage[utf8]{inputenc}
-\usepackage[T1]{fontenc}
-\usepackage{amsmath, amssymb, amsfonts}
-\usepackage{booktabs}
-\usepackage{graphicx}
-\usepackage{hyperref}
-\usepackage{bm} % For bold math symbols
-
-% Custom commands from the source text for consistency
-\newcommand{\KL}{D_{\mathrm{KL}}}
-\def\figref#1{Figure~\ref{#1}}
-
-\title[Meta-Safe RL]{A CMDP-within-online framework for Meta-Safe Reinforcement Learning}
-\author{Vanshaj Khattar\inst{1} \and Yuhao Ding\inst{2} \and Bilgehan Sel\inst{1} \and Javad Lavaei\inst{2} \and Ming Jin\inst{1}}
-\institute[VT \& UCB]{
-  \inst{1} Virginia Tech \\
-  \inst{2} UC Berkeley
-}
-\date{\today}
-
-
-\setbeamerfont{caption}{size=\scriptsize}
-\begin{document}
-\begin{frame}{Experimental Results: MuJoCo Environments}
-    \centering
-    \textbf{Half-Cheetah (Low Task-Similarity)}
-    \begin{figure}
-        \includegraphics[width=0.2\textwidth]{HalfCheetah/HalfCheetahReward_low_task_similarity_broken_axis.pdf}
-        \includegraphics[width=0.2\textwidth]{HalfCheetah/HalfCheetahCost_low_task_similarity.pdf}
-        \caption{Reward (top) and constraint violation (bottom) for Half-Cheetah. Our method (Meta-SRL) learns a high-reward policy while keeping the constraint violation below the threshold (blue line).}
-        \label{fig:halfcheetah}
-    \end{figure}
-\end{frame}
-\end{document}
\ No newline at end of file
diff --git a/Paper2Video/src/latex_proj/slidesproposal_0.5.tex b/Paper2Video/src/latex_proj/slidesproposal_0.5.tex
deleted file mode 100644
index ae22fd4d9fb66a47b5a6b0a34427e50605aa89dd..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/slidesproposal_0.5.tex
+++ /dev/null
@@ -1,42 +0,0 @@
-\documentclass{beamer}
-
-
-% Theme and Color
-\usetheme{Madrid}
-\usecolortheme{default}
-
-% Packages
-\usepackage[utf8]{inputenc}
-\usepackage[T1]{fontenc}
-\usepackage{amsmath, amssymb, amsfonts}
-\usepackage{booktabs}
-\usepackage{graphicx}
-\usepackage{hyperref}
-\usepackage{bm} % For bold math symbols
-
-% Custom commands from the source text for consistency
-\newcommand{\KL}{D_{\mathrm{KL}}}
-\def\figref#1{Figure~\ref{#1}}
-
-\title[Meta-Safe RL]{A CMDP-within-online framework for Meta-Safe Reinforcement Learning}
-\author{Vanshaj Khattar\inst{1} \and Yuhao Ding\inst{2} \and Bilgehan Sel\inst{1} \and Javad Lavaei\inst{2} \and Ming Jin\inst{1}}
-\institute[VT \& UCB]{
-  \inst{1} Virginia Tech \\
-  \inst{2} UC Berkeley
-}
-\date{\today}
-
-
-\setbeamerfont{caption}{size=\scriptsize}
-\begin{document}
-\begin{frame}{Experimental Results: MuJoCo Environments}
-    \centering
-    \textbf{Half-Cheetah (Low Task-Similarity)}
-    \begin{figure}
-        \includegraphics[width=0.4\textwidth]{HalfCheetah/HalfCheetahReward_low_task_similarity_broken_axis.pdf}
-        \includegraphics[width=0.4\textwidth]{HalfCheetah/HalfCheetahCost_low_task_similarity.pdf}
-        \caption{Reward (top) and constraint violation (bottom) for Half-Cheetah. Our method (Meta-SRL) learns a high-reward policy while keeping the constraint violation below the threshold (blue line).}
-        \label{fig:halfcheetah}
-    \end{figure}
-\end{frame}
-\end{document}
\ No newline at end of file
diff --git a/Paper2Video/src/latex_proj/slidesproposal_0.75.tex b/Paper2Video/src/latex_proj/slidesproposal_0.75.tex
deleted file mode 100644
index 8b90bc1a9fe97fed98f22786b6f79ea0f91f346e..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/slidesproposal_0.75.tex
+++ /dev/null
@@ -1,42 +0,0 @@
-\documentclass{beamer}
-
-
-% Theme and Color
-\usetheme{Madrid}
-\usecolortheme{default}
-
-% Packages
-\usepackage[utf8]{inputenc}
-\usepackage[T1]{fontenc}
-\usepackage{amsmath, amssymb, amsfonts}
-\usepackage{booktabs}
-\usepackage{graphicx}
-\usepackage{hyperref}
-\usepackage{bm} % For bold math symbols
-
-% Custom commands from the source text for consistency
-\newcommand{\KL}{D_{\mathrm{KL}}}
-\def\figref#1{Figure~\ref{#1}}
-
-\title[Meta-Safe RL]{A CMDP-within-online framework for Meta-Safe Reinforcement Learning}
-\author{Vanshaj Khattar\inst{1} \and Yuhao Ding\inst{2} \and Bilgehan Sel\inst{1} \and Javad Lavaei\inst{2} \and Ming Jin\inst{1}}
-\institute[VT \& UCB]{
-  \inst{1} Virginia Tech \\
-  \inst{2} UC Berkeley
-}
-\date{\today}
-
-
-\setbeamerfont{caption}{size=\scriptsize}
-\begin{document}
-\begin{frame}{Experimental Results: MuJoCo Environments}
-    \centering
-    \textbf{Half-Cheetah (Low Task-Similarity)}
-    \begin{figure}
-        \includegraphics[width=0.6\textwidth]{HalfCheetah/HalfCheetahReward_low_task_similarity_broken_axis.pdf}
-        \includegraphics[width=0.6\textwidth]{HalfCheetah/HalfCheetahCost_low_task_similarity.pdf}
-        \caption{Reward (top) and constraint violation (bottom) for Half-Cheetah. Our method (Meta-SRL) learns a high-reward policy while keeping the constraint violation below the threshold (blue line).}
-        \label{fig:halfcheetah}
-    \end{figure}
-\end{frame}
-\end{document}
\ No newline at end of file
diff --git a/Paper2Video/src/latex_proj/slidesproposal_1.tex b/Paper2Video/src/latex_proj/slidesproposal_1.tex
deleted file mode 100644
index 1c8414ac84b3971a895cc136a11473ced457fcac..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/slidesproposal_1.tex
+++ /dev/null
@@ -1,42 +0,0 @@
-\documentclass{beamer}
-
-
-% Theme and Color
-\usetheme{Madrid}
-\usecolortheme{default}
-
-% Packages
-\usepackage[utf8]{inputenc}
-\usepackage[T1]{fontenc}
-\usepackage{amsmath, amssymb, amsfonts}
-\usepackage{booktabs}
-\usepackage{graphicx}
-\usepackage{hyperref}
-\usepackage{bm} % For bold math symbols
-
-% Custom commands from the source text for consistency
-\newcommand{\KL}{D_{\mathrm{KL}}}
-\def\figref#1{Figure~\ref{#1}}
-
-\title[Meta-Safe RL]{A CMDP-within-online framework for Meta-Safe Reinforcement Learning}
-\author{Vanshaj Khattar\inst{1} \and Yuhao Ding\inst{2} \and Bilgehan Sel\inst{1} \and Javad Lavaei\inst{2} \and Ming Jin\inst{1}}
-\institute[VT \& UCB]{
-  \inst{1} Virginia Tech \\
-  \inst{2} UC Berkeley
-}
-\date{\today}
-
-
-\setbeamerfont{caption}{size=\scriptsize}
-\begin{document}
-\begin{frame}{Experimental Results: MuJoCo Environments}
-    \centering
-    \textbf{Half-Cheetah (Low Task-Similarity)}
-    \begin{figure}
-        \includegraphics[width=0.8\textwidth]{HalfCheetah/HalfCheetahReward_low_task_similarity_broken_axis.pdf}
-        \includegraphics[width=0.8\textwidth]{HalfCheetah/HalfCheetahCost_low_task_similarity.pdf}
-        \caption{Reward (top) and constraint violation (bottom) for Half-Cheetah. Our method (Meta-SRL) learns a high-reward policy while keeping the constraint violation below the threshold (blue line).}
-        \label{fig:halfcheetah}
-    \end{figure}
-\end{frame}
-\end{document}
\ No newline at end of file
diff --git a/Paper2Video/src/latex_proj/slidesrefined.tex b/Paper2Video/src/latex_proj/slidesrefined.tex
deleted file mode 100644
index 2a406b3b93c01c18a7906568c2664d50becb438b..0000000000000000000000000000000000000000
--- a/Paper2Video/src/latex_proj/slidesrefined.tex
+++ /dev/null
@@ -1,354 +0,0 @@
-\documentclass{beamer}
-
-
-% Theme and Color
-\usetheme{Madrid}
-\usecolortheme{default}
-
-% Packages
-\usepackage[utf8]{inputenc}
-\usepackage[T1]{fontenc}
-\usepackage{amsmath, amssymb, amsfonts}
-\usepackage{booktabs}
-\usepackage{graphicx}
-\usepackage{hyperref}
-\usepackage{bm} % For bold math symbols
-
-% Custom commands from the source text for consistency
-\newcommand{\KL}{D_{\mathrm{KL}}}
-\def\figref#1{Figure~\ref{#1}}
-
-\title[Meta-Safe RL]{A CMDP-within-online framework for Meta-Safe Reinforcement Learning}
-\author{Vanshaj Khattar\inst{1} \and Yuhao Ding\inst{2} \and Bilgehan Sel\inst{1} \and Javad Lavaei\inst{2} \and Ming Jin\inst{1}}
-\institute[VT \& UCB]{
-  \inst{1} Virginia Tech \\
-  \inst{2} UC Berkeley
-}
-\date{\today}
-
-
-\setbeamerfont{caption}{size=\scriptsize}
-\begin{document}
-\begin{frame}
-  \titlepage
-\end{frame}
-\begin{frame}{Outline}
-  \tableofcontents
-\end{frame}
-\section{Motivation}
-\begin{frame}{Motivation: Why Meta-Safe RL?}
-  \begin{block}{Background: Meta-Reinforcement Learning (Meta-RL)}
-    \footnotesize
-    \begin{itemize}
-      \item Meta-RL enables agents to learn new tasks quickly with limited experience.
-      \item It's a "learning-to-learn" framework successful in robotics, federated learning, etc.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{The Problem: Safety is Critical}
-    \footnotesize
-    \begin{itemize}
-      \item Many real-world applications have \alert{safety constraints} that must not be violated (e.g., robotics, autonomous driving).
-      \item Existing Meta-RL methods do not adequately address these constraints.
-      \item Safe RL problems are often modeled as \alert{Constrained Markov Decision Processes (CMDPs)}, but standard CMDP algorithms don't generalize efficiently to new tasks.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Our Goal}
-    \footnotesize
-    \begin{itemize}
-      \item Develop a principled framework, \alert{Meta-Safe RL (Meta-SRL)}, that combines the fast adaptation of meta-learning with the safety guarantees of Safe RL.
-      \item Provide the \alert{first provable guarantees} for learning across multiple safe RL tasks.
-    \end{itemize}
-  \end{block}
-\end{frame}
-\section{Related Work}
-\begin{frame}{Related Work}
-  \begin{itemize}
-    \item \textbf{Meta-Reinforcement Learning:}
-    \begin{itemize}
-        \item Focuses on learning initial conditions, hyperparameters, etc., for fast adaptation.
-        \item Most work is for \alert{unconstrained} environments.
-    \end{itemize}
-    \item \textbf{Online Meta-Learning:}
-    \begin{itemize}
-        \item Provides theoretical frameworks, often for convex and decomposable loss functions.
-        \item Our work extends this to the \alert{nonconvex and complex} setting of CMDPs.
-    \end{itemize}
-    \item \textbf{Safe RL and CMDPs:}
-    \begin{itemize}
-        \item A rich field with many algorithms (e.g., primal-dual, policy-based like \alert{CRPO}).
-        \item However, these are designed for a \alert{single task} and are not built to generalize or adapt quickly to unseen tasks.
-    \end{itemize}
-  \end{itemize}
-\end{frame}
-\section{Method}
-\begin{frame}{Method: CMDP-within-Online Framework}
-  \begin{block}{Core Idea}
-    \footnotesize
-    \footnotesize
-    \begin{itemize}
-        \item A \alert{meta-learner} (online algorithm) operates over a sequence of CMDP tasks.
-        \item For each task $t$, the meta-learner provides an initial policy $\alert{\pi_{t,0}}$ and a learning rate $\alert{\alpha_t}$ to a \alert{within-task} Safe RL algorithm (e.g., CRPO).
-        \item The goal is to minimize the \textbf{Task-Averaged Optimality Gap (TAOG)} and \textbf{Task-Averaged Constraint Violation (TACV)}.
-    \end{itemize}
-  \end{block}
-  \begin{figure}
-    \centering
-    \includegraphics[width=0.3\textwidth]{illustrate.pdf}
-    \caption{Conceptual illustration of the meta-learning process.}
-    \label{fig:method_concept}
-  \end{figure}
-\end{frame}
-\begin{frame}{Method: The Within-Task Algorithm (CRPO)}
-  % Corresponds to the source text Section 2.1
-  \begin{block}{Constrained Markov Decision Process (CMDP)}
-    \footnotesize
-    For each task $t$, the agent aims to solve:
-    \begin{equation*}
-        \underset{\pi}{\max} \hspace{0.1cm} J_{t,0}(\pi) \hspace{0.3cm} \text{s.t.} \hspace{0.2cm} \alert{J_{t,i}(\pi) \leq d_{t,i}}, \hspace{0.3cm} \forall i = 1,...,p
-    \end{equation*}
-    where $J_{t,0}$ is the expected reward and $J_{t,i}$ are expected costs.
-  \end{block}
-
-  \begin{block}{CRPO Algorithm \& Regret}
-    \footnotesize
-    \begin{itemize}
-        \item We use the Constraint-Rectified Policy Optimization (\alert{CRPO}) algorithm.
-        \item The single-task optimality gap ($R_0$) and constraint violation ($R_i$) are bounded by:
-        \begin{equation*}
-            R_0, R_i \leq \mathcal{O}\left( \frac{\mathbb{E}_{s \sim \nu_t^*}[\alert{\KL(\pi_t^*|\pi_{t,0})}]}{\alpha_t M} + \alpha_t \right)
-        \end{equation*}
-        \item \textbf{Key Insight:} The performance depends heavily on the KL-divergence between the optimal policy $\pi_t^*$ and the initial policy $\pi_{t,0}$.
-        \item Our meta-learner will optimize this upper bound by choosing good `$\alert{\pi_{t,0}}$` and `$\alert{\alpha_t}$`.
-    \end{itemize}
-  \end{block}
-\end{frame}
-\begin{frame}{Method: The Inexact Framework}
-  % Corresponds to the source text Section 3.1
-  \begin{block}{Challenge: Unknown Optimal Policies}
-    \footnotesize
-    \begin{itemize}
-        \item In practice, the optimal policy $\alert{\pi_t^*}$ and its state distribution $\alert{\nu_t^*}$ are unknown.
-        \item We only have access to a suboptimal policy $\alert{\hat{\pi}_t}$ and collected trajectory data $\alert{\mathcal{D}_t}$.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Solution: Estimate and Bound the Error}
-    \footnotesize
-    \begin{itemize}
-        \item \textbf{Estimate:} Use the suboptimal policy $\hat{\pi}_t$ and estimate its state distribution $\hat{\nu}_t$ from data $\mathcal{D}_t$ using \alert{DualDICE}.
-        \item \textbf{Inexact Loss:} The meta-learner optimizes an inexact loss function:
-        $$ \hat{f}_{t}(\phi) = \mathbb{E}_{\hat{\nu}_t}[\KL(\hat{\pi}_t|\phi)] $$
-        \item \textbf{Bound the Error:} We prove a bound on the estimation error:
-        $$ |\mathbb{E}_{\nu_t^*}[\KL(\pi_t^*|\phi)] - \mathbb{E}_{\hat{\nu}_t}[\KL(\hat{\pi}_t|\phi)]| \leq \alert{\epsilon_t} $$
-        This bound (Thm. 3.1) is derived using novel techniques from \alert{tame geometry}.
-    \end{itemize}
-  \end{block}
-\end{frame}
-\begin{frame}{Method: Dynamic Regret \& Adaptive Learning Rates}
-  % Corresponds to the source text Section 3.3
-  \begin{block}{Challenge: Adapting to Dynamic Environments}
-    \footnotesize
-    \begin{itemize}
-        \item A fixed meta-initialization may not be optimal if the environment changes over time.
-        \item Setting the learning rate $\alpha_t$ optimally requires knowledge of future tasks.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Solution: Separate Online Learners}
-    \footnotesize
-    \begin{itemize}
-        \item We decompose the regret upper bound into two components.
-        \item We use two parallel Online Gradient Descent (OGD) algorithms:
-        \begin{enumerate}
-            \item \textbf{INIT}: Learns the policy initialization $\alert{\pi_{t,0}}$ by minimizing $\hat{f}_{t}^{init}(\phi) = \mathbb{E}_{\hat{\nu}_t}[\KL(\hat{\pi}_t|\phi)]$.
-            \item \textbf{SIM}: Learns the learning rate $\alert{\alpha_t}$ by minimizing its own loss term $\hat{f}_t^{sim}(\kappa)$.
-        \end{enumerate}
-        \item This allows the framework to adapt both policy and learning rate online, without knowing task properties in advance.
-    \end{itemize}
-  \end{block}
-\end{frame}
-\section{Innovation}
-\begin{frame}{Our Innovations}
-  \begin{block}{Novel Framework and Guarantees}
-    \footnotesize
-    \begin{itemize}
-        \item The \alert{first provable guarantees} for Meta-Safe RL, establishing bounds on task-averaged optimality gap (TAOG) and constraint violation (TACV).
-        \item The regret bounds explicitly improve with \alert{task-similarity} ($\hat{D}^*$) or \alert{task-relatedness} ($\hat{V}_\psi$).
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Practical and Adaptive Algorithm}
-    \footnotesize
-    \begin{itemize}
-        \item \textbf{Inexact framework}: Works with suboptimal policies and estimates distributions using \alert{DualDICE}, making it practical.
-        \item \textbf{Adaptive learning}: The meta-learner adapts both policy initialization and learning rates for each task, handling dynamic environments.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Technical Contributions}
-    \footnotesize
-    \begin{itemize}
-        \item New analysis of the \alert{optimization landscape of CMDPs} using tame geometry to bound the distance between optimal and suboptimal policies.
-        \item Extended analysis for \alert{inexact online gradient descent} to handle dynamic regret with biased gradient estimates.
-    \end{itemize}
-  \end{block}
-\end{frame}
-\begin{frame}
-  \centering
-  \Huge
-  Experimental Evaluation
-\end{frame}
-\section{Experimental Method}
-\begin{frame}{Experimental Method}
-  \begin{block}{Objective}
-    \footnotesize
-    \begin{itemize}
-      \item To empirically validate the effectiveness of our \alert{Meta-SRL} framework against standard meta-learning baselines.
-    \end{itemize}
-  \end{block}
-  
-  \begin{block}{Baselines for Comparison}
-    \footnotesize
-    \begin{itemize}
-        \item \alert{Random Initialization}: Standard CRPO with a new random policy for each task.
-        \item \alert{Pre-trained}: Initialize with the final policy from the previous task.
-        \item \alert{Simple Averaging}: Offline average of all previously learned policies.
-        \item \alert{Follow the Average Leader (FAL)}: Online average of all previously learned policies.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Task Generation}
-    \footnotesize
-    \begin{itemize}
-        \item We generate a sequence of related CMDP tasks by sampling from a distribution over environment parameters (e.g., transition dynamics, reward functions).
-        \item We test under two conditions: \alert{high task-similarity} and \alert{low task-similarity}.
-    \end{itemize}
-  \end{block}
-\end{frame}
-\section{Experimental Setting}
-\begin{frame}{Experimental Setting}
-  \begin{block}{Environments}
-  \footnotesize
-  We use a range of classic control environments with added safety constraints:
-    \begin{itemize}
-        \item \textbf{OpenAI Gym:}
-        \begin{itemize}
-            \item \alert{FrozenLake}: Discrete state space, $T=10$ tasks.
-            \item \alert{Acrobot}: Continuous state space, $T=50$ tasks.
-        \end{itemize}
-        \item \textbf{MuJoCo:}
-        \begin{itemize}
-            \item \alert{Half-Cheetah}: High-dimensional continuous control, $T=100$ tasks. Constraint on head height.
-            \item \alert{Humanoid}: Very high-dimensional, $T=250$ tasks. Constraint on joint angles for smooth motion.
-        \end{itemize}
-    \end{itemize}
-  \end{block}
-\end{frame}
-\section{Experimental Results}
-\begin{frame}{Experimental Results: Low Task-Similarity}
-  \begin{columns}[T]
-    \begin{column}{0.5\textwidth}
-      \centering
-      \textbf{FrozenLake}
-      \includegraphics[width=1\textwidth]{FrozenLake/FrozenLakeLowSimilarity.pdf}
-    \end{column}
-    \begin{column}{0.5\textwidth}
-      \centering
-      \textbf{Acrobot}
-      \includegraphics[width=1\textwidth]{Acrobot/Acrobot_low_similarity2.pdf}
-    \end{column}
-  \end{columns}
-  
-  \begin{block}{Observations}
-    \footnotesize
-    \footnotesize
-    \begin{itemize}
-      \item In settings with low task similarity, \alert{Meta-SRL} (our method) consistently learns faster and more safely.
-      \item It achieves higher rewards while rapidly satisfying the safety constraints (driving constraint violation to zero).
-      \item Simpler baselines like \alert{FAL} and \alert{Pre-trained} struggle to satisfy constraints or learn good policies.
-    \end{itemize}
-  \end{block}
-\end{frame}
-\begin{frame}{Experimental Results: MuJoCo Environments}
-    \centering
-    \textbf{Half-Cheetah (Low Task-Similarity)}
-    \begin{figure}
-        \includegraphics[width=0.6\textwidth]{HalfCheetah/HalfCheetahReward_low_task_similarity_broken_axis.pdf}
-        \includegraphics[width=0.6\textwidth]{HalfCheetah/HalfCheetahCost_low_task_similarity.pdf}
-        \caption{Reward (top) and constraint violation (bottom) for Half-Cheetah. Our method (Meta-SRL) learns a high-reward policy while keeping the constraint violation below the threshold (blue line).}
-        \label{fig:halfcheetah}
-    \end{figure}
-\end{frame}
-\section{Ablation Experiment}
-\begin{frame}{Ablation Analysis}
-  While no explicit ablation study was conducted, comparing Meta-SRL to the baselines serves as a validation of its key components.
-
-  \begin{block}{Meta-SRL vs. FAL / Simple Averaging}
-    \footnotesize
-    \begin{itemize}
-        \item \textbf{Ablated Component:} The intelligent meta-update (using \alert{DualDICE} estimates and \alert{OGD} on the regret bound).
-        \item \textbf{Result:} Meta-SRL significantly outperforms simple averaging, showing that a weighted, adaptive update is crucial and superior to naive averaging.
-    \end{itemize}
-  \end{block}
-
-  \begin{block}{Meta-SRL vs. Pre-trained}
-    \footnotesize
-    \begin{itemize}
-        \item \textbf{Ablated Component:} Learning from a history of multiple tasks. The pre-trained baseline only uses the most recent task.
-        \item \textbf{Result:} Meta-SRL is more robust, especially in low-similarity settings, demonstrating the benefit of aggregating knowledge from diverse past experiences.
-    \end{itemize}
-  \end{block}
-  
-  \begin{block}{Conclusion}
-  \footnotesize
-  The full \alert{Meta-SRL} model, with its inexact estimation and adaptive learning, is critical for achieving strong performance and safety.
-  \end{block}
-\end{frame}
-\section{Deficiencies}
-\begin{frame}{Limitations of the Current Method}
-  \begin{itemize}
-    \item \textbf{Algorithm-Specific Guarantees:}
-    \begin{itemize}
-        \item Our theoretical framework is built upon the \alert{CRPO} algorithm.
-        \item Extending it to other within-task Safe RL algorithms (e.g., primal-dual methods) would require a new analysis of their specific regret bounds.
-    \end{itemize}
-    \bigskip
-    \item \textbf{No Hard Safety Guarantees During Learning:}
-    \begin{itemize}
-        \item The framework minimizes task-averaged constraint violation, achieving safety \textit{on average} and \textit{asymptotically}.
-        \item It does not guarantee \alert{zero constraint violation} at every step during the learning process, which may be a requirement for highly critical systems.
-    \end{itemize}
-  \end{itemize}
-\end{frame}
-\section{Future Research}
-\begin{frame}{Future Research Directions}
-  \begin{itemize}
-    \item \textbf{Meta-SRL with Zero-Violation Guarantees:}
-    \begin{itemize}
-        \item Designing frameworks that can provide hard safety constraints throughout the learning phase, possibly by integrating pessimistic or certified approaches.
-    \end{itemize}
-    \bigskip
-    \item \textbf{Extension to More Complex Scenarios:}
-    \begin{itemize}
-        \item \alert{Non-stationary environments} where the task distribution itself may shift over time.
-        \item \alert{Multi-agent settings}, where agents must learn to coordinate safely and adapt to each other's policies.
-    \end{itemize}
-    \bigskip
-    \item \textbf{Fairness and Socially Responsible AI:}
-    \begin{itemize}
-        \item Adapting the framework to handle \alert{fairness constraints}, ensuring that RL agents do not produce biased or discriminatory outcomes in non-stationary environments.
-    \end{itemize}
-  \end{itemize}
-\end{frame}
-\section{End}
-\begin{frame}
-  \centering
-  \Huge
-  Thank You!
-  \vfill
-  \Large
-  Questions?
-\end{frame}
-\end{document}
\ No newline at end of file
diff --git a/Paper2Video/src/pipeline.py b/Paper2Video/src/pipeline.py
deleted file mode 100644
index 4d26a406acfc333c0f42fd61c3a1da6587e8d11c..0000000000000000000000000000000000000000
--- a/Paper2Video/src/pipeline.py
+++ /dev/null
@@ -1,187 +0,0 @@
-'''
-    1. (LLM) slide generation 
-    2. (VLM) subtitle and cursor prompt generation
-    3. TTS->audio; GUI&WhisperX Grounding->cursor;
-    4. Talking Gen: local-[hallo2, fantasy, ...], api-[HeyGen]
-    5. Merage
-'''
-import cv2
-import pdb
-import json
-import time
-import shutil
-import asyncio
-import os, sys
-import argparse
-import subprocess
-from os import path
-from pdf2image import convert_from_path
-
-print("Initializing...")
-from speech_gen import tts_per_slide
-from subtitle_render import add_subtitles
-from talking_gen import talking_gen_per_slide
-from cursor_gen import cursor_gen_per_sentence
-from slide_code_gen import latex_code_gen
-# from slide_code_gen_select_improvement import latex_code_gen_upgrade
-from cursor_render import render_video_with_cursor_from_json
-from subtitle_cursor_prompt_gen import subtitle_cursor_gen
-
-from wei_utils import get_agent_config
-
-
-# os.environ["GEMINI_API_KEY"] = ""
-# os.environ["OPENAI_API_KEY"] = ""
-
-def copy_folder(src_dir, dst_dir):
-    if not os.path.exists(src_dir): raise FileNotFoundError(f"no such dir: {src_dir}")
-    os.makedirs(os.path.dirname(dst_dir), exist_ok=True)
-    shutil.copytree(src_dir, dst_dir)
-
-def str2list(s): return [int(x) for x in s.split(',')]
-
-if __name__ == '__main__':
-    parser = argparse.ArgumentParser(description='Paper2Video Generation Pipeline')
-    parser.add_argument('--result_dir', type=str, default='./result/zeyu')
-    parser.add_argument('--model_name_t', type=str, default='gpt-4.1') 
-    parser.add_argument('--model_name_v', type=str, default='gpt-4.1') 
-    parser.add_argument('--model_name_talking', type=str, default='hallo2')
-    parser.add_argument('--paper_latex_root', type=str, default='./assets/demo/latex_proj')
-    parser.add_argument('--ref_img', type=str, default='./assets/demo/zeyu.png')
-    parser.add_argument('--ref_audio', type=str, default='./assets/demo/zeyu.wav')
-    parser.add_argument('--ref_text', type=str, default=None)
-    parser.add_argument('--gpu_list', type=str2list, default="")
-    parser.add_argument('--if_tree_search', type=bool, default=True)
-    parser.add_argument('--beamer_templete_prompt', type=str, default=None)
-    parser.add_argument('--stage', type=str, default="[\"0\"]") 
-    parser.add_argument('--talking_head_env', type=str, default="") 
-    # slide+subtitle: 1; 
-    # tts+cusor: 2; 
-    # talking-head: 3: 
-    # all: 0
-    args = parser.parse_args()
-    stage = json.loads(args.stage)
-    print("start", "stage:", stage, args.gpu_list)
-    
-    cursor_img_path = "./cursor_image/red.png"
-    os.makedirs(args.result_dir, exist_ok=True) # result dir
-    agent_config_t = get_agent_config(args.model_name_t) # LLM
-    agent_config_v = get_agent_config(args.model_name_v) # VLM
-    copy_latex_proj_path = path.join(args.result_dir, path.basename(args.paper_latex_root))
-    if path.exists(copy_latex_proj_path) is False:
-        copy_folder(args.paper_latex_root, copy_latex_proj_path)
-    args.paper_latex_root = copy_latex_proj_path
-    
-    if path.exists(path.join(args.result_dir, "sat.json")) is True:
-        with open(path.join(args.result_dir, "sat.json"), 'r') as f: 
-            time_second = json.load(f)
-    else: time_second = {}
-        
-    if path.exists(path.join(args.result_dir, "token.json")) is True:
-        with open(path.join(args.result_dir, "token.json"), 'r') as f: 
-            token_usage = json.load(f)
-    else: token_usage = {}
-    
-    ## Step 1: Slide Generation
-    slide_latex_path = path.join(args.paper_latex_root, "slides.tex")
-    slide_image_dir = path.join(args.result_dir, 'slide_imgs')
-    os.makedirs(slide_image_dir, exist_ok=True)
-    
-    start_time = time.time() # start time
-    if "1" in stage or  "0" in stage:
-        prompt_path = "./prompts/slide_beamer_prompt.txt"
-        if args.if_tree_search is True: 
-            usage_slide, beamer_path = latex_code_gen(prompt_path=prompt_path, tex_dir=args.paper_latex_root, beamer_save_path=slide_latex_path, 
-                                                            model_config_ll=agent_config_t, model_config_vl=agent_config_v, beamer_temp_name=args.beamer_templete_prompt)
-        else:
-            paper_latex_path = path.join(args.paper_latex_root, "main.tex") 
-            usage_slide = latex_code_gen(prompt_path=prompt_path, tex_dir=args.paper_latex_root, tex_path=paper_latex_path, beamer_save_path=slide_latex_path, model_config=agent_config_t)
-            
-        slide_imgs = convert_from_path(beamer_path, dpi=400)
-        for i, img in enumerate(slide_imgs): img.save(path.join(slide_image_dir, f"{i+1}.png")) # save slides as images
-        if args.model_name_t not in token_usage.keys(): 
-            token_usage[args.model_name_t] = [usage_slide]
-        else: token_usage[args.model_name_t].append(usage_slide)
-        step1_time =  time.time()
-        time_second["slide_gen"] = [step1_time-start_time]
-        print("Slide Generation", step1_time-start_time)
-    
-    ## Step 2: Subtitle and Cursor Prompt Generation
-    start_time = time.time() # start time
-    subtitle_cursor_save_path = path.join(args.result_dir, 'subtitle_w_cursor.txt')
-    cursor_save_path = path.join(args.result_dir, 'cursor.json')
-
-    speech_save_dir = path.join(args.result_dir, 'audio')
-    if "2" in stage or  "0" in stage:
-        prompt_path = "./prompts/slide_subtitle_cursor_prompt.txt"
-        subtitle, usage_subtitle = subtitle_cursor_gen(slide_image_dir, prompt_path, agent_config_v)
-        with open(subtitle_cursor_save_path, 'w') as f: f.write(subtitle)
-        if args.model_name_v not in token_usage.keys(): 
-            token_usage[args.model_name_v] = [usage_subtitle]
-        else: token_usage[args.model_name_v].append(usage_subtitle)
-        step2_time =  time.time()
-        time_second["subtitle_cursor_prompt_gen"] = [step2_time-start_time]
-        print("Subtitle and Cursor Prompt Generation", step2_time-start_time)
-
-        ## Step 3-1: Speech Generation
-        tts_per_slide(model_type='f5', script_path=subtitle_cursor_save_path, 
-                    speech_save_dir=speech_save_dir, ref_audio=args.ref_audio, ref_text=args.ref_text)  
-        step3_1_time =  time.time()
-        time_second["tts"] = [step3_1_time-step2_time]
-        print("Speech Generation", step3_1_time-step2_time)
-        
-        ## Step 3-2: Cursor Generation
-        os.environ["PYTHONHASHSEED"] = "random"        
-        cursor_token = cursor_gen_per_sentence(script_path=subtitle_cursor_save_path, slide_img_dir=slide_image_dir, 
-                                slide_audio_dir=speech_save_dir, cursor_save_path=cursor_save_path, gpu_list=args.gpu_list)
-        token_usage["cursor"] = cursor_token
-        step3_2_time =  time.time()
-        time_second["cursor_gen"] = [step3_2_time-step3_1_time]
-        print("Cursor Generation", step3_2_time-step3_1_time)
-    
-    ## Step 4: Talking Video Generation
-    start_time = time.time() # start time
-    if "3" in stage or  "0" in stage:
-        talking_save_dir = path.join(args.result_dir, 'talking_{}'.format(args.model_name_talking))
-        talking_inference_input = []
-        audio_path_list = [path.join(speech_save_dir, name) for name in os.listdir(speech_save_dir)]
-        for audio_path in audio_path_list: talking_inference_input.append([args.ref_img, audio_path])
-        talking_gen_per_slide(args.model_name_talking, talking_inference_input, talking_save_dir, args.gpu_list, env_path=args.talking_head_env)
-        step4_time =  time.time()
-        time_second["talking_gen"] = [step4_time-start_time]
-        print("Cursor Generation", step4_time-start_time)
-    
-        ## Step5: Merage
-        # merage talking and slides
-        tmp_merage_dir = path.join(args.result_dir, "merage")
-        tmp_merage_1 = path.join(args.result_dir, "1_merage.mp4")
-        image_size = cv2.imread(path.join(slide_image_dir, '1.png')).shape
-        if args.model_name_talking == 'hallo2':
-            size = max(image_size[0]//6, image_size[1]//6)
-            width, height = size, size
-        num_slide = len(os.listdir(slide_image_dir))
-        print(args.ref_img.split("/")[-1].split(".")[0])
-        merage_cmd =  ["./1_merage.bash", slide_image_dir, talking_save_dir, tmp_merage_dir,
-                    str(width), str(height), str(num_slide), tmp_merage_1, args.ref_img.split("/")[-1].replace(".png", "")]
-        out = subprocess.run(merage_cmd, text=True)
-        # render cursor
-        cursor_size = size//6
-        tmp_merage_2 = path.join(args.result_dir, "2_merage.mp4")
-        render_video_with_cursor_from_json(video_path=tmp_merage_1, out_video_path=tmp_merage_2, 
-                                        json_path=cursor_save_path, cursor_img_path=cursor_img_path, 
-                                        transition_duration=0.1, cursor_size=cursor_size)
-        # render subtitle
-        front_size = size//10
-        tmp_merage_3 = path.join(args.result_dir, "3_merage.mp4")
-        add_subtitles(tmp_merage_2, tmp_merage_3, size//10)
-        step5_time =  time.time()
-        time_second["merage"] = [step5_time-step4_time]
-        print("Merage", step5_time-step4_time)
-        
-    # sat. save
-    time_second = {"slide_gen": [step1_time-start_time, usage_slide], 
-                   "subtitle_cursor_prompt_gen": [step2_time-step1_time, usage_subtitle],
-                   "tts": step3_1_time-step2_time, "cursor_gen": step3_2_time-step3_1_time, 
-                   "talking_gen": step4_time-step3_2_time, "merage": step5_time-step4_time}
-    with open(path.join(args.result_dir, "sat.json"), 'w') as f: json.dump(time_second, f, indent=4)
-    with open(path.join(args.result_dir, "token.json"), 'w') as f: json.dump(token_usage, f, indent=4)
diff --git a/Paper2Video/src/prompts/select_proposal.txt b/Paper2Video/src/prompts/select_proposal.txt
deleted file mode 100644
index a01a126612d25703b2afcaac6ba96f16c1facf00..0000000000000000000000000000000000000000
--- a/Paper2Video/src/prompts/select_proposal.txt
+++ /dev/null
@@ -1,21 +0,0 @@
-You are a slide layout judge. You see four slides A–D in a 2×2 grid:
-A (top-left), B (top-right), C (bottom-left), D (bottom-right).
-
-Definitions
-
-* Overfull: any part of the figure or its caption is clipped, outside the frame, or overlapped/hidden.
-* Coverage: among non-overfull options, larger visible content with less empty background is better.
-* Risk of overfull decreases from A → D (A is largest, D is smallest).
-* Coverage decreases from A → D
-
-Rules (judge only the given images)
-
-1. Disqualify any option with overfull (caption must be fully visible).
-2. From the remaining, pick the one with the greatest coverage.
-3. Practical method: must scan A → B → C → D all and then choose the first slide in that order that is not overfull.
-
-Output only(do *NOT* ouptut '''json):
-{
-"reason": "concise comparison: chosen slide is largest without overfull; others are clipped or smaller",
-"choice": "A" | "B" | "C" | "D"
-}
diff --git a/Paper2Video/src/prompts/slide_beamer_correct.txt b/Paper2Video/src/prompts/slide_beamer_correct.txt
deleted file mode 100644
index ee6bc80636407c6dc37ec3b8f63ce9b11cd5e337..0000000000000000000000000000000000000000
--- a/Paper2Video/src/prompts/slide_beamer_correct.txt
+++ /dev/null
@@ -1,3 +0,0 @@
-You are given a latex beamer code for the slides of a research paper and its error information.
-You should correct these errors but do not change the slide content (e.g., text, figures and layout).
-# Only output latex code which should be ready to compile using tectonic(simple verson of TeX Live).
\ No newline at end of file
diff --git a/Paper2Video/src/prompts/slide_beamer_prompt.txt b/Paper2Video/src/prompts/slide_beamer_prompt.txt
deleted file mode 100644
index ca562b7155b11c102ca6b5494dd914f7f98141db..0000000000000000000000000000000000000000
--- a/Paper2Video/src/prompts/slide_beamer_prompt.txt
+++ /dev/null
@@ -1,45 +0,0 @@
-Please generate a complete English PPT introduction based on the given TeX source of a research paper, using LaTeX Beamer.
-The specific requirements are as follows.
-
-Content structure
-The PPT must contain the following chapters (arranged in order), and each chapter must have a clear title and content:
-·Open slide (title, author, instructions​​)
-·Motivation (research background and problem statement and how differentiation from existing work)
-·Related work (current status and challenges in the field)
-·Method (core technical framework) [The content of the method needs to be introduced in detail, and each part of the method should be introduced on a separate page]
-·Experimental method (experimental design and process)
-·Experimental setting (dataset, parameters, environment, etc.)
-·Experimental results (main experimental results and comparative analysis)
-·Ablation experiment (validation of the role of key modules)
-·Deficiencies (limitations of current methods)
-·Future research (improvement direction or potential application)
-·End slide (Thank you)
-
-Format requirements
-·Use Beamer's theme suitable for academic presentations. If given a theme you should use it (could be refer to local path)
-·The content of each page should be concise, avoid long paragraphs, and use itemize or block environment to present points.
-·The title page contains the paper title, author, institution, and date.
-·Key terms or mathematical symbols are highlighted with \alert{}.
-·You must use as many figures as possible since it is more expressive.
-
-​​Image and table processing
-·All image relative paths are given, the picture names must "be consistent with the name in tex file" when using 'ref{}'.
-·Images should automatically adapt to width (for example, \includegraphics[width=0.8\textwidth]{...}), and add titles and labels (\caption and \label).
-·Experimental result tables should be extracted from the source text, formatted using tabular or booktabs environments, and marked with reference sources (for example, "as shown in table \ref{tab:results}").
-
-​​Code generation requirements
-·The generated LaTeX code must be complete and can be compiled directly (including necessary structures such as \documentclass, \begin{document}).
-·Mark the source text location corresponding to each section in the code comments (for example, % corresponds to the source text Section 3.2).
-·If there are mathematical formulas in the source text, they must be retained and correctly converted to LaTeX syntax (such as $y=f(x)$).
-
-​​Other instructions​​
-·Image content should be read from the tex file, and the source name should be used directly without arbitrary modification. Image references should use real image names and should not be forged;
-·Table content should first extract real data from the source document.
-·All content should be in English.
-·If the source text is long, it is allowed to summarize the content, but the core methods, experimental data and conclusions must be retained.
-·Perfer more images than heavy text. **The number of slides should be around 10.** 
-·Must begin as \documentclass{beamer} and end as \end{document}.
-**Don't use "\usepackage{resizebox}" in the code which is not right in grammer.**
-**& in title is not allowed which will cause error "Misplaced alignment tab character &"**
-**Pay attention to this "error: !File ended while scanning use of \frame"**
-Only output *complete* latex code which should be ready to compile using tectonic(simple verson of TeX Live). Before output check if the code is grammatically correct.
\ No newline at end of file
diff --git a/Paper2Video/src/prompts/slide_subtitle_cursor_prompt.txt b/Paper2Video/src/prompts/slide_subtitle_cursor_prompt.txt
deleted file mode 100644
index 55e0d1b37406102a64259195e1d7823060c806c3..0000000000000000000000000000000000000000
--- a/Paper2Video/src/prompts/slide_subtitle_cursor_prompt.txt
+++ /dev/null
@@ -1,20 +0,0 @@
-You are an academic researcher presenting your own work at a research conference. You are provided with a sequence of adjacent slides. 
-
-Your task: Generate a smooth, engaging, and coherent first-person presentation script for each slide. Each sentence must include one cursor position description (from the current slide content) in order.
-
-Requirements:
-1. Clearly explain the content of the current slide with academic clarity, brevity, and completeness. Use a professional, formal tone suitable for a research conference. 
-2. Keep the script concise and professional. Do not explain content unrelated to the paper. 
-3. Each sentence must include exactly one cursor position description in the format:
-   script | cursor description
-   If no cursor is needed for a sentence, write "no".
-4. The total script for each slide must not exceed 50 words. 
-5. Separate slides using "###". 
-
-Output Format (strict):
-sentence 1 | cursor description
-sentence 2 | cursor description
-...
-###
-sentence 1 | cursor description
-...
diff --git a/Paper2Video/src/requirements.txt b/Paper2Video/src/requirements.txt
deleted file mode 100644
index 7839cd7a0915bb6b3d4710e1eaa9ef06b5b66dd0..0000000000000000000000000000000000000000
--- a/Paper2Video/src/requirements.txt
+++ /dev/null
@@ -1,17 +0,0 @@
-ui_tars==0.1.4
-numpy==1.26.0
-torch==2.7.0
-moviepy==1.0.3
-f5_tts==1.1.6
-whisper==1.1.10
-asyncio==3.4.3
-Pillow==10.3.0
-playwright==1.51.0
-mcp==1.10.1
-pydantic==2.10.6
-camel-ai>=0.2.0
-torchvision==0.22.0
-PyMuPDF
-whisperx
-pdf2image
-opencv_python
\ No newline at end of file
diff --git a/Paper2Video/src/slide_code_gen.py b/Paper2Video/src/slide_code_gen.py
deleted file mode 100644
index dfafeeda85537bb1222b7da00c5b92afdbb69404..0000000000000000000000000000000000000000
--- a/Paper2Video/src/slide_code_gen.py
+++ /dev/null
@@ -1,654 +0,0 @@
-'''
-    Slide Beamer Code Generation
-'''
-
-import re
-import fitz
-import yaml
-import json
-import bisect
-import string
-import os, sys, pdb
-import subprocess
-import multiprocessing as mp
-from os import path
-from pathlib import Path
-from bisect import bisect_right
-from camel.models import ModelFactory
-from camel.agents import ChatAgent
-from camel.messages import BaseMessage
-from camel.types import ModelPlatformType
-from pathlib import Path
-from typing import Sequence, Tuple, Optional
-from PIL import Image, ImageDraw, ImageFont
-from .wei_utils import get_agent_config
-
-
-def extract_json_block(text: str, first_only: bool = True):
-    pattern = r"```json\s*([\s\S]*?)\s*```"
-    matches = re.findall(pattern, text, flags=re.IGNORECASE)
-    if first_only:
-        return matches[0] if matches else text
-    return matches
-
-def extract_beamer_code(tex_str):
-    match = re.search(r"(\\documentclass(?:\[[^\]]*\])?\{beamer\}.*?\\end\{document\})", tex_str, re.DOTALL)
-    return match.group(1) if match else None
-
-def latex_code_gen(prompt_path, tex_dir, beamer_save_path, 
-                           model_config_ll, model_config_vl,
-                           beamer_temp_name=None, if_fix=True, if_tree_search=True):
-    print("\n🟦 [1/8] Initializing language model for Beamer code generation...")
-    model = ModelFactory.create(
-        model_platform=model_config_ll["model_platform"],
-        model_type=model_config_ll["model_type"],
-        model_config_dict=model_config_ll.get("model_config"),
-        url=model_config_ll.get("url", None),)
-    agent = ChatAgent(model=model, system_message="",)
-    print("✅ Model initialized successfully.")
-
-    print("\n🟦 [2/8] Loading prompt template from:", prompt_path)
-    with open(prompt_path, 'r', encoding='utf-8') as f_prompt: 
-        templete_prompt = f_prompt.read()
-    token_usage = {}
-
-    print("\n🟦 [3/8] Reading all .tex files from:", tex_dir)
-    tex_list = find_all_tex_files(tex_dir)
-    print(f"📄 Found {len(tex_list)} tex files.")
-    tex_content = '/n'.join(tex_list)
-    root_dir = Path(tex_dir) 
-    all_relative_paths = [str(file.relative_to(root_dir)) for file in root_dir.rglob("*") if file.is_file()]
-    print(f"📁 Found {len(all_relative_paths)} project files (figures, data, etc.)")
-
-    print("\n🟦 [4/8] Generating main inference prompt...")
-    if beamer_temp_name is None:
-        main_inference_prompt = [
-            templete_prompt, "This is the latex code for paper:", tex_content,
-            "The file paths in the project are: \n{}".format(str(all_relative_paths))
-        ]
-    else:
-        main_inference_prompt = [
-            templete_prompt, "This is the latex code for paper:", tex_content,
-            "The file paths in the project are: \n{}".format(str(all_relative_paths)),
-            "Use Beamer Theme: {}".format(beamer_temp_name) 
-        ]
-    main_inference_prompt = "\n".join(map(str, main_inference_prompt))
-
-    print("🤖 Sending prompt to model for Beamer slide generation...")
-    user_msg = BaseMessage.make_user_message(role_name="User", content=main_inference_prompt)
-    response = safe_step(agent, user_msg)
-    token_usage["slide_gen"] = response.info['usage']
-    print("✅ Slide LaTeX code generated.")
-
-    code = extract_beamer_code(response.msgs[-1].content)
-    if not isinstance(code, str): 
-        print("⚠️ Failed to extract Beamer code, dumping raw output...")
-        print(response.msgs[-1].content)
-
-    print(f"\n🟦 [5/8] Saving generated Beamer file to: {beamer_save_path}")
-    with open(beamer_save_path, "w", encoding="utf-8") as f: 
-        f.write(code)
-    print("✅ Beamer code saved.")
-
-    print("\n🟦 [6/8] Compiling the generated .tex file using tectonic...")
-    feedback = compile_tex(beamer_save_path)
-
-    ## fix if error
-    num_try = 0
-    token_usage["fix"] = []
-    while num_try < 10:
-        if "error" in feedback:
-            print(f"⚠️ Compilation error detected, attempt {num_try+1} — fixing...")
-            error_info = re.findall(r'^(error: .+)', feedback, flags=re.MULTILINE)
-            agent.reset()
-            code, fix_usage = correcte_error(code, error_info, agent)
-            token_usage["fix"].append(fix_usage)
-        else:
-            print("✅ No further compilation errors detected.")
-            break
-        if not isinstance(code, str): 
-            print("❌ Failed to fix code automatically.")
-        with open(beamer_save_path, "w", encoding="utf-8") as f: 
-            f.write(code)
-        feedback = compile_tex(beamer_save_path)
-        num_try += 1
-
-    ## improve slide layout
-    print("\n🟦 [7/8] Checking for layout warnings and optimizing slide layout...")
-    config = model_config_vl
-    if if_tree_search is True:
-        new_code_save_path, token_usage_improve = improve_layout(code, feedback, beamer_save_path, config)
-        token_usage["improve"] = token_usage_improve
-        print(f"✅ Layout improvement complete. Final slides saved at: {new_code_save_path}")
-        return token_usage, new_code_save_path
-    else:
-        final_pdf = beamer_save_path.replace(".tex", ".pdf")
-        print(f"✅ Compilation finished. Final PDF saved at: {final_pdf}")
-        return token_usage, final_pdf
-
-
-select_proposal_prompt_path = "./Paper2Video/src/prompts/select_proposal.txt"
-def improve_layout(code, feedback, beamer_save_path, model_config):
-    with open(select_proposal_prompt_path, 'r') as f: template_prompt = f.read()
-    token_usage_improve = []
-    
-    ## get layout warning info
-    warning_info = re.findall(r'^(warning: .+)', feedback, flags=re.MULTILINE)
-    warning_info = warning_info[:len(warning_info)//2]
-    warning_info = [s for s in warning_info if 'Overfull' in s]
-    
-    ## find out which slide needed to be improved
-    head = re.search(r'\\documentclass(?:\[[^\]]*\])?\{beamer\}(.*?)\\begin{document}', code, flags=re.DOTALL).group(1)
-    head = head + "\n" + "\\setbeamerfont{caption}{size=\\scriptsize}" ## smaller the caption front size
-    frames = compute_frame_spans(code)
-    need_improve_list = []
-    for warning in warning_info:
-        num = int(re.search(r'(?<=\.tex:)\d+', warning).group())
-        for idx, f in enumerate(frames):
-            if f["start_line"]<=num<= f["end_line"]:
-                if "\\includegraphics" in f["text"]:
-                    need_improve_list.append(idx)
-                break
-    need_improve_list = sorted(set(need_improve_list))
-    ## propose
-    # num_process = 4
-    # args_list = []
-    # for idx, frame_idx in enumerate(need_improve_list):
-    #     args_list.append([idx, model_config, template_prompt, head, frames[frame_idx]])
-    # with mp.Pool(processes=num_process) as pool: results = pool.map(improve_per_slide, args_list)
-    # for result in results:
-    #     idx, refined_code, usage_improve = result
-    #     frames[frame_idx]["text"] = refined_code
-    #     token_usage_improve.append(usage_improve)
-    imporve_model = ModelFactory.create(
-        model_platform=model_config["model_platform"],
-        model_type=model_config["model_type"],
-        model_config_dict=model_config.get("model_config"),
-        url=model_config.get("url", None),)
-    imporve_agent = ChatAgent(model=imporve_model, system_message="",)
-    proposal_tmp_dir = path.join(path.dirname(beamer_save_path), 'proposal_imgs')
-    os.makedirs(proposal_tmp_dir, exist_ok=True)
-    factors = [1, 0.75, 0.5, 0.25]
-    map_dic = {"A": 0, "B": 1, "C": 2, "D": 3}
-    for idx, frame_idx in enumerate(need_improve_list):
-        frame = frames[frame_idx]
-        proposal_imgs_path_list = []
-        proposal_code_list = []
-        for factor in factors:
-            proposal_code = scale_includegraphics_widths(frame["text"], factor)
-            proposal_code = add_small_after_blocks(proposal_code)
-            proposal_full_code =  '\n'.join(["\\documentclass{beamer}", head, "\\begin{document}", proposal_code, "\\end{document}"])
-            proposal_code_save_path = beamer_save_path.replace('.tex', 'proposal_{}.tex'.format(str(factor)))
-            with open(proposal_code_save_path, 'w') as f: f.write(proposal_full_code)
-            feedback = compile_tex(proposal_code_save_path)  
-            img_path = pdf2img(proposal_code_save_path.replace(".tex", ".pdf"), proposal_tmp_dir)
-            proposal_imgs_path_list.append(img_path)  
-            proposal_code_list.append(proposal_code)
-        prompt_img_path =  path.join(proposal_tmp_dir, "meraged.png")
-        make_grid_with_labels(proposal_imgs_path_list, prompt_img_path, rows=2, cols=2)
-        imporve_agent.reset() # inference
-        user_msg = BaseMessage.make_user_message(
-                role_name="User",
-                content="\n".join([template_prompt, "Here are the choices A, B, C, D"]),
-                image_list=[Image.open(prompt_img_path)]
-        )
-        response = safe_step(imporve_agent, user_msg)
-        token_usage_improve.append(response.info['usage'])
-        # print(response.msgs[-1].content)
-        choice_str = extract_json_block(response.msgs[-1].content)
-        print(f"🤖 Model layout decision: {choice_str}")
-        choice = json.loads(choice_str)
-        refined_code = proposal_code_list[map_dic[choice["choice"]]]
-        frames[frame_idx]["text"] = refined_code
-    ## update code
-    new_code = ["\\documentclass{beamer}", head, "\\begin{document}"]
-    section = []
-    subsection = []
-    for frame in frames: 
-        if len(frame["section"]) != 0 and frame["section"] not in section:  
-            new_code.append("\\section{{{}}}".format(frame["section"]))
-            section.append(frame["section"])
-            subsection = []
-        if len(frame["subsection"]) != 0 and frame["subsection"] not in subsection: 
-            new_code.append("\\subsection{{{}}}".format(frame["subsection"]))
-            subsection.append(frame["subsection"])
-        new_code.append(add_small_after_blocks(frame["text"]))   
-    new_code.append("\\end{document}")
-    new_code = "\n".join(new_code)
-    new_code_save_path = beamer_save_path.replace(".tex", "_refined.tex")
-    with open(new_code_save_path, 'w') as f: f.write(new_code) 
-    feedback = compile_tex(new_code_save_path)
-    return new_code_save_path.replace(".tex", ".pdf"), token_usage_improve
-
-def improve_per_slide(data):
-    idx, model_config, template_prompt, head, frame = data
-    ## model for selecting the proposed result
-    imporve_model = ModelFactory.create(
-        model_platform=model_config["model_platform"],
-        model_type=model_config["model_type"],
-        model_config_dict=model_config.get("model_config"),
-        url=model_config.get("url", None),)
-    imporve_agent = ChatAgent(model=imporve_model, system_message="",)
-    factors = [1, 0.75, 0.5, 0.25]
-    map_dic = {"A": 0, "B": 1, "C": 2, "D": 3}
-    proposal_tmp_dir = path.join(path.dirname(beamer_save_path), 'proposal_imgs_'+str(idx))
-    os.makedirs(proposal_tmp_dir, exist_ok=True)
-    proposal_imgs_path_list = []
-    proposal_code_list = []
-    for factor in factors:
-        proposal_code = scale_includegraphics_widths(frame["text"], factor)
-        proposal_code = add_small_after_blocks(proposal_code)
-        proposal_full_code =  '\n'.join(["\\documentclass{beamer}", head, "\\begin{document}", proposal_code, "\\end{document}"])
-        proposal_code_save_path = beamer_save_path.replace('.tex', 'proposal_{}.tex'.format(str(factor)))
-        with open(proposal_code_save_path, 'w') as f: f.write(proposal_full_code)
-        feedback = compile_tex(proposal_code_save_path)  
-        img_path = pdf2img(proposal_code_save_path.replace(".tex", ".pdf"), proposal_tmp_dir)
-        proposal_imgs_path_list.append(img_path)  
-        proposal_code_list.append(proposal_code)
-    prompt_img_path =  path.join(proposal_tmp_dir, "meraged.png")
-    make_grid_with_labels(proposal_imgs_path_list, prompt_img_path, rows=2, cols=2)
-    imporve_agent.reset() # inference
-    user_msg = BaseMessage.make_user_message(
-            role_name="User",
-            content="\n".join([template_prompt, "Here are the choices A, B, C, D"]),
-            image_list=[Image.open(prompt_img_path)]
-    )
-    response = safe_step(imporve_agent, user_msg)
-    choice = json.loads(response.msgs[-1].content)
-    refined_code = proposal_code_list[map_dic[choice["choice"]]]
-    return idx, refined_code, response.info['usage']
-
-def make_2x2_grid_with_labels(
-    img_paths: Sequence[str],
-    out_path: str,
-    cell_size: Tuple[int, int] = (512, 512),
-    gap: int = 16,
-    labels: Sequence[str] = ("A", "B", "C", "D"),
-    bg_color: Tuple[int, int, int] = (255, 255, 255),
-    font_path: Optional[str] = None,
-    font_size: Optional[int] = None,
-) -> Path:
-
-    if len(img_paths) != 4: raise ValueError("img_paths must contain 4 img pathes")
-
-    cw, ch = cell_size
-    canvas_w = cw * 2 + gap
-    canvas_h = ch * 2 + gap
-    canvas = Image.new("RGB", (canvas_w, canvas_h), bg_color)
-
-    def _to_rgb(img: Image.Image) -> Image.Image:
-        if img.mode in ("RGBA", "LA"):
-            base = Image.new("RGB", img.size, bg_color)
-            base.paste(img, mask=img.split()[-1])
-            return base
-        return img.convert("RGB")
-
-    if font_size is None:
-        font_size = max(16, int(min(cw, ch) * 0.08))
-    font = None
-    if font_path:
-        try:
-            font = ImageFont.truetype(font_path, font_size)
-        except Exception:
-            font = None
-    if font is None:
-        for try_name in ["DejaVuSans-Bold.ttf", "Arial.ttf", "Helvetica.ttf"]:
-            try:
-                font = ImageFont.truetype(try_name, font_size)
-                break
-            except Exception:
-                continue
-    if font is None:
-        font = ImageFont.load_default()
-
-    draw = ImageDraw.Draw(canvas)
-    positions = [
-        (0, 0),            # A
-        (cw + gap, 0),     # B
-        (0, ch + gap),     # C
-        (cw + gap, ch + gap)  # D
-    ]
-
-    for i, (p, (x0, y0)) in enumerate(zip(img_paths, positions)):
-        im = Image.open(p)
-        im = _to_rgb(im)
-        w, h = im.size
-        scale = min(cw / w, ch / h)
-        nw, nh = max(1, int(w * scale)), max(1, int(h * scale))
-        im_resized = im.resize((nw, nh), Image.BICUBIC)
-        px = x0 + (cw - nw) // 2
-        py = y0 + (ch - nh) // 2
-        canvas.paste(im_resized, (px, py))
-
-        label = labels[i]
-        margin = max(6, font_size // 4)
-        tx, ty = x0 + margin, y0 + margin
-        draw.text(
-            (tx, ty), label, font=font,
-            fill=(255, 255, 255),
-            stroke_width=max(1, font_size // 16),
-            stroke_fill=(0, 0, 0)
-        )
-
-    out_path = Path(out_path)
-    out_path.parent.mkdir(parents=True, exist_ok=True)
-    canvas.save(out_path.as_posix())
-
-def make_grid_with_labels(
-    img_paths: Sequence[str],
-    out_path: str,
-    cell_size: Tuple[int, int] = (512, 512),
-    gap: int = 16,
-    rows: int = 2,
-    cols: int = 3,
-    labels: Optional[Sequence[str]] = None,     # 默认自动 A..Z
-    bg_color: Tuple[int, int, int] = (255, 255, 255),
-    font_path: Optional[str] = None,
-    font_size: Optional[int] = None,
-) -> Path:
-    n = rows * cols
-    if len(img_paths) != n:
-        raise ValueError(f"img_paths must contain {n} image paths (got {len(img_paths)})")
-
-    if labels is None:
-        labels = list(string.ascii_uppercase[:n])
-    elif len(labels) != n:
-        raise ValueError(f"labels length must be {n} (got {len(labels)})")
-
-    cw, ch = cell_size
-    canvas_w = cw * cols + gap * (cols - 1)
-    canvas_h = ch * rows + gap * (rows - 1)
-    canvas = Image.new("RGB", (canvas_w, canvas_h), bg_color)
-
-    def _to_rgb(img: Image.Image) -> Image.Image:
-        if img.mode in ("RGBA", "LA"):
-            base = Image.new("RGB", img.size, bg_color)
-            base.paste(img, mask=img.split()[-1])
-            return base
-        return img.convert("RGB")
-
-    if font_size is None:
-        font_size = max(16, int(min(cw, ch) * 0.08))
-    font = None
-    if font_path:
-        try:
-            font = ImageFont.truetype(font_path, font_size)
-        except Exception:
-            font = None
-    if font is None:
-        for try_name in ["DejaVuSans-Bold.ttf", "Arial.ttf", "Helvetica.ttf"]:
-            try:
-                font = ImageFont.truetype(try_name, font_size)
-                break
-            except Exception:
-                continue
-    if font is None:
-        font = ImageFont.load_default()
-
-    draw = ImageDraw.Draw(canvas)
-
-    positions = []
-    for r in range(rows):
-        for c in range(cols):
-            x0 = c * (cw + gap)
-            y0 = r * (ch + gap)
-            positions.append((x0, y0))
-
-    for i, (p, (x0, y0)) in enumerate(zip(img_paths, positions)):
-        with Image.open(p) as im_raw:
-            im = _to_rgb(im_raw)
-        w, h = im.size
-        scale = min(cw / w, ch / h)
-        nw, nh = max(1, int(w * scale)), max(1, int(h * scale))
-        im_resized = im.resize((nw, nh), Image.BICUBIC)
-
-        px = x0 + (cw - nw) // 2
-        py = y0 + (ch - nh) // 2
-        canvas.paste(im_resized, (px, py))
-
-        label = labels[i]
-        margin = max(6, font_size // 4)
-        tx, ty = x0 + margin, y0 + margin
-        draw.text(
-            (tx, ty), label, font=font,
-            fill=(255, 0, 0),
-            stroke_width=max(1, font_size // 16),
-            stroke_fill=(255, 0, 0)
-        )
-
-    out_path = Path(out_path)
-    out_path.parent.mkdir(parents=True, exist_ok=True)
-    canvas.save(out_path.as_posix())
-    return out_path
-
-def pdf2img(pdf_path, image_dir, dpi=300, fmt="png", strict_single_page=True):
-    pdf_path = Path(pdf_path)
-    image_dir = Path(image_dir)
-    if pdf_path.suffix.lower() != ".pdf": raise ValueError(f"not pdf file: {pdf_path}")
-    if not pdf_path.exists(): raise FileNotFoundError(f"can not find: {pdf_path}")
-    with fitz.open(pdf_path) as doc:
-        if strict_single_page and doc.page_count != 1: raise ValueError(f"not single slide {doc.page_count}: {pdf_path}")
-        page = doc[0]
-        scale = dpi / 72.0 
-        mat = fitz.Matrix(scale, scale)
-        pix = page.get_pixmap(matrix=mat, alpha=False)
-    image_dir.mkdir(parents=True, exist_ok=True)
-    fmt = fmt.lower()
-    if fmt == "jpeg":
-        fmt = "jpg"
-    out_path = image_dir / f"{pdf_path.stem}.{fmt}"
-    pix.save(out_path.as_posix())
-    return out_path
-
-### smaller the front size
-def add_small_after_blocks(tex) -> str:
-    text = tex
-    pattern = re.compile(
-        r'(?m)^([ \t]*)\\begin\{(?:block|alertblock|exampleblock)\}'
-        r'(?:<[^>\n]*>)?(?:\[[^\]\n]*\])?\s*\{[^}]*\}[^\n]*\r?\n'
-        r'([ \t]*)(?!\\small\b)'
-    )
-    def repl(m: re.Match) -> str:
-        return f"{m.group(0)}\\footnotesize\n{m.group(2)}"
-    new_text = pattern.sub(repl, text)
-    return new_text
-
-### smaller the figure size
-def scale_includegraphics_widths(tex: str, factor: float, precision: int = 3, add_if_missing: bool = False) -> str:
-    INCLUDE_RE = re.compile(
-        r'\\includegraphics(?:\s*\[(?P<opts>[^\]]*)\])?\s*\{(?P<path>[^}]*)\}',
-        re.DOTALL,
-    )
-    WIDTH_RE = re.compile(r'(?<![a-zA-Z])width\s*=\s*([^,\]]+)', re.IGNORECASE)
-    REL_RE = re.compile(r'^\s*(?:(\d*\.?\d+)|\.(\d+))?\s*\\(textwidth|linewidth|columnwidth)\b')
-
-    def scale_rel(expr: str) -> str | None:
-        val = expr.strip().strip("{}")
-        m = REL_RE.match(val)
-        if not m:
-            return None
-        num = m.group(1)
-        if num is None and m.group(2) is not None:
-            num = "0." + m.group(2)
-        k = 1.0 if not num else float(num)
-        new_k = round(k * factor, precision)
-        new_k_str = f"{new_k:g}"
-        return f"{new_k_str}\\{m.group(3)}"
-
-    def repl_inc(mm: re.Match) -> str:
-        opts = mm.group("opts")
-        path = mm.group("path")
-        if opts is None or opts.strip() == "":
-            if add_if_missing:
-                return f"\\includegraphics[width={factor:g}\\textwidth]{{{path}}}"
-            else:
-                return mm.group(0)
-        def repl_width(mw: re.Match) -> str:
-            expr = mw.group(1)
-            scaled = scale_rel(expr)
-            return f"width={scaled}" if scaled is not None else mw.group(0)
-        new_opts = WIDTH_RE.sub(repl_width, opts)
-        if new_opts == opts and add_if_missing:
-            new_opts = f"width={factor:g}\\textwidth," + opts.strip()
-        return f"\\includegraphics[{new_opts}]{{{path}}}"
-    return INCLUDE_RE.sub(repl_inc, tex)
-
-def _line_starts(text):
-    starts = [0]
-    for m in re.finditer('\n', text):
-        starts.append(m.end())
-    return starts
-
-def _pos_to_line(pos, line_starts):
-    return bisect.bisect_right(line_starts, pos)
-
-def compute_frame_spans(code: str):
-    line_starts = _line_starts(code)
-    sec_re  = re.compile(r'(?m)^\\section\*?(?:\[[^\]]*\])?\{([^}]*)\}')
-    sub_re  = re.compile(r'(?m)^\\subsection\*?(?:\[[^\]]*\])?\{([^}]*)\}')
-
-    sections = []
-    for m in sec_re.finditer(code):
-        pos = m.start()
-        sections.append({
-            "pos": pos,
-            "line": _pos_to_line(pos, line_starts),
-            "title": m.group(1).strip()
-        })
-    subsections = []
-    for m in sub_re.finditer(code):
-        pos = m.start()
-        subsections.append({
-            "pos": pos,
-            "line": _pos_to_line(pos, line_starts),
-            "title": m.group(1).strip()
-        })
-
-    sec_pos_list  = [s["pos"] for s in sections]
-    sub_pos_list  = [s["pos"] for s in subsections]
-
-    frame_re = re.compile(
-        r'\\begin\{frame\}(?:<[^>\n]*>)?(?:\[[^\]\n]*\])?(?:\{.*?\}){0,2}.*?\\end\{frame\}',
-        re.DOTALL
-    )
-    frametitle_re = re.compile(r'\\frametitle(?:<[^>]*>)?(?:\[[^\]]*\])?\{([^}]*)\}')
-
-    frame_env_title_re = re.compile(
-        r'^\\begin\{frame\}(?:<[^>\n]*>)?(?:\[[^\]\n]*\])?\s*\{([^}]*)\}',
-        re.DOTALL
-    )
-
-    frames = []
-    for i, m in enumerate(frame_re.finditer(code)):
-        start, end = m.start(), m.end()
-        start_line = _pos_to_line(start, line_starts)
-        end_line   = _pos_to_line(end - 1, line_starts)
-        text = m.group(0)
-
-        t = frametitle_re.search(text)
-        if t:
-            title = t.group(1).strip()
-        else:
-            t2 = frame_env_title_re.search(text)
-            title = t2.group(1).strip() if t2 else ""
-
-        if sec_pos_list:
-            j = bisect_right(sec_pos_list, start) - 1
-            if j >= 0:
-                sec_title = sections[j]["title"]
-                sec_line  = sections[j]["line"]
-            else:
-                sec_title, sec_line = "", None
-        else:
-            sec_title, sec_line = "", None
-
-        if sub_pos_list:
-            k = bisect_right(sub_pos_list, start) - 1
-            if k >= 0:
-                sub_title = subsections[k]["title"]
-                sub_line  = subsections[k]["line"]
-            else:
-                sub_title, sub_line = "", None
-        else:
-            sub_title, sub_line = "", None
-
-        frames.append({
-            "idx": i,
-            "start": start,
-            "end": end,
-            "start_line": start_line,
-            "end_line": end_line,
-            "title": title,
-            "section": sec_title,
-            "section_line": sec_line,
-            "subsection": sub_title,
-            "subsection_line": sub_line,
-            "text": text
-        })
-
-    return frames
-
-## fix the grammer error with complie error
-correct_prompt_path = "./Paper2Video/src/prompts/slide_beamer_correct.txt"
-def correcte_error(beamer_code, error_info, agent):
-    with open(correct_prompt_path, 'r', encoding='utf-8') as f_prompt: templete_prompt = f_prompt.read()
-    inference_prompt = (
-        templete_prompt,
-        "This is the latex code for slides:", beamer_code,
-        "The errors are:", "\n".join(error_info)
-    )
-    inference_prompt = "\n".join(map(str, inference_prompt))
-    print(len(inference_prompt))
-    user_msg = BaseMessage.make_user_message(role_name="User", content=inference_prompt)
-    response = safe_step(agent, user_msg)
-    code = extract_beamer_code(response.msgs[-1].content)
-    return code, response.info['usage']
-
-def safe_step(agent, user_msg, max_retries=5):
-    for attempt in range(max_retries):
-        response = agent.step(user_msg)
-        if getattr(response, "msgs", None) and len(response.msgs) > 0:
-            return response
-        print(f"[Retry {attempt+1}/{max_retries}] Empty or invalid response, retrying...")
-    raise RuntimeError(f"Agent failed after {max_retries} retries: {user_msg}")
-
-# def find_all_tex_files(root_dir):
-#     tex_files = []
-#     for dirpath, dirnames, filenames in os.walk(root_dir):
-#         for filename in filenames:
-#             if filename.endswith(".tex"):
-#                 full_path = os.path.join(dirpath, filename)
-#                 with open(full_path, 'r', encoding='utf-8') as f: 
-#                     tex_files.append(f.read())
-#     return tex_files
-def find_all_tex_files(root_dir):
-    tex_files = []
-    for dirpath, dirnames, filenames in os.walk(root_dir):
-        for filename in filenames:
-            if filename.endswith(".tex"):
-                full_path = os.path.join(dirpath, filename)
-                try:
-                    with open(full_path, 'r', encoding='utf-8') as f:
-                        tex_files.append(f.read())
-                except Exception as e:
-                    print(f"⚠️ Skip {full_path}: {e}")
-                    continue
-    return tex_files
-
-def compile_tex(tex_path):
-    tex_path = Path(tex_path).resolve()
-    if not tex_path.exists(): raise FileNotFoundError(f"Tex file {tex_path} does not exist")
-    try:
-        result = subprocess.run(
-            ["tectonic", str(tex_path)],
-            check=True,
-            capture_output=True,
-            text=True
-        )
-        print("🛠️ Compiling LaTeX file...")
-        print(result.stdout)
-        return "\n".join([result.stdout, result.stderr])
-    except subprocess.CalledProcessError as e:
-        print("Compilation failed:")
-        print(e.stderr)
-        return e.stderr
diff --git a/Paper2Video/src/test_write_perm.tmp b/Paper2Video/src/test_write_perm.tmp
deleted file mode 100644
index b5754e20373fdaa5331ef6e4623dbae636225e3b..0000000000000000000000000000000000000000
--- a/Paper2Video/src/test_write_perm.tmp
+++ /dev/null
@@ -1 +0,0 @@
-ok
\ No newline at end of file
diff --git a/Paper2Video/src/wei_utils.py b/Paper2Video/src/wei_utils.py
deleted file mode 100644
index 26e848278299e4d732f39c05e35b0e32e74a6d2f..0000000000000000000000000000000000000000
--- a/Paper2Video/src/wei_utils.py
+++ /dev/null
@@ -1,174 +0,0 @@
-from camel.types import ModelPlatformType, ModelType
-from camel.configs import ChatGPTConfig, QwenConfig, VLLMConfig, OpenRouterConfig, GeminiConfig
-
-def get_agent_config(model_type):
-    agent_config = {}
-    if model_type == 'qwen':
-        agent_config = {
-            "model_type": ModelType.DEEPINFRA_QWEN_2_5_72B,
-            "model_config": QwenConfig().as_dict(),
-            "model_platform": ModelPlatformType.DEEPINFRA,
-        }
-    elif model_type == 'gemini':
-        agent_config = {
-            "model_type": ModelType.DEEPINFRA_GEMINI_2_FLASH,
-            "model_config": GeminiConfig().as_dict(),
-            "model_platform": ModelPlatformType.DEEPINFRA,
-            'max_images': 99
-        }
-    elif model_type == 'phi4':
-        agent_config = {
-            "model_type": ModelType.DEEPINFRA_PHI_4_MULTIMODAL,
-            "model_config": QwenConfig().as_dict(),
-            "model_platform": ModelPlatformType.DEEPINFRA,
-        }
-    elif model_type == 'llama-4-scout-17b-16e-instruct':
-        agent_config = {
-            'model_type': ModelType.ALIYUN_LLAMA4_SCOUT_17B_16E,
-            'model_config': QwenConfig().as_dict(),
-            'model_platform': ModelPlatformType.QWEN,
-            'max_images': 99
-        }
-    elif model_type == 'qwen-2.5-vl-72b':
-        agent_config = {
-            'model_type': ModelType.QWEN_2_5_VL_72B,
-            'model_config': QwenConfig().as_dict(),
-            'model_platform': ModelPlatformType.QWEN,
-            'max_images': 99
-        }
-    elif model_type == 'gemma':
-        agent_config = {
-            "model_type": "google/gemma-3-4b-it",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:5555/v1',
-            'max_images': 99
-        }
-    elif model_type == 'llava':
-        agent_config = {
-            "model_type": "llava-hf/llava-onevision-qwen2-7b-ov-hf",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:8000/v1',
-            'max_images': 99
-        }
-    elif model_type == 'molmo-o':
-        agent_config = {
-            "model_type": "allenai/Molmo-7B-O-0924",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:8000/v1',
-            'max_images': 99
-        }
-    elif model_type == 'qwen-2-vl-7b':
-        agent_config = {
-            "model_type": "Qwen/Qwen2-VL-7B-Instruct",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:8000/v1',
-            'max_images': 99
-        }
-    elif model_type == 'vllm_phi4':
-        agent_config = {
-            "model_type": "microsoft/Phi-4-multimodal-instruct",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:8000/v1',
-            'max_images': 99
-        }
-    elif model_type == 'o3-mini':
-        agent_config = {
-            "model_type": ModelType.O3_MINI,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-        }
-    elif model_type == 'gpt-4.1':
-        agent_config = {
-            "model_type": ModelType.GPT_4_1,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-        }
-    elif model_type == 'gpt-4.1-mini':
-        agent_config = {
-            "model_type": ModelType.GPT_4_1_MINI,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-        }
-    elif model_type == '4o':
-        agent_config = {
-            "model_type": ModelType.GPT_4O,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-            # "model_name": '4o'
-        }
-    elif model_type == '4o-mini':
-        agent_config = {
-            "model_type": ModelType.GPT_4O_MINI,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-        }
-    elif model_type == 'o1':
-        agent_config = {
-            "model_type": ModelType.O1,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-            # "model_name": 'o1'
-        }
-    elif model_type == 'o3':
-        agent_config = {
-            "model_type": ModelType.O3,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-        }
-    elif model_type == 'gpt-5':
-        agent_config = {
-            "model_type": ModelType.GPT_5,
-            "model_config": ChatGPTConfig().as_dict(),
-            "model_platform": ModelPlatformType.OPENAI,
-        }
-    elif model_type == 'vllm_qwen_vl':
-        agent_config = {
-            "model_type": "Qwen/Qwen2.5-VL-7B-Instruct",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:7000/v1'
-        }
-    elif model_type == 'vllm_qwen':
-        agent_config = {
-            "model_type": "Qwen/Qwen2.5-7B-Instruct",
-            "model_platform": ModelPlatformType.VLLM,
-            "model_config": VLLMConfig().as_dict(),
-            "url": 'http://localhost:8000/v1',
-        }
-    elif model_type == 'openrouter_qwen_72b':
-        agent_config = {
-            'model_type': ModelType.OPENROUTER_QWEN_2_5_72B,
-            'model_platform': ModelPlatformType.OPENROUTER,
-            'model_config': OpenRouterConfig().as_dict(),
-        }
-    elif model_type == 'openrouter_qwen_vl_72b':
-        agent_config = {
-            'model_type': ModelType.OPENROUTER_QWEN_2_5_VL_72B,
-            'model_platform': ModelPlatformType.OPENROUTER,
-            'model_config': OpenRouterConfig().as_dict(),
-        }
-    elif model_type == 'openrouter_qwen_vl_7b':
-        agent_config = {
-            'model_type': ModelType.OPENROUTER_QWEN_2_5_VL_7B,
-            'model_platform': ModelPlatformType.OPENROUTER,
-            'model_config': OpenRouterConfig().as_dict(),
-        }
-    elif model_type == 'openrouter_qwen_7b':
-        agent_config = {
-            'model_type': ModelType.OPENROUTER_QWEN_2_5_7B,
-            'model_platform': ModelPlatformType.OPENROUTER,
-            'model_config': OpenRouterConfig().as_dict(),
-        }
-    else:
-        agent_config = {
-            'model_type': model_type,
-            'model_platform': ModelPlatformType.OPENAI_COMPATIBLE_MODEL,
-            'model_config': None
-        }
-    
-    return agent_config
\ No newline at end of file
diff --git a/RUN b/RUN
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/app.py b/app.py
index 0f812efccd9fd7ee13372b317a70be3530eee1a3..c7a13eeaf62567d9a38fff5cc557fd6f72f773e8 100644
--- a/app.py
+++ b/app.py
@@ -371,7 +371,7 @@ def run_pipeline(arxiv_url, pdf_file, openai_key, logo_files, meeting_logo_file,
     POSTER_LATEX_DIR = WORK_DIR / "posterbuilder" / "latex_proj"
 
     _write_logs(LOG_PATH, logs)
-    yield "\n".join(logs), None, ""
+    yield "\n".join(logs), None
 
     # ====== Validation: must upload LOGO ======
     if logo_files is None:
@@ -384,7 +384,7 @@ def run_pipeline(arxiv_url, pdf_file, openai_key, logo_files, meeting_logo_file,
     #     msg = "❌ You must upload at least one institutional logo (multiple allowed)."
     #     logs.append(msg)
     #     _write_logs(LOG_PATH, logs)
-    #     yield "\n".join(logs), None, ""
+    #     yield "\n".join(logs), None
     #     return
 
     # Save logos into run-local dir
@@ -398,7 +398,7 @@ def run_pipeline(arxiv_url, pdf_file, openai_key, logo_files, meeting_logo_file,
         saved_logo_paths.append(p)
     logs.append(f"🏷️ Saved {len(saved_logo_paths)} logo file(s) → {LOGO_DIR.relative_to(WORK_DIR)}")
     _write_logs(LOG_PATH, logs)
-    yield "\n".join(logs), None, ""
+    yield "\n".join(logs), None
 
     # ====== Handle uploaded PDF (optional) ======
     pdf_path = None
@@ -413,14 +413,14 @@ def run_pipeline(arxiv_url, pdf_file, openai_key, logo_files, meeting_logo_file,
         canonical_pdf = INPUT_DIR / "paper.pdf"
         shutil.copy(pdf_file.name, canonical_pdf)
         _write_logs(LOG_PATH, logs)
-        yield "\n".join(logs), None, ""
+        yield "\n".join(logs), None
 
     # ====== Validate input source ======
     if not arxiv_url and not pdf_file:
         msg = "❌ Please provide either an arXiv link or upload a PDF file (choose one)."
         logs.append(msg)
         _write_logs(LOG_PATH, logs)
-        yield "\n".join(logs), None, ""
+        yield "\n".join(logs), None
         return
 
     # ====== Build command (run INSIDE workspace) ======
@@ -441,7 +441,7 @@ def run_pipeline(arxiv_url, pdf_file, openai_key, logo_files, meeting_logo_file,
     logs.append("\n======= REAL-TIME LOG =======")
     logs.append(f"cwd = runs/{WORK_DIR.name}")
     _write_logs(LOG_PATH, logs)
-    yield "\n".join(logs), None, ""
+    yield "\n".join(logs), None
 
     # ====== Run with REAL-TIME streaming, inside workspace ======
     try:
@@ -458,7 +458,7 @@ def run_pipeline(arxiv_url, pdf_file, openai_key, logo_files, meeting_logo_file,
         msg = f"❌ Pipeline failed to start: {e}"
         logs.append(msg)
         _write_logs(LOG_PATH, logs)
-        yield "\n".join(logs), None, ""
+        yield "\n".join(logs), None
         return
 
     last_yield = time.time()
@@ -472,7 +472,7 @@ def run_pipeline(arxiv_url, pdf_file, openai_key, logo_files, meeting_logo_file,
                 except Exception:
                     pass
                 _write_logs(LOG_PATH, logs)
-                yield "\n".join(logs), None, ""
+                yield "\n".join(logs), None
                 return
 
             line = process.stdout.readline()
@@ -483,7 +483,7 @@ def run_pipeline(arxiv_url, pdf_file, openai_key, logo_files, meeting_logo_file,
                 now = time.time()
                 if now - last_yield >= 0.3:
                     last_yield = now
-                    yield "\n".join(logs), None, ""
+                    yield "\n".join(logs), None
             elif process.poll() is not None:
                 break
             else:
@@ -492,18 +492,18 @@ def run_pipeline(arxiv_url, pdf_file, openai_key, logo_files, meeting_logo_file,
         return_code = process.wait()
         logs.append(f"\nProcess finished with code {return_code}")
         _write_logs(LOG_PATH, logs)
-        yield "\n".join(logs), None, ""
+        yield "\n".join(logs), None
 
         if return_code != 0:
             logs.append("❌ Process exited with non-zero status. See logs above.")
             _write_logs(LOG_PATH, logs)
-            yield "\n".join(logs), None, ""
+            yield "\n".join(logs), None
             return
 
     except Exception as e:
         logs.append(f"❌ Error during streaming: {e}")
         _write_logs(LOG_PATH, logs)
-        yield "\n".join(logs), None, ""
+        yield "\n".join(logs), None
         return
     finally:
         try:
@@ -526,7 +526,7 @@ def run_pipeline(arxiv_url, pdf_file, openai_key, logo_files, meeting_logo_file,
         msg = "❌ No output generated. Please check logs above."
         logs.append(msg)
         _write_logs(LOG_PATH, logs)
-        yield "\n".join(logs), None, ""
+        yield "\n".join(logs), None
         return
 
     # ====== NEW: Post-processing (optional features) ======
@@ -543,11 +543,11 @@ def run_pipeline(arxiv_url, pdf_file, openai_key, logo_files, meeting_logo_file,
     _apply_left_logo(OUTPUT_DIR, logo_files, logs)
 
     _write_logs(LOG_PATH, logs)
-    yield "\n".join(logs), None, ""
+    yield "\n".join(logs), None
 
 
     _write_logs(LOG_PATH, logs)
-    yield "\n".join(logs), None, ""
+    yield "\n".join(logs), None
 
     # ====== Zip output (run-local) ======
     try:
@@ -646,7 +646,7 @@ The framework builds upon [CAMEL-ai](https://github.com/camel-ai/camel).
     run_btn.click(
         fn=run_pipeline,
         inputs=[arxiv_in, pdf_in, key_in, inst_logo_in, conf_logo_in, theme_in],
-        outputs=[logs_out, zip_out, overleaf_out],
+        outputs=[logs_out, zip_out],
     )
 
 if __name__ == "__main__":
diff --git a/requirements_core.txt b/requirements_core.txt
deleted file mode 100644
index 3184390153e6e529befb0532a161e5308c873702..0000000000000000000000000000000000000000
--- a/requirements_core.txt
+++ /dev/null
@@ -1,41 +0,0 @@
---prefer-binary
-
-# ========= Core Runtime =========
-numpy==1.26.4
-pandas
-torch==2.5.1
-torchvision==0.20.1
-Pillow==10.4.0
-opencv-python==4.11.0.86
-pdf2image==1.17.0
-PyMuPDF==1.25.2
-moviepy==1.0.3
-aiofiles==24.1.0
-aiohttp==3.11.11
-tqdm==4.67.1
-matplotlib==3.10.0
-scikit-learn==1.6.1
-scipy==1.15.1
-sentence-transformers==3.3.1
-transformers==4.48.0
-
-# ========= ML / LLM Frameworks =========
-accelerate
-huggingface-hub==0.27.1
-openai==1.59.8
-pydantic==2.10.6
-
-# ========= Web / API / Async =========
-fastapi==0.115.6
-uvicorn==0.32.1
-requests==2.32.3
-httpx==0.27.2
-nest-asyncio==1.6.0
-
-# ========= Utils =========
-filelock==3.16.1
-regex==2024.11.6
-PyYAML==6.0.2
-typing_extensions==4.12.2
-rich==13.9.4
-coloredlogs==15.0.1
diff --git a/requirements_extra.txt b/requirements_extra.txt
deleted file mode 100644
index 2a2afa3f31d1c8661dac13772916a7386adbfb12..0000000000000000000000000000000000000000
--- a/requirements_extra.txt
+++ /dev/null
@@ -1,42 +0,0 @@
---prefer-binary
-
-# ========= LangChain / LLMs =========
-langchain==0.3.17
-langchain-community==0.3.16
-langchain-core==0.3.33
-langchain-openai==0.3.3
-
-# ========= Image / Layout / OCR =========
-layoutparser==0.3.4
-easyocr
-pytesseract==0.3.13
-shapely==2.0.7
-WeasyPrint==52.5
-CairoSVG==2.7.1
-
-# ========= PDF / DOC / PPT =========
-python-docx==1.1.2
-python-pptx @ git+https://github.com/Force1ess/python-pptx@dc356685d4d210a10abe1ffab3c21315cdfae63d
-pypdf==5.2.0
-pypandoc==1.15
-openpyxl==3.1.5
-
-# ========= Poster2Video / Poster2Poster =========
-camel-ai>=0.2.0
-f5_tts==1.1.6
-whisper==1.1.10
-whisperx
-pymilvus==2.5.4
-peft==0.14.0
-diffusers==0.25.1
-einops==0.8.0
-xformers==0.0.28.post3
-arxiv==2.1.3
-arxiv2text==0.1.14
-agentops==0.3.26
-
-# ========= Optional (Audio / IO / etc.) =========
-soundfile==0.13.1
-pydub==0.25.1
-ffmpeg-python==0.2.0
-playwright==1.51.0