PaperShow / posterbuilder /latex_proj /poster_output.tex
ZaynZhu
Clean version without large assets
7c08dc3
raw
history blame
7.08 kB
% Unofficial University of Cambridge Poster Template
% https://github.com/andiac/gemini-cam
% a fork of https://github.com/anishathalye/gemini
% also refer to https://github.com/k4rtik/uchicago-poster
\documentclass[final]{beamer}
% ====================
% Packages
% ====================
\usepackage[T1]{fontenc}
\usepackage{lmodern}
\usepackage[size=custom,width=120,height=72,scale=1.05]{beamerposter}
\usetheme{gemini}
\usecolortheme{cam}
\usepackage{graphicx}
\usepackage{booktabs}
\usepackage[numbers]{natbib}
\usepackage{tikz}
\usepackage{pgfplots}
\pgfplotsset{compat=1.14}
\usepackage{anyfontsize}
% ====================
% Lengths
% ====================
% If you have N columns, choose \sepwidth and \colwidth such that
% (N+1)*\sepwidth + N*\colwidth = \paperwidth
\newlength{\sepwidth}
\newlength{\colwidth}
\setlength{\sepwidth}{0.025\paperwidth}
\setlength{\colwidth}{0.3\paperwidth}
\newcommand{\separatorcolumn}{\begin{column}{\sepwidth}\end{column}}
% ====================
% Title
% ====================
\title{Paper2Poster: \\ Towards Multimodal Poster Automation from Scientific Papers}
\author{Wei Pang, Kevin Qinghong Lin, Xiangru Jian, Xi He, Philip Torr}
\institute[shortinst]{1 University of Waterloo, 2 National University of Singapore, 3 University of Oxford}
% ====================
% Footer (optional)
% ====================
\footercontent{
\href{https://www.example.com}{https://www.example.com} \hfill
ABC Conference 2025, New York --- XYZ-1234 \hfill
\href{mailto:alyssa.p.hacker@example.com}{alyssa.p.hacker@example.com}}
% (can be left out to remove footer)
% ====================
% Logo (optional)
% ====================
% use this to include logos on the left and/or right side of the header:
% \logoright{\includegraphics[height=7cm]{logo1.pdf}}
% \logoleft{\includegraphics[height=7cm]{logo2.pdf}}
% ====================
% Body
% ====================
% --- injected font tweaks ---
\setbeamerfont{title}{size=\huge}
\setbeamerfont{author}{size=\Large}
\setbeamerfont{institute}{size=\large}
\setbeamerfont{block title}{size=\Large}
\setbeamerfont{block body}{size=\large}
\setbeamerfont{caption}{size=\small}
\setlength{\abovecaptionskip}{4pt}
\setlength{\belowcaptionskip}{3pt}
\begin{document}
% Refer to https://github.com/k4rtik/uchicago-poster
% logo: https://www.cam.ac.uk/brand-resources/about-the-logo/logo-downloads
\addtobeamertemplate{headline}{}
{
\begin{tikzpicture}[remember picture,overlay]
\node [anchor=north west, inner sep=3cm] at ([xshift=0.0cm,yshift=1.0cm]current page.north west)
{\includegraphics[height=4.5cm]{logos/cambridge-reversed-color-logo.eps}};
\node[anchor=north east, inner sep=2.0cm] at ([xshift=-2.0cm,yshift=0.0cm]current page.north east)
{\includegraphics[height=6.0cm]{logo.png}};
\end{tikzpicture}
}
\begin{frame}[t]
\begin{columns}[t]
\separatorcolumn
\begin{column}{\colwidth}
\begin{block}{Introduction}
Academic posters are crucial for \textbf\{scientific communication\}, allowing rapid dissemination of key findings. Unlike slide decks, posters must condense entire papers into a single page, requiring \textit\{multi-modal context handling\}, \textcolor\{red\}\{tight text-graphics interleaving\}, and \textcolor\{red\}\{spatial constraint respect\}. Existing VLM- or LLM-only approaches lack explicit visual feedback, making it difficult to maintain logical flow and legibility.
\end{block}
\begin{block}{Benchmark \& Metrics}
We introduce the \textbf\{Paper2Poster Benchmark\}, the first benchmark for poster generation, evaluating outputs on \textcolor\{blue\}\{Visual Quality\}, \textcolor\{blue\}\{Textual Coherence\}, \textcolor\{blue\}\{Holistic Assessment\}, and \textcolor\{blue\}\{PaperQuiz\}. This benchmark pairs recent conference papers with author-designed posters, enabling systematic comparison and evaluation of generated posters.
\begin{figure}
\centering
\includegraphics[width=0.60\linewidth]{figures/paper-picture-1.png}
\caption{Overview of this work. We address two core challenges in scientific poster generation: Left: How to create a poster from a paper -we propose PosterAgent (Sec. 4), a framework that transforms long-context scientific papers (20K+ tokens) into structured visual posters; and Right: How to evaluate poster quality -weintroduce the Paper2Poster benchmark (Sec. 3), which enables systematic comparison between agent-generated and author-designed posters.}
\end{figure}
\end{block}
\end{column}
\separatorcolumn
\begin{column}{\colwidth}
\begin{block}{PosterAgent Framework}
Our proposed \textbf\{PosterAgent\} framework is a \textit\{multi-agent pipeline\} that transforms scientific papers into structured visual posters. It consists of three components: \textcolor\{blue\}\{Parser\}, \textcolor\{blue\}\{Planner\}, and \textcolor\{blue\}\{Painter-Commenter\}. The Parser distills the paper into a structured asset library, the Planner aligns text-visual pairs into a binary-tree layout, and the Painter-Commenter loop refines each panel using VLM feedback.
\begin{figure}
\centering
\includegraphics[width=0.78\linewidth]{figures/paper-picture-8.png}
\caption{Illustration of the PosterAgent pipeline. Given an input paper, PosterAgent generates a structured academic poster through three modules: 1. Parser: Extracts key textual and visual assets using a combination of tools and LLM-based summarization, resulting in a structured asset library. 2. Planner: Matches assets and arranges them into coherent layouts, iteratively generating panels with a zoom-in operation. 3. Painter-Commenter: The Painter generates panel-level bullet-content along with executable code, and renders the visual output, while the Commenter-a VLM with in-context reference-provides feedback to ensure layout coherence and prevent content overflow.}
\end{figure}
\end{block}
\begin{block}{Evaluation \& Results}
Our comprehensive evaluation reveals that \textbf\{PosterAgent\} outperforms existing systems across nearly all metrics, using \textcolor\{blue\}\{87\\% fewer tokens\}. While GPT-4o outputs are visually appealing, they suffer from \textcolor\{red\}\{noisy text\} and poor PaperQuiz scores. Our open-source variants, based on Qwen-2.5, achieve superior performance, highlighting the effectiveness of our \textit\{visual-semantic-aware asset library\} and \textit\{layout generation\}.
\begin{figure}
\centering
\includegraphics[width=0.79\linewidth]{figures/paper-table-1.png}
\end{figure}
\end{block}
\end{column}
\separatorcolumn
\begin{column}{\colwidth}
\begin{block}{Conclusion}
We present \textbf\{Paper2Poster\}, a new benchmark for poster generation, and the \textbf\{PosterAgent\} framework, which significantly enhances generation quality. Our findings chart clear directions for the next generation of fully automated poster-generation models, emphasizing the importance of \textit\{structured parsing\}, \textit\{hierarchical planning\}, and \textit\{visual feedback\}.
\end{block}
\end{column}
\separatorcolumn
\end{columns}
\end{frame}
\end{document}