pdf.tocgen.split / README.md
adelevett's picture
Upload 76 files
046e3b8 verified

A newer version of the Streamlit SDK is available: 1.55.0

Upgrade
metadata
title: PDF TOC Generator Split
emoji: 📑
colorFrom: blue
colorTo: indigo
sdk: streamlit
sdk_version: 1.41.1
app_file: app.py
pinned: false
license: agpl-3.0
short_description: Generate PDF Table of Contents and Split Chapters

PDF Table of Contents Generator (Split Edition)

Based on pdf.tocgen.

Features

  • Analyze Fonts: Automatically detect chapter headers by font size and style.
  • Search: Find headers by text search (Case Sensitive option available).
  • Generate TOC: Create a clickable PDF bookmark outline.
  • Split Chapters: Export each chapter as a separate PDF in a ZIP file.
  • Front/Back Matter: Automatically handle un-numbered front matter and user-defined back matter (Index, Glossary).

Usage

  1. Upload a PDF.
  2. Use "Scan & Generate" to find headers.
  3. Configure the "Back Matter" start page if needed.
  4. Run Pipeline.
  5. Download the Bookmarked PDF or the Zipped Chapter Splits.