YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

PDF to Image (Python)

License Repo Size Last Commit Python Markdown

Simply convert PDF files into rendered image pages at high resolution.

This project was inspired by and serves as a Python alternative to the PHP package spatie/pdf-to-image.

System Overview

graph TD
    A[PDF Input] --> B{Process Type}
    B -->|Single| C[single_page.py]
    B -->|Batch| D[all_page.py]
    C --> E[First Page Export]
    D --> F[Full Document Export]
    E --> G[Output Folder]
    F --> G

Project Structure

pdf-to-image-python/
β”œβ”€β”€ .gitignore          # Git ignore rules
β”œβ”€β”€ PDF/                # Input PDF directory
β”œβ”€β”€ output/             # Generated images directory (auto-created)
β”œβ”€β”€ LICENSE             # Project license
β”œβ”€β”€ README.md           # Main documentation
β”œβ”€β”€ STACKS.md           # Technical stack audit
β”œβ”€β”€ all_page.py         # Full PDF conversion script
β”œβ”€β”€ requirements.txt    # Dependency list
└── single_page.py      # First page conversion script

Requirements

  • Python 3.14+
  • PyMuPDF library

Setup Instructions

Make sure your environment is ready before running the tool:

  1. Create a virtual environment:
    python -m venv venv
    
  2. Activate the virtual environment:
    # On Windows
    venv\Scripts\activate
    
  3. Install the dependencies:
    pip install -r requirements.txt
    

Usage

Basic usage to convert a PDF in the most efficient way:

# Convert all pages of a PDF (provide folder or file)
python all_page.py "PDF_folder" --dpi 300 --format png

# Convert only the first page (Cover) for quick previews
python single_page.py "PDF_folder" --dpi 300 --format png

Citation

If you use this tool in your research or project, please cite it as follows:

@misc{pdf_to_image_2026,
  author = {Rembrant Oyangoren Albeos},
  title = {PDF to Image Python Utility},
  year = {2026},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/not-algorembrant/*}},
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support