pdfFormDetector / README.md
luisabwk's picture
Create README.md
ab5abb1 verified

A newer version of the Gradio SDK is available: 6.14.0

Upgrade
metadata
title: pdfFormDetector
emoji: 📄
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: 6.13.0
app_file: app.py
pinned: false
license: mit

CommonForms — Form Field Detector

Gradio Space que usa o modelo FFDNet-L (jbarrow/FFDNet-L-cpu) via o pacote oficial commonforms para converter PDFs em formulários preenchíveis, detectando:

  • text boxes
  • checkboxes (choice buttons)
  • signature fields (opcional)

Paper: CommonForms — arxiv 2509.16506

Arquivos

Arquivo Propósito
app.py UI Gradio + endpoint /detect
requirements.txt gradio==6.13.0 + commonforms
packages.txt poppler-utils (necessário pra pdf2image)
README.md Metadata do Space + docs

O pacote commonforms baixa o ONNX do FFDNet-L do HF Hub no primeiro uso.

Endpoint

​```python from gradio_client import Client, handle_file

client = Client("luisabwk/pdfFormDetector") out_pdf = client.predict( pdf_path=handle_file("/caminho/input.pdf"), image_size=1600, use_signature_fields=False, keep_existing_fields=False, api_name="/detect", ) print(out_pdf) # path para o PDF preenchível ​```

Parâmetros

  • image_size (512–2048, default 1600) — maior = mais preciso, mais lento.
  • use_signature_fields — detecta regiões de assinatura.
  • keep_existing_fields — preserva widgets AcroForm já presentes.