Spaces:

Oliverdsfdsf
/

ComicPanelsAndTextDetect

Sleeping

App Files Files Community

ComicPanelsAndTextDetect / README.md

Oliverdsfdsf

Update README.md

79b8766 verified 3 days ago

preview code

raw

history blame contribute delete

1.67 kB

metadata

title: ComicPanelsAndTextDetect
emoji: 📊
colorFrom: indigo
colorTo: red
sdk: gradio
sdk_version: 6.14.0
python_version: '3.13'
app_file: app.py
pinned: false
short_description: Using YOLO to detect comic panels and text

🚀 ebookcc - AI Comic Panel & Text Detector (Demo)

Welcome to the AI vision core demo for ebookcc! This Space hosts a custom-trained YOLO26n-seg instance segmentation model, optimized specifically for comics, manga, manhwa, and standard book layouts.

🌟 Live Demo Features

Panel Detection: Automatically identifies the boundaries of comic frames, preparing them for reading-flow re-layout.
Text Bubble Segmentation: Pinpoints all speech bubbles and on-page text areas precisely, making them ready for seamless AI OCR integration.

🛠️ Tech Stack & Background

This web demo is part of the ebookcc.com project.

Backend Model: Fine-tuned based on Ultralytics YOLO11m-seg.
Frontend Deployment: The full Web App runs on the user's browser via TF.js and deeply integrates with Cloudflare Workers to reduce server costs.

📝 How to Use This Model Locally

If you want to call this model in your own Python project, please check out our Hugging Face Model Card.

from ultralytics import YOLO

# Load the model
model = YOLO("comic-panels-and-text-detect.pt")

# Predict with optimal manga settings
results = model.predict("comic_page.jpg", imgsz=1280, conf=0.25)
results[0].show()

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference