Spaces:

blacksinisterx
/

DetectifAI-Backend

Sleeping

App Files Files Community

DetectifAI-Backend / report_generation /README.md

blacksinisterx

fix: keyframe images, video clips, evidence images, live stream webcam+URL, remove demo mode

fd50325 verified 3 months ago

preview code

raw

history blame contribute delete

8.24 kB

DetectifAI Report Generation Module

Automatically generates professional forensic incident reports from detected surveillance events using a local LLM.

📋 Overview

This module takes already-processed event data (detections, timestamps, captions, keyframes) and uses a local instruction-tuned LLM to generate structured, professional reports exportable as PDF or HTML.

Key Features

Offline Operation: Uses local LLM (Qwen2.5-3B-Instruct or Phi-3-mini)
Deterministic Output: No hallucinations - only uses provided data
Professional Reports: Structured Markdown converted to PDF/HTML
Evidence Integration: Embeds keyframes and face crops
Zero Cloud Dependencies: Everything runs locally

🛠️ Prerequisites

System Requirements

Component	Minimum	Recommended
RAM	8 GB	16 GB
CPU	4 cores	8+ cores
Disk Space	5 GB	10 GB
GPU (optional)	None	NVIDIA with CUDA

Software Requirements

Python 3.9+ (already installed for DetectifAI)
GTK3 Runtime (for WeasyPrint PDF export on Windows)

📦 Installation

Step 1: Install Python Dependencies

# Navigate to backend directory
cd backend

# Install report generation dependencies
pip install llama-cpp-python huggingface_hub jinja2 markdown weasyprint reportlab Pillow

Note for GPU acceleration (optional):

# For NVIDIA CUDA support
CMAKE_ARGS="-DLLAMA_CUBLAS=on" pip install llama-cpp-python --force-reinstall --no-cache-dir

# For Windows with CUDA
set CMAKE_ARGS=-DLLAMA_CUBLAS=on
pip install llama-cpp-python --force-reinstall --no-cache-dir

Step 2: Install GTK3 (Required for PDF Export on Windows)

WeasyPrint requires GTK3 runtime. Choose one method:

Option A: Using MSYS2 (Recommended)

Download MSYS2 from: https://www.msys2.org/
Install and open MSYS2 terminal
Run: pacman -S mingw-w64-x86_64-gtk3
Add to PATH: C:\msys64\mingw64\bin

Option B: Standalone GTK3

Download from: https://github.com/nicothin/MSYS2-GTK-Windows
Extract to C:\GTK3
Add C:\GTK3\bin to system PATH

Option C: Skip PDF (Use HTML only)

If GTK3 is problematic, use HTML export or the SimplePDFExporter (reportlab-based)

Step 3: Download the LLM Model

The module will auto-download on first use, but you can pre-download:

# Run the download script
python -c "from report_generation.llm_engine import LLMEngine; e = LLMEngine(); e.download_model()"

Or manually download:

Model	Size	License	Download
Qwen2.5-3B-Instruct (Primary)	~2 GB	Apache 2.0	HuggingFace
Phi-3-mini-4k-instruct (Alt)	~2.3 GB	MIT	HuggingFace

Download the q4_k_m.gguf quantized version and place in:

backend/report_generation/models/qwen2.5-3b-instruct-q4_k_m.gguf

🚀 Usage

Basic Usage

from report_generation import ReportGenerator

# Initialize generator
generator = ReportGenerator()

# Generate report for a video
report = generator.generate_report(
    video_id="video_20240101_120000_abc123"
)

# Export as HTML
html_path = generator.export_html(report)
print(f"HTML report: {html_path}")

# Export as PDF
pdf_path = generator.export_pdf(report)
print(f"PDF report: {pdf_path}")

With Time Range Filter

from datetime import datetime

report = generator.generate_report(
    video_id="video_20240101_120000_abc123",
    time_range=(
        datetime(2024, 1, 1, 12, 0, 0),
        datetime(2024, 1, 1, 13, 0, 0)
    )
)

Selective Sections

# Generate only specific sections
report = generator.generate_report(
    video_id="video_123",
    include_sections=['header', 'executive_summary', 'timeline', 'conclusion']
)

Without LLM (Fallback Mode)

If the LLM fails to load, the module automatically uses fallback templates:

from report_generation.config import ReportConfig

# Explicitly disable LLM
config = ReportConfig()
config.llm.n_gpu_layers = 0  # CPU only
config.llm.n_threads = 2     # Reduce for slow systems

generator = ReportGenerator(config)

📁 Module Structure

report_generation/
├── __init__.py              # Package initialization
├── config.py                # Configuration (LLM, paths, settings)
├── llm_engine.py            # LLM loading and inference
├── prompt_templates.py      # Prompt engineering templates
├── data_collector.py        # MongoDB data collection
├── report_builder.py        # Main orchestration
├── html_renderer.py         # Jinja2 HTML generation
├── pdf_exporter.py          # WeasyPrint/reportlab PDF export
├── templates/               # HTML/CSS templates
│   ├── report_base.html
│   └── report_styles.css
├── models/                  # LLM model files (.gitignored)
│   └── .gitkeep
└── README.md

📊 Report Sections

Section	Description	LLM Generated
Header	Report ID, metadata, video info	❌
Executive Summary	Overview of findings	✅
Incident Timeline	Chronological event list	✅
Evidence Catalog	Keyframes and face crops	✅
Observations	Pattern analysis	✅
Conclusion	Summary and recommendations	✅

⚙️ Configuration

Edit config.py or pass custom config:

from report_generation.config import ReportConfig, LLMConfig

# Custom LLM settings
llm_config = LLMConfig(
    n_threads=8,           # More CPU threads
    n_gpu_layers=35,       # Offload to GPU
    temperature=0.1,       # Low for determinism
    max_tokens=2048        # Max output length
)

config = ReportConfig(llm=llm_config)
config.organization_name = "My Security Company"
config.report_classification = "INTERNAL"

generator = ReportGenerator(config)

🔧 Troubleshooting

LLM Won't Load

Error: Model not found

Solution: Download the model manually or check path in config.py

PDF Export Fails on Windows

OSError: cannot load library 'gobject-2.0-0'

Solution: Install GTK3 runtime and add to PATH (see Step 2)

Out of Memory

RuntimeError: CUDA out of memory

Solutions:

Set n_gpu_layers=0 for CPU-only
Use smaller context: n_ctx=2048
Close other applications

Slow Generation

Solutions:

Increase n_threads (up to CPU core count)
Enable GPU with n_gpu_layers > 0
Use smaller model (Phi-3 instead of Qwen)

📝 API Reference

ReportGenerator

class ReportGenerator:
    def __init__(config: ReportConfig = None)
    def initialize() -> bool
    def generate_report(
        video_id: str,
        time_range: Tuple[datetime, datetime] = None,
        include_sections: List[str] = None
    ) -> GeneratedReport
    def export_html(report: GeneratedReport, output_path: str = None) -> str
    def export_pdf(report: GeneratedReport, output_path: str = None) -> str

GeneratedReport

@dataclass
class GeneratedReport:
    report_id: str
    video_id: str
    title: str
    generated_at: datetime
    time_range: Tuple[datetime, datetime]
    sections: List[ReportSection]
    metadata: Dict[str, Any]
    statistics: Dict[str, Any]

🔒 Security Notes

Local Processing: All data stays on your machine
No Cloud Calls: LLM runs entirely offline
Fact-Based: Reports only contain provided data
Confidential Marking: Reports are marked CONFIDENTIAL by default

📄 License

This module is part of DetectifAI and follows the project license.

The recommended LLM models have the following licenses:

Qwen2.5: Apache 2.0
Phi-3: MIT

Both are free for commercial use.