Diagram Detector - v5

High-accuracy diagram detection model for academic papers using YOLO11 architecture.

Model Description

This model detects diagrams, figures, and illustrations in academic papers. It is trained on mathematical publications with rigorous augmentation to preserve semantic meaning of diagrams.

Performance

Page-Level Metrics (Binary Classification)

  • Binary F1: 98.49%
  • Precision: 98.58%
  • Recall: 98.40%
  • Accuracy: 98.37%

Box-Level Metrics (Object Detection)

  • mAP50: N/A
  • mAP50-95: N/A
  • Precision: N/A
  • Recall: N/A
  • Best Epoch: N/A

Optimal Inference Parameters

  • Confidence Threshold: 0.20
  • IOU Threshold: 0.30

These thresholds were determined through grid search optimization for best page-level performance.

Usage

from diagram_detector import DiagramDetector

# Initialize detector
detector = DiagramDetector(model='diagram-detector-v5')

# Detect diagrams in a PDF
results = detector.detect_pdf('paper.pdf')

# Or detect in images
results = detector.detect('path/to/images/')

Training Details

Dataset Splits

  • Training: 70.00%
  • Validation: 15.00%
  • Test: 15.00%

Augmentation

  • Rotation: ±0.0°
  • Translation: ±10%
  • Scale: ±20%
  • Horizontal/Vertical Flips: Disabled (preserves semantic meaning)

Hyperparameters

  • Image Size: 640px
  • Batch Size: auto
  • Epochs: 200 (patience: 50)
  • Optimizer: auto
  • Learning Rate: 0.0100 → 0.0100

Limitations

  • Optimized for academic papers and mathematical publications
  • May require fine-tuning for other document types
  • Performance may vary on hand-drawn or low-quality diagrams
  • No horizontal/vertical flips during training (maintains orientation semantics)

Citation

@software{diagram_detector_yolo11n_2026,
  author = {Sørensen, Henrik Kragh},
  title = {Diagram Detector Model v5},
  year = {2026},
  publisher = {Hugging Face},
  url = {https://huggingface.co/hksorensen/diagram-detector-model}
}

Additional Resources

  • Package: pip install diagram-detector
  • Repository: Private (contact author for access)

Model uploaded: 2026-01-12T19:28:05.111806

Downloads last month
10
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support