SLANet-1M: A Lightweight Model for Table Recognition

License Python Framework


๐Ÿงพ Overview

SLANet-1M is a lightweight convolutional model for table recognition designed to extract table structure and cell content from document images efficiently.
It is trained on over one million synthetic and real-world tables and provides competitive performance compared to transformer-based architecturesโ€”while being significantly smaller and faster.

This model was developed as part of a Masterโ€™s thesis at the University of Florence and the Swiss AI Center (iCoSys, Fribourg), and presented at SwissText 2025.

The paper is available here.


๐Ÿš€ Key Features

  • Lightweight architecture (โ‰ˆ9.2M parameters)
  • Transformer-free design for CPU-friendly deployment
  • Trained on large-scale datasets (PubTabNet + SynthTabNet)
  • Compatible with deployment pipelines such as the Core Engine
  • Outputs table structure in HTML format

๐Ÿ“ฆ Model Details

Property Description
Model Name SLANet-1M
Architecture CNN-based (SLANet variant with depthwise separable convolutions)
Parameters ~9.2 million
Input Size 480ร—480 (RGB)
Output Format HTML table structure
Training Data PubTabNet + SynthTabNet (all subsets)
Metrics S-TEDS: 99.36 on SynthTabNet and 97.36 on PubTabNet

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for dimtri009/SLANet-1M

Quantizations
1 model