SLANet-1M / README.md
dimtri009's picture
Update README.md
a8ea21b verified
---
license: mit
datasets:
- dimtri009/SLANet-1M_dataset
language:
- en
base_model:
- PaddlePaddle/SLANet
tags:
- table-recognition
- table-structure-recognition
- slanet-1m
---
# SLANet-1M: A Lightweight Model for Table Recognition
![License](https://img.shields.io/badge/License-MIT-blue.svg)
![Python](https://img.shields.io/badge/python-3.9%2B-blue)
![Framework](https://img.shields.io/badge/framework-PyTorch-orange)
---
## 🧾 Overview
**SLANet-1M** is a lightweight convolutional model for **table recognition** designed to extract table structure and cell content from document images efficiently.
It is trained on over **one million synthetic and real-world tables** and provides competitive performance compared to transformer-based architectures—while being significantly smaller and faster.
This model was developed as part of a **Master’s thesis** at the University of Florence and the Swiss AI Center (iCoSys, Fribourg), and presented at **SwissText 2025**.
The paper is available [here](https://aclanthology.org/2025.swisstext-1.9/).
---
## 🚀 Key Features
- **Lightweight architecture** (≈9.2M parameters)
- **Transformer-free design** for CPU-friendly deployment
- **Trained on large-scale datasets** (PubTabNet + SynthTabNet)
- **Compatible with deployment pipelines** such as the *Core Engine*
- **Outputs** table structure in **HTML** format
---
## 📦 Model Details
| Property | Description |
|-----------|-------------|
| **Model Name** | SLANet-1M |
| **Architecture** | CNN-based (SLANet variant with depthwise separable convolutions) |
| **Parameters** | ~9.2 million |
| **Input Size** | 480×480 (RGB) |
| **Output Format** | HTML table structure |
| **Training Data** | PubTabNet + SynthTabNet (all subsets) |
| **Metrics** | S-TEDS: 99.36 on SynthTabNet and 97.36 on PubTabNet |
---
Please cite us:
```
@inproceedings{romaric-etal-2025-slanet,
title = "{SLAN}et-1{M}: A Lightweight and Efficient Model for Table Recognition with Minimal Computational Cost",
author = "Romaric, Nguinwa Mbakop Dimitri and
Petrucci, Andrea and
Marinai, Simone and
Hennebert, Jean",
editor = {Gerber, Jonathan and
Cieliebak, Mark and
Tuggener, Don and
H{\"u}rlimann, Manuela},
booktitle = "Proceedings of the 10th edition of the Swiss Text Analytics Conference",
month = may,
year = "2025",
address = "Winterthur, Switzerland",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2025.swisstext-1.9/",
pages = "89--102"
}
```