ilMassy
/

advertising-panel-segmentation

semantic-segmentation

advertising-boards

Model card Files Files and versions

advertising-panel-segmentation / README.md

ilMassy's picture

Upload README.md with huggingface_hub

227728d verified 11 days ago

|

history blame contribute delete

3 kB

	---
	license: mit
	tags:
	- segmentation
	- semantic-segmentation
	- segformer
	- sports
	- advertising-boards
	- mmsegmentation
	---

	# Advertising Panel Segmentation — SegFormer

	Semantic segmentation models for detecting advertising LED boards in sports broadcasting footage.

	## Dataset
	- 967 images at 1920x1080 from sports highlights
	- Manually annotated via CVAT.ai with polygon annotations
	- Binary segmentation: background (0) vs advertising board (1)
	- Split: Train 672 / Val 149 / Test 146

	## Models

	\| Experiment \| Model \| Augmentation \| mIoU \| Board IoU \| Dice \| Precision \| Recall \|
	\|---\|---\|---\|---\|---\|---\|---\|---\|
	\| Exp0 \| SegFormer-B0 \| Standard \| 87.15% \| 76.28% \| 86.55% \| 84.13% \| 89.10% \|
	\| Exp1 \| SegFormer-B1 \| Standard \| 84.29% \| 71.12% \| 83.12% \| 79.26% \| 87.39% \|
	\| Exp2 \| SegFormer-B1 \| Sport-specific \| 87.26% \| 76.45% \| 86.66% \| 85.76% \| 87.57% \|
	\| Exp3-opt1 \| SegFormer-B1 \| Sport-specific reduced + LR 3e-5 + channels=512 \| 84.58% \| 71.68% \| 83.51% \| 78.66% \| 88.99% \|
	\| Exp3-opt2 \| SegFormer-B1 \| Sport-specific reduced + LR 3e-5 + channels=256 \| 84.92% \| 72.26% \| 83.90% \| 80.40% \| 87.71% \|
	\| Exp3-opt3 \| SegFormer-B1 \| Sport-specific + drop_path=0.15 \| 86.42% \| 74.99% \| 85.71% \| 82.25% \| 89.47% \|

	Best model: Exp2 - SegFormer-B1 Augmented (`models/exp2_segformer_b1_augmented/best_mIoU_iter_14000.pth`)

	## Detailed Results

	### Exp0 - SegFormer-B0 Baseline
	- Best checkpoint: best_mIoU_iter_18000.pth
	- mIoU: 87.15% \| Board IoU: 76.28% \| Dice: 86.55% \| Precision: 84.13% \| Recall: 89.10%

	### Exp1 - SegFormer-B1 Standard
	- Best checkpoint: best_mIoU_iter_10000.pth
	- mIoU: 84.29% \| Board IoU: 71.12% \| Dice: 83.12% \| Precision: 79.26% \| Recall: 87.39%

	### Exp2 - SegFormer-B1 Augmented ⭐ Best
	- Best checkpoint: best_mIoU_iter_14000.pth
	- mIoU: 87.26% \| Board IoU: 76.45% \| Dice: 86.66% \| Precision: 85.76% \| Recall: 87.57%

	### Exp3 - SegFormer-B1 Optimized1
	- Best checkpoint: best_mIoU_iter_14000.pth
	- mIoU: 84.58% \| Board IoU: 71.68% \| Dice: 83.51% \| Precision: 78.66% \| Recall: 88.99%

	### Exp3 - SegFormer-B1 Optimized2
	- Best checkpoint: best_mIoU_iter_12000.pth
	- mIoU: 84.92% \| Board IoU: 72.26% \| Dice: 83.90% \| Precision: 80.40% \| Recall: 87.71%

	### Exp3 - SegFormer-B1 Optimized3
	- Best checkpoint: best_mIoU_iter_18000.pth
	- mIoU: 86.42% \| Board IoU: 74.99% \| Dice: 85.71% \| Precision: 82.25% \| Recall: 89.47%

	## Framework
	- PyTorch 2.4.1
	- MMSegmentation 1.2.2
	- mmcv 2.2.0

	## Repository Structure
	```
	models/
	exp0_segformer_b0_baseline/best_mIoU_iter_18000.pth
	exp1_segformer_b1_standard/best_mIoU_iter_10000.pth
	exp2_segformer_b1_augmented/best_mIoU_iter_14000.pth
	exp3_segformer_b1_optimized/best_mIoU_iter_14000.pth
	exp3_segformer_b1_optimized2/best_mIoU_iter_12000.pth
	exp3_segformer_b1_optimized3/best_mIoU_iter_18000.pth
	dataset/
	processed.zip (967 images + masks, train/val/test split)
	```

	## GitHub
	https://github.com/ilMassy/advertising-panel-segmentation