Upload folder using huggingface_hub

8254e28 verified 2 days ago

11 kB

	---
	language: en
	license: mit
	tags:
	- court-detection
	- keypoint-detection
	- sports-analysis
	- badminton
	- tennis
	- pytorch
	- computer-vision
	- octave-convolution
	- geometric-constraints
	- polar-transform
	datasets:
	- adithyanraj03/CourtKeyNet-Dataset
	metrics:
	- pck
	- iou
	pipeline_tag: keypoint-detection
	library_name: pytorch
	model-index:
	- name: CourtKeyNet
	results:
	- task:
	type: keypoint-detection
	name: Court Keypoint Detection
	dataset:
	name: CourtKeyNet Badminton Court Dataset
	type: custom
	metrics:
	- name: PCK
	type: pck
	value: 99.99
	- name: IoU
	type: iou
	value: 99.67
	---

	<div align="center">
	<img src="assets/teasser.png" alt="CourtKeyNet Banner" width="100%">

	# CourtKeyNet: A Novel Octave-Based Architecture for Precision Court Detection

	Adithya N Raj

	[![Paper](https://img.shields.io/static/v1?label=Paper&message=ScienceDirect&color=red&logo=elsevier&logoColor=white)](https://www.sciencedirect.com/science/article/pii/S2666827026000496)
	[![GitHub](https://img.shields.io/static/v1?label=Code&message=GitHub&color=black&logo=github)](https://github.com/adithyanraj03/CourtKeyNet)
	[![Dataset](https://img.shields.io/badge/%F0%9F%93%81%20Dataset-GitHub-blue)](https://github.com/adithyanraj03/Paper_09_Data-Set_CourtKeyNet)
	[![License](https://img.shields.io/badge/License-MIT-green)](LICENSE)
	[![SafeTensors](https://img.shields.io/badge/Format-SafeTensors-8A2BE2)](https://huggingface.co/docs/safetensors)

	</div>

	---

	## Model Summary

	CourtKeyNet is a lightweight deep learning architecture for precision court keypoint detection in sports videos. It detects 4 corner keypoints of badminton and tennis courts from a single image, enabling downstream applications like court homography estimation, match statistics generation, and automated broadcasting systems.

	Published in [Machine Learning with Applications (Elsevier)](https://www.sciencedirect.com/science/article/pii/S2666827026000496), 2026.

	### Key Features

	- 🎯 99.99% PCK and 99.67% IoU on the fine-tuned model
	- ⚡ Lightweight and fast — efficient architecture for real-time use
	- 🔬 Octave Feature Extractor — Multi-frequency feature decomposition for capturing fine court details and global structural context
	- 🎱 Polar Transform Attention — Boundary detection in polar coordinates for precise court line localization
	- 📐 Geometric Consistency Loss — Ensures structurally valid quadrilateral outputs
	- 🏸 Supports both badminton and tennis court detection

	---

	## Model Variants

	\| Variant \| File \| Format \| Val Loss \| PCK (%) \| IoU (%) \| Description \|
	\|---------\|------\|--------\|----------\|---------\|---------\|-------------\|
	\| CourtKeyNet-Base \| `pretrained/courtkeynet_base.safetensors` \| SafeTensors \| 0.0977 \| 99.09 \| 94.33 \| Pre-trained on 140k+ images from scratch \|
	\| CourtKeyNet-Finetuned ⭐ \| `finetuned/courtkeynet_finetuned.safetensors` \| SafeTensors \| 0.0013 \| 99.99 \| 99.67 \| Fine-tuned on 7k+ clean annotated images \|

	> Recommendation: Use the fine-tuned variant for best results. The pre-trained base model is provided for researchers who wish to fine-tune on their own custom court datasets.

	---

	## Quick Start

	### 1. Download Weights

	```bash
	# Install huggingface CLI
	pip install "huggingface_hub[cli]"

	# Download the full repository
	huggingface-cli download Cracked-ANJ/CourtKeyNet --local-dir ./courtkeynet-weights

	# Or download only the fine-tuned model (recommended)
	huggingface-cli download Cracked-ANJ/CourtKeyNet finetuned/courtkeynet_finetuned.safetensors --local-dir ./courtkeynet-weights
	```

	### 2. Run Inference

	For inference, training, and fine-tuning scripts, use the full source code from the [GitHub Repository](https://github.com/adithyanraj03/CourtKeyNet):

	```bash
	# Clone the source code
	git clone https://github.com/adithyanraj03/CourtKeyNet.git
	cd CourtKeyNet

	# Install dependencies
	pip install -r requirements.txt

	# Run Inference Studio (GUI)
	cd courtkeynet
	python inference.py
	```

	> The Inference Studio provides a complete GUI with image/video/webcam support, confidence scoring, and real-time visualization.

	---

	## Architecture

	<div align="center">
	<img src="assets/architecture.png" alt="CourtKeyNet Architecture" width="80%">
	</div>



	<details>
	<summary style="font-size: 1.2em; font-weight: bold; cursor: pointer;">🔍 How Confidence Detection Works (Visual Explanation)</summary>

	### How Confidence Detection Works

	The model (`CourtKeyNet`) works like this:

	![Architecture](assets/data_1.png)

	Problem: It has no "court detector" — it assumes every image IS a court!

	---

	#### What the Model Actually Outputs Internally

	When you run `model(image)`, it returns a dictionary with these components:

	```python
	outputs = {
	'heatmaps': Tensor[B, 4, 160, 160], # 4 gaussian peaks (one per corner)
	'kpts_init': Tensor[B, 4, 2], # Initial keypoints from heatmaps
	'kpts_refined': Tensor[B, 4, 2], # Final refined keypoints
	'features': Tensor[B, 256, 20, 20] # Feature maps (optional)
	}
	```

	#### Visualization of Heatmap Output

	For a real court image:
	```text
	Heatmap for Corner 0 (Top-Left):
	```
	![Heatmap](assets/data_2.png)

	For a non-court image (e.g., random person):
	```text
	Heatmap for Corner 0:
	```
	![Heatmap](assets/data_3.png)

	---

	#### 3 Confidence Metrics

	##### 1️⃣ Heatmap Peak Confidence (Primary Signal)

	What it measures: How "peaky" the heatmap is

	```python
	max_values = heatmaps.max(dim=(2,3)) # Find highest value in each heatmap
	conf_heatmap = max_values.mean() # Average across 4 corners
	```

	Visual comparison:

	![Comparison](assets/data_4.png)

	---

	##### 2️⃣ Heatmap Entropy (Uncertainty)

	What it measures: How "spread out" the probability is

	```python
	# Entropy = -Σ(p * log(p))
	# Low entropy = focused (good)
	# High entropy = random noise (bad)
	```

	Visual comparison:

	![Entropy_Comparison](assets/data_5.png)

	---

	##### 3️⃣ Geometric Validity (Shape Check)

	What it checks: Does the quad look like a real court?

	```text
	Checklist:
	✓ Are corners in correct positions? (TL upper-left, BR lower-right)
	✓ Is the quad convex? (no crossed lines)
	✓ Is the area reasonable? (not too tiny, not entire image)
	✓ Is aspect ratio court-like? (not a thin line)
	```

	Visual examples:

	![Visual_Example](assets/data_6.png)

	</details>

	---

	## Training Details

	### Pre-training (from scratch)

	\| Parameter \| Value \|
	\|-----------\|-------\|
	\| Dataset \| 140,000+ badminton & tennis court images \|
	\| Epochs \| 158 (early stopping, patience=20) \|
	\| Best Epoch \| 138 \|
	\| Optimizer \| AdamW \|
	\| Learning Rate \| 5e-5 (cosine schedule) \|
	\| Batch Size \| 47 \|
	\| Image Size \| 640×640 \|
	\| Mixed Precision \| ✓ FP16 \|
	\| Hardware \| NVIDIA RTX 5090 (32GB) \|

	### Fine-tuning

	\| Parameter \| Value \|
	\|-----------\|-------\|
	\| Base Model \| Pre-trained CourtKeyNet-Base (epoch 138) \|
	\| Dataset \| 7,000+ precisely annotated clean images \|
	\| Epochs \| 79 (early stopping, patience=20) \|
	\| Best Epoch \| 59 \|
	\| Learning Rate \| 1e-4 \|
	\| Batch Size \| 48 \|
	\| Geometric Loss \| Enabled (edge + diagonal + angle constraints) \|

	---

	## Evaluation Results

	### Fine-tuned Model (Recommended)

	\| Metric \| Score \|
	\|--------\|-------\|
	\| PCK (Percentage of Correct Keypoints) \| 99.99% \|
	\| IoU (Intersection over Union) \| 99.67% \|
	\| Validation Loss \| 0.0013 \|

	### Pre-trained Base Model

	\| Metric \| Score \|
	\|--------\|-------\|
	\| PCK \| 99.09% \|
	\| IoU \| 94.33% \|
	\| Validation Loss \| 0.0977 \|

	### Training Metrics

	<div align="center">
	<img src="assets/training_metrics.png" alt="Training Metrics" width="100%">
	<p><em>Pre-training metrics across 158 epochs showing loss convergence and PCK/IoU progression</em></p>
	</div>

	<div align="center">
	<img src="assets/finetune_detail.png" alt="Fine-tuning Detail" width="100%">
	<p><em>Fine-tuning convergence: rapid loss decrease with PCK reaching 99.99% and IoU reaching 99.67%</em></p>
	</div>

	### Validation Samples

	<div align="center">
	<table>
	<tr>
	<td align="center"><strong>Epoch 1 (Early Fine-tuning)</strong></td>
	<td align="center"><strong>Epoch 79 (Final)</strong></td>
	</tr>
	<tr>
	<td><img src="assets/val_sample_early.jpg" alt="Early Validation" width="100%"></td>
	<td><img src="assets/val_sample_final.jpg" alt="Final Validation" width="100%"></td>
	</tr>
	</table>
	<p><em>Red = Predicted court boundary \| Green = Ground truth \| Yellow dots = Keypoints</em></p>
	</div>

	---

	## Fine-tuning on Custom Data

	To fine-tune CourtKeyNet on your own court dataset (any sport):

	```bash
	# 1. Clone the source code
	git clone https://github.com/adithyanraj03/CourtKeyNet.git
	cd CourtKeyNet

	# 2. Download weights
	huggingface-cli download Cracked-ANJ/CourtKeyNet pretrained/courtkeynet_base.pt --local-dir ./weights

	# 3. Fine-tune
	cd courtkeynet
	python finetune.py
	```

	> Note: Fine-tuning requires only 5,000–7,000 clean annotated images. Training from scratch requires 140,000+ images.

	---

	## Intended Use

	### ✅ Appropriate Uses
	- Badminton and tennis court detection in sports videos
	- Court homography estimation for tactical analysis
	- Generating match statistics from broadcast footage
	- Automated camera calibration for sports broadcasting
	- Academic research in sports video analysis

	### ⚠️ Limitations
	- Trained specifically on badminton and tennis courts; other court types may require fine-tuning
	- Expects single-court images; multi-court scenes are not supported
	- Performance may degrade on heavily occluded courts or extreme camera angles
	- The model detects 4 outer corners only (not inner court lines)

	---

	## Citation

	If you use CourtKeyNet in your research, please cite:

	Paper: [CourtKeyNet: A novel octave-based architecture for precision badminton court detection with geometric constraints](https://www.sciencedirect.com/science/article/pii/S2666827026000496)
	DOI: [10.1016/j.mlwa.2026.100884](https://doi.org/10.1016/j.mlwa.2026.100884)

	```bibtex
	@article{NRAJ2026100884,
	title = {CourtKeyNet: A novel octave-based architecture for precision badminton court detection with geometric constraints},
	journal = {Machine Learning with Applications},
	volume = {24},
	pages = {100884},
	year = {2026},
	issn = {2666-8270},
	doi = {https://doi.org/10.1016/j.mlwa.2026.100884},
	url = {https://www.sciencedirect.com/science/article/pii/S2666827026000496},
	author = {Adithya N Raj and Prethija G.}
	}
	```

	---

	## License

	This project is released under the [MIT License](LICENSE), suitable for both academic and commercial use.

	## Contact

	For questions or collaboration opportunities: adithyanraj03@gmail.com