Improve model card
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,10 +1,48 @@
|
|
| 1 |
---
|
|
|
|
|
|
|
| 2 |
tags:
|
| 3 |
- model_hub_mixin
|
| 4 |
- pytorch_model_hub_mixin
|
| 5 |
---
|
| 6 |
|
| 7 |
-
|
| 8 |
-
|
| 9 |
-
-
|
| 10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
pipeline_tag: object-detection
|
| 4 |
tags:
|
| 5 |
- model_hub_mixin
|
| 6 |
- pytorch_model_hub_mixin
|
| 7 |
---
|
| 8 |
|
| 9 |
+
# EdgeCrafter: Compact ViTs for Edge Dense Prediction
|
| 10 |
+
|
| 11 |
+
EdgeCrafter is a unified framework for compact Vision Transformers (ViTs) optimized for edge dense prediction tasks like object detection, instance segmentation, and pose estimation. It bridges the performance gap between compact ViTs and CNN-based architectures (like YOLO) through task-specialized distillation and edge-friendly architectural designs.
|
| 12 |
+
|
| 13 |
+
- **Paper:** [EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation](https://huggingface.co/papers/2603.18739)
|
| 14 |
+
- **Project Page:** [https://intellindust-ai-lab.github.io/projects/EdgeCrafter/](https://intellindust-ai-lab.github.io/projects/EdgeCrafter/)
|
| 15 |
+
- **Repository:** [https://github.com/Intellindust-AI-Lab/EdgeCrafter](https://github.com/Intellindust-AI-Lab/EdgeCrafter)
|
| 16 |
+
|
| 17 |
+
## Model Description
|
| 18 |
+
|
| 19 |
+
Compact ViTs often struggle with task-specific representation learning on edge devices. EdgeCrafter addresses this by introducing **ECDet**, a detection model built from a distilled compact backbone. On the COCO dataset, ECDet-S achieves 51.7 AP with fewer than 10M parameters. This framework demonstrates that when paired with task-specialized distillation, compact ViTs are highly competitive for real-time edge applications.
|
| 20 |
+
|
| 21 |
+
## Quick Start (Inference)
|
| 22 |
+
|
| 23 |
+
To run inference on a sample image using the pre-trained model, follow these steps from the official repository:
|
| 24 |
+
|
| 25 |
+
```bash
|
| 26 |
+
# 1. Clone the repository and install dependencies
|
| 27 |
+
git clone https://github.com/Intellindust-AI-Lab/EdgeCrafter
|
| 28 |
+
cd EdgeCrafter
|
| 29 |
+
pip install -r requirements.txt
|
| 30 |
+
|
| 31 |
+
# 2. Run PyTorch inference
|
| 32 |
+
# Make sure to replace `path/to/your/image.jpg` with an actual image path
|
| 33 |
+
cd ecdetseg
|
| 34 |
+
python tools/inference/torch_inf.py -c configs/ecdet/ecdet_l.yml -r ecdet_l.pth -i path/to/your/image.jpg
|
| 35 |
+
```
|
| 36 |
+
|
| 37 |
+
## Citation
|
| 38 |
+
|
| 39 |
+
If you find this project useful in your research, please consider citing:
|
| 40 |
+
|
| 41 |
+
```bibtex
|
| 42 |
+
@article{liu2026edgecrafter,
|
| 43 |
+
title={EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation},
|
| 44 |
+
author={Liu, Longfei and Hou, Yongjie and Li, Yang and Wang, Qirui and Sha, Youyang and Yu, Yongjun and Wang, Yinzhi and Ru, Peizhe and Yu, Xuanlong and Shen, Xi},
|
| 45 |
+
journal={arXiv},
|
| 46 |
+
year={2026}
|
| 47 |
+
}
|
| 48 |
+
```
|