Spaces:
Sleeping
Sleeping
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -1,12 +1,91 @@
|
|
| 1 |
---
|
| 2 |
-
title:
|
| 3 |
-
emoji:
|
| 4 |
-
colorFrom:
|
| 5 |
-
colorTo:
|
| 6 |
sdk: gradio
|
| 7 |
-
sdk_version:
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 10 |
---
|
| 11 |
|
| 12 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
title: IPAD VAD Training
|
| 3 |
+
emoji: π
|
| 4 |
+
colorFrom: blue
|
| 5 |
+
colorTo: purple
|
| 6 |
sdk: gradio
|
| 7 |
+
sdk_version: 4.44.0
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
+
license: mit
|
| 11 |
+
tags:
|
| 12 |
+
- video-anomaly-detection
|
| 13 |
+
- industrial-ai
|
| 14 |
+
- computer-vision
|
| 15 |
+
- pytorch
|
| 16 |
+
- swin-transformer
|
| 17 |
+
duplicated_from:
|
| 18 |
+
short_description: Train IPAD video anomaly detection models on ZeroGPU
|
| 19 |
+
hardware: zero-a100
|
| 20 |
---
|
| 21 |
|
| 22 |
+
# π IPAD: Industrial Process Anomaly Detection Training
|
| 23 |
+
|
| 24 |
+
Train state-of-the-art video anomaly detection models on industrial datasets using ZeroGPU.
|
| 25 |
+
|
| 26 |
+
## Features
|
| 27 |
+
|
| 28 |
+
- π **ZeroGPU Training**: Free access to NVIDIA H200 GPUs
|
| 29 |
+
- π **12 Industrial Devices**: Synthetic training data for various equipment
|
| 30 |
+
- π― **Baseline Model**: Video Swin Transformer + Periodic Memory Module
|
| 31 |
+
- πΎ **Automatic Checkpointing**: Save and upload trained models to HF Hub
|
| 32 |
+
- π **Real-time Monitoring**: Track training progress with Gradio interface
|
| 33 |
+
|
| 34 |
+
## Quick Start
|
| 35 |
+
|
| 36 |
+
1. **Download Dataset**: Click "Download Dataset" to fetch the 8.3GB IPAD dataset from HF Hub
|
| 37 |
+
2. **Quick Test**: Verify GPU access with the quick test (no dataset required)
|
| 38 |
+
3. **Train Baseline**: Select a device and start training with default hyperparameters
|
| 39 |
+
|
| 40 |
+
## Dataset
|
| 41 |
+
|
| 42 |
+
The IPAD dataset contains:
|
| 43 |
+
- **16 Industrial Devices** (12 synthetic + 4 real)
|
| 44 |
+
- **597,979 Video Frames**
|
| 45 |
+
- **39 Anomaly Classes**
|
| 46 |
+
- **Frame-level Annotations**
|
| 47 |
+
|
| 48 |
+
[View Dataset β](https://huggingface.co/datasets/MSherbinii/ipad-industrial-anomaly)
|
| 49 |
+
|
| 50 |
+
## Model Architecture
|
| 51 |
+
|
| 52 |
+
```
|
| 53 |
+
Input (16 frames, 256Γ256)
|
| 54 |
+
β
|
| 55 |
+
Video Swin Transformer Encoder
|
| 56 |
+
β
|
| 57 |
+
Periodic Memory Module (2000-dim)
|
| 58 |
+
β
|
| 59 |
+
I3D Decoder
|
| 60 |
+
β
|
| 61 |
+
Reconstructed Output
|
| 62 |
+
```
|
| 63 |
+
|
| 64 |
+
## Baseline Results
|
| 65 |
+
|
| 66 |
+
| Device | AUC (%) | Device | AUC (%) |
|
| 67 |
+
|--------|---------|--------|---------|
|
| 68 |
+
| S01 | 69.5 | S07 | 60.6 |
|
| 69 |
+
| S02 | 63.9 | S08 | 85.6 |
|
| 70 |
+
| S03 | 70.6 | S09 | 71.2 |
|
| 71 |
+
| S04 | 58.3 | S10 | 62.2 |
|
| 72 |
+
| S05 | 86.2 | S11 | 60.9 |
|
| 73 |
+
| S06 | 61.2 | S12 | 67.1 |
|
| 74 |
+
| **Avg** | **68.6** | | |
|
| 75 |
+
|
| 76 |
+
## Citation
|
| 77 |
+
|
| 78 |
+
```bibtex
|
| 79 |
+
@article{liu2024ipad,
|
| 80 |
+
title={IPAD: Industrial Process Anomaly Detection Dataset},
|
| 81 |
+
author={Liu, Jinfan and Yan, Yichao and Li, Junjie and others},
|
| 82 |
+
journal={arXiv preprint arXiv:2404.15033},
|
| 83 |
+
year={2024}
|
| 84 |
+
}
|
| 85 |
+
```
|
| 86 |
+
|
| 87 |
+
## Links
|
| 88 |
+
|
| 89 |
+
- [Paper](https://arxiv.org/abs/2404.15033)
|
| 90 |
+
- [Original Code](https://github.com/LJF1113/IPAD)
|
| 91 |
+
- [Dataset](https://huggingface.co/datasets/MSherbinii/ipad-industrial-anomaly)
|