MSherbinii commited on
Commit
d3a99e5
Β·
verified Β·
1 Parent(s): 54712e0

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +85 -6
README.md CHANGED
@@ -1,12 +1,91 @@
1
  ---
2
- title: Ipad Vad Training
3
- emoji: 🌍
4
- colorFrom: purple
5
- colorTo: yellow
6
  sdk: gradio
7
- sdk_version: 5.49.1
8
  app_file: app.py
9
  pinned: false
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: IPAD VAD Training
3
+ emoji: 🏭
4
+ colorFrom: blue
5
+ colorTo: purple
6
  sdk: gradio
7
+ sdk_version: 4.44.0
8
  app_file: app.py
9
  pinned: false
10
+ license: mit
11
+ tags:
12
+ - video-anomaly-detection
13
+ - industrial-ai
14
+ - computer-vision
15
+ - pytorch
16
+ - swin-transformer
17
+ duplicated_from:
18
+ short_description: Train IPAD video anomaly detection models on ZeroGPU
19
+ hardware: zero-a100
20
  ---
21
 
22
+ # 🏭 IPAD: Industrial Process Anomaly Detection Training
23
+
24
+ Train state-of-the-art video anomaly detection models on industrial datasets using ZeroGPU.
25
+
26
+ ## Features
27
+
28
+ - πŸš€ **ZeroGPU Training**: Free access to NVIDIA H200 GPUs
29
+ - πŸ“Š **12 Industrial Devices**: Synthetic training data for various equipment
30
+ - 🎯 **Baseline Model**: Video Swin Transformer + Periodic Memory Module
31
+ - πŸ’Ύ **Automatic Checkpointing**: Save and upload trained models to HF Hub
32
+ - πŸ“ˆ **Real-time Monitoring**: Track training progress with Gradio interface
33
+
34
+ ## Quick Start
35
+
36
+ 1. **Download Dataset**: Click "Download Dataset" to fetch the 8.3GB IPAD dataset from HF Hub
37
+ 2. **Quick Test**: Verify GPU access with the quick test (no dataset required)
38
+ 3. **Train Baseline**: Select a device and start training with default hyperparameters
39
+
40
+ ## Dataset
41
+
42
+ The IPAD dataset contains:
43
+ - **16 Industrial Devices** (12 synthetic + 4 real)
44
+ - **597,979 Video Frames**
45
+ - **39 Anomaly Classes**
46
+ - **Frame-level Annotations**
47
+
48
+ [View Dataset β†’](https://huggingface.co/datasets/MSherbinii/ipad-industrial-anomaly)
49
+
50
+ ## Model Architecture
51
+
52
+ ```
53
+ Input (16 frames, 256Γ—256)
54
+ ↓
55
+ Video Swin Transformer Encoder
56
+ ↓
57
+ Periodic Memory Module (2000-dim)
58
+ ↓
59
+ I3D Decoder
60
+ ↓
61
+ Reconstructed Output
62
+ ```
63
+
64
+ ## Baseline Results
65
+
66
+ | Device | AUC (%) | Device | AUC (%) |
67
+ |--------|---------|--------|---------|
68
+ | S01 | 69.5 | S07 | 60.6 |
69
+ | S02 | 63.9 | S08 | 85.6 |
70
+ | S03 | 70.6 | S09 | 71.2 |
71
+ | S04 | 58.3 | S10 | 62.2 |
72
+ | S05 | 86.2 | S11 | 60.9 |
73
+ | S06 | 61.2 | S12 | 67.1 |
74
+ | **Avg** | **68.6** | | |
75
+
76
+ ## Citation
77
+
78
+ ```bibtex
79
+ @article{liu2024ipad,
80
+ title={IPAD: Industrial Process Anomaly Detection Dataset},
81
+ author={Liu, Jinfan and Yan, Yichao and Li, Junjie and others},
82
+ journal={arXiv preprint arXiv:2404.15033},
83
+ year={2024}
84
+ }
85
+ ```
86
+
87
+ ## Links
88
+
89
+ - [Paper](https://arxiv.org/abs/2404.15033)
90
+ - [Original Code](https://github.com/LJF1113/IPAD)
91
+ - [Dataset](https://huggingface.co/datasets/MSherbinii/ipad-industrial-anomaly)