File size: 614 Bytes
469c325 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 |
# Swin Transformer for Video
## Model Architecture
- **Type**: Video Swin Transformer (Tiny - Swin3D-T)
- **Source**: Torchvision `models.swin3d_t`.
- **Modifications**: Classification head (Linear) modified to output 2 classes (Binary).
- **Features**: Hierarchical transformer with shifted windows, adapted for 3D Video processing.
## Dataset Structure
Expects `Dataset` folder in parent directory.
```
Dataset/
├── violence/
└── no-violence/
```
## How to Run
1. Install dependencies: `torch`, `opencv-python`, `scikit-learn`, `numpy`, `torchvision`.
2. Run `python train.py`.
|