Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,32 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
language: en
|
| 3 |
+
license: mit
|
| 4 |
+
tags:
|
| 5 |
+
- environment
|
| 6 |
+
- computer-vision
|
| 7 |
+
- vit
|
| 8 |
+
- climate-change
|
| 9 |
+
---
|
| 10 |
+
|
| 11 |
+
# wildfire_smoke_segmentation_vit
|
| 12 |
+
|
| 13 |
+
## Overview
|
| 14 |
+
This model is a Vision Transformer (ViT) designed for the early detection of wildfires via satellite and aerial imagery. By identifying specific smoke patterns and thermal anomalies, it provides real-time alerts for environmental monitoring agencies.
|
| 15 |
+
|
| 16 |
+
|
| 17 |
+
|
| 18 |
+
## Model Architecture
|
| 19 |
+
The model is based on the **ViT-Base** (Patch 16) architecture:
|
| 20 |
+
- **Patching**: Divides input images into 16x16 patches to capture global spatial dependencies.
|
| 21 |
+
- **Attention**: Uses multi-head self-attention to distinguish between cloud cover and low-density smoke plumes.
|
| 22 |
+
- **Pre-training**: Initialized on ImageNet-21k and fine-tuned on the FIRESAT dataset.
|
| 23 |
+
|
| 24 |
+
## Intended Use
|
| 25 |
+
- **Remote Sensing**: Automated monitoring of vast forested areas via Sentinel-2 or Landsat imagery.
|
| 26 |
+
- **Early Warning Systems**: Integration into IoT-enabled lookout towers for local fire departments.
|
| 27 |
+
- **Post-Fire Analysis**: Assessing the spread and intensity of smoke for environmental impact studies.
|
| 28 |
+
|
| 29 |
+
## Limitations
|
| 30 |
+
- **Atmospheric Conditions**: Heavy cloud cover or fog can lead to false positives.
|
| 31 |
+
- **Resolution**: Accuracy drops significantly for images where the smoke plume is smaller than 32x32 pixels.
|
| 32 |
+
- **Time of Day**: Optimized for daytime multi-spectral imagery; night-time performance relies on thermal band availability.
|