---
language: en
license: mit
tags:
- environment
- computer-vision
- vit
- climate-change
---

# wildfire_smoke_segmentation_vit

## Overview
This model is a Vision Transformer (ViT) designed for the early detection of wildfires via satellite and aerial imagery. By identifying specific smoke patterns and thermal anomalies, it provides real-time alerts for environmental monitoring agencies.


## Model Architecture
The model is based on the **ViT-Base** (Patch 16) architecture:
- **Patching**: Divides input images into 16x16 patches to capture global spatial dependencies.
- **Attention**: Uses multi-head self-attention to distinguish between cloud cover and low-density smoke plumes.
- **Pre-training**: Initialized on ImageNet-21k and fine-tuned on the FIRESAT dataset.

## Intended Use
- **Remote Sensing**: Automated monitoring of vast forested areas via Sentinel-2 or Landsat imagery.
- **Early Warning Systems**: Integration into IoT-enabled lookout towers for local fire departments.
- **Post-Fire Analysis**: Assessing the spread and intensity of smoke for environmental impact studies.

## Limitations
- **Atmospheric Conditions**: Heavy cloud cover or fog can lead to false positives.
- **Resolution**: Accuracy drops significantly for images where the smoke plume is smaller than 32x32 pixels.
- **Time of Day**: Optimized for daytime multi-spectral imagery; night-time performance relies on thermal band availability.