---
license: cc-by-nc-4.0
---

Checkpoints of models in CanadaWildFireDaily benchmark.

Each model is evaluated across three independent runs, with the random seed derived from the run index. We provide the optimal weights as best_checkpoint.pt, organized using the directory structure modelName/checkpoints_modelName_runN/
where N denotes the run index. For example the UNet checkpoints are provided under unet/checkpoints_unet_run1, unet/checkpoints_unet_run2, and unet/checkpoints_unet_run3.
This strcuture applies to all evaluated models: unet_age, unet_attention, unet_convlstm, unet_segformer, UTAE

## Mono-temporal models:

* Standard UNet(architecture: 'unet'): The baseline spatial U-Net model.
* Age-Encoding UNet(architecture: 'unet_age'): A U-Net that explicitly encodes the satellite age (the time gap in days between the fire event and the satellite acquisition).
* Attention UNet(architecture: 'unet_attention'): A U-Net utilizing attention gates in the skip connections to help the model focus on the most critical spatial features and suppress irrelevant background noise.
* UNet-SegFormer(architecture: 'unet_segformer'): A hybrid vision-transformer architecture that replaces the standard CNN encoder with SegFormer's Mix Vision Transformer (MiT), paired with a standard U-Net decoder for heavy pixel-level accuracy.
## Multi-temporal models:
* Spatiotemporal UNet(architecture: 'unet_convlstm'): A U-Net featuring aConvLSTMbottleneck for recurrent time-series processing (e.g., 3-day sliding window forecasting).
* UT-AE(architecture: 'utae'): A temporal attention encoder-decoder baseline adapted from the ICCV 2021 U-TAE model for satellite image time series. This baseline uses the time-series offline samples fromTimeseries_Samples/, and the generator now stores sequence positions for the temporal attention encoder when you regenerate those samples.