--- license: cc-by-nc-4.0 --- Checkpoints of models in CanadaWildFireDaily benchmark. Each model is evaluated across three independent runs, with the random seed derived from the run index. We provide the optimal weights as best_checkpoint.pt, organized using the directory structure modelName/checkpoints_modelName_runN/ where N denotes the run index. For example the UNet checkpoints are provided under unet/checkpoints_unet_run1, unet/checkpoints_unet_run2, and unet/checkpoints_unet_run3. This strcuture applies to all evaluated models: unet_age, unet_attention, unet_convlstm, unet_segformer, UTAE ## Mono-temporal models: * Standard UNet(architecture: 'unet'): The baseline spatial U-Net model. * Age-Encoding UNet(architecture: 'unet_age'): A U-Net that explicitly encodes the satellite age (the time gap in days between the fire event and the satellite acquisition). * Attention UNet(architecture: 'unet_attention'): A U-Net utilizing attention gates in the skip connections to help the model focus on the most critical spatial features and suppress irrelevant background noise. * UNet-SegFormer(architecture: 'unet_segformer'): A hybrid vision-transformer architecture that replaces the standard CNN encoder with SegFormer's Mix Vision Transformer (MiT), paired with a standard U-Net decoder for heavy pixel-level accuracy. ## Multi-temporal models: * Spatiotemporal UNet(architecture: 'unet_convlstm'): A U-Net featuring aConvLSTMbottleneck for recurrent time-series processing (e.g., 3-day sliding window forecasting). * UT-AE(architecture: 'utae'): A temporal attention encoder-decoder baseline adapted from the ICCV 2021 U-TAE model for satellite image time series. This baseline uses the time-series offline samples fromTimeseries_Samples/, and the generator now stores sequence positions for the temporal attention encoder when you regenerate those samples.