leharris3
/

satformer

Video Classification

precipitation-nowcasting

weather-forecasting

video-transformer

space-time-attention

satellite-imagery

Model card Files Files and versions

leharris3 commited on 8 days ago

Commit

6b4e7d9

·

verified ·

1 Parent(s): d8fe4ee

Add model card

Files changed (1) hide show

README.md +77 -3

README.md CHANGED Viewed

@@ -1,3 +1,77 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+library_name: pytorch
+tags:
+  - precipitation-nowcasting
+  - weather-forecasting
+  - video-transformer
+  - space-time-attention
+  - satellite-imagery
+pipeline_tag: image-classification
+---
+# SaTformer: A Space-Time Transformer for Precipitation Nowcasting
+**Authors:** Levi Harris, Tianlong Chen — *The University of North Carolina at Chapel Hill*
+[![arXiv](https://img.shields.io/badge/arXiv-2511.11090-b31b1b.svg)](https://arxiv.org/abs/2511.11090)
+[![NeurIPS](https://img.shields.io/badge/NeurIPS_2025-1st_Place_CUMSUM-4b44ce.svg)](https://neurips.cc/virtual/2025/loc/san-diego/135896)
+[![GitHub](https://img.shields.io/badge/GitHub-satformer-181717.svg?logo=github)](https://github.com/leharris3/satformer)
+SaTformer is a Vision Transformer adapted for spatio-temporal precipitation nowcasting from geostationary satellite (HRIT) imagery. It won **1st place** in the NeurIPS 2025 CUMSUM challenge.
+## Model Details
+| Parameter | Value |
+|---|---|
+| Architecture | Vision Transformer (adapted from TimeSformer) |
+| Attention | Joint space-time (ST²) |
+| Embedding dim | 512 |
+| Depth | 12 blocks |
+| Heads | 8 (dim 64) |
+| Input | 4 frames x 11 channels x 32x32 |
+| Output | 64 precipitation bins (classification) |
+| Patch size | 4x4 |
+## Usage
+```python
+import torch
+from huggingface_hub import hf_hub_download
+from src.model.SaTformer.SaTformer import SaTformer
+model = SaTformer(
+    dim=512,
+    num_frames=4,
+    num_classes=64,
+    image_size=32,
+    patch_size=4,
+    channels=11,
+    depth=12,
+    heads=8,
+    dim_head=64,
+    attn_dropout=0.1,
+    ff_dropout=0.1,
+    rotary_emb=False,
+    attn="ST^2"
+)
+weights = hf_hub_download(repo_id="leharris3/satformer", filename="sf-64-cls.pt")
+model.load_state_dict(torch.load(weights, weights_only=True), strict=False)
+model.eval()
+with torch.no_grad():
+    x = torch.rand(1, 4, 11, 32, 32)  # (batch, frames, channels, H, W)
+    logits = model(x)                  # -> [1, 64]
+```
+## Citation
+```bibtex
+@article{harris2025satformer,
+  title={A Space-Time Transformer for Precipitation Forecasting},
+  author={Harris, Levi and Chen, Tianlong},
+  journal={arXiv preprint arXiv:2511.11090},
+  year={2025}
+}
+```