pzharring commited on
Commit
0012fa6
·
verified ·
1 Parent(s): afa3712

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +240 -0
README.md ADDED
@@ -0,0 +1,240 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # PhysicsNeMo Checkpoints: StormScope-GOES-MRMS
2
+
3
+ ## Description:
4
+
5
+ StormScope is a mesoscale weather AI model that autoregressively predicts GOES (satellite) and MRMS (radar) variables.
6
+ The model can make predictions at multiple spatiotemporal resolutions, and the [HRRR](https://rapidrefresh.noaa.gov/hrrr/) model grid is used as the base 3 km grid over the continental US (CONUS).
7
+
8
+ The model can be conditioned on additional data, such as synoptic-scale 500 hPa geopotential (z500), if desired.
9
+ Models predicting MRMS are conditioned on the most recent GOES state variables.
10
+
11
+ For inference see [NVIDIA Earth2Studio](https://github.com/NVIDIA/earth2studio).
12
+
13
+ This model is for research and development only.
14
+
15
+ ### License/Terms of Use:
16
+
17
+ **Governing Terms**: Use of this model is governed by the [NVIDIA Open Model License](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/).
18
+
19
+ ### Deployment Geography:
20
+
21
+ Global
22
+
23
+ ### Use Case:
24
+
25
+ High-resolution ensemble forecasting of GOES and MRMS observations over the continental US (CONUS)
26
+
27
+ ### Release Date:
28
+
29
+ Hugging Face: 01/26/2026 via [https://huggingface.co/nvidia/stormscope-goes-mrms](https://huggingface.co/nvidia/stormscope-goes-mrms)
30
+
31
+ ## Model Architecture
32
+
33
+ **Architecture Type:** StormScope uses a diffusion transformer architecture with 2D neighborhood attention. <br>
34
+ **Network Architecture:** Diffusion Transformer (DiT)
35
+
36
+ This model provides the following checkpoints:
37
+
38
+ - 6 km grid, 1 hr DiT -- 194M Parameters
39
+ - 6 km grid, 10 min DiT -- 195M Parameters
40
+ - 3 km grid, 10 min DiT -- 260M Parameters
41
+
42
+ ## Input:
43
+
44
+ **Input Type(s):**
45
+
46
+ - Tensor (1-8 state variables from GOES/MRMS, and 1-8 optional conditioning variables.)
47
+ - DateTime (NumPy Array) <br>
48
+
49
+ **Input Format(s):** PyTorch Tensor / NumPy array <br>
50
+ **Input Parameters:**
51
+
52
+ - Five Dimensional (5D) (batch, lead time, variable, height, width) <br>
53
+ - Input DateTime (1D) <br>
54
+
55
+ **Other Properties Related to Input:**
56
+
57
+ - Input grid (height/width) dimensions are that of the HRRR model grid.
58
+ - Input lead time can be of size one, for a single input timestep, or of size 6 for a series of input timesteps.
59
+ - Input state GOES variables: `abi01c`, `abi02c`, `abi03c`, `abi07c`, `abi08c`, `abi09c`, `abi10c`, `abi13c`
60
+ - Input state MRMS variables: `refc`
61
+ - Conditioning variables (optional): `z500`, or the GOES variables listed above.
62
+
63
+ For variable naming information, review the [GOES](https://github.com/NVIDIA/earth2studio/blob/main/earth2studio/lexicon/goes.py) and [MRMS](https://github.com/NVIDIA/earth2studio/blob/main/earth2studio/lexicon/mrms.py) Lexicons in Earth2Studio.
64
+
65
+ ## Output:
66
+
67
+ **Output Type(s):** Tensor (1-8 state variables from GOES/MRMS, and 1-8 optional conditioning variable.) <br>
68
+ **Output Format:** Pytorch Tensors <br>
69
+ **Output Parameters:** Five Dimensional (5D) (batch, lead time, variable, latitude, longitude) <br>
70
+ **Other Properties Related to Output:**
71
+
72
+ - Output grid (height/width) dimensions are that of the HRRR model grid.
73
+ - Output lead time is of size 1 (model predicts one timestep)
74
+ - Output state GOES variables: `abi01c`, `abi02c`, `abi03c`, `abi07c`, `abi08c`, `abi09c`, `abi10c`, `abi13c`
75
+ - Output state MRMS variables: `refc`
76
+
77
+ Our AI models are designed and/or optimized to run on NVIDIA GPU-accelerated systems. By leveraging NVIDIA’s hardware (e.g. GPU cores) and software frameworks (e.g., CUDA libraries), the model achieves faster training and inference times compared to CPU-only solutions.
78
+
79
+ ## Software Integration
80
+
81
+ **Runtime Engine(s):** Not Applicable <br>
82
+ **Supported Hardware Microarchitecture Compatibility:** <br>
83
+ * NVIDIA Ampere <br>
84
+ * NVIDIA Blackwell <br>
85
+ * NVIDIA Hopper <br>
86
+
87
+ **Supported Operating System(s):**
88
+ * Linux <br>
89
+
90
+ The integration of foundation and fine-tuned models into AI systems requires additional testing using use-case-specific data to ensure safe and effective deployment. Following the V-model methodology, iterative testing and validation at both unit and system levels are essential to mitigate risks, meet technical and functional requirements, and ensure compliance with safety and ethical standards before deployment.
91
+
92
+ ## Model Version(s):
93
+
94
+ **Model Version:** v1 <br>
95
+
96
+ # Training, Testing, and Evaluation Datasets:
97
+
98
+ ## Training Dataset:
99
+
100
+ **Link:** [ERA5](https://cds.climate.copernicus.eu/) <br>
101
+
102
+ *Data Collection Method by dataset:* <br>
103
+ * Automatic/Sensors <br>
104
+
105
+ *Labeling Method by dataset:* <br>
106
+ * Automatic/Sensors <br>
107
+
108
+ *Data Modality:*
109
+ * Gridded geophysical time series
110
+
111
+ *Data Size:*
112
+ * 131 GB subset used for model training
113
+
114
+ **Properties:**
115
+ ERA5 data for the period January 2018 - December 2023. ERA5 provides hourly estimates of various
116
+ atmospheric, land, and oceanic climate variables. The data covers the Earth on a 30km
117
+ grid and resolves the atmosphere at 137 levels. <br>
118
+
119
+ **Link:** [GOES](hhttps://registry.opendata.aws/noaa-goes/) <br>
120
+
121
+ *Data Collection Method by dataset* <br>
122
+ * Automatic/Sensors <br>
123
+
124
+ *Labeling Method by dataset* <br>
125
+ * Automatic/Sensors <br>
126
+
127
+ *Data Modality:*
128
+ * Gridded geophysical time series
129
+
130
+ *Data Size:*
131
+ * 18.5 TB subset used for model training
132
+
133
+ **Properties:**
134
+ GOES imaging data for the period January 2018 - December 2023. GOES is a NOAA geostationary satellite product providing multispectral radiances at 3 km resolution over a disk covering the continental US. <br>
135
+
136
+ **Link:** [MRMS](https://registry.opendata.aws/noaa-mrms-pds/) <br>
137
+
138
+ *Data Collection Method by dataset* <br>
139
+ * Automatic/Sensors <br>
140
+
141
+ *Labeling Method by dataset* <br>
142
+ * Automatic/Sensors <br>
143
+
144
+ *Data Modality:*
145
+ * Gridded geophysical time series
146
+
147
+ *Data Size:*
148
+ * 2.3 TB subset used for model training
149
+
150
+ **Properties:**
151
+ MRMS radar data for the period January 2018 - December 2023. MRMS is a NOAA precipitation data product incorporating weather radar, surface, and other observations at high resolution over the continental US. <br>
152
+
153
+ ## Testing Dataset:
154
+
155
+ **Link:** [ERA5](https://cds.climate.copernicus.eu/) <br>
156
+
157
+ *Data Collection Method by dataset:* <br>
158
+ * Automatic/Sensors <br>
159
+
160
+ *Labeling Method by dataset:* <br>
161
+ * Automatic/Sensors <br>
162
+
163
+ **Properties:**
164
+ ERA5 data for the period January 2024 - December 2024. ERA5 provides hourly estimates of various
165
+ atmospheric, land, and oceanic climate variables. The data covers the Earth on a 30km
166
+ grid and resolves the atmosphere at 137 levels. <br>
167
+
168
+ **Link:** [GOES](https://registry.opendata.aws/noaa-goes/) <br>
169
+
170
+ *Data Collection Method by dataset* <br>
171
+ * Automatic/Sensors <br>
172
+
173
+ *Labeling Method by dataset* <br>
174
+ * Automatic/Sensors <br>
175
+
176
+ **Properties:**
177
+ GOES imaging data for the period January 2024 - December 2024. GOES is a NOAA geostationary satellite product providing multispectral radiances at 3 km resolution over a disk covering the continental US. <br>
178
+
179
+ **Link:** [MRMS](https://registry.opendata.aws/noaa-mrms-pds/) <br>
180
+
181
+ *Data Collection Method by dataset* <br>
182
+ * Automatic/Sensors <br>
183
+
184
+ *Labeling Method by dataset* <br>
185
+ * Automatic/Sensors <br>
186
+
187
+ **Properties:**
188
+ MRMS radar data for the period January 2024 - December 2024. MRMS is a NOAA precipitation data product incorporating weather radar, surface, and other observations at high resolution over the continental US. <br>
189
+
190
+ ## Evaluation Dataset:
191
+
192
+ **Link:** [ERA5](https://cds.climate.copernicus.eu/) <br>
193
+
194
+ *Data Collection Method by dataset:* <br>
195
+ * Automatic/Sensors <br>
196
+
197
+ *Labeling Method by dataset:* <br>
198
+ * Automatic/Sensors <br>
199
+
200
+ **Properties:**
201
+ ERA5 data for the period January 2025 - December 2025. ERA5 provides hourly estimates of various
202
+ atmospheric, land, and oceanic climate variables. The data covers the Earth on a 30km
203
+ grid and resolves the atmosphere at 137 levels. <br>
204
+
205
+ **Link:** [GOES](https://registry.opendata.aws/noaa-goes/) <br>
206
+
207
+ *Data Collection Method by dataset* <br>
208
+ * Automatic/Sensors <br>
209
+
210
+ *Labeling Method by dataset* <br>
211
+ * Automatic/Sensors <br>
212
+
213
+ **Properties:**
214
+ GOES imaging data for the period January 2025 - December 2025. GOES is a NOAA geostationary satellite product providing multispectral radiances at 3 km resolution over a disk covering the continental US. <br>
215
+
216
+ **Link:** [MRMS](https://registry.opendata.aws/noaa-mrms-pds/) <br>
217
+
218
+ *Data Collection Method by dataset* <br>
219
+ * Automatic/Sensors <br>
220
+
221
+ *Labeling Method by dataset* <br>
222
+ * Automatic/Sensors <br>
223
+
224
+ **Properties:**
225
+ MRMS radar data for the period January 2025 - December 2025. MRMS is a NOAA precipitation data product incorporating weather radar, surface, and other observations at high resolution over the continental US. <br>
226
+
227
+ ## Inference:
228
+
229
+ **Engine:** PyTorch <br>
230
+ **Test Hardware:**
231
+ * A100 <br>
232
+ * H100 <br>
233
+ * L40S <br>
234
+
235
+
236
+ ## Ethical Considerations:
237
+
238
+ NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.
239
+
240
+ Please report model quality, risk, security vulnerabilities or NVIDIA AI Concerns [here](https://www.nvidia.com/en-us/support/submit-security-vulnerability/).