File size: 7,017 Bytes
1d0b8a3 9dbcdb8 1d0b8a3 20cd07b 3407056 1d0b8a3 a2ffac3 1d0b8a3 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 | ---
license: apache-2.0
---
# Earth-2 Checkpoints: DLESyM-V1-ERA5
## Description:
DLESyM-V1-ERA5 is an ensemble forecast model for global earth system modeling.
This model includes an atmosphere and ocean component, using atmospheric
variables as well as the sea-surface temperature on a HEALPix nside=64
(approximately 1 degree) resolution grid. This model package includes several
individual trained checkpoints for the atmosphere and ocean components, which
can be used to improve model variability in ensembles. The model architecture
is a U-Net with padding operations modified to support using the HEALPix grid.
For training recipies see [PhysicsNeMo](https://github.com/NVIDIA/physicsnemo/tree/main/examples/weather/dlwp_healpix), for inference [Earth2Studio](https://nvidia.github.io/earth2studio/examples/14_dlesym_example.html#sphx-glr-examples-14-dlesym-example-py).
This model is ready for commercial/non-commercial use.
### License/Terms of Use:
**Governing Terms**: Use of this model is governed by the [NVIDIA Community Model License](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-community-models-license/).
### Deployment Geography:
Global
### Use Case:
Industry, academic, and government research teams interested in subseasonal-to-seasonal weather forecasting, and climate modeling.
### Release Date:
NGC 05/12/2025
## Reference:
- [Advancing Parsimonious Deep Learning Weather Prediction using the HEALPix Mesh](https://arxiv.org/abs/2311.06253) <br>
- [A Deep Learning Earth System Model for Efficient Simulation of the Observed Climate](https://arxiv.org/abs/2409.16247) <br>
## Model Architecture:
**Architecture Type:** DLESyM uses two UNet architectures adapted to the HEALPix grid,
one for each of the atmosphere and ocean components. <br>
**Network Architecture:** UNet <br>
## Input:
**Input Type:**
- Tensor (9 surface and pressure-level variables)
**Input Format:** PyTorch Tensor <br>
**Input Parameters:**
- Six Dimensional (6D) (batch, lead time, variable, face, height, width) <br>
**Other Properties Related to Input:**
- Input latitude/longitude grid: 0.25 degree 721 x 1440, regridded to HEALPix nside=64
grid in "XY" format with a north origin and clockwise order
- See `HEALPIX_PAD_XY` in the [earth2grid package](https://github.com/NVlabs/earth2grid)
for specific details
- Input state weather variables: `z500`, `tau300-700`, `z1000`, `t2m`, `tcwv`, `t850`,
`z250`, `ws10m`, `sst`
- `tau300-700` (geopotential thickness) is defined as the difference between `z300`
and `z700` geopotential levels.
- `ws10m` (wind speed at 10m above surface) is defined as the square root of the sum
of the squared zonal and meridional wind components, i.e. `sqrt(u10m **2 + v10m **2)`.
For variable name information, review the `HRRR` Lexicon at [Earth2Studio](https://github.com/NVIDIA/earth2studio).
Review the `config.yaml` provided in the model package for information on the input lead
times required by the model.
## Output:
**Output Type:** Tensor (9 surface and pressure-level variables) <br>
**Output Format:** Pytorch Tensor <br>
**Output Parameters:** Six Dimensional (6D) (batch, lead time, variable, face, height,
width) <br>
**Other Properties Related to Output:**
- Output latitude/longitude grid: 0.25 degree 721 x 1440, regridded to HEALPix nside=64
grid in "XY" format with a north origin and clockwise order.
- See `HEALPIX_PAD_XY` in the [earth2grid package](https://github.com/NVlabs/earth2grid)
for specific details
- Output state weather variables: `z500`, `tau300-700`, `z1000`, `t2m`, `tcwv`, `t850`,
`z250`, `ws10m`, `sst` <br>
- `tau300-700` (geopotential thickness) is defined as the difference between `z300`
and `z700` geopotential levels.
- `ws10m` (wind speed at 10m above surface) is defined as the square root of the sum
of the squared zonal and meridional wind components, i.e. `sqrt(u10m **2 + v10m **2)`.
Our AI models are designed and/or optimized to run on NVIDIA GPU-accelerated systems.
By leveraging NVIDIA’s hardware (e.g. GPU cores) and software frameworks (e.g., CUDA
libraries), the model achieves faster training and inference times compared to
CPU-only solutions.
## Software Integration
**Runtime Engine:** Pytorch <br>
**Supported Hardware Microarchitecture Compatibility:** <br>
* NVIDIA Ampere <br>
* NVIDIA Hopper <br>
* NVIDIA Turing <br>
**Supported Operating System:**
* Linux <br>
## Model Version:
**Model Version:** v1 <br>
# Training, Testing, and Evaluation Datasets:
**Total size (in number of data points):** 110,960 <br>
**Total number of datasets:** 1<br>
**Dataset partition:** training 90%, testing 5%, validation 5% <br>
## Training Dataset:
**Link:** [ERA5](https://cds.climate.copernicus.eu/) <br>
**Data Collection Method by dataset** <br>
* Automatic/Sensors <br>
**Labeling Method by dataset** <br>
* Automatic/Sensors <br>
**Properties:**
ERA5 data for the period 1980-2015. ERA5 provides hourly estimates of various
atmospheric, land, and oceanic climate variables. The data covers the Earth on a 30km
grid and resolves the atmosphere at 137 levels. We re-grid to a healpix-64 grid that
corresponds to an approximate 1 degree lat/lon grid. <br>
## Testing Dataset:
**Link:** [ERA5](https://cds.climate.copernicus.eu/) <br>
**Data Collection Method by dataset** <br>
* Automatic/Sensors <br>
**Labeling Method by dataset** <br>
* Automatic/Sensors <br>
**Properties:**
ERA5 data for the period 2016-2017. ERA5 provides hourly estimates of various
atmospheric, land, and oceanic climate variables. The data covers the Earth on a 30km
grid and resolves the atmosphere at 137 levels. We re-grid to a healpix-64 grid that
corresponds to an approximate 1 degree lat/lon grid. <br>
## Evaluation Dataset:
**Link:** [ERA5](https://cds.climate.copernicus.eu/) <br>
**Data Collection Method by dataset** <br>
* Automatic/Sensors <br>
**Labeling Method by dataset** <br>
* Automatic/Sensors <br>
**Properties:**
ERA5 data for the period 2018-2019. ERA5 provides hourly estimates of various
atmospheric, land, and oceanic climate variables. The data covers the Earth on a 30km
grid and resolves the atmosphere at 137 levels. We re-grid to a healpix-64 grid that
corresponds to an approximate 1 degree lat/lon grid. <br>
## Inference:
**Acceleration Engine:** PhysicsNeMo, PyTorch <br>
**Test Hardware:**
* A100 <br>
* H100 <br>
* L40S <br>
## Ethical Considerations:
NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.
Please report security vulnerabilities or NVIDIA AI Concerns [here](https://www.nvidia.com/en-us/support/submit-security-vulnerability/). |