InstaDeepAI
/

nequip-organics

Model card Files Files and versions

xet

Community

Update README.md

by heloise-chomet - opened May 7, 2025

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

+81

-0

Files changed (1) hide show

README.md +81 -0

README.md CHANGED Viewed

@@ -1,3 +1,84 @@
 ## License summary
 1. The Licensed Models are **only** available under this License for Non-Commercial Purposes.

+# Nequip
+## Reference
+Simon Batzner, Albert Musaelian, Lixin Sun, Mario Geiger, Jonathan P. Mailoa,
+Mordechai Kornbluth, Nicola Molinari, Tess E. Smidt, and Boris Kozinsky.
+E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials.
+Nature Communications, 13(1), May 2022. ISSN: 2041-1723. URL: https://dx.doi.org/10.1038/s41467-022-29939-5.
+## Hyperparameters, model configurations and training strategies
+### Model architecture
+| Parameter                 | Value                                         | Description                                 |
+|---------------------------|-----------------------------------------------|---------------------------------------------|
+| `num_layers`              | `5`                                           | Number of NequIP layers.                    |
+| `node_irreps`             | `64x0e + 64x0o + 32x1e + 32x1o + 4x2e + 4x2o` | O3 representation space of node features.   |
+| `l_max`                   | `2`                                           | Maximal degree of spherical harmonics.      |
+| `num_bessel`              | `8`                                           | Number of Bessel basis functions.           |
+| `radial_net_nonlinearity` | `swish`                                       | Activation function for radial MLP.         |
+| `radial_net_n_hidden`     | `64`                                          | Number of hidden features in radial MLP.    |
+| `radial_net_n_layers`     | `2`                                           | Number of layers in radial MLP.             |
+| `radial_envelope`         | `polynomial_envelope`                         | Radial envelope function.                   |
+| `scalar_mlp_std`          | `4`                                           | Standard deviation of weight initialisation.|
+| `atomic_energies`         | `None`                                        | Treatment of the atomic energies.           |
+| `avg_um_neighbors`        | `None`                                        | Mean number of neighbors.                   |
+### Training
+| Parameter                | Value  | Description                                      |
+|--------------------------|--------|--------------------------------------------------|
+| `num_epochs`             | `220`  | Number of epochs to run.                         |
+| `ema_decay`              | `0.99` | The EMA decay rate.                              |
+| `eval_num_graphs`        | `None` | Number of validation set graphs to evaluate on.  |
+| `use_ema_params_for_eval`| `True` | Whether to use the EMA parameters for evaluation.|
+### Optimizer
+| Parameter                        | Value          | Description                                                     |
+|----------------------------------|----------------|-----------------------------------------------------------------|
+| `init_learning_rate`             | `0.002`        | Initial learning rate.                                          |
+| `peak_learning_rate`             | `0.002`        | Peak learning rate.                                             |
+| `final_learning_rate`            | `0.002`        | Final learning rate.                                            |
+| `weight_decay`                   | `0`            | Weight decay.                                                   |
+| `warmup_steps`                   | `4000`         | Number of optimizer warm-up steps.                              |
+| `transition_steps`               | `360000`       | Number of optimizer transition steps.                           |
+| `grad_norm`                      | `500`          | Gradient norm used for gradient clipping.                       |
+| `num_gradient_accumulation_steps`| `1`            | Steps to accumulate before taking an optimizer step.            |
+| `algorithm`                      | `optax.amsgrad`| The AMSGrad optimizer.                                          |
+| `b1`                             | `0.9`          | Exponential decay rate to track first moment of past gradients. |
+| `b2`                             | `0.999`        | Exponential decay rate to track second moment of past gradients.|
+| `eps`                            | `1e-8`         | Constant applied to denominator outside the square root.        |
+| `eps_root`                       | `0.0`          | Constant applied to denominator inside the square root.         |
+### Huber Loss Energy weight schedule
+| Parameter              | Value                              | Description                                                                                     |
+|------------------------|------------------------------------|-------------------------------------------------------------------------------------------------|
+| `schedule`             | `optax.piecewise_constant_schedule`| Piecewise constant schedule with scaled jumps at specific boundaries.                           |
+| `init_value`           | `40`                               | Initial value.                                                                                  |
+| `boundaries_and_scale` | `{115: 25}`                        | Dictionary of {step: scale} where scale is multiplied into the schedule value at the given step.|
+### Huber Loss Force weight schedule
+| Parameter              | Value                              | Description                                                                                     |
+|------------------------|------------------------------------|-------------------------------------------------------------------------------------------------|
+| `schedule`             | `optax.piecewise_constant_schedule`| Piecewise constant schedule with scaled jumps at specific boundaries.                           |
+| `init_value`           | `1000`                             | Initial value.                                                                                  |
+| `boundaries_and_scale` | `{115: 0.04}`                      | Dictionary of {step: scale} where scale is multiplied into the schedule value at the given step.|
+### Dataset
+| Parameter                   | Value | Description                                |
+|-----------------------------|-------|--------------------------------------------|
+| `graph_cutoff_angstrom`     | `5`   | Graph cutoff distance (in Å).              |
+| `max_n_node`                | `32`  | Maximum number of nodes allowed in a batch.|
+| `max_n_edge`                | `288` | Maximum number of edges allowed in a batch.|
+| `batch_size`                | `16`  | Number of graphs in a batch.               |
+This model was trained on the [SPICE2_curated dataset](https://huggingface.co/datasets/InstaDeepAI/SPICE2-curated).
+## How to Use
+For complete usage instructions and more information, please refer to our [documentation](https://instadeep.github.io/mlip)
 ## License summary
 1. The Licensed Models are **only** available under this License for Non-Commercial Purposes.