deepmodelingcommunity
/

DPA-2.4-7M

Model card Files Files and versions

xet

Community

penganyang commited on 3 days ago

Commit

c302366

verified ·

1 Parent(s): b7cecbd

Update README.md

Browse files

Files changed (1) hide show

README.md +106 -3

README.md CHANGED Viewed

@@ -1,3 +1,106 @@
----
-license: cc-by-4.0
----

+---
+license: cc-by-4.0
+---
+# Description
+DPA-2.4-7M is trained using a multitask strategy on the [OpenLAM datasets](https://www.aissquare.com/datasets/detail?pageType=datasets&name=OpenLAM-TrainingSet-v1&id=308). The benchmark results can be found at the [LAMBench website.](https://www.aissquare.com/openlam?tab=Benchmark)
+Please ensure the usage of the correct code version (**v3.1.0a0**) corresponding to this model.
+# How to use
+## Installation
+To use the pretrained model, you need to first install the corresponding version of DeePMD-kit. You can try easy installation:
+```bash
+pip install torch torchvision torchaudio
+pip install git+https://github.com/deepmodeling/deepmd-kit@v3.1.0a0
+```
+For other installation options, please visit the [Releases page](https://github.com/deepmodeling/deepmd-kit/releases/tag/v3.1.0a0) page to download the off-line package for **deepmd-kit v3.1.0a0**, and refer to the [official documentation](https://docs.deepmodeling.com/projects/deepmd/en/v3.0.0rc0/install/easy-install.html) for off-line installation instructions.
+## Use the pretrained model
+The pretrained model can be used directly for prediction tasks, such as serving as an ASE calculator.
+This model is pretrained in a multi-task manner, featuring a unified backbone (referred to as the unified descriptor) and several fitting nets for different datasets. For detailed information, refer to the [DPA-2 paper](https://arxiv.org/abs/2312.15492).
+The first step involves selecting a specific fitting net from the model to make predictions. To list the available fitting nets (`model-branch`) in this pretrained model, use the following command:
+```bash
+dp --pt show dpa-2.4.pt model-branch
+```
+This will generate an output similar to the following:
+```plaintext
+Available model branches are ['Domains_Alloy', 'Domains_Anode', 'Domains_Cluster',
+'Domains_Drug', 'Domains_FerroEle', 'Domains_SSE_PBE', 'Domains_SemiCond', 'H2O_H2O_PD',
+'Metals_AlMgCu', 'Metals_Cu', 'Metals_Sn', 'Metals_Ti', 'Metals_V', 'Metals_W', 'Others_C12H26',
+'Others_HfO2', 'Domains_SSE_PBESol', 'Domains_Transition1x', 'H2O_H2O_DPLR',
+'H2O_H2O_PBE0TS_MD', 'H2O_H2O_PBE0TS', 'H2O_H2O_SCAN0', 'Metals_AgAu_PBED3', 'Others_In2Se3',
+'MP_traj_v024_alldata_mixu', 'Alloy_tongqi', 'SSE_ABACUS', 'Hybrid_Perovskite',
+'solvated_protein_fragments', 'Electrolyte', 'ODAC23', 'Alex2D', 'Omat24', 'SPICE2', 'OC20M',
+'OC22', 'Organic_Reactions', 'RANDOM'], where 'RANDOM' means using a randomly initialized
+fitting net.
+```
+Select a fitting net that closely matches your system. Ensure that the elements in your system are included in the corresponding fitting net if you are conducting direct predictions or simulations. For more information on the pretrained datasets, refer to [below](https://www.aissquare.com/models/detail?pageType=models&name=DPA-2.3.1-v3.0.0rc0&id=287#data-used-for-pretraining). **Note**: model branches including "Metals_Cu","Others_C12H26","H2O_H2O_DPLR","H2O_H2O_PBE0TS_MD","H2O_H2O_PBE0TS", and "H2O_H2O_SCAN0" are deprecated (training datasets were removed during the training process).
+Assuming you choose `H2O_H2O-PD`, you can first freeze the model branch from the multi-task pretrained model:
+```bash
+dp --pt freeze -c dpa-2.4-7M.pt -o frozen_model.pth --head H2O_H2O-PD
+```
+Then you can use the following Python code for prediction or optimization:
+```python
+## Compute potential energy
+from ase import Atoms
+from deepmd.calculator import DP as DPCalculator
+dp = DPCalculator("frozen_model.pth")
+water = Atoms('H2O', positions=[(0.7601, 1.9270, 1), (1.9575, 1, 1), (1., 1., 1.)], cell=[100, 100, 100])
+water.calc = dp
+print(water.get_potential_energy())
+print(water.get_forces())
+## Run BFGS structure optimization
+from ase.optimize import BFGS
+dyn = BFGS(water)
+dyn.run(fmax=1e-6)
+print(water.get_positions())
+```
+## Zero-Shot Inference
+Given that energy labels calculated via DFT can vary by an arbitrary constant, LAMs are consistently used to predict the energy difference between the label and a dummy model that estimates potential energy solely based on the chemical formula.
+Specifically, the dummy model is defined as
+E = Σₑ nₑ · b̂ₑ;
+where **E** is the predicted energy, **nₑ** is the number of atoms of element **e**, and **b̂ₑ** is the per-atom energy of element **e** obtained via least-squares fitting:
+b̂ = argmin Σᵢ ( yᵢ − Σₑ nᵢ,ₑ bₑ )²,
+with **nᵢ,ₑ** denoting the number of atoms of element **e** in sample **i**, and **yᵢ** the corresponding DFT energy label.
+When applying the pretrained model to a new downstream system, it is advisable to adjust the energy bias term to better align with the DFT energy labels of the downstream system. This adjustment can enhance prediction accuracy without necessitating further training of the neural network parameters.
+To be specific, given a new downstream system, we use `dp --pt change-bias` to do zero-shot procedure, which will inherit the neural network parameters of the pretrained multitask model, but updating the energy bias to better align with the downstream system.
+```bash
+dp --pt change-bias dpa-2.4-7M.pt -s <your_system> --model-branch Omat24
+```
+## Fine-tuning
+To fine-tune a pretrained model, you need to ensure that the model configuration (model/descriptor and model/fitting_net) matches exactly with the pretrained model. The input_finetune.json file provided on this page contains the same model configuration as the pretrained model.
+To prepare for fine-tuning, replace the model/type_map and training/training(validation)_data sections in input_finetune.json with your specific dataset information, just the same as from scratch. Additionally, adjust the learning rate for fine-tuning, typically a small learning rate such as 1e-4 for learning_rate/start_lr is recommended. One can inherit a pretrained fitting net by using the `--model-branch` argument.
+```bash
+dp --pt train input_finetune.json --finetune dpa-2.4-7M.pt --model-branch Omat24
+```