giadefa commited on
Commit
b8c2e32
·
verified ·
1 Parent(s): 1eacef6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +106 -3
README.md CHANGED
@@ -1,3 +1,106 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ license_link: LICENSE
4
+ tags:
5
+ - chemistry
6
+ - molecular simulations
7
+ - machine learning potentials
8
+ - neural network potentials
9
+ - drug discovery
10
+ ---
11
+
12
+ # Acellera AceFF 2.0 - UPCOMING
13
+
14
+ **Organization(s):** Acellera Therapeutics, inc
15
+ **Contact:** info@acellera.com
16
+ **License:** apache 2.0
17
+
18
+ ---
19
+
20
+ ## Overview
21
+
22
+ Acellera AceFF 2.0 is a next-generation **machine learning interatomic potential (MLIP)** designed for **small molecules**.
23
+ It addresses key limitations of traditional molecular mechanics (MM) force fields and earlier NNP models, including restricted atom types, limited charge support, and computational inefficiencies.
24
+
25
+ The model leverages the TensorNet v2 architecture [1] and the NNP software library TorchMD-Net [2] to provide accurate predictions for diverse drug-like compounds, supporting all key chemical elements and charged molecules.
26
+ Acellera AceFF 2.0 improves the stability of molecular dynamics simulations, supports 2 fs timesteps, and achieves state-of-the-art accuracy with fewer outliers in RBFE predictions.
27
+
28
+
29
+ ## Description
30
+
31
+ Acellera AceFF 2.0 is the second version of a new family of potentials released by [Acellera](https://www.acellera.com).
32
+ It uses [TensorNet](https://proceedings.neurips.cc/paper_files/paper/2023/file/75c2ec5f98d7b2f50ad68033d2c07086-Paper-Conference.pdf) 2-layers trained
33
+ on Acellera's internal proprietary dataset of molecular forces and energies using the wB97M-V/def2-tzvppd level of theory and VV10 dispersion corrections.
34
+
35
+ The training set was built on [PubChem](https://ftp.ncbi.nlm.nih.gov/pubchem/Compound/CURRENT-Full/SDF).
36
+ We extracted the SMILES and generated molecules, filtering out molecules larger than 30 atoms.
37
+ We kept only molecules with the elements H, B, C, N, O, F, Si, P, S, Cl, Br, and I.
38
+
39
+ ---
40
+
41
+ ## Bechmarks
42
+
43
+ ### Wiggle150
44
+ The table shows the results on the [Wiggle150](https://pubs.acs.org/doi/10.1021/acs.jctc.5c00015) benchmark. We include AIMNet2 and ANI-2x for comparison.
45
+
46
+ | **Method** | **MAE (kcal/mol)** | **RMSE (kcal/mol)** |
47
+ |------------|--------------------|---------------------|
48
+ | AceFF-2.0 | | |
49
+ | AceFF-1.1 | 2.51 | 3.18 |
50
+ | AceFF-1.0 | 2.73 | 3.32 |
51
+ | AIMNet2 | 2.39 | 3.13 |
52
+ | ANI-2X | 4.41 | 5.41 |
53
+
54
+ *Performance of MLIPs on Wiggle150 benchmark*
55
+
56
+ ### Schrodinger ligands test set
57
+ We create our own hold-out test set by labelling 650 ligands from the [Schrodinger public binding free energy benchmark](https://github.com/schrodinger/public_binding_free_energy_benchmark) (Jacs, Merk, and charge_annhil sets) with AceFF's DFT level of theory. We evaluate the Force MAE of the AceFF predictions.
58
+ ![ligand_testset](https://media.githubusercontent.com/media/Acellera/aceff_examples/refs/heads/main/figures/ligand_testset.png)
59
+
60
+ ---
61
+
62
+ ## Key Features
63
+
64
+ - **Broad Applicability:** Supports diverse drug-like molecules, including charged species and rare chemical groups.
65
+ - **High Accuracy:** Benchmark-tested on the JACS dataset, demonstrating performance comparable to or better than MM-based methods (e.g., GAFF2, FEP+).
66
+ - **Improved Stability:** Enables a 2 fs timestep for NNP/MM simulations, significantly reducing computational costs.
67
+ - **Integration-Friendly:** Available for RBFE calculations via [HTMD](https://github.com/acellera/htmd).
68
+ - **Open Science:** The model and all benchmarking data are accessible on GitHub for not-for-profit usage.
69
+
70
+
71
+ ---
72
+
73
+ ## Usage
74
+
75
+ 1. [Example notebooks](https://github.com/Acellera/aceff_examples) are available in **Google Colab**, demonstrating the use of Acellera AceFF with OpenMM and ASE.
76
+
77
+ - Single point calculation with ASE [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/Acellera/aceff_examples/blob/main/notebooks/aceff_single_point_calculation.ipynb)
78
+ - ML molecular dynamics of a small molecule with OpenMM [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/Acellera/aceff_examples/blob/main/notebooks/aceff_MD_example.ipynb)
79
+ - MM/ML protein-ligand simulations with OpenMM [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/Acellera/aceff_examples/blob/main/notebooks/aceff_protein_ligand.ipynb)
80
+ 2. Run ML potential molecular simulations of a small molecule using ACEMD with this [tutorial](https://software.acellera.com/acemd/nnp.html), e.g., to minimize.
81
+ 3. For a tutorial on running mixed protein-ligand simulations, refer to [NNP/MM in ACEMD](https://software.acellera.com/acemd/nnpmm.html).
82
+
83
+
84
+ ---
85
+
86
+ ## Applications
87
+
88
+ - **Drug Discovery:** Optimizing lead compounds in hit-to-lead and lead optimization stages using free energy methods.
89
+ - **Binding Free Energy Calculations:** Accurate and efficient RBFE predictions for diverse molecular systems.
90
+ - **Molecular dynamics:** Capturing higher-body terms than traditional MM force fields, Acellera AceFF can be used for structure minimization and dynamics of small molecules.
91
+
92
+ ## Limitations
93
+
94
+ - **Small molecules only**: Acellera AceFF-2.0 is trained on specifically curated and extended PubChem data. However, proteins, water, etc are not part of the dataset right now.
95
+ - **Time step**: Use time steps of 2fs to run dynamics with hydrogen mass repartitioning.
96
+ - **Only -2,-1,0,1,2 charges**: For simplicity, we have trained only on these types of charged molecules.
97
+
98
+ ---
99
+
100
+ ## References
101
+
102
+ [1] Simeon, Guillem, and Gianni De Fabritiis, Tensornet: Cartesian tensor representations for efficient learning of molecular potentials, Advances in Neural Information Processing Systems 36 (2024), https://arxiv.org/abs/2306.06482
103
+
104
+ [2] Raul P. Pelaez, Guillem Simeon, Raimondas Galvelis, Antonio Mirarchi, Peter Eastman, Stefan Doerr, Philipp Thölke, Thomas E. Markland, Gianni De Fabritiis, TorchMD-Net 2.0: Fast Neural Network Potentials for Molecular Simulations, J. Chem. Theory Comput. 2024, 20, 10, 4076–4087, https://arxiv.org/abs/2402.17660
105
+
106
+ [3] Francesc Sabanés Zariquiey, Stephen E. Farr, Stefan Doerr, Gianni De Fabritiis, QuantumBind-RBFE: Accurate Relative Binding Free Energy Calculations Using Neural Network Potentials, https://arxiv.org/abs/2501.01811 (2025).