|
|
--- |
|
|
license: apache-2.0 |
|
|
extra_gated_prompt: "The OpenFold3-preview model is released under Apache 2.0 license. You will automatically get access to the model after answering the following simple questions:" |
|
|
extra_gated_fields: |
|
|
First Name: text |
|
|
Last Name: text |
|
|
Company Name or Affiliation: text |
|
|
Role or Job Title: text |
|
|
I want to use the OpenFold-3 model for: text |
|
|
--- |
|
|
|
|
|
# OpenFold3-preview |
|
|
|
|
|
 |
|
|
|
|
|
OpenFold3-preview is a biomolecular structure prediction model aiming to be a bitwise reproduction of DeepMind's |
|
|
[AlphaFold3](https://github.com/deepmind/alphafold3), developed by the AlQuraishi Lab at Columbia University and the OpenFold consortium. This research preview is intended to gather community feedback and allow developers to start building on top of the OpenFold ecosystem. The OpenFold project is committed to long-term maintenance and open source support, and our repository is freely available for academic and commercial use under the Apache 2.0 license. |
|
|
|
|
|
For our reproduction of AlphaFold2, please refer to the original [OpenFold repository](https://github.com/aqlaboratory/openfold). |
|
|
|
|
|
## Features |
|
|
|
|
|
OpenFold3-preview replicates the input features described in the [AlphaFold3](https://www.nature.com/articles/s41586-024-07487-w) publication, as well as batch job support and efficient kernel-accelerated inference. |
|
|
|
|
|
A summary of our supported features includes: |
|
|
- Structure prediction of standard and non-canonical protein, RNA, and DNA chains, and small molecules |
|
|
- Pipelines for generating MSAs using the [ColabFold server](https://github.com/sokrypton/ColabFold) or using JackHMMER / hhblits following the AlphaFold3 protocol |
|
|
- [Structure templates](https://openfold-3.readthedocs.io/en/latest/template_how_to.html) for protein monomers |
|
|
- Kernel acceleration through [cuEquivariance](https://docs.nvidia.com/cuda/cuequivariance) and [DeepSpeed4Science](https://www.deepspeed.ai/tutorials/ds4sci_evoformerattention/) kernels - more details [here](https://openfold-3.readthedocs.io/en/latest/kernels.html) |
|
|
- Support for [multi-query jobs](https://openfold-3.readthedocs.io/en/latest/input_format.html) with [distributed predictions across multiple GPUs](https://openfold-3.readthedocs.io/en/latest/inference.html#inference-run-on-multiple-gpus) |
|
|
- Custom settings for [memory constrained GPU resources](https://openfold-3.readthedocs.io/en/latest/inference.html#inference-low-memory-mode) |
|
|
|
|
|
## Quick-Start for Inference |
|
|
|
|
|
Make your first predictions with OpenFold3-preview in a few easy steps: |
|
|
|
|
|
|
|
|
1. Install OpenFold3-preview using our pip package |
|
|
```bash |
|
|
pip install openfold3 |
|
|
mamba install kalign2 -c bioconda |
|
|
``` |
|
|
|
|
|
2. Setup your installation of OpenFold3-preview and download model parameters: |
|
|
```bash |
|
|
setup_openfold |
|
|
``` |
|
|
|
|
|
3. Run your first prediction using the ColabFold MSA server with the `run_openfold` binary |
|
|
|
|
|
```bash |
|
|
run_openfold predict --query_json=examples/example_inference_inputs/query_ubiquitin.json |
|
|
``` |
|
|
|
|
|
More information on how to customize your inference prediction can be found at our documentation home at https://openfold-3.readthedocs.io/en/latest/. More examples for inputs and outputs can be found in our [HuggingFace examples](https://huggingface.co/OpenFold/OpenFold3/tree/main/examples/common_examples). |
|
|
|
|
|
## Benchmarking |
|
|
|
|
|
OpenFold3-preview performs competitively with the state of the art in open source biomolecular structure prediction, while being the only model to match AlphaFold3's performance on monomeric RNA structures. |
|
|
|
|
|
**Preliminary results:** |
|
|
|
|
|
 |
|
|
|
|
|
Performance of OF3p and other models on a diverse set of benchmarks: |
|
|
- Protein and RNA monomers from [CASP16](https://www.biorxiv.org/content/10.1101/2025.05.06.652459v2) and [Ludaic et al](https://www.biorxiv.org/content/10.1101/2025.04.30.651414v2) |
|
|
- Proein-protein complexes from [CASP16](https://www.biorxiv.org/content/10.1101/2025.05.29.656875v1) and [FoldBench](https://www.biorxiv.org/content/10.1101/2025.05.22.655600v1) |
|
|
- Protein-ligand complexes from the [Runs and Poses](https://www.biorxiv.org/content/10.1101/2025.02.03.636309v3) |
|
|
|
|
|
For more details on inferences procedures and benchmarking methods, please refer to our [whitepaper](assets/whitepaper.pdf) . |
|
|
|
|
|
## Documentation |
|
|
Please visit our full documentation at https://openfold-3.readthedocs.io/en/latest/ |
|
|
|
|
|
## Upcoming |
|
|
The final OpenFold3 model is still in development, and we are actively working on the following features: |
|
|
- Full parity on all modalities with AlphaFold3 |
|
|
- Training documentation & dataset release |
|
|
- Workflows for training on custom non-PDB data |
|
|
|
|
|
## Contributing |
|
|
|
|
|
If you encounter problems using OpenFold3-preview, feel free to create an issue! We also |
|
|
welcome pull requests from the community. |
|
|
|
|
|
## Citing this Work |
|
|
|
|
|
Any work that cites OpenFold should also cite [AlphaFold3](https://www.nature.com/articles/s41586-024-07487-w). |
|
|
|
|
|
|
|
|
|
|
|
To download the entire data, you can use `git clone <huggingface link>`. You can download individual files by clicking the download button next each file name. |