Transformers
Safetensors
English
esmfold2
biology
esm
protein
protein-structure-prediction
structure-prediction
protein-design
3d-structure
confidence-estimation
molecular-dynamics
Instructions to use biohub/ESMFold2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use biohub/ESMFold2 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("biohub/ESMFold2", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -17,7 +17,7 @@ tags:
|
|
| 17 |
- transformers
|
| 18 |
---
|
| 19 |
|
| 20 |
-
#
|
| 21 |
|
| 22 |
## Model Details
|
| 23 |
|
|
@@ -33,8 +33,8 @@ Read more about ESMFold2 in our paper [here](https://biohub.ai/papers/esm_protei
|
|
| 33 |
|
| 34 |
| Model | MSA Conditioning | Description | Data Cutoff |
|
| 35 |
| :---- | :---- | :---- | :---- |
|
| 36 |
-
| [
|
| 37 |
-
| [
|
| 38 |
|
| 39 |
## Performance Metrics
|
| 40 |
|
|
@@ -177,13 +177,6 @@ Informed by our risk assessments, we are releasing the source code and model wei
|
|
| 177 |
|
| 178 |
## Biases and Limitations
|
| 179 |
|
| 180 |
-
### Potential Biases
|
| 181 |
-
|
| 182 |
-
- The model may reflect biases present in the training data.
|
| 183 |
-
-
|
| 184 |
-
|
| 185 |
-
### Limitations
|
| 186 |
-
|
| 187 |
- **Dataset biases**: The model may reflect biases present in the training data (PDB, AFDB), including over-representation of certain protein families, experimental conditions, or structural classes. Performance may vary for underrepresented protein types.
|
| 188 |
- **Dataset limitations:** PDB historically lacks comprehensive data on protein conformations, post-translational modifications, disordered regions, etc. Like all other structure prediction models trained on the PDB, performance may degrade on other biomolecules.
|
| 189 |
- **Computational demand:** Highest accuracy structure predictions require scaling inference time compute. Predictions made with reduced inference parameters may lead to suboptimal performance.
|
|
@@ -205,7 +198,27 @@ Should you have any security or privacy issues or questions related to this mode
|
|
| 205 |
|
| 206 |
### Citation
|
| 207 |
|
| 208 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 209 |
|
| 210 |
## Acknowledgements
|
| 211 |
|
|
|
|
| 17 |
- transformers
|
| 18 |
---
|
| 19 |
|
| 20 |
+
# ESMFold2
|
| 21 |
|
| 22 |
## Model Details
|
| 23 |
|
|
|
|
| 33 |
|
| 34 |
| Model | MSA Conditioning | Description | Data Cutoff |
|
| 35 |
| :---- | :---- | :---- | :---- |
|
| 36 |
+
| [ESMFold2](https://huggingface.co/biohub/ESMFold2) | Yes | Large model, capable of either single-sequence or MSA conditioned structure prediction for improved accuracy on difficult targets | Sept 2021 |
|
| 37 |
+
| [ESMFold2-Fast](https://huggingface.co/biohub/ESMFold2-Fast) | No | Inference optimized single-sequence structure prediction model | Sept 2021 |
|
| 38 |
|
| 39 |
## Performance Metrics
|
| 40 |
|
|
|
|
| 177 |
|
| 178 |
## Biases and Limitations
|
| 179 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 180 |
- **Dataset biases**: The model may reflect biases present in the training data (PDB, AFDB), including over-representation of certain protein families, experimental conditions, or structural classes. Performance may vary for underrepresented protein types.
|
| 181 |
- **Dataset limitations:** PDB historically lacks comprehensive data on protein conformations, post-translational modifications, disordered regions, etc. Like all other structure prediction models trained on the PDB, performance may degrade on other biomolecules.
|
| 182 |
- **Computational demand:** Highest accuracy structure predictions require scaling inference time compute. Predictions made with reduced inference parameters may lead to suboptimal performance.
|
|
|
|
| 198 |
|
| 199 |
### Citation
|
| 200 |
|
| 201 |
+
```
|
| 202 |
+
@misc{candido2026language,
|
| 203 |
+
title = {Language Modeling Materializes a World Model of Protein Biology},
|
| 204 |
+
author = {Candido, Salvatore and Hayes, Thomas and Derry, Alexander and Rao, Roshan
|
| 205 |
+
and Lin, Zeming and Verkuil, Robert and Wu, Bryan and Lee, Jin Sub
|
| 206 |
+
and Bruguera, Elise S. and Keval, Jehan A. and Kopylov, Mykhailo
|
| 207 |
+
and Pak, John E. and Wu, Wesley and Thomas, Neil and Mataraso, Samson
|
| 208 |
+
and Hsu, Alvin and Trotman-Grant, Ashton C. and Fatras, Kilian
|
| 209 |
+
and dos Santos Costa, Allan and Badkundri, Rohil and Ak{\i}n, Halil
|
| 210 |
+
and Oktay, Deniz and Deaton, Jonathan and Montabana, Elizabeth
|
| 211 |
+
and Sitwala, Hrishita and Yu, Yue and Wiggert, Marius
|
| 212 |
+
and Carlin, Dylan Alexander and Goering, Anthony W. and Blazejewski, Tomasz
|
| 213 |
+
and Sandora, McCullen and Hla, Michael and Jia, Tina Z.
|
| 214 |
+
and Kloker, Leon H. and Sofroniew, Nicholas J. and Uehara, Masatoshi
|
| 215 |
+
and Pannu, Jassi and Bachas, Sharrol and Liu, Daniel S.
|
| 216 |
+
and Sercu, Tom and Rives, Alexander},
|
| 217 |
+
year = {2026},
|
| 218 |
+
url = {https://biohub.ai/papers/esm_protein.pdf},
|
| 219 |
+
note = {Preprint}
|
| 220 |
+
}
|
| 221 |
+
```
|
| 222 |
|
| 223 |
## Acknowledgements
|
| 224 |
|