ebetica commited on
Commit
f767eed
·
verified ·
1 Parent(s): e8d47c8

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +24 -11
README.md CHANGED
@@ -17,7 +17,7 @@ tags:
17
  - transformers
18
  ---
19
 
20
- # Model Card for ESMFold2
21
 
22
  ## Model Details
23
 
@@ -33,8 +33,8 @@ Read more about ESMFold2 in our paper [here](https://biohub.ai/papers/esm_protei
33
 
34
  | Model | MSA Conditioning | Description | Data Cutoff |
35
  | :---- | :---- | :---- | :---- |
36
- | [esmfold2](https://huggingface.co/biohub/esmfold2) | Yes | Large model, capable of either single-sequence or MSA conditioned structure prediction for improved accuracy on difficult targets | Sept 2021 |
37
- | [esmfold2-fast](https://huggingface.co/biohub/esmfold2-fast) | No | Inference optimized single-sequence structure prediction model | Sept 2021 |
38
 
39
  ## Performance Metrics
40
 
@@ -177,13 +177,6 @@ Informed by our risk assessments, we are releasing the source code and model wei
177
 
178
  ## Biases and Limitations
179
 
180
- ### Potential Biases
181
-
182
- - The model may reflect biases present in the training data.
183
- -
184
-
185
- ### Limitations
186
-
187
  - **Dataset biases**: The model may reflect biases present in the training data (PDB, AFDB), including over-representation of certain protein families, experimental conditions, or structural classes. Performance may vary for underrepresented protein types.
188
  - **Dataset limitations:** PDB historically lacks comprehensive data on protein conformations, post-translational modifications, disordered regions, etc. Like all other structure prediction models trained on the PDB, performance may degrade on other biomolecules.
189
  - **Computational demand:** Highest accuracy structure predictions require scaling inference time compute. Predictions made with reduced inference parameters may lead to suboptimal performance.
@@ -205,7 +198,27 @@ Should you have any security or privacy issues or questions related to this mode
205
 
206
  ### Citation
207
 
208
- Coming soon.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
209
 
210
  ## Acknowledgements
211
 
 
17
  - transformers
18
  ---
19
 
20
+ # ESMFold2
21
 
22
  ## Model Details
23
 
 
33
 
34
  | Model | MSA Conditioning | Description | Data Cutoff |
35
  | :---- | :---- | :---- | :---- |
36
+ | [ESMFold2](https://huggingface.co/biohub/ESMFold2) | Yes | Large model, capable of either single-sequence or MSA conditioned structure prediction for improved accuracy on difficult targets | Sept 2021 |
37
+ | [ESMFold2-Fast](https://huggingface.co/biohub/ESMFold2-Fast) | No | Inference optimized single-sequence structure prediction model | Sept 2021 |
38
 
39
  ## Performance Metrics
40
 
 
177
 
178
  ## Biases and Limitations
179
 
 
 
 
 
 
 
 
180
  - **Dataset biases**: The model may reflect biases present in the training data (PDB, AFDB), including over-representation of certain protein families, experimental conditions, or structural classes. Performance may vary for underrepresented protein types.
181
  - **Dataset limitations:** PDB historically lacks comprehensive data on protein conformations, post-translational modifications, disordered regions, etc. Like all other structure prediction models trained on the PDB, performance may degrade on other biomolecules.
182
  - **Computational demand:** Highest accuracy structure predictions require scaling inference time compute. Predictions made with reduced inference parameters may lead to suboptimal performance.
 
198
 
199
  ### Citation
200
 
201
+ ```
202
+ @misc{candido2026language,
203
+ title = {Language Modeling Materializes a World Model of Protein Biology},
204
+ author = {Candido, Salvatore and Hayes, Thomas and Derry, Alexander and Rao, Roshan
205
+ and Lin, Zeming and Verkuil, Robert and Wu, Bryan and Lee, Jin Sub
206
+ and Bruguera, Elise S. and Keval, Jehan A. and Kopylov, Mykhailo
207
+ and Pak, John E. and Wu, Wesley and Thomas, Neil and Mataraso, Samson
208
+ and Hsu, Alvin and Trotman-Grant, Ashton C. and Fatras, Kilian
209
+ and dos Santos Costa, Allan and Badkundri, Rohil and Ak{\i}n, Halil
210
+ and Oktay, Deniz and Deaton, Jonathan and Montabana, Elizabeth
211
+ and Sitwala, Hrishita and Yu, Yue and Wiggert, Marius
212
+ and Carlin, Dylan Alexander and Goering, Anthony W. and Blazejewski, Tomasz
213
+ and Sandora, McCullen and Hla, Michael and Jia, Tina Z.
214
+ and Kloker, Leon H. and Sofroniew, Nicholas J. and Uehara, Masatoshi
215
+ and Pannu, Jassi and Bachas, Sharrol and Liu, Daniel S.
216
+ and Sercu, Tom and Rives, Alexander},
217
+ year = {2026},
218
+ url = {https://biohub.ai/papers/esm_protein.pdf},
219
+ note = {Preprint}
220
+ }
221
+ ```
222
 
223
  ## Acknowledgements
224