model card initial commit
Browse files
README.md
CHANGED
|
@@ -1,3 +1,30 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
---
|
| 4 |
+
## Dyno Psi Overview
|
| 5 |
+
|
| 6 |
+
Dyno Psi-1 is a generative protein design model for *de novo* binder design.
|
| 7 |
+
|
| 8 |
+
[](https://github.com/dynotx/dynopsi)
|
| 9 |
+
|
| 10 |
+
|
| 11 |
+
|
| 12 |
+
### Description
|
| 13 |
+
|
| 14 |
+
The Dyno Psi approach to binder design consists of a backbone generation model (Dyno Psi-1), a sequence design component, and *in silico* filters. Dyno Psi-1 generates backbones via a flow-matching-based denoising model. Sequences are designed post hoc using an inverse folding model, and candidates are evaluated with structure-based filters.
|
| 15 |
+
|
| 16 |
+
This Hugging Face repository and corresponding [GitHub repository](https://github.com/dynotx/dynopsi) enable the use of the Dyno Psi-1 backbone generation model.
|
| 17 |
+
|
| 18 |
+
Dyno Psi-1 was trained in multiple stages to improve its ability to generalize. Training began with large-scale monomer generation on structures from the PDB and AlphaFold Database (AFDB), followed by joint training on monomers and protein–protein interactions. To expand structural diversity beyond experimentally solved complexes, millions of synthetic intra-domain interaction pairs derived from AFDB monomers were generated and clustered to reduce redundancy. This mixture of experimental and synthetic interaction data enables the model to learn diverse geometric and interface features relevant to *de novo* binder generation.
|
| 19 |
+
|
| 20 |
+
|
| 21 |
+
### Architecture & Training
|
| 22 |
+
Dyno Psi-1 is a ~200M-parameter coordinate denoising model built on a scalable protein transformer with triangle updates, and has a maximum residue context window of 512.
|
| 23 |
+
|
| 24 |
+
#### Training Phases
|
| 25 |
+
| Phase | Steps | GPUs | Max Tokens | Data |
|
| 26 |
+
|-------|-------|------|------------|------|
|
| 27 |
+
| Monomer pretraining | 232k | 16× H200 | 256 | PDB monomers, AFDB monomers |
|
| 28 |
+
| Interface-aware pretraining | 143k | 96× H200 | 512 | PDB monomers, AFDB monomers, PDB multimers, ~ 4M synthetic intra-domain interaction pairs (~1M clusters) |
|
| 29 |
+
| Binder design fine-tuning | 154k | 96× H200 | 512 | PDB complexes, AF2-filtered synthetic interaction pairs |
|
| 30 |
+
|