logica / README.md
super-dainiu's picture
Update model card
b7d1baf verified
|
Raw
History Blame Contribute Delete
1.43 kB
---
license: mit
library_name: pytorch
tags:
- protein-ligand
- drug-target-interaction
- contrastive-learning
---
# LogiCA — pretrained protein-ligand backbones (anonymous, under review)
Anonymous artifact for double-blind review. Pretrained checkpoints for a
logit-space contrastive alignment model: a bidirectional ESM-2 (protein) +
SELFormer (ligand) backbone with a cross-attention adapter, pretrained on
BindingDB with a token-likelihood contrastive objective (40 epochs).
| Size | Protein backbone | File | Weights |
| --- | --- | --- | --- |
| 8M | `esm2_t6_8M_UR50D` | `checkpoints/8m/best.pt` | 19 MB |
| 35M | `esm2_t12_35M_UR50D` | `checkpoints/35m/best.pt` | 38 MB |
| 150M | `esm2_t30_150M_UR50D` | `checkpoints/150m/best.pt` | 64 MB |
| 650M | `esm2_t33_650M_UR50D` | `checkpoints/650m/best.pt` | 97 MB |
Ligand encoder: `HUBioDataLab/SELFormer` (all sizes).
Checkpoints are weights-only (`model_state_dict` + `epoch` + `metrics`); load with
`strict=False` into the bidirectional model. Download the size you need and pass it
to the training code as the pretrained checkpoint:
```python
from huggingface_hub import hf_hub_download
ckpt = hf_hub_download(REPO_ID, "checkpoints/150m/best.pt") # REPO_ID withheld for review
```
Training and fine-tuning code is provided as an anonymized supplement with the
submission. Repository, authors, and citation are withheld for double-blind review.