File size: 1,429 Bytes
d294de1
 
 
 
 
 
 
 
 
b7d1baf
d294de1
b7d1baf
 
 
 
d294de1
b7d1baf
 
 
 
 
 
d294de1
b7d1baf
d294de1
b7d1baf
 
 
d294de1
 
 
b7d1baf
d294de1
 
b7d1baf
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
---
license: mit
library_name: pytorch
tags:
  - protein-ligand
  - drug-target-interaction
  - contrastive-learning
---

# LogiCA — pretrained protein-ligand backbones (anonymous, under review)

Anonymous artifact for double-blind review. Pretrained checkpoints for a
logit-space contrastive alignment model: a bidirectional ESM-2 (protein) +
SELFormer (ligand) backbone with a cross-attention adapter, pretrained on
BindingDB with a token-likelihood contrastive objective (40 epochs).

| Size | Protein backbone | File | Weights |
| --- | --- | --- | --- |
| 8M   | `esm2_t6_8M_UR50D`    | `checkpoints/8m/best.pt`   | 19 MB |
| 35M  | `esm2_t12_35M_UR50D`  | `checkpoints/35m/best.pt`  | 38 MB |
| 150M | `esm2_t30_150M_UR50D` | `checkpoints/150m/best.pt` | 64 MB |
| 650M | `esm2_t33_650M_UR50D` | `checkpoints/650m/best.pt` | 97 MB |

Ligand encoder: `HUBioDataLab/SELFormer` (all sizes).

Checkpoints are weights-only (`model_state_dict` + `epoch` + `metrics`); load with
`strict=False` into the bidirectional model. Download the size you need and pass it
to the training code as the pretrained checkpoint:

```python
from huggingface_hub import hf_hub_download
ckpt = hf_hub_download(REPO_ID, "checkpoints/150m/best.pt")  # REPO_ID withheld for review
```

Training and fine-tuning code is provided as an anonymized supplement with the
submission. Repository, authors, and citation are withheld for double-blind review.