--- license: mit library_name: pytorch tags: - protein-ligand - drug-target-interaction - contrastive-learning --- # LogiCA — pretrained protein-ligand backbones (anonymous, under review) Anonymous artifact for double-blind review. Pretrained checkpoints for a logit-space contrastive alignment model: a bidirectional ESM-2 (protein) + SELFormer (ligand) backbone with a cross-attention adapter, pretrained on BindingDB with a token-likelihood contrastive objective (40 epochs). | Size | Protein backbone | File | Weights | | --- | --- | --- | --- | | 8M | `esm2_t6_8M_UR50D` | `checkpoints/8m/best.pt` | 19 MB | | 35M | `esm2_t12_35M_UR50D` | `checkpoints/35m/best.pt` | 38 MB | | 150M | `esm2_t30_150M_UR50D` | `checkpoints/150m/best.pt` | 64 MB | | 650M | `esm2_t33_650M_UR50D` | `checkpoints/650m/best.pt` | 97 MB | Ligand encoder: `HUBioDataLab/SELFormer` (all sizes). Checkpoints are weights-only (`model_state_dict` + `epoch` + `metrics`); load with `strict=False` into the bidirectional model. Download the size you need and pass it to the training code as the pretrained checkpoint: ```python from huggingface_hub import hf_hub_download ckpt = hf_hub_download(REPO_ID, "checkpoints/150m/best.pt") # REPO_ID withheld for review ``` Training and fine-tuning code is provided as an anonymized supplement with the submission. Repository, authors, and citation are withheld for double-blind review.