kouroshSA commited on
Commit
eefe55e
·
verified ·
1 Parent(s): ff8cc28

Add MED4 epoch-4 checkpoint used for low-confidence Y2H screening, with sidecar note

Browse files
Files changed (1) hide show
  1. checkpoints/ppiBTPE_epoch_4.md +34 -0
checkpoints/ppiBTPE_epoch_4.md ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # ppiBTPE_epoch_4.pth
2
+
3
+ **Checkpoint used for screening low-confidence Y2H pairs in the *Prochlorococcus marinus* MED4 interactome.**
4
+
5
+ ## Provenance
6
+
7
+ | | |
8
+ |---|---|
9
+ | Model | ppiBTEP (Siamese / twin-branch ESM-1b-inspired transformer, trained from scratch) |
10
+ | Architecture | 12 transformer layers |
11
+ | Epoch | 4 |
12
+ | File size | ~913 MB |
13
+ | Training run | `out_june17_MED4_l12` (June 17, 2025) |
14
+ | Training set | `train_MED4_ppiBTEPM-pseudo_Int_combo1-2-3.csv` (≈13,008 pairs, pre-clean — see note below) |
15
+ | Validation set | `val_MED4_100_Y2H-RND_ppiBRTPM.csv` |
16
+
17
+ ## Intended use
18
+
19
+ Inference / screening of candidate MED4 protein–protein interactions that
20
+ were originally flagged as **low-confidence Y2H hits**. The model is run on
21
+ each candidate pair and its softmax probability is used (in concert with the
22
+ other tri-model components, ppiDCE and ppiGPLM) to retain or discard the pair.
23
+
24
+ ## Notes
25
+
26
+ - This checkpoint was produced **before** the PRS/RRS de-overlapping pass on
27
+ `train.csv` (see [`MED4-PPIs-low-confidence_ppiTEPM_prompts.csv`](../MED4-PPIs-low-confidence_ppiTEPM_prompts.csv) and the cleaned
28
+ `train.clean.csv` / `train.clean2x.csv` companions). Approximately 608 of
29
+ the 13,008 training rows (4.67 %) overlap with the PRS+RRS evaluation pairs
30
+ in either orientation. Treat metrics on those pairs accordingly.
31
+ - Loading: use `train_ppiBTPE3b.py` / `inference_ppiBTPE_2GPU.py` from the
32
+ parent repo with `--num_layers 12` and `--model_config
33
+ facebook/esm1b_t33_650M_UR50S` (config-only; weights are loaded from this
34
+ checkpoint, not from the HF ESM-1b release).