ProtoMechModels / README.md
ktalreja's picture
Update README.md
9f2d115 verified
---
language:
- en
base_model:
- facebook/esm2_t6_8M_UR50D
---
This repository contains models from the paper "Protein Circuit Tracing via Cross-Layer Transcoders" by Darin Tsui, Kunal Talreja, Danial Saeedi, and Amirali Aghazadeh,to be presented at ICML 2026. You can find the preprint [here](https://arxiv.org/abs/2602.12026).
# ProtoMech Models
1. `CLT_L{num_layers}_D{clt_dim}`: CLT model with checkpoints. Can be loaded with `CLT_L{num_layers}_D{clt_dim}/checkpoints/last.ckpt`
2. `PLT_L{num_layers}_D{plt_dim}`: PLT model with checkpoints. Can be loaded with `PLT_L{num_layers}_D{clt_dim}/checkpoints/last.ckpt`
3. `esm2_t6_8M_UR50D.pt`: Base ESM model used for experiments (same for `esm2_t12_35M_UR50D.pt`)
4. `eval_models`: CNN evaluation models used for probe steering, one per DMS assay (same for `eval_models_35M`)
5. `probe`: probe evaluation models for function circuit, one per DMS assay and split configuration