File size: 927 Bytes
4a12549 6f26557 4a12549 4504315 00f2585 aa58dee 9f2d115 00f2585 b279e20 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 | ---
language:
- en
base_model:
- facebook/esm2_t6_8M_UR50D
---
This repository contains models from the paper "Protein Circuit Tracing via Cross-Layer Transcoders" by Darin Tsui, Kunal Talreja, Danial Saeedi, and Amirali Aghazadeh,to be presented at ICML 2026. You can find the preprint [here](https://arxiv.org/abs/2602.12026).
# ProtoMech Models
1. `CLT_L{num_layers}_D{clt_dim}`: CLT model with checkpoints. Can be loaded with `CLT_L{num_layers}_D{clt_dim}/checkpoints/last.ckpt`
2. `PLT_L{num_layers}_D{plt_dim}`: PLT model with checkpoints. Can be loaded with `PLT_L{num_layers}_D{clt_dim}/checkpoints/last.ckpt`
3. `esm2_t6_8M_UR50D.pt`: Base ESM model used for experiments (same for `esm2_t12_35M_UR50D.pt`)
4. `eval_models`: CNN evaluation models used for probe steering, one per DMS assay (same for `eval_models_35M`)
5. `probe`: probe evaluation models for function circuit, one per DMS assay and split configuration |