|
|
--- |
|
|
language: |
|
|
- en |
|
|
tags: |
|
|
- protein-language-models |
|
|
- sparse-autoencoder |
|
|
license: mit |
|
|
--- |
|
|
|
|
|
# Sparse Autoencoders for ESM-2 (650M) |
|
|
|
|
|
Interpret protein language model representations using sparse autoencoders trained on ESM-2-650m layers. These models decompose complex neural representations into interpretable features, enabling deeper understanding of how protein language models process sequence information. |
|
|
|
|
|
* π Model details in the [InterPLM pre-print](https://www.biorxiv.org/content/10.1101/2024.11.14.623630v1) |
|
|
* π©βπ» Training and analysis code in the [GitHub repo](https://github.com/ElanaPearl/InterPLM) |
|
|
* 𧬠Explore features at [InterPLM.ai](https://www.interplm.ai) |
|
|
|
|
|
## Model Details |
|
|
- Base Model: ESM-2 650M (33 layers) |
|
|
- Architecture: Sparse Autoencoder |
|
|
- Input Dimension: 1,280 |
|
|
- Feature Dimension: 10,240 |
|
|
|
|
|
## Available Models |
|
|
|
|
|
We provide SAE models trained on different layers of ESM-2-650M: |
|
|
|
|
|
| Model name | ESM2 model | ESM2 layer | |
|
|
|-|-|-| |
|
|
| [InterPLM-esm2-650m-l1](https://huggingface.co/Elana/InterPLM-esm2-650m/tree/main/layer_1) | esm2_t33_650m_UR50D | 1 | |
|
|
| [InterPLM-esm2-650m-l9](https://huggingface.co/Elana/InterPLM-esm2-650m/tree/main/layer_9) | esm2_t33_650m_UR50D | 9 | |
|
|
| [InterPLM-esm2-650m-l18](https://huggingface.co/Elana/InterPLM-esm2-650m/tree/main/layer_18) | esm2_t33_650m_UR50D | 18 | |
|
|
| [InterPLM-esm2-650m-l24](https://huggingface.co/Elana/InterPLM-esm2-650m/tree/main/layer_24) | esm2_t33_650m_UR50D | 24 | |
|
|
| [InterPLM-esm2-650m-l30](https://huggingface.co/Elana/InterPLM-esm2-650m/tree/main/layer_30) | esm2_t33_650m_UR50D | 30 | |
|
|
| [InterPLM-esm2-650m-l33](https://huggingface.co/Elana/InterPLM-esm2-650m/tree/main/layer_33) | esm2_t33_650m_UR50D | 33 | |
|
|
|
|
|
All models share the same architecture and dictionary size (10,240). You can find SAEs trained on ESM-2 8M [here](https://huggingface.co/Elana/InterPLM-esm2-8m). The 650M SAEs capture more known biological concepts than the 8M but require additional compute for both ESM embedding and SAE feature extraction. |
|
|
|
|
|
## Usage |
|
|
|
|
|
```python |
|
|
from interplm.sae.inference import load_sae_from_hf |
|
|
from interplm.esm.embed import embed_single_sequence |
|
|
|
|
|
# Get ESM embeddings for protein sequence |
|
|
embeddings = embed_single_sequence( |
|
|
sequence="MRWQEMGYIFYPRKLR", |
|
|
model_name="esm2_t33_650M_UR50D", |
|
|
layer=18 # Choose ESM layer (1,9,18,24,30,33) |
|
|
) |
|
|
|
|
|
# Load SAE model and extract features |
|
|
sae = load_sae_from_hf(plm_model="esm2-650m", plm_layer=18) |
|
|
features = sae.encode(embeddings) |
|
|
``` |
|
|
|
|
|
For details on training and analyzing SAEs on PLMs, see the [GitHub README](https://github.com/ElanaPearl/InterPLM/blob/main/README.md). |
|
|
|
|
|
## Model Normalization |
|
|
The SAEs we've trained have arbitrary scales between features since encoder/decoder weights could be linearly scaled without changing reconstructions. To make features comparable, we normalize them to activate between 0-1 based on max activation values from Swiss-Prot (since this is our primary analysis dataset). By default, use our pre-normalized SAEs (`ae_normalized.pt`). As this might not perfectly scale features not present in Swiss-Prot proteins, for custom normalization use `ae_unnormalized.pt` with [this code](https://github.com/ElanaPearl/InterPLM/blob/main/interplm/sae/normalize.py). |