SAELens
English
sparse-autoencoder
SAE
interpretability
deception-detection
mechanistic-interpretability
neuronpedia
Solshine's picture
Initial public release: SAE weights, cfg, and model card
2325115