--- license: mit --- **Sparse Autoencoders for *Evo 2*** — BatchTopK sparse autoencoders for Arc Institute's Evo 2 genomic foundation model. Evo 2 is a genomic foundation model capable of generalist prediction and design tasks across DNA, RNA, and proteins. It uses a frontier deep learning architecture to enable modeling of biological sequences at single-nucleotide resolution with near-linear scaling of compute and memory relative to context length. Evo 2 is trained with 40 billion parameters and 1 megabase context length on over 9 trillion nucleotides of diverse eukaryotic and prokaryotic genomes. This repository contains the layer 26 SAE mixed prokaryote/eukaryote SAE used in the Evo 2 paper. [More on Evo 2](https://arcinstitute.org/tools/evo)