| --- |
| license: apache-2.0 |
| tags: |
| - biology |
| - genomics |
| - DNA |
| --- |
| |
| <img src="https://cdn-uploads.huggingface.co/production/uploads/649aee789fc303937a045f6a/IGUfG31MMvDzhdjRK-nlJ.jpeg" width="70%" /> |
|
|
| ## Evo 2 |
|
|
| Evo 2 is a state-of-the-art DNA language model trained autoregressively on trillions of DNA tokens. |
|
|
| For instructions, details, and examples, please refer to the [github](https://github.com/ArcInstitute/evo2) and [paper](). |
|
|
| Evo 2 40B and 7B checkpoints, trained up to 1 million sequence length, are available here: |
| | Checkpoint name | Num layers | Num parameters | |
| |------------------------------|----|----------| |
| | [evo2_40b](https://huggingface.co/arcinstitute/evo2_40b) | 50 | 40B | |
| | [evo2_7b](https://huggingface.co/arcinstitute/evo2_7b) | 32 | 7B | |
|
|
| We also share 40B, 7B, and 1B base checkpoints trained on 8192 context length: |
| | Checkpoint name | Num layers | Num parameters | |
| |------------------------------|----|----------| |
| | [evo2_40b_base](https://huggingface.co/arcinstitute/evo2_40b_base) | 50 | 40B | |
| | [evo2_7b_base](https://huggingface.co/arcinstitute/evo2_7b_base) | 32 | 7B | |
| | [evo2_1b_base](https://huggingface.co/arcinstitute/evo2_1b_base) | 25 | 1B | |