Update README.md
Browse files
README.md
CHANGED
|
@@ -10,6 +10,9 @@ arxiv: 2502.07272
|
|
| 10 |
---
|
| 11 |
# GENERator-eukaryote-1.2b-base model
|
| 12 |
|
|
|
|
|
|
|
|
|
|
| 13 |
## Abouts
|
| 14 |
In this repository, we present GENERator, a generative genomic foundation model featuring a context length of 98k base pairs and 1.2B parameters, trained on an expansive dataset comprising 386 billion base pairs of eukaryotic DNA. Our evaluations demonstrate that the GENERator consistently achieves state-of-the-art performance across a wide spectrum of benchmarks, including [Genomic Benchmarks](https://huggingface.co/datasets/katielink/genomic-benchmarks/tree/main), [NT tasks](https://huggingface.co/datasets/InstaDeepAI/nucleotide_transformer_downstream_tasks_revised), and our newly proposed [Gener tasks](https://huggingface.co/GenerTeam).
|
| 15 |
|
|
|
|
| 10 |
---
|
| 11 |
# GENERator-eukaryote-1.2b-base model
|
| 12 |
|
| 13 |
+
## Important Notice !!!
|
| 14 |
+
An issue was identified in the `model.safetensors` file of the initial release, likely caused by an unstable internet connection during upload. If you downloaded **GENERator-eukaryote-1.2b-base** before **February 26, 2025**, please re-download the model to ensure optimal and reliable performance.
|
| 15 |
+
|
| 16 |
## Abouts
|
| 17 |
In this repository, we present GENERator, a generative genomic foundation model featuring a context length of 98k base pairs and 1.2B parameters, trained on an expansive dataset comprising 386 billion base pairs of eukaryotic DNA. Our evaluations demonstrate that the GENERator consistently achieves state-of-the-art performance across a wide spectrum of benchmarks, including [Genomic Benchmarks](https://huggingface.co/datasets/katielink/genomic-benchmarks/tree/main), [NT tasks](https://huggingface.co/datasets/InstaDeepAI/nucleotide_transformer_downstream_tasks_revised), and our newly proposed [Gener tasks](https://huggingface.co/GenerTeam).
|
| 18 |
|