Update README.md
Browse files
README.md
CHANGED
|
@@ -18,8 +18,6 @@ model-index:
|
|
| 18 |
|
| 19 |
**TropicBERT** is the first genomic foundation model series specifically designed for **tropical fruit crop genome sequences**. We pre-trained BERT models using the **MLM (Masked Language Modeling)** objective on datasets comprising **13 different species combinations**, releasing a total of **13 pre-trained model variants** covering 1, 5, and 10 species.
|
| 20 |
|
| 21 |
-
This project aims to explore the impact of genomic diversity and data scale on model capability. All models are trained on high-quality genome assemblies and serve as general feature extractors for plant genomics research.
|
| 22 |
-
|
| 23 |
The models are developed based on **TropicBERT-LLMs_One_stop_tutorial**, a reproducible pipeline that successfully transfers the "pre-training and fine-tuning" paradigm from NLP to genomics, significantly lowering the barrier for developing plant genomic large language models.
|
| 24 |
|
| 25 |
🔗 **Related Resources**:
|
|
|
|
| 18 |
|
| 19 |
**TropicBERT** is the first genomic foundation model series specifically designed for **tropical fruit crop genome sequences**. We pre-trained BERT models using the **MLM (Masked Language Modeling)** objective on datasets comprising **13 different species combinations**, releasing a total of **13 pre-trained model variants** covering 1, 5, and 10 species.
|
| 20 |
|
|
|
|
|
|
|
| 21 |
The models are developed based on **TropicBERT-LLMs_One_stop_tutorial**, a reproducible pipeline that successfully transfers the "pre-training and fine-tuning" paradigm from NLP to genomics, significantly lowering the barrier for developing plant genomic large language models.
|
| 22 |
|
| 23 |
🔗 **Related Resources**:
|