Update README.md
Browse files
README.md
CHANGED
|
@@ -9,11 +9,22 @@ tags:
|
|
| 9 |
- biology
|
| 10 |
- protein
|
| 11 |
---
|
| 12 |
-
#
|
| 13 |
|
| 14 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 15 |
|
| 16 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 17 |
|
| 18 |
## Model Details
|
| 19 |
|
|
@@ -23,13 +34,13 @@ This modelcard aims to be a base template for new models. It has been generated
|
|
| 23 |
|
| 24 |
|
| 25 |
|
| 26 |
-
- **Developed by:** [
|
| 27 |
-
- **Funded by [optional]:** [
|
| 28 |
-
- **Shared by [optional]:** [More Information
|
| 29 |
-
- **Model type:** [
|
| 30 |
-
- **Language(s) (NLP):** [
|
| 31 |
-
- **License:** [
|
| 32 |
-
- **Finetuned from model [optional]:** [
|
| 33 |
|
| 34 |
### Model Sources [optional]
|
| 35 |
|
|
|
|
| 9 |
- biology
|
| 10 |
- protein
|
| 11 |
---
|
| 12 |
+
# Contrastively Learned Attention based Stratified PTM Predictor (CLASPP) a unified PTM prediction model
|
| 13 |
|
| 14 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 15 |
|
| 16 |
+
Post-Translational Modifications (PTMs) are a fundamental mechanism for regulating cellular functions and increasing the functional diversity of the proteome.
|
| 17 |
+
Despite the identification of hundreds of unique PTMs through mass-spectrometry (MS) studies, accurately predicting many PTM types based on sequence data alone
|
| 18 |
+
remains a significant challenge. Existing PTM prediction models predominantly focus on either single PTM types or employ ensemble methods that combine multiple
|
| 19 |
+
models to predict different PTM types. This fragmentation is largely driven by the vast imbalance in data availability across PTM types making it difficult to
|
| 20 |
+
predict multiple PTM types with a single model. To address this limitation, we present the Contrastively Learned Attention-Based Stratified PTM Predictor (CLASPP),
|
| 21 |
+
a unified PTM prediction model. CLASPP overcomes data imbalance challenges by leveraging unsupervised clustering-based under-sampling and incorporating a novel
|
| 22 |
+
contrastive learning framework tailored to PTM data. Drawing inspiration from advancements in image and natural language processing, the CLASPP model employs
|
| 23 |
+
a multi-stage training strategy and utilizes a high-quality curated training dataset to improve PTM prediction accuracy compared to existing multi-PTM prediction
|
| 24 |
+
models. Existing PTM prediction models predominantly focus on either single PTM types or employ ensemble methods that combine multiple
|
| 25 |
+
models to predict different PTM types. This fragmentation is largely driven by the vast imbalance in data availability across PTM types making it difficult to
|
| 26 |
+
predict multiple PTM types with a single model. To address this limitation, we present the Contrastively Learned Attention-Based Stratified PTM Predictor (CLASPP),
|
| 27 |
+
a unified PTM prediction model.
|
| 28 |
|
| 29 |
## Model Details
|
| 30 |
|
|
|
|
| 34 |
|
| 35 |
|
| 36 |
|
| 37 |
+
- **Developed by:** [Nathan Gravel]
|
| 38 |
+
- **Funded by [optional]:** [NIH]
|
| 39 |
+
- **Shared by [optional]:** [More Information Neede]
|
| 40 |
+
- **Model type:** [Text classication]
|
| 41 |
+
- **Language(s) (NLP):** [Protein Sequence]
|
| 42 |
+
- **License:** [MIT]
|
| 43 |
+
- **Finetuned from model [optional]:** [ESM-2 150M]
|
| 44 |
|
| 45 |
### Model Sources [optional]
|
| 46 |
|