gravelcompbio commited on
Commit
e73bf00
·
verified ·
1 Parent(s): 96a687b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -9
README.md CHANGED
@@ -9,11 +9,22 @@ tags:
9
  - biology
10
  - protein
11
  ---
12
- # Model Card for Model ID
13
 
14
  <!-- Provide a quick summary of what the model is/does. -->
15
 
16
- This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1).
 
 
 
 
 
 
 
 
 
 
 
17
 
18
  ## Model Details
19
 
@@ -23,13 +34,13 @@ This modelcard aims to be a base template for new models. It has been generated
23
 
24
 
25
 
26
- - **Developed by:** [More Information Needed]
27
- - **Funded by [optional]:** [More Information Needed]
28
- - **Shared by [optional]:** [More Information Needed]
29
- - **Model type:** [More Information Needed]
30
- - **Language(s) (NLP):** [More Information Needed]
31
- - **License:** [More Information Needed]
32
- - **Finetuned from model [optional]:** [More Information Needed]
33
 
34
  ### Model Sources [optional]
35
 
 
9
  - biology
10
  - protein
11
  ---
12
+ # Contrastively Learned Attention based Stratified PTM Predictor (CLASPP) a unified PTM prediction model
13
 
14
  <!-- Provide a quick summary of what the model is/does. -->
15
 
16
+ Post-Translational Modifications (PTMs) are a fundamental mechanism for regulating cellular functions and increasing the functional diversity of the proteome.
17
+ Despite the identification of hundreds of unique PTMs through mass-spectrometry (MS) studies, accurately predicting many PTM types based on sequence data alone
18
+ remains a significant challenge. Existing PTM prediction models predominantly focus on either single PTM types or employ ensemble methods that combine multiple
19
+ models to predict different PTM types. This fragmentation is largely driven by the vast imbalance in data availability across PTM types making it difficult to
20
+ predict multiple PTM types with a single model. To address this limitation, we present the Contrastively Learned Attention-Based Stratified PTM Predictor (CLASPP),
21
+ a unified PTM prediction model. CLASPP overcomes data imbalance challenges by leveraging unsupervised clustering-based under-sampling and incorporating a novel
22
+ contrastive learning framework tailored to PTM data. Drawing inspiration from advancements in image and natural language processing, the CLASPP model employs
23
+ a multi-stage training strategy and utilizes a high-quality curated training dataset to improve PTM prediction accuracy compared to existing multi-PTM prediction
24
+ models. Existing PTM prediction models predominantly focus on either single PTM types or employ ensemble methods that combine multiple
25
+ models to predict different PTM types. This fragmentation is largely driven by the vast imbalance in data availability across PTM types making it difficult to
26
+ predict multiple PTM types with a single model. To address this limitation, we present the Contrastively Learned Attention-Based Stratified PTM Predictor (CLASPP),
27
+ a unified PTM prediction model.
28
 
29
  ## Model Details
30
 
 
34
 
35
 
36
 
37
+ - **Developed by:** [Nathan Gravel]
38
+ - **Funded by [optional]:** [NIH]
39
+ - **Shared by [optional]:** [More Information Neede]
40
+ - **Model type:** [Text classication]
41
+ - **Language(s) (NLP):** [Protein Sequence]
42
+ - **License:** [MIT]
43
+ - **Finetuned from model [optional]:** [ESM-2 150M]
44
 
45
  ### Model Sources [optional]
46