Update pipeline tag, add specific tags, and refine citation

#2
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +15 -9
README.md CHANGED
@@ -1,7 +1,12 @@
1
  ---
2
- license: mit
3
  library_name: transformers
4
- pipeline_tag: feature-extraction
 
 
 
 
 
 
5
  ---
6
 
7
  # PDeepPP: A Comprehensive Protein Language Model Hub
@@ -10,7 +15,7 @@ This repository contains the model as presented in [A general language model for
10
 
11
  PDeepPP is a hybrid protein language model designed to predict post-translational modification (PTM) sites, analyze biologically relevant features, and support a wide range of protein sequence analysis tasks. This repository serves as the central hub for accessing and exploring various specialized PDeepPP models, each fine-tuned for specific tasks, such as PTM site prediction, bioactivity analysis, and more.
12
 
13
- The models in this repository can all be used on their corresponding datasets on GitHub ([https://github.com/fondress/PDeepPP/tree/main])
14
 
15
  ---
16
 
@@ -27,7 +32,7 @@ This repository contains links to multiple task-specific PDeepPP models. These m
27
  - **Flexible Architecture**: Combines self-attention and convolutional operations for robust feature extraction.
28
  - **Task-Specific Models**: Includes pre-trained models for PTM prediction, bioactivity classification, and more.
29
  - **Dataset Support**: Models are validated on datasets such as PTM and BPS, ensuring performance on real-world tasks.
30
- - **Extensibility**: Users can fine-tune the models on custom datasets for new tasks.
31
 
32
  ---
33
 
@@ -120,12 +125,13 @@ Refer to the `PDeepPPConfig` class in the source repository for details on avail
120
 
121
  If you use any of the PDeepPP models in your research, please cite the associated paper:
122
 
123
- ```
124
- @article{your_reference,
125
  title={A general language model for peptide identification},
126
- author={Author Name},
127
- journal={Journal Name},
128
- year={2025}
 
129
  }
130
  ```
131
 
 
1
  ---
 
2
  library_name: transformers
3
+ license: mit
4
+ pipeline_tag: text-classification
5
+ tags:
6
+ - peptide
7
+ - protein
8
+ - bioinformatics
9
+ - biology
10
  ---
11
 
12
  # PDeepPP: A Comprehensive Protein Language Model Hub
 
15
 
16
  PDeepPP is a hybrid protein language model designed to predict post-translational modification (PTM) sites, analyze biologically relevant features, and support a wide range of protein sequence analysis tasks. This repository serves as the central hub for accessing and exploring various specialized PDeepPP models, each fine-tuned for specific tasks, such as PTM site prediction, bioactivity analysis, and more.
17
 
18
+ The models in this repository can all be used on their corresponding datasets on GitHub ([https://github.com/fondress/PDeepPP])
19
 
20
  ---
21
 
 
32
  - **Flexible Architecture**: Combines self-attention and convolutional operations for robust feature extraction.
33
  - **Task-Specific Models**: Includes pre-trained models for PTM prediction, bioactivity classification, and more.
34
  - **Dataset Support**: Models are validated on datasets such as PTM and BPS, ensuring performance on real-world tasks.
35
+ - **Extensibility**: Users can fine-tune the model on custom datasets for new tasks.
36
 
37
  ---
38
 
 
125
 
126
  If you use any of the PDeepPP models in your research, please cite the associated paper:
127
 
128
+ ```bibtex
129
+ @article{pdeeppp2025peptide,
130
  title={A general language model for peptide identification},
131
+ author={{PDeepPP Authors}},
132
+ journal={arXiv preprint arXiv:2502.15610},
133
+ year={2025},
134
+ url={https://arxiv.org/abs/2502.15610}
135
  }
136
  ```
137