Add metadata, link to paper and code

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +15 -4
README.md CHANGED
@@ -1,5 +1,13 @@
 
 
 
 
 
 
1
  # PDeepPP: A Comprehensive Protein Language Model Hub
2
 
 
 
3
  PDeepPP is a hybrid protein language model designed to predict post-translational modification (PTM) sites, analyze biologically relevant features, and support a wide range of protein sequence analysis tasks. This repository serves as the central hub for accessing and exploring various specialized PDeepPP models, each fine-tuned for specific tasks, such as PTM site prediction, bioactivity analysis, and more.
4
 
5
  The models in this repository can all be used on their corresponding datasets on GitHub ([https://github.com/fondress/PDeepPP/tree/main])
@@ -87,7 +95,7 @@ To use any of the models, you need to install the required dependencies, such as
87
  pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
88
  pip install transformers
89
  ```
90
- Here’s a quick example of how to load a model(The use of models with specific biological features can be found in Task-Specific Models.):
91
 
92
  ```python
93
  from transformers import AutoModel, AutoTokenizer
@@ -109,13 +117,16 @@ Refer to the `PDeepPPConfig` class in the source repository for details on avail
109
 
110
  ---
111
  ## Citation
112
- If you use any of the PDeepPP models in your research, please cite the associated paper or repository:
 
113
 
114
  ```
115
- @article{your_reference,
116
  title={A general language model for peptide identification},
117
  author={Author Name},
118
  journal={Journal Name},
119
  year={2025}
120
  }
121
- ```
 
 
 
1
+ ---
2
+ license: mit
3
+ library_name: transformers
4
+ pipeline_tag: feature-extraction
5
+ ---
6
+
7
  # PDeepPP: A Comprehensive Protein Language Model Hub
8
 
9
+ This repository contains the model as presented in [PDeepPP:A Deep learning framework with Pretrained Protein language for peptide classification](https://huggingface.co/papers/2502.15610).
10
+
11
  PDeepPP is a hybrid protein language model designed to predict post-translational modification (PTM) sites, analyze biologically relevant features, and support a wide range of protein sequence analysis tasks. This repository serves as the central hub for accessing and exploring various specialized PDeepPP models, each fine-tuned for specific tasks, such as PTM site prediction, bioactivity analysis, and more.
12
 
13
  The models in this repository can all be used on their corresponding datasets on GitHub ([https://github.com/fondress/PDeepPP/tree/main])
 
95
  pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
96
  pip install transformers
97
  ```
98
+ Here’s a quick example of how to load a model (The use of models with specific biological features can be found in Task-Specific Models.):
99
 
100
  ```python
101
  from transformers import AutoModel, AutoTokenizer
 
117
 
118
  ---
119
  ## Citation
120
+
121
+ If you use any of the PDeepPP models in your research, please cite the associated paper:
122
 
123
  ```
124
+ @article{your_reference,
125
  title={A general language model for peptide identification},
126
  author={Author Name},
127
  journal={Journal Name},
128
  year={2025}
129
  }
130
+ ```
131
+
132
+ Code: [https://github.com/fondress/PDeepPP](https://github.com/fondress/PDeepPP)