Add metadata, link to paper and code
#1
by
nielsr
HF Staff
- opened
README.md
CHANGED
|
@@ -1,5 +1,13 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
# PDeepPP: A Comprehensive Protein Language Model Hub
|
| 2 |
|
|
|
|
|
|
|
| 3 |
PDeepPP is a hybrid protein language model designed to predict post-translational modification (PTM) sites, analyze biologically relevant features, and support a wide range of protein sequence analysis tasks. This repository serves as the central hub for accessing and exploring various specialized PDeepPP models, each fine-tuned for specific tasks, such as PTM site prediction, bioactivity analysis, and more.
|
| 4 |
|
| 5 |
The models in this repository can all be used on their corresponding datasets on GitHub ([https://github.com/fondress/PDeepPP/tree/main])
|
|
@@ -87,7 +95,7 @@ To use any of the models, you need to install the required dependencies, such as
|
|
| 87 |
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
|
| 88 |
pip install transformers
|
| 89 |
```
|
| 90 |
-
Here’s a quick example of how to load a model(The use of models with specific biological features can be found in Task-Specific Models.):
|
| 91 |
|
| 92 |
```python
|
| 93 |
from transformers import AutoModel, AutoTokenizer
|
|
@@ -109,13 +117,16 @@ Refer to the `PDeepPPConfig` class in the source repository for details on avail
|
|
| 109 |
|
| 110 |
---
|
| 111 |
## Citation
|
| 112 |
-
|
|
|
|
| 113 |
|
| 114 |
```
|
| 115 |
-
|
| 116 |
title={A general language model for peptide identification},
|
| 117 |
author={Author Name},
|
| 118 |
journal={Journal Name},
|
| 119 |
year={2025}
|
| 120 |
}
|
| 121 |
-
```
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
library_name: transformers
|
| 4 |
+
pipeline_tag: feature-extraction
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
# PDeepPP: A Comprehensive Protein Language Model Hub
|
| 8 |
|
| 9 |
+
This repository contains the model as presented in [PDeepPP:A Deep learning framework with Pretrained Protein language for peptide classification](https://huggingface.co/papers/2502.15610).
|
| 10 |
+
|
| 11 |
PDeepPP is a hybrid protein language model designed to predict post-translational modification (PTM) sites, analyze biologically relevant features, and support a wide range of protein sequence analysis tasks. This repository serves as the central hub for accessing and exploring various specialized PDeepPP models, each fine-tuned for specific tasks, such as PTM site prediction, bioactivity analysis, and more.
|
| 12 |
|
| 13 |
The models in this repository can all be used on their corresponding datasets on GitHub ([https://github.com/fondress/PDeepPP/tree/main])
|
|
|
|
| 95 |
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
|
| 96 |
pip install transformers
|
| 97 |
```
|
| 98 |
+
Here’s a quick example of how to load a model (The use of models with specific biological features can be found in Task-Specific Models.):
|
| 99 |
|
| 100 |
```python
|
| 101 |
from transformers import AutoModel, AutoTokenizer
|
|
|
|
| 117 |
|
| 118 |
---
|
| 119 |
## Citation
|
| 120 |
+
|
| 121 |
+
If you use any of the PDeepPP models in your research, please cite the associated paper:
|
| 122 |
|
| 123 |
```
|
| 124 |
+
@article{your_reference,
|
| 125 |
title={A general language model for peptide identification},
|
| 126 |
author={Author Name},
|
| 127 |
journal={Journal Name},
|
| 128 |
year={2025}
|
| 129 |
}
|
| 130 |
+
```
|
| 131 |
+
|
| 132 |
+
Code: [https://github.com/fondress/PDeepPP](https://github.com/fondress/PDeepPP)
|