Update pipeline tag, add specific tags, and refine citation
#2
by
nielsr
HF Staff
- opened
README.md
CHANGED
|
@@ -1,7 +1,12 @@
|
|
| 1 |
---
|
| 2 |
-
license: mit
|
| 3 |
library_name: transformers
|
| 4 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 5 |
---
|
| 6 |
|
| 7 |
# PDeepPP: A Comprehensive Protein Language Model Hub
|
|
@@ -10,7 +15,7 @@ This repository contains the model as presented in [A general language model for
|
|
| 10 |
|
| 11 |
PDeepPP is a hybrid protein language model designed to predict post-translational modification (PTM) sites, analyze biologically relevant features, and support a wide range of protein sequence analysis tasks. This repository serves as the central hub for accessing and exploring various specialized PDeepPP models, each fine-tuned for specific tasks, such as PTM site prediction, bioactivity analysis, and more.
|
| 12 |
|
| 13 |
-
The models in this repository can all be used on their corresponding datasets on GitHub ([https://github.com/fondress/PDeepPP
|
| 14 |
|
| 15 |
---
|
| 16 |
|
|
@@ -27,7 +32,7 @@ This repository contains links to multiple task-specific PDeepPP models. These m
|
|
| 27 |
- **Flexible Architecture**: Combines self-attention and convolutional operations for robust feature extraction.
|
| 28 |
- **Task-Specific Models**: Includes pre-trained models for PTM prediction, bioactivity classification, and more.
|
| 29 |
- **Dataset Support**: Models are validated on datasets such as PTM and BPS, ensuring performance on real-world tasks.
|
| 30 |
-
- **Extensibility**: Users can fine-tune the
|
| 31 |
|
| 32 |
---
|
| 33 |
|
|
@@ -120,12 +125,13 @@ Refer to the `PDeepPPConfig` class in the source repository for details on avail
|
|
| 120 |
|
| 121 |
If you use any of the PDeepPP models in your research, please cite the associated paper:
|
| 122 |
|
| 123 |
-
```
|
| 124 |
-
@article{
|
| 125 |
title={A general language model for peptide identification},
|
| 126 |
-
author={
|
| 127 |
-
journal={
|
| 128 |
-
year={2025}
|
|
|
|
| 129 |
}
|
| 130 |
```
|
| 131 |
|
|
|
|
| 1 |
---
|
|
|
|
| 2 |
library_name: transformers
|
| 3 |
+
license: mit
|
| 4 |
+
pipeline_tag: text-classification
|
| 5 |
+
tags:
|
| 6 |
+
- peptide
|
| 7 |
+
- protein
|
| 8 |
+
- bioinformatics
|
| 9 |
+
- biology
|
| 10 |
---
|
| 11 |
|
| 12 |
# PDeepPP: A Comprehensive Protein Language Model Hub
|
|
|
|
| 15 |
|
| 16 |
PDeepPP is a hybrid protein language model designed to predict post-translational modification (PTM) sites, analyze biologically relevant features, and support a wide range of protein sequence analysis tasks. This repository serves as the central hub for accessing and exploring various specialized PDeepPP models, each fine-tuned for specific tasks, such as PTM site prediction, bioactivity analysis, and more.
|
| 17 |
|
| 18 |
+
The models in this repository can all be used on their corresponding datasets on GitHub ([https://github.com/fondress/PDeepPP])
|
| 19 |
|
| 20 |
---
|
| 21 |
|
|
|
|
| 32 |
- **Flexible Architecture**: Combines self-attention and convolutional operations for robust feature extraction.
|
| 33 |
- **Task-Specific Models**: Includes pre-trained models for PTM prediction, bioactivity classification, and more.
|
| 34 |
- **Dataset Support**: Models are validated on datasets such as PTM and BPS, ensuring performance on real-world tasks.
|
| 35 |
+
- **Extensibility**: Users can fine-tune the model on custom datasets for new tasks.
|
| 36 |
|
| 37 |
---
|
| 38 |
|
|
|
|
| 125 |
|
| 126 |
If you use any of the PDeepPP models in your research, please cite the associated paper:
|
| 127 |
|
| 128 |
+
```bibtex
|
| 129 |
+
@article{pdeeppp2025peptide,
|
| 130 |
title={A general language model for peptide identification},
|
| 131 |
+
author={{PDeepPP Authors}},
|
| 132 |
+
journal={arXiv preprint arXiv:2502.15610},
|
| 133 |
+
year={2025},
|
| 134 |
+
url={https://arxiv.org/abs/2502.15610}
|
| 135 |
}
|
| 136 |
```
|
| 137 |
|