Update README.md
Browse files
README.md
CHANGED
|
@@ -7,16 +7,16 @@ tags:
|
|
| 7 |
pipeline_tag: text-classification
|
| 8 |
---
|
| 9 |
# Model descriptions
|
| 10 |
-
PPPSL-ESM2(PPPSL, Prediction of prokaryotic protein subcellular localization) is a protein language model fine-tuned from ESM2 pretrained model [facebook/esm2_t36_3B_UR50D](https://huggingface.co/facebook/esm2_t36_3B_UR50D) on a prokaryotic protein subcellular localization dataset. It achieves the following results
|
| 11 |
Train Loss: 0.0015
|
| 12 |
Train Accuracy: 0.9893
|
| 13 |
Validation Loss: 0.0155
|
| 14 |
Validation Accuracy: 0.9702
|
| 15 |
Epoch: 20
|
| 16 |
-
# The dataset for training PPPSL-ESM2
|
| 17 |
The full dataset contains 11,970 protein sequences, including Cellwall (87), Cytoplasmic (6,905), CYtoplasmic Membrane (2,567), Extracellular (1,085), Outer Membrane (758), and Periplasmic (568).
|
| 18 |
The highly imbalanced sample sizes across the six categories in this dataset pose a significant challenge for classification.
|
| 19 |
-
# How to use
|
| 20 |
|
| 21 |
### An example
|
| 22 |
Pytorch and transformers libraries should be installed in your system.
|
|
|
|
| 7 |
pipeline_tag: text-classification
|
| 8 |
---
|
| 9 |
# Model descriptions
|
| 10 |
+
**PPPSL-ESM2**(PPPSL, Prediction of prokaryotic protein subcellular localization) is a protein language model fine-tuned from ESM2 pretrained model [facebook/esm2_t36_3B_UR50D](https://huggingface.co/facebook/esm2_t36_3B_UR50D) on a prokaryotic protein subcellular localization dataset. It achieves the following results:
|
| 11 |
Train Loss: 0.0015
|
| 12 |
Train Accuracy: 0.9893
|
| 13 |
Validation Loss: 0.0155
|
| 14 |
Validation Accuracy: 0.9702
|
| 15 |
Epoch: 20
|
| 16 |
+
# The dataset for training **PPPSL-ESM2**
|
| 17 |
The full dataset contains 11,970 protein sequences, including Cellwall (87), Cytoplasmic (6,905), CYtoplasmic Membrane (2,567), Extracellular (1,085), Outer Membrane (758), and Periplasmic (568).
|
| 18 |
The highly imbalanced sample sizes across the six categories in this dataset pose a significant challenge for classification.
|
| 19 |
+
# How to use **PPPSL-ESM2**
|
| 20 |
|
| 21 |
### An example
|
| 22 |
Pytorch and transformers libraries should be installed in your system.
|