sihuapeng commited on
Commit
e6f4ace
·
verified ·
1 Parent(s): cbd5ac0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -7,16 +7,16 @@ tags:
7
  pipeline_tag: text-classification
8
  ---
9
  # Model descriptions
10
- PPPSL-ESM2(PPPSL, Prediction of prokaryotic protein subcellular localization) is a protein language model fine-tuned from ESM2 pretrained model [facebook/esm2_t36_3B_UR50D](https://huggingface.co/facebook/esm2_t36_3B_UR50D) on a prokaryotic protein subcellular localization dataset. It achieves the following results on the evaluation set:
11
  Train Loss: 0.0015
12
  Train Accuracy: 0.9893
13
  Validation Loss: 0.0155
14
  Validation Accuracy: 0.9702
15
  Epoch: 20
16
- # The dataset for training PPPSL-ESM2
17
  The full dataset contains 11,970 protein sequences, including Cellwall (87), Cytoplasmic (6,905), CYtoplasmic Membrane (2,567), Extracellular (1,085), Outer Membrane (758), and Periplasmic (568).
18
  The highly imbalanced sample sizes across the six categories in this dataset pose a significant challenge for classification.
19
- # How to use
20
 
21
  ### An example
22
  Pytorch and transformers libraries should be installed in your system.
 
7
  pipeline_tag: text-classification
8
  ---
9
  # Model descriptions
10
+ **PPPSL-ESM2**(PPPSL, Prediction of prokaryotic protein subcellular localization) is a protein language model fine-tuned from ESM2 pretrained model [facebook/esm2_t36_3B_UR50D](https://huggingface.co/facebook/esm2_t36_3B_UR50D) on a prokaryotic protein subcellular localization dataset. It achieves the following results:
11
  Train Loss: 0.0015
12
  Train Accuracy: 0.9893
13
  Validation Loss: 0.0155
14
  Validation Accuracy: 0.9702
15
  Epoch: 20
16
+ # The dataset for training **PPPSL-ESM2**
17
  The full dataset contains 11,970 protein sequences, including Cellwall (87), Cytoplasmic (6,905), CYtoplasmic Membrane (2,567), Extracellular (1,085), Outer Membrane (758), and Periplasmic (568).
18
  The highly imbalanced sample sizes across the six categories in this dataset pose a significant challenge for classification.
19
+ # How to use **PPPSL-ESM2**
20
 
21
  ### An example
22
  Pytorch and transformers libraries should be installed in your system.