Update README.md
Browse files
README.md
CHANGED
|
@@ -5,14 +5,7 @@ tags:
|
|
| 5 |
# Model description
|
| 6 |
**MHC-II-EpiPred** (MHC-II-EpiPred, MHC II molecular epitope prediction) is a protein language model fine-tuned from [**ESM2**](https://github.com/facebookresearch/esm) pretrained model [(***facebook/esm2_t33_650M_UR50D***)](https://huggingface.co/facebook/esm2_t33_650M_UR50D) on a T cell MHC II epitope dataset.
|
| 7 |
|
| 8 |
-
**MHC-II-EpiPred** is a classification model for predicting the class of MHC II epitope.
|
| 9 |
-
# Results
|
| 10 |
-
**MHC-II-EpiPred** achieved the following results:
|
| 11 |
-
Training Loss (mse): 0.1407
|
| 12 |
-
Training Accuracy: 0.9898
|
| 13 |
-
Evaluation Loss (mse): 0.0836
|
| 14 |
-
Evaluation Accuracy: 0.9703
|
| 15 |
-
Epochs: 324
|
| 16 |
|
| 17 |
# The dataset for training **MHC-II-EpiPred**
|
| 18 |
The original data was downloaded from IEDB data base at https://www.iedb.org/home_v3.php.
|
|
@@ -20,6 +13,15 @@ The full data can be downloaded at https://www.iedb.org/downloader.php?file_name
|
|
| 20 |
This dataset comprises 543,717 T-cell epitope entries, spanning a variety of species and infections caused by diverse viruses. The epitope information included encompasses a broad range of potential sources, including data relevant to disease immunotherapy.
|
| 21 |
|
| 22 |
Finally, the dataset we used to train the model contains 60,256 positive and negative samples, which is stored in https://github.com/pengsihua2023/MHC-II-EpiPred/tree/main/data.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 23 |
# Model training code at GitHub
|
| 24 |
https://github.com/pengsihua2023/MHC-II-EpiPred
|
| 25 |
|
|
|
|
| 5 |
# Model description
|
| 6 |
**MHC-II-EpiPred** (MHC-II-EpiPred, MHC II molecular epitope prediction) is a protein language model fine-tuned from [**ESM2**](https://github.com/facebookresearch/esm) pretrained model [(***facebook/esm2_t33_650M_UR50D***)](https://huggingface.co/facebook/esm2_t33_650M_UR50D) on a T cell MHC II epitope dataset.
|
| 7 |
|
| 8 |
+
**MHC-II-EpiPred** is a classification model for predicting the class of MHC II epitope.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
|
| 10 |
# The dataset for training **MHC-II-EpiPred**
|
| 11 |
The original data was downloaded from IEDB data base at https://www.iedb.org/home_v3.php.
|
|
|
|
| 13 |
This dataset comprises 543,717 T-cell epitope entries, spanning a variety of species and infections caused by diverse viruses. The epitope information included encompasses a broad range of potential sources, including data relevant to disease immunotherapy.
|
| 14 |
|
| 15 |
Finally, the dataset we used to train the model contains 60,256 positive and negative samples, which is stored in https://github.com/pengsihua2023/MHC-II-EpiPred/tree/main/data.
|
| 16 |
+
|
| 17 |
+
# Results
|
| 18 |
+
**MHC-II-EpiPred** achieved the following results:
|
| 19 |
+
Training Loss (mse): 0.1407
|
| 20 |
+
Training Accuracy: 0.9898
|
| 21 |
+
Evaluation Loss (mse): 0.0836
|
| 22 |
+
Evaluation Accuracy: 0.9703
|
| 23 |
+
Epochs: 324
|
| 24 |
+
|
| 25 |
# Model training code at GitHub
|
| 26 |
https://github.com/pengsihua2023/MHC-II-EpiPred
|
| 27 |
|