Spaces:
Sleeping
Sleeping
Update templates/about.html
Browse files- templates/about.html +70 -67
templates/about.html
CHANGED
|
@@ -5,73 +5,76 @@
|
|
| 5 |
<section class="card about-flex">
|
| 6 |
|
| 7 |
<div class="about-text">
|
| 8 |
-
|
| 9 |
-
|
| 10 |
-
|
| 11 |
-
|
| 12 |
-
|
| 13 |
-
|
| 14 |
-
|
| 15 |
-
|
| 16 |
-
|
| 17 |
-
|
| 18 |
-
|
| 19 |
-
|
| 20 |
-
|
| 21 |
-
|
| 22 |
-
|
| 23 |
-
|
| 24 |
-
|
| 25 |
-
|
| 26 |
-
|
| 27 |
-
|
| 28 |
-
|
| 29 |
-
|
| 30 |
-
|
| 31 |
-
|
| 32 |
-
|
| 33 |
-
|
| 34 |
-
|
| 35 |
-
<
|
| 36 |
-
|
| 37 |
-
|
| 38 |
-
|
| 39 |
-
|
| 40 |
-
|
| 41 |
-
|
| 42 |
-
|
| 43 |
-
|
| 44 |
-
|
| 45 |
-
|
| 46 |
-
|
| 47 |
-
|
| 48 |
-
|
| 49 |
-
|
| 50 |
-
|
| 51 |
-
|
| 52 |
-
<
|
| 53 |
-
|
| 54 |
-
|
| 55 |
-
|
| 56 |
-
|
| 57 |
-
<
|
| 58 |
-
|
| 59 |
-
<
|
| 60 |
-
|
| 61 |
-
|
| 62 |
-
|
| 63 |
-
|
| 64 |
-
|
| 65 |
-
|
| 66 |
-
|
| 67 |
-
|
| 68 |
-
|
| 69 |
-
|
| 70 |
-
|
| 71 |
-
|
| 72 |
-
|
| 73 |
-
|
| 74 |
-
|
|
|
|
|
|
|
|
|
|
| 75 |
|
| 76 |
</section>
|
| 77 |
</div>
|
|
|
|
| 5 |
<section class="card about-flex">
|
| 6 |
|
| 7 |
<div class="about-text">
|
| 8 |
+
<h2>About CANLoc</h2>
|
| 9 |
+
|
| 10 |
+
<p>
|
| 11 |
+
CANLoc is a machine-learning system designed to predict the subcellular
|
| 12 |
+
localization of proteins directly from the protein sequence. It combines
|
| 13 |
+
transformer-based embeddings from the <b>ESM2</b> model
|
| 14 |
+
with an optimized <b>XGBoost</b> classifier trained on curated
|
| 15 |
+
protein datasets.
|
| 16 |
+
</p>
|
| 17 |
+
|
| 18 |
+
<h3>Performance & Evaluation</h3>
|
| 19 |
+
|
| 20 |
+
<p>
|
| 21 |
+
CANLoc achieves high accuracy, precision, recall, and F1-scores across all
|
| 22 |
+
classes. We additionally validate the model using:
|
| 23 |
+
</p>
|
| 24 |
+
|
| 25 |
+
<ul>
|
| 26 |
+
<li>Train/test split evaluation</li>
|
| 27 |
+
<li>10-fold stratified cross-validation</li>
|
| 28 |
+
<li>ROC curves for each class</li>
|
| 29 |
+
<li>Sensitivity and specificity analysis</li>
|
| 30 |
+
</ul>
|
| 31 |
+
|
| 32 |
+
<p>
|
| 33 |
+
These evaluations confirm that CANLoc predictions are reliable for academic
|
| 34 |
+
and research workflows.
|
| 35 |
+
</p>
|
| 36 |
+
|
| 37 |
+
<h3>Intended Use</h3>
|
| 38 |
+
|
| 39 |
+
<ul>
|
| 40 |
+
<li>Functional protein studies</li>
|
| 41 |
+
<li>Localization-oriented drug delivery strategy</li>
|
| 42 |
+
</ul>
|
| 43 |
+
|
| 44 |
+
<h3>Model Strengths</h3>
|
| 45 |
+
<ul>
|
| 46 |
+
<li>Fast and scalable for single or batch prediction</li>
|
| 47 |
+
<li>Transformer embeddings provide rich biological context</li>
|
| 48 |
+
<li>High accuracy with interpretable confidence metrics</li>
|
| 49 |
+
<li>No alignment or preprocessing required beyond the raw sequence</li>
|
| 50 |
+
</ul>
|
| 51 |
+
|
| 52 |
+
<h3>Limitations</h3>
|
| 53 |
+
<ul>
|
| 54 |
+
<li>Performance depends on sequence length and quality</li>
|
| 55 |
+
<li>Ambiguous sequences may reduce confidence</li>
|
| 56 |
+
<li>Designed for four major classes only</li>
|
| 57 |
+
</ul>
|
| 58 |
+
|
| 59 |
+
<p>
|
| 60 |
+
CANLoc represents a balance between modern deep learning and classical
|
| 61 |
+
machine learning methods, producing a system that is both
|
| 62 |
+
<b>reliable</b> and <b>lightweight enough to deploy</b>
|
| 63 |
+
in real-world web applications.
|
| 64 |
+
</p>
|
| 65 |
+
</div>
|
| 66 |
+
|
| 67 |
+
<div class="about-image">
|
| 68 |
+
<figure>
|
| 69 |
+
<img src="{{ url_for('static', filename='cell_diagram.png') }}"
|
| 70 |
+
alt="Eukaryotic cell diagram showing Subcellular Location">
|
| 71 |
+
<figcaption>
|
| 72 |
+
<strong>Figure 1.</strong> Schematic representation of a cell
|
| 73 |
+
highlighting major subcellular locations relevant to protein
|
| 74 |
+
localization prediction.
|
| 75 |
+
</figcaption>
|
| 76 |
+
</figure>
|
| 77 |
+
</div>
|
| 78 |
|
| 79 |
</section>
|
| 80 |
</div>
|