Biocoder09 commited on
Commit
e477b98
·
verified ·
1 Parent(s): ab3333c

Update templates/about.html

Browse files
Files changed (1) hide show
  1. templates/about.html +70 -67
templates/about.html CHANGED
@@ -5,73 +5,76 @@
5
  <section class="card about-flex">
6
 
7
  <div class="about-text">
8
- <h2>About CANLoc</h2>
9
-
10
- <p>
11
- CANLoc is a machine-learning system designed to predict the subcellular
12
- localization of proteins directly from the protein sequence. It combines
13
- transformer-based embeddings from the <b>ESM2</b> model
14
- with an optimized <b>XGBoost</b> classifier trained on curated
15
- protein datasets.
16
- </p>
17
- </div>
18
-
19
- <div class="about-image">
20
- <img src="{{ url_for('static', filename=cell_diagram.png') }}"
21
- alt="cell diagram showing Subcellular Location">
22
- </div>
23
-
24
-
25
- <h3>Performance & Evaluation</h3>
26
-
27
- <p>
28
- CANLoc achieves high accuracy, precision, recall, and F1-scores across all
29
- classes. We additionally validate the model using:
30
- </p>
31
-
32
- <ul>
33
- <li>Train/test split evaluation</li>
34
- <li>10-fold stratified cross-validation</li>
35
- <li>ROC curves for each class</li>
36
- <li>Sensitivity and specificity analysis</li>
37
- </ul>
38
-
39
- <p>
40
- These evaluations confirm that CANLoc predictions are reliable for academic
41
- and research workflows.
42
- </p>
43
-
44
- <h3>Intended Use</h3>
45
-
46
- <p>
47
- CANLoc is designed for:
48
- </p>
49
-
50
- <ul>
51
- <li>Functional protein studies</li>
52
- <li>Localization-oriented drug delivery strategy</li>
53
- </ul>
54
-
55
- <h3>Model Strengths</h3>
56
- <ul>
57
- <li>Fast and scalable for single or batch prediction</li>
58
- <li>Transformer embeddings provide rich biological context</li>
59
- <li>High accuracy with interpretable confidence metrics</li>
60
- <li>No alignment or preprocessing required beyond the raw sequence</li>
61
- </ul>
62
-
63
- <h3>Limitations</h3>
64
- <ul>
65
- <li>Performance depends on sequence length and quality</li>
66
- <li>Ambiguous sequences may reduce confidence</li>
67
- <li>Designed for four major classes only</li>
68
- </ul>
69
-
70
- <p>
71
- CANLoc represents a balance between modern deep learning and classical machine
72
- learning methods, producing a system that is both <b>reliable</b> and
73
- <b>lightweight enough to deploy</b> in real-world web applications.
74
- </p>
 
 
 
75
 
76
  </section>
77
  </div>
 
5
  <section class="card about-flex">
6
 
7
  <div class="about-text">
8
+ <h2>About CANLoc</h2>
9
+
10
+ <p>
11
+ CANLoc is a machine-learning system designed to predict the subcellular
12
+ localization of proteins directly from the protein sequence. It combines
13
+ transformer-based embeddings from the <b>ESM2</b> model
14
+ with an optimized <b>XGBoost</b> classifier trained on curated
15
+ protein datasets.
16
+ </p>
17
+
18
+ <h3>Performance & Evaluation</h3>
19
+
20
+ <p>
21
+ CANLoc achieves high accuracy, precision, recall, and F1-scores across all
22
+ classes. We additionally validate the model using:
23
+ </p>
24
+
25
+ <ul>
26
+ <li>Train/test split evaluation</li>
27
+ <li>10-fold stratified cross-validation</li>
28
+ <li>ROC curves for each class</li>
29
+ <li>Sensitivity and specificity analysis</li>
30
+ </ul>
31
+
32
+ <p>
33
+ These evaluations confirm that CANLoc predictions are reliable for academic
34
+ and research workflows.
35
+ </p>
36
+
37
+ <h3>Intended Use</h3>
38
+
39
+ <ul>
40
+ <li>Functional protein studies</li>
41
+ <li>Localization-oriented drug delivery strategy</li>
42
+ </ul>
43
+
44
+ <h3>Model Strengths</h3>
45
+ <ul>
46
+ <li>Fast and scalable for single or batch prediction</li>
47
+ <li>Transformer embeddings provide rich biological context</li>
48
+ <li>High accuracy with interpretable confidence metrics</li>
49
+ <li>No alignment or preprocessing required beyond the raw sequence</li>
50
+ </ul>
51
+
52
+ <h3>Limitations</h3>
53
+ <ul>
54
+ <li>Performance depends on sequence length and quality</li>
55
+ <li>Ambiguous sequences may reduce confidence</li>
56
+ <li>Designed for four major classes only</li>
57
+ </ul>
58
+
59
+ <p>
60
+ CANLoc represents a balance between modern deep learning and classical
61
+ machine learning methods, producing a system that is both
62
+ <b>reliable</b> and <b>lightweight enough to deploy</b>
63
+ in real-world web applications.
64
+ </p>
65
+ </div>
66
+
67
+ <div class="about-image">
68
+ <figure>
69
+ <img src="{{ url_for('static', filename='cell_diagram.png') }}"
70
+ alt="Eukaryotic cell diagram showing Subcellular Location">
71
+ <figcaption>
72
+ <strong>Figure 1.</strong> Schematic representation of a cell
73
+ highlighting major subcellular locations relevant to protein
74
+ localization prediction.
75
+ </figcaption>
76
+ </figure>
77
+ </div>
78
 
79
  </section>
80
  </div>