Upload model_info.txt with huggingface_hub
Browse files- model_info.txt +56 -0
model_info.txt
ADDED
|
@@ -0,0 +1,56 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
|
| 2 |
+
DNA Mutation Pathogenicity Predictor - Model Information
|
| 3 |
+
=========================================================
|
| 4 |
+
|
| 5 |
+
Model Type: Convolutional Neural Network (CNN)
|
| 6 |
+
Parameters: 191,522
|
| 7 |
+
Input Dimension: 1101
|
| 8 |
+
Output: Binary (Pathogenic/Benign) + Importance Score
|
| 9 |
+
|
| 10 |
+
DATASET:
|
| 11 |
+
--------
|
| 12 |
+
Source: ClinVar (GRCh38)
|
| 13 |
+
Total Variants: 16,000
|
| 14 |
+
- Pathogenic: 8,000
|
| 15 |
+
- Benign: 8,000
|
| 16 |
+
|
| 17 |
+
Chromosomes: 1, 2, 3, 5, 6, 7, 11, 13, 17, 19, 22, X
|
| 18 |
+
|
| 19 |
+
PERFORMANCE:
|
| 20 |
+
-----------
|
| 21 |
+
Test Accuracy: 57.54%
|
| 22 |
+
Test AUC-ROC: 0.6166
|
| 23 |
+
Precision: 0.5705
|
| 24 |
+
Recall: 0.6100
|
| 25 |
+
F1 Score: 0.5896
|
| 26 |
+
|
| 27 |
+
ARCHITECTURE:
|
| 28 |
+
------------
|
| 29 |
+
Input: 1101-dim encoding
|
| 30 |
+
- Reference sequence: 99bp × 5 nucleotides = 495
|
| 31 |
+
- Mutated sequence: 99bp × 5 nucleotides = 495
|
| 32 |
+
- Difference mask: 99
|
| 33 |
+
- Mutation type: 12
|
| 34 |
+
|
| 35 |
+
Layers:
|
| 36 |
+
1. Conv1D (11→64) + BatchNorm + ReLU
|
| 37 |
+
2. Conv1D (64→128) + BatchNorm + ReLU
|
| 38 |
+
3. Conv1D (128→256) + BatchNorm + ReLU
|
| 39 |
+
4. Global Average Pooling
|
| 40 |
+
5. FC layers (256+32 → 128 → 64 → 1)
|
| 41 |
+
6. Importance head (256 → 1)
|
| 42 |
+
|
| 43 |
+
Outputs:
|
| 44 |
+
- Pathogenic logit (apply sigmoid for probability)
|
| 45 |
+
- Importance score (mutation position explainability)
|
| 46 |
+
|
| 47 |
+
TRAINING:
|
| 48 |
+
--------
|
| 49 |
+
Optimizer: Adam (lr=0.0005)
|
| 50 |
+
Loss: BCE + 0.1 × Contrastive Importance Loss
|
| 51 |
+
Epochs: 60
|
| 52 |
+
Batch Size: 32
|
| 53 |
+
Split: 70% train / 15% val / 15% test
|
| 54 |
+
|
| 55 |
+
Created: February 18, 2026
|
| 56 |
+
Framework: PyTorch
|