Files changed (1) hide show
  1. README.md +130 -3
README.md CHANGED
@@ -1,3 +1,130 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: pytorch
3
+ framework: pytorch
4
+ tags:
5
+ - pytorch
6
+ - pytorch-lightning
7
+ - bioinformatics
8
+ - rna-binding-proteins
9
+ - explainability
10
+ - alternative-splicing
11
+ - deep-learning
12
+ license: mit
13
+ ---
14
+
15
+ # DeepRBP Predictor (pretrained)
16
+
17
+ This repository provides a **pretrained DeepRBP predictor model**, a deep learning framework designed to infer **RNA-binding protein (RBP)–transcript and RBP–gene regulatory relationships** from expression data.
18
+
19
+ DeepRBP was introduced in the following preprint:
20
+
21
+ > **DeepRBP: A deep neural network for inferring splicing regulation**
22
+ > https://doi.org/10.1101/2024.04.11.589004
23
+
24
+ The model is intended to be used **directly for inference and explainability**, without retraining.
25
+
26
+ ---
27
+
28
+ ## Model overview
29
+
30
+ DeepRBP is composed of two conceptual stages:
31
+
32
+ 1. **Prediction stage**
33
+ A neural network predicts transcript abundances from:
34
+ - RBP expression
35
+ - Gene expression
36
+
37
+ 2. **Explainability stage**
38
+ Feature attribution methods (e.g. DeepLIFT) are applied on the trained predictor to compute:
39
+ - Transcript × RBP (TxRBP) scores
40
+ - Gene × RBP (GxRBP) scores
41
+
42
+ This repository contains **only the pretrained predictor and its required preprocessing artifacts**.
43
+
44
+ ---
45
+
46
+ ## Files in this repository
47
+
48
+ ⚠️ **All files are required for correct inference and explainability.**
49
+
50
+ | File | Description |
51
+ |-----|-------------|
52
+ | `model.ckpt` | PyTorch Lightning checkpoint of the pretrained DeepRBP predictor |
53
+ | `scaler.joblib` | Fitted input scaler used during model training |
54
+ | `sigma.npy` | Scaling parameter required to reconstruct transcript abundance values |
55
+
56
+ The scaler and sigma are **part of the trained model state** and must be used together with the checkpoint.
57
+
58
+ ---
59
+
60
+ ## Intended use
61
+
62
+ This pretrained model is intended for:
63
+
64
+ - Computing transcript abundance predictions
65
+ - Running explainability analyses (e.g. DeepLIFT-based attribution)
66
+ - Identifying candidate RBP–transcript and RBP–gene regulatory relationships
67
+ - Downstream biological interpretation and hypothesis generation
68
+
69
+ Typical applications include:
70
+ - Cancer transcriptomics (e.g. TCGA)
71
+ - Perturbation studies (e.g. RBP knockdowns)
72
+ - Comparative regulatory analyses across conditions
73
+
74
+ ---
75
+
76
+ ## Usage
77
+
78
+ This repository **does not provide a standalone inference script**.
79
+
80
+ Please refer to the **main DeepRBP code repository** for:
81
+ - Data preprocessing
82
+ - Model loading
83
+ - Running prediction and explainability pipelines
84
+
85
+ 👉 **Main repository:**
86
+ https://github.com/ML4BM-Lab/DeepRBP
87
+
88
+ The main repository contains:
89
+ - End-to-end examples
90
+ - Command-line interfaces
91
+ - Explainability workflows
92
+ - Validation pipelines
93
+
94
+ ---
95
+
96
+ ## Reproducibility notes
97
+
98
+ - The model was trained on public datasets (TCGA, GTEx and related resources).
99
+ - The provided scaler and sigma ensure:
100
+ - Consistent input normalization
101
+ - Comparable predictions and explainability scores across users
102
+ - Using a different scaler or recomputing normalization **will break comparability**.
103
+
104
+ ---
105
+
106
+ ## Limitations
107
+
108
+ - The model was trained on bulk RNA-seq data and may not generalize to:
109
+ - Single-cell RNA-seq
110
+ - Extremely low-coverage datasets
111
+ - Predictions represent **associations**, not direct causal regulation.
112
+ - Experimental validation is required before biological conclusions.
113
+
114
+ ---
115
+
116
+ ## License
117
+
118
+ This model is released under the **MIT License**.
119
+
120
+ You are free to use, modify and redistribute it, provided that the license and copyright notice are preserved.
121
+
122
+ ---
123
+
124
+ ## Citation
125
+
126
+ If you use DeepRBP in your work, please cite:
127
+
128
+ DeepRBP: A deep neural network for inferring splicing regulation
129
+ bioRxiv (2024)
130
+ https://doi.org/10.1101/2024.04.11.589004