Update README.md
Browse files
README.md
CHANGED
|
@@ -179,50 +179,59 @@ pip3 install torch torchvision torchaudio
|
|
| 179 |
|
| 180 |
|
| 181 |
|
|
|
|
| 182 |
|
|
|
|
|
|
|
|
|
|
| 183 |
|
| 184 |
-
|
| 185 |
-
|
| 186 |
-
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
| 187 |
-
|
| 188 |
-
### Direct Use
|
| 189 |
-
|
| 190 |
-
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
|
| 191 |
-
|
| 192 |
-
[More Information Needed]
|
| 193 |
-
|
| 194 |
-
|
| 195 |
-
### Recommendations
|
| 196 |
-
|
| 197 |
-
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
|
| 198 |
-
|
| 199 |
-
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
|
| 200 |
-
|
| 201 |
-
|
| 202 |
|
| 203 |
|
| 204 |
|
|
|
|
| 205 |
|
|
|
|
| 206 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 207 |
|
| 208 |
-
|
|
|
|
| 209 |
|
| 210 |
-
|
|
|
|
|
|
|
| 211 |
|
| 212 |
-
**BibTeX:**
|
| 213 |
|
| 214 |
-
|
|
|
|
|
|
|
|
|
|
| 215 |
|
| 216 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 217 |
|
| 218 |
-
|
| 219 |
|
|
|
|
| 220 |
|
|
|
|
| 221 |
|
| 222 |
|
| 223 |
|
| 224 |
|
| 225 |
-
- **Developed by:**
|
|
|
|
|
|
|
| 226 |
- **Funded by [optional]:** [NIH]
|
| 227 |
- **Shared by [optional]:** [More Information Neede]
|
| 228 |
- **Model type:** [Text classication]
|
|
@@ -230,4 +239,4 @@ Users (both direct and downstream) should be made aware of the risks, biases and
|
|
| 230 |
- **License:** [MIT]
|
| 231 |
- **Finetuned from model [optional]:** [ESM-2 150M]
|
| 232 |
|
| 233 |
-
[More Information Needed]
|
|
|
|
| 179 |
|
| 180 |
|
| 181 |
|
| 182 |
+
We provided code to test CLASPP (see section below)
|
| 183 |
|
| 184 |
+
|
| 185 |
+
:tada: you are know ready to run the code :tada:
|
| 186 |
+
|
| 187 |
|
| 188 |
+
Use the code below to get started with the model.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 189 |
|
| 190 |
|
| 191 |
|
| 192 |
+
## Uses
|
| 193 |
|
| 194 |
+
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
| 195 |
|
| 196 |
+
### Direct Use
|
| 197 |
+
```
|
| 198 |
+
Usage: python3 claspp_forward.py [OPTION]... --input INPUT [FASTA_FILE or TXT_FILE]...
|
| 199 |
+
predict PTM events on peptides or full sequences
|
| 200 |
|
| 201 |
+
Example 1: python3 claspp_forward.py -B 100 -S 0 -i random.txt
|
| 202 |
+
Example 2: python3 claspp_forward.py -B 50 -S 1 -i random.fasta
|
| 203 |
|
| 204 |
+
FASTA_FILE contain protein sequences in proper fasta or a2m format
|
| 205 |
+
TXT_FILE cointain protien peptides 21 in length with the center
|
| 206 |
+
residue being the PTM modification site
|
| 207 |
|
|
|
|
| 208 |
|
| 209 |
+
Pattern selection and interpretation:
|
| 210 |
+
-B, --batch_size (int) that describes how many predictions
|
| 211 |
+
can be predicted at a time on the GPU
|
| 212 |
+
(reduce if you get run out of GPU space)
|
| 213 |
|
| 214 |
+
-S --scrape_fasta (int) should be a 1 or a 0
|
| 215 |
+
1 = read a fasta and scrape posible 21 peptides
|
| 216 |
+
that can be modified by a PTM
|
| 217 |
+
0 = read a txt file that has the 21mer already
|
| 218 |
+
sperated and all peptides should be sperated by
|
| 219 |
+
a '\\n' (can be faster) than fasta option
|
| 220 |
+
|
| 221 |
+
-h --help your reading it right now
|
| 222 |
|
| 223 |
+
-i --input location of the input fasta or txt
|
| 224 |
|
| 225 |
+
-o --output location of the output csv
|
| 226 |
|
| 227 |
+
```
|
| 228 |
|
| 229 |
|
| 230 |
|
| 231 |
|
| 232 |
+
- **Developed by:** Major author for most code Nathan Gravel. Finetuning code inspried by Zhongliang Zhou,
|
| 233 |
+
- Contrastive learing code inspried by Ruili Fang, Codebase testing and verstion controle by Austin Downes,
|
| 234 |
+
- Webtool dev Saber Soleymani
|
| 235 |
- **Funded by [optional]:** [NIH]
|
| 236 |
- **Shared by [optional]:** [More Information Neede]
|
| 237 |
- **Model type:** [Text classication]
|
|
|
|
| 239 |
- **License:** [MIT]
|
| 240 |
- **Finetuned from model [optional]:** [ESM-2 150M]
|
| 241 |
|
| 242 |
+
[More Information Needed]
|