genomenet's picture
Set space emoji to 🧬 to override default rocket icon
a3d14e7
---
title: CRISPR Array Detection
emoji: 🧬
colorFrom: gray
colorTo: gray
sdk: docker
pinned: false
license: mit
short_description: Detect CRISPR arrays in DNA sequences
---
# crispr-detect
BERT-based CRISPR array detection in prokaryotic genomes.
## Model
| | |
|---|---|
| architecture | BERT, 24 layers, 768 hidden, 430M params |
| input | DNA sequence (min 1000 bp) |
| output | per-position probability (0-1) |
## Deployment
### Push changes
```bash
cd /vol/hpcprojects/pmuench/crispr_tool/crispr-hf-space
git add -A
git commit -m "description"
git push
```
### Git credentials (first time)
```bash
git config --global credential.helper store
huggingface-cli login
# paste token from https://huggingface.co/settings/tokens
```
### Clone fresh
```bash
git clone https://huggingface.co/spaces/genomenet/crispr-array-detection
```
### Space settings (HuggingFace web UI)
- SDK: Docker
- Hardware: CPU Basic works for the default demo; T4 GPU is recommended for long sequences or low stride values
- Visibility: Public
### Model weights
Hosted at: https://huggingface.co/genomenet/crispr-bert-model
Downloaded automatically via `huggingface_hub` at startup.
## Local dev
```bash
pip install -r requirements.txt
python app.py
# http://localhost:7860
```
## Files
```
β”œβ”€β”€ app.py # gradio app
β”œβ”€β”€ inference/
β”‚ β”œβ”€β”€ model_loader.py # model download
β”‚ β”œβ”€β”€ tokenizer.py # sequence validation
β”‚ └── inference.py # prediction
β”œβ”€β”€ Dockerfile
└── requirements.txt
```
## Acknowledgements
- Ziyu Mu (HZI BIFO)
- DFG SPP 2141 (MC 172)
- BMBF GenomeNet