Spaces:
Sleeping
Sleeping
| title: CRISPR Array Detection | |
| emoji: 𧬠| |
| colorFrom: gray | |
| colorTo: gray | |
| sdk: docker | |
| pinned: false | |
| license: mit | |
| short_description: Detect CRISPR arrays in DNA sequences | |
| # crispr-detect | |
| BERT-based CRISPR array detection in prokaryotic genomes. | |
| ## Model | |
| | | | | |
| |---|---| | |
| | architecture | BERT, 24 layers, 768 hidden, 430M params | | |
| | input | DNA sequence (min 1000 bp) | | |
| | output | per-position probability (0-1) | | |
| ## Deployment | |
| ### Push changes | |
| ```bash | |
| cd /vol/hpcprojects/pmuench/crispr_tool/crispr-hf-space | |
| git add -A | |
| git commit -m "description" | |
| git push | |
| ``` | |
| ### Git credentials (first time) | |
| ```bash | |
| git config --global credential.helper store | |
| huggingface-cli login | |
| # paste token from https://huggingface.co/settings/tokens | |
| ``` | |
| ### Clone fresh | |
| ```bash | |
| git clone https://huggingface.co/spaces/genomenet/crispr-array-detection | |
| ``` | |
| ### Space settings (HuggingFace web UI) | |
| - SDK: Docker | |
| - Hardware: CPU Basic works for the default demo; T4 GPU is recommended for long sequences or low stride values | |
| - Visibility: Public | |
| ### Model weights | |
| Hosted at: https://huggingface.co/genomenet/crispr-bert-model | |
| Downloaded automatically via `huggingface_hub` at startup. | |
| ## Local dev | |
| ```bash | |
| pip install -r requirements.txt | |
| python app.py | |
| # http://localhost:7860 | |
| ``` | |
| ## Files | |
| ``` | |
| βββ app.py # gradio app | |
| βββ inference/ | |
| β βββ model_loader.py # model download | |
| β βββ tokenizer.py # sequence validation | |
| β βββ inference.py # prediction | |
| βββ Dockerfile | |
| βββ requirements.txt | |
| ``` | |
| ## Acknowledgements | |
| - Ziyu Mu (HZI BIFO) | |
| - DFG SPP 2141 (MC 172) | |
| - BMBF GenomeNet | |