Spaces:
Sleeping
Sleeping
Commit ·
3255634
1
Parent(s): a3107bb
Deploy DeepAMR API backend
Browse filesFastAPI backend with deep learning AMR prediction:
- 11 drug classes, 84.3% Micro F1, 98.6% AUC
- FASTA/FASTQ file upload and prediction
- Bangladesh clinical guidelines (DGHS/IEDCR)
- PDF clinical report generation
- User auth, prediction history, admin dashboard
- Rate limiting and security hardening
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This view is limited to 50 files because it contains too many changes. See raw diff
- .gitignore +7 -0
- Dockerfile +30 -0
- README.md +15 -5
- data_processed/card/card_drug_class_X_test.npy +3 -0
- data_processed/card/card_drug_class_X_train.npy +3 -0
- data_processed/card/card_drug_class_X_val.npy +3 -0
- data_processed/card/card_drug_class_metadata.json +551 -0
- data_processed/card/card_drug_class_y_test.npy +3 -0
- data_processed/card/card_drug_class_y_train.npy +3 -0
- data_processed/card/card_drug_class_y_val.npy +3 -0
- data_processed/card/card_gene_family_X_test.npy +3 -0
- data_processed/card/card_gene_family_X_train.npy +3 -0
- data_processed/card/card_gene_family_X_val.npy +3 -0
- data_processed/card/card_gene_family_metadata.json +911 -0
- data_processed/card/card_gene_family_y_test.npy +3 -0
- data_processed/card/card_gene_family_y_train.npy +3 -0
- data_processed/card/card_gene_family_y_val.npy +3 -0
- data_processed/card/card_mechanism_X_test.npy +3 -0
- data_processed/card/card_mechanism_X_train.npy +3 -0
- data_processed/card/card_mechanism_X_val.npy +3 -0
- data_processed/card/card_mechanism_metadata.json +523 -0
- data_processed/card/card_mechanism_y_test.npy +3 -0
- data_processed/card/card_mechanism_y_train.npy +3 -0
- data_processed/card/card_mechanism_y_val.npy +3 -0
- data_processed/ncbi/ncbi_amr_X_test.npy +3 -0
- data_processed/ncbi/ncbi_amr_X_train.npy +3 -0
- data_processed/ncbi/ncbi_amr_X_val.npy +3 -0
- data_processed/ncbi/ncbi_amr_metadata.json +537 -0
- data_processed/ncbi/ncbi_amr_y_test.npy +3 -0
- data_processed/ncbi/ncbi_amr_y_train.npy +3 -0
- data_processed/ncbi/ncbi_amr_y_val.npy +3 -0
- data_processed/ncbi/ncbi_organism_X_test.npy +3 -0
- data_processed/ncbi/ncbi_organism_X_train.npy +3 -0
- data_processed/ncbi/ncbi_organism_X_val.npy +3 -0
- data_processed/ncbi/ncbi_organism_metadata.json +521 -0
- data_processed/ncbi/ncbi_organism_y_test.npy +3 -0
- data_processed/ncbi/ncbi_organism_y_train.npy +3 -0
- data_processed/ncbi/ncbi_organism_y_val.npy +3 -0
- data_processed/patric/patric_cefoxitin_X_test.npy +3 -0
- data_processed/patric/patric_cefoxitin_X_train.npy +3 -0
- data_processed/patric/patric_cefoxitin_X_val.npy +3 -0
- data_processed/patric/patric_cefoxitin_metadata.json +515 -0
- data_processed/patric/patric_cefoxitin_y_test.npy +3 -0
- data_processed/patric/patric_cefoxitin_y_train.npy +3 -0
- data_processed/patric/patric_cefoxitin_y_val.npy +3 -0
- data_processed/patric/patric_ciprofloxacin_X_test.npy +3 -0
- data_processed/patric/patric_ciprofloxacin_X_train.npy +3 -0
- data_processed/patric/patric_ciprofloxacin_X_val.npy +3 -0
- data_processed/patric/patric_ciprofloxacin_metadata.json +515 -0
- data_processed/patric/patric_ciprofloxacin_y_test.npy +3 -0
.gitignore
ADDED
|
@@ -0,0 +1,7 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
__pycache__/
|
| 2 |
+
*.py[codz]
|
| 3 |
+
.DS_Store
|
| 4 |
+
deepamr.db
|
| 5 |
+
deepamr.db-wal
|
| 6 |
+
deepamr.db-shm
|
| 7 |
+
.env
|
Dockerfile
ADDED
|
@@ -0,0 +1,30 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
FROM python:3.11-slim
|
| 2 |
+
|
| 3 |
+
# HF Spaces runs as uid 1000
|
| 4 |
+
RUN useradd -m -u 1000 user
|
| 5 |
+
|
| 6 |
+
WORKDIR /app
|
| 7 |
+
|
| 8 |
+
# Install system dependencies
|
| 9 |
+
RUN apt-get update && apt-get install -y --no-install-recommends \
|
| 10 |
+
gcc g++ && \
|
| 11 |
+
rm -rf /var/lib/apt/lists/*
|
| 12 |
+
|
| 13 |
+
# Install Python dependencies
|
| 14 |
+
COPY requirements.txt .
|
| 15 |
+
RUN pip install --no-cache-dir -r requirements.txt
|
| 16 |
+
|
| 17 |
+
# Copy application code
|
| 18 |
+
COPY src/ src/
|
| 19 |
+
COPY models/ models/
|
| 20 |
+
COPY data_processed/ data/processed/
|
| 21 |
+
COPY demo/ demo/
|
| 22 |
+
|
| 23 |
+
# Make everything writable for the app user (needed for SQLite DB)
|
| 24 |
+
RUN chown -R user:user /app
|
| 25 |
+
|
| 26 |
+
USER user
|
| 27 |
+
|
| 28 |
+
EXPOSE 7860
|
| 29 |
+
|
| 30 |
+
CMD ["uvicorn", "src.api.main:app", "--host", "0.0.0.0", "--port", "7860"]
|
README.md
CHANGED
|
@@ -1,12 +1,22 @@
|
|
| 1 |
---
|
| 2 |
-
title:
|
| 3 |
-
emoji:
|
| 4 |
colorFrom: blue
|
| 5 |
-
colorTo:
|
| 6 |
sdk: docker
|
|
|
|
| 7 |
pinned: false
|
| 8 |
license: mit
|
| 9 |
-
short_description: '
|
| 10 |
---
|
| 11 |
|
| 12 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
title: DeepAMR API
|
| 3 |
+
emoji: 🧬
|
| 4 |
colorFrom: blue
|
| 5 |
+
colorTo: green
|
| 6 |
sdk: docker
|
| 7 |
+
app_port: 7860
|
| 8 |
pinned: false
|
| 9 |
license: mit
|
| 10 |
+
short_description: 'Deep Learning for AMR Prediction'
|
| 11 |
---
|
| 12 |
|
| 13 |
+
# DeepAMR - Antimicrobial Resistance Prediction API
|
| 14 |
+
|
| 15 |
+
Deep Learning API for predicting antibiotic resistance from bacterial genomic sequences.
|
| 16 |
+
|
| 17 |
+
- **11 drug classes** supported
|
| 18 |
+
- **84.3% Micro F1**, **98.6% AUC**
|
| 19 |
+
- Bangladesh-specific clinical guidelines
|
| 20 |
+
- PDF report generation
|
| 21 |
+
|
| 22 |
+
API docs available at `/docs`
|
data_processed/card/card_drug_class_X_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:37754e2ffccec906083a21935e46f862471a3945ffa65b46fea2c87ae191bb5d
|
| 3 |
+
size 4844128
|
data_processed/card/card_drug_class_X_train.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7053e2e0734de9ae8c46483fad2fb11c8d181f371c26d250cbfe5eea2abe3355
|
| 3 |
+
size 16948128
|
data_processed/card/card_drug_class_X_val.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:562364532d2014e72213a80cd2191590b9963132f29c09843abf6b6a1fc6df1b
|
| 3 |
+
size 2424128
|
data_processed/card/card_drug_class_metadata.json
ADDED
|
@@ -0,0 +1,551 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"feature_names": [
|
| 3 |
+
"AAA",
|
| 4 |
+
"IGL",
|
| 5 |
+
"ALA",
|
| 6 |
+
"KTG",
|
| 7 |
+
"LLL",
|
| 8 |
+
"AAL",
|
| 9 |
+
"LAA",
|
| 10 |
+
"ISL",
|
| 11 |
+
"RLD",
|
| 12 |
+
"GLE",
|
| 13 |
+
"TGS",
|
| 14 |
+
"LEQ",
|
| 15 |
+
"ALG",
|
| 16 |
+
"AST",
|
| 17 |
+
"GPL",
|
| 18 |
+
"DLA",
|
| 19 |
+
"ALQ",
|
| 20 |
+
"SIG",
|
| 21 |
+
"AAV",
|
| 22 |
+
"RDT",
|
| 23 |
+
"LDA",
|
| 24 |
+
"PAS",
|
| 25 |
+
"LAT",
|
| 26 |
+
"LGL",
|
| 27 |
+
"ALL",
|
| 28 |
+
"STF",
|
| 29 |
+
"LAR",
|
| 30 |
+
"LDR",
|
| 31 |
+
"SAA",
|
| 32 |
+
"VDA",
|
| 33 |
+
"IPG",
|
| 34 |
+
"TFK",
|
| 35 |
+
"LPA",
|
| 36 |
+
"GDA",
|
| 37 |
+
"PLK",
|
| 38 |
+
"AVL",
|
| 39 |
+
"LAQ",
|
| 40 |
+
"LAV",
|
| 41 |
+
"GYG",
|
| 42 |
+
"VAL",
|
| 43 |
+
"ATL",
|
| 44 |
+
"LFG",
|
| 45 |
+
"ANL",
|
| 46 |
+
"ALI",
|
| 47 |
+
"LAS",
|
| 48 |
+
"QAL",
|
| 49 |
+
"GLG",
|
| 50 |
+
"TLL",
|
| 51 |
+
"GLP",
|
| 52 |
+
"LPL",
|
| 53 |
+
"VAF",
|
| 54 |
+
"GFG",
|
| 55 |
+
"SLL",
|
| 56 |
+
"RVG",
|
| 57 |
+
"AVA",
|
| 58 |
+
"LSA",
|
| 59 |
+
"VLA",
|
| 60 |
+
"LAN",
|
| 61 |
+
"SDN",
|
| 62 |
+
"LTA",
|
| 63 |
+
"PAL",
|
| 64 |
+
"SLK",
|
| 65 |
+
"TTG",
|
| 66 |
+
"AGG",
|
| 67 |
+
"LDL",
|
| 68 |
+
"YGN",
|
| 69 |
+
"DDR",
|
| 70 |
+
"TLA",
|
| 71 |
+
"LLA",
|
| 72 |
+
"AIP",
|
| 73 |
+
"GLA",
|
| 74 |
+
"QRL",
|
| 75 |
+
"GWE",
|
| 76 |
+
"GVK",
|
| 77 |
+
"ATY",
|
| 78 |
+
"LLD",
|
| 79 |
+
"GLF",
|
| 80 |
+
"LLR",
|
| 81 |
+
"IAA",
|
| 82 |
+
"LGW",
|
| 83 |
+
"GSV",
|
| 84 |
+
"RIG",
|
| 85 |
+
"ERL",
|
| 86 |
+
"SKE",
|
| 87 |
+
"YGV",
|
| 88 |
+
"TAG",
|
| 89 |
+
"EQQ",
|
| 90 |
+
"NLL",
|
| 91 |
+
"ELG",
|
| 92 |
+
"RFP",
|
| 93 |
+
"LEK",
|
| 94 |
+
"LLT",
|
| 95 |
+
"LKR",
|
| 96 |
+
"GST",
|
| 97 |
+
"AGL",
|
| 98 |
+
"GGL",
|
| 99 |
+
"SVS",
|
| 100 |
+
"LVD",
|
| 101 |
+
"FGA",
|
| 102 |
+
"LSG",
|
| 103 |
+
"KAL",
|
| 104 |
+
"DER",
|
| 105 |
+
"KRL",
|
| 106 |
+
"TLG",
|
| 107 |
+
"LAG",
|
| 108 |
+
"VGD",
|
| 109 |
+
"TPA",
|
| 110 |
+
"NAL",
|
| 111 |
+
"TYT",
|
| 112 |
+
"SAI",
|
| 113 |
+
"TLF",
|
| 114 |
+
"AIA",
|
| 115 |
+
"GGP",
|
| 116 |
+
"LAL",
|
| 117 |
+
"LGD",
|
| 118 |
+
"LGS",
|
| 119 |
+
"ALV",
|
| 120 |
+
"ERF",
|
| 121 |
+
"LTG",
|
| 122 |
+
"PVT",
|
| 123 |
+
"TAT",
|
| 124 |
+
"AER",
|
| 125 |
+
"LVI",
|
| 126 |
+
"LGI",
|
| 127 |
+
"LKG",
|
| 128 |
+
"DTT",
|
| 129 |
+
"AAN",
|
| 130 |
+
"LTL",
|
| 131 |
+
"GVL",
|
| 132 |
+
"EIG",
|
| 133 |
+
"ARS",
|
| 134 |
+
"NTA",
|
| 135 |
+
"LFE",
|
| 136 |
+
"TTP",
|
| 137 |
+
"LAE",
|
| 138 |
+
"VPA",
|
| 139 |
+
"QTL",
|
| 140 |
+
"VGP",
|
| 141 |
+
"VSK",
|
| 142 |
+
"AGN",
|
| 143 |
+
"ASA",
|
| 144 |
+
"EAA",
|
| 145 |
+
"ASK",
|
| 146 |
+
"RAS",
|
| 147 |
+
"RLY",
|
| 148 |
+
"VTP",
|
| 149 |
+
"QLG",
|
| 150 |
+
"GIA",
|
| 151 |
+
"GLV",
|
| 152 |
+
"VTA",
|
| 153 |
+
"ADI",
|
| 154 |
+
"KSL",
|
| 155 |
+
"AAR",
|
| 156 |
+
"SQR",
|
| 157 |
+
"EQL",
|
| 158 |
+
"MKA",
|
| 159 |
+
"APA",
|
| 160 |
+
"ASL",
|
| 161 |
+
"TTT",
|
| 162 |
+
"TSA",
|
| 163 |
+
"YRQ",
|
| 164 |
+
"QGL",
|
| 165 |
+
"SYG",
|
| 166 |
+
"TAF",
|
| 167 |
+
"ATT",
|
| 168 |
+
"LLS",
|
| 169 |
+
"ADL",
|
| 170 |
+
"PLQ",
|
| 171 |
+
"AAS",
|
| 172 |
+
"TLP",
|
| 173 |
+
"SKT",
|
| 174 |
+
"GKA",
|
| 175 |
+
"IGD",
|
| 176 |
+
"PLL",
|
| 177 |
+
"KAS",
|
| 178 |
+
"ALE",
|
| 179 |
+
"LLF",
|
| 180 |
+
"KTF",
|
| 181 |
+
"RRI",
|
| 182 |
+
"PAP",
|
| 183 |
+
"AQA",
|
| 184 |
+
"VLV",
|
| 185 |
+
"ALP",
|
| 186 |
+
"PAD",
|
| 187 |
+
"AVI",
|
| 188 |
+
"AIS",
|
| 189 |
+
"AYA",
|
| 190 |
+
"LGG",
|
| 191 |
+
"AIL",
|
| 192 |
+
"LIG",
|
| 193 |
+
"DAE",
|
| 194 |
+
"YVA",
|
| 195 |
+
"LVG",
|
| 196 |
+
"GAA",
|
| 197 |
+
"SLG",
|
| 198 |
+
"LNA",
|
| 199 |
+
"SAL",
|
| 200 |
+
"STL",
|
| 201 |
+
"DRP",
|
| 202 |
+
"IAR",
|
| 203 |
+
"PAG",
|
| 204 |
+
"ATA",
|
| 205 |
+
"DMT",
|
| 206 |
+
"MKK",
|
| 207 |
+
"IVA",
|
| 208 |
+
"DEV",
|
| 209 |
+
"DLL",
|
| 210 |
+
"QPQ",
|
| 211 |
+
"QLA",
|
| 212 |
+
"PGD",
|
| 213 |
+
"DGK",
|
| 214 |
+
"LLN",
|
| 215 |
+
"NDI",
|
| 216 |
+
"QDK",
|
| 217 |
+
"IAD",
|
| 218 |
+
"NKT",
|
| 219 |
+
"DKT",
|
| 220 |
+
"KLA",
|
| 221 |
+
"TGA",
|
| 222 |
+
"MLN",
|
| 223 |
+
"EAY",
|
| 224 |
+
"PGM",
|
| 225 |
+
"YSN",
|
| 226 |
+
"ARL",
|
| 227 |
+
"WQP",
|
| 228 |
+
"YTA",
|
| 229 |
+
"LCG",
|
| 230 |
+
"FTA",
|
| 231 |
+
"GAV",
|
| 232 |
+
"ANK",
|
| 233 |
+
"LEG",
|
| 234 |
+
"FPD",
|
| 235 |
+
"YPN",
|
| 236 |
+
"VQP",
|
| 237 |
+
"VGW",
|
| 238 |
+
"AVQ",
|
| 239 |
+
"KTL",
|
| 240 |
+
"LKI",
|
| 241 |
+
"LKA",
|
| 242 |
+
"DLV",
|
| 243 |
+
"ILS",
|
| 244 |
+
"ISA",
|
| 245 |
+
"GNT",
|
| 246 |
+
"FSY",
|
| 247 |
+
"ALD",
|
| 248 |
+
"YGL",
|
| 249 |
+
"SNP",
|
| 250 |
+
"QAG",
|
| 251 |
+
"PSI",
|
| 252 |
+
"QYS",
|
| 253 |
+
"GNA",
|
| 254 |
+
"LGV",
|
| 255 |
+
"IGS",
|
| 256 |
+
"GDK",
|
| 257 |
+
"KIS",
|
| 258 |
+
"KAE",
|
| 259 |
+
"SVQ",
|
| 260 |
+
"FWL",
|
| 261 |
+
"PGP",
|
| 262 |
+
"LLG",
|
| 263 |
+
"AEL",
|
| 264 |
+
"AYG",
|
| 265 |
+
"ETL",
|
| 266 |
+
"PLA",
|
| 267 |
+
"QQG",
|
| 268 |
+
"KSG",
|
| 269 |
+
"ARR",
|
| 270 |
+
"LVT",
|
| 271 |
+
"MTL",
|
| 272 |
+
"DAA",
|
| 273 |
+
"VLL",
|
| 274 |
+
"GYA",
|
| 275 |
+
"MAV",
|
| 276 |
+
"RLL",
|
| 277 |
+
"GKP",
|
| 278 |
+
"GAL",
|
| 279 |
+
"KLL",
|
| 280 |
+
"VKT",
|
| 281 |
+
"APL",
|
| 282 |
+
"FGY",
|
| 283 |
+
"NEA",
|
| 284 |
+
"TLR",
|
| 285 |
+
"LQF",
|
| 286 |
+
"ITP",
|
| 287 |
+
"NPS",
|
| 288 |
+
"IKK",
|
| 289 |
+
"VAI",
|
| 290 |
+
"YAK",
|
| 291 |
+
"LNK",
|
| 292 |
+
"GMT",
|
| 293 |
+
"EIK",
|
| 294 |
+
"QWQ",
|
| 295 |
+
"ALS",
|
| 296 |
+
"GWV",
|
| 297 |
+
"DTP",
|
| 298 |
+
"SEK",
|
| 299 |
+
"VPG",
|
| 300 |
+
"LLI",
|
| 301 |
+
"LDD",
|
| 302 |
+
"KKS",
|
| 303 |
+
"FPA",
|
| 304 |
+
"LRF",
|
| 305 |
+
"KEL",
|
| 306 |
+
"LLK",
|
| 307 |
+
"ILA",
|
| 308 |
+
"HKT",
|
| 309 |
+
"AGE",
|
| 310 |
+
"GPG",
|
| 311 |
+
"YGK",
|
| 312 |
+
"IAG",
|
| 313 |
+
"VMK",
|
| 314 |
+
"GIV",
|
| 315 |
+
"APQ",
|
| 316 |
+
"GSR",
|
| 317 |
+
"VPL",
|
| 318 |
+
"GIS",
|
| 319 |
+
"ARA",
|
| 320 |
+
"FAA",
|
| 321 |
+
"GDE",
|
| 322 |
+
"VIY",
|
| 323 |
+
"IAL",
|
| 324 |
+
"ADK",
|
| 325 |
+
"AND",
|
| 326 |
+
"DRA",
|
| 327 |
+
"QQV",
|
| 328 |
+
"STN",
|
| 329 |
+
"NAE",
|
| 330 |
+
"GVA",
|
| 331 |
+
"PLD",
|
| 332 |
+
"LPF",
|
| 333 |
+
"QFP",
|
| 334 |
+
"DIA",
|
| 335 |
+
"VPE",
|
| 336 |
+
"KDQ",
|
| 337 |
+
"TYA",
|
| 338 |
+
"TFT",
|
| 339 |
+
"DVP",
|
| 340 |
+
"STS",
|
| 341 |
+
"ADE",
|
| 342 |
+
"VNP",
|
| 343 |
+
"PVY",
|
| 344 |
+
"KDD",
|
| 345 |
+
"VYQ",
|
| 346 |
+
"ELA",
|
| 347 |
+
"GEA",
|
| 348 |
+
"LRK",
|
| 349 |
+
"TGW",
|
| 350 |
+
"NDL",
|
| 351 |
+
"ARV",
|
| 352 |
+
"GPA",
|
| 353 |
+
"EKH",
|
| 354 |
+
"QIA",
|
| 355 |
+
"AIT",
|
| 356 |
+
"AMA",
|
| 357 |
+
"AAD",
|
| 358 |
+
"IYA",
|
| 359 |
+
"TRL",
|
| 360 |
+
"YAQ",
|
| 361 |
+
"EQT",
|
| 362 |
+
"LEL",
|
| 363 |
+
"TNG",
|
| 364 |
+
"GMA",
|
| 365 |
+
"LIA",
|
| 366 |
+
"TGV",
|
| 367 |
+
"GKV",
|
| 368 |
+
"DLG",
|
| 369 |
+
"AQT",
|
| 370 |
+
"KLS",
|
| 371 |
+
"FVP",
|
| 372 |
+
"AQG",
|
| 373 |
+
"LNE",
|
| 374 |
+
"PIS",
|
| 375 |
+
"IVM",
|
| 376 |
+
"GAY",
|
| 377 |
+
"PET",
|
| 378 |
+
"QDL",
|
| 379 |
+
"AYV",
|
| 380 |
+
"ATV",
|
| 381 |
+
"LVL",
|
| 382 |
+
"VML",
|
| 383 |
+
"NGF",
|
| 384 |
+
"LLQ",
|
| 385 |
+
"VFK",
|
| 386 |
+
"ERI",
|
| 387 |
+
"GDM",
|
| 388 |
+
"EKN",
|
| 389 |
+
"AFV",
|
| 390 |
+
"QVL",
|
| 391 |
+
"FVD",
|
| 392 |
+
"VQD",
|
| 393 |
+
"MTV",
|
| 394 |
+
"FEL",
|
| 395 |
+
"EVK",
|
| 396 |
+
"AKS",
|
| 397 |
+
"GSQ",
|
| 398 |
+
"LFA",
|
| 399 |
+
"IKA",
|
| 400 |
+
"AAT",
|
| 401 |
+
"ELN",
|
| 402 |
+
"LGA",
|
| 403 |
+
"VAE",
|
| 404 |
+
"VKA",
|
| 405 |
+
"KIA",
|
| 406 |
+
"PEL",
|
| 407 |
+
"DNT",
|
| 408 |
+
"KAA",
|
| 409 |
+
"LYA",
|
| 410 |
+
"RKL",
|
| 411 |
+
"ADR",
|
| 412 |
+
"LPV",
|
| 413 |
+
"AEA",
|
| 414 |
+
"EER",
|
| 415 |
+
"KVA",
|
| 416 |
+
"KAN",
|
| 417 |
+
"ASI",
|
| 418 |
+
"LQA",
|
| 419 |
+
"WLV",
|
| 420 |
+
"VIL",
|
| 421 |
+
"PLR",
|
| 422 |
+
"AFS",
|
| 423 |
+
"PHY",
|
| 424 |
+
"HRI",
|
| 425 |
+
"VTE",
|
| 426 |
+
"DRL",
|
| 427 |
+
"AVK",
|
| 428 |
+
"FIP",
|
| 429 |
+
"VVA",
|
| 430 |
+
"RIA",
|
| 431 |
+
"AAI",
|
| 432 |
+
"YQG",
|
| 433 |
+
"LTN",
|
| 434 |
+
"YLA",
|
| 435 |
+
"AFL",
|
| 436 |
+
"ERV",
|
| 437 |
+
"LQP",
|
| 438 |
+
"QVG",
|
| 439 |
+
"GQP",
|
| 440 |
+
"GRR",
|
| 441 |
+
"HPE",
|
| 442 |
+
"LQG",
|
| 443 |
+
"LNL",
|
| 444 |
+
"SGG",
|
| 445 |
+
"TPQ",
|
| 446 |
+
"SYV",
|
| 447 |
+
"WVV",
|
| 448 |
+
"QAQ",
|
| 449 |
+
"DSV",
|
| 450 |
+
"ARG",
|
| 451 |
+
"NST",
|
| 452 |
+
"TPE",
|
| 453 |
+
"RPL",
|
| 454 |
+
"HYF",
|
| 455 |
+
"EQI",
|
| 456 |
+
"LVA",
|
| 457 |
+
"RSL",
|
| 458 |
+
"QQL",
|
| 459 |
+
"YFT",
|
| 460 |
+
"APG",
|
| 461 |
+
"GEL",
|
| 462 |
+
"FDG",
|
| 463 |
+
"SGL",
|
| 464 |
+
"SGA",
|
| 465 |
+
"AQI",
|
| 466 |
+
"QPV",
|
| 467 |
+
"FSL",
|
| 468 |
+
"GYL",
|
| 469 |
+
"PNA",
|
| 470 |
+
"RAP",
|
| 471 |
+
"QRA",
|
| 472 |
+
"LFI",
|
| 473 |
+
"ANR",
|
| 474 |
+
"GNL",
|
| 475 |
+
"VLG",
|
| 476 |
+
"LFP",
|
| 477 |
+
"QKD",
|
| 478 |
+
"GIL",
|
| 479 |
+
"EKI",
|
| 480 |
+
"SPV",
|
| 481 |
+
"DQA",
|
| 482 |
+
"LER",
|
| 483 |
+
"RLG",
|
| 484 |
+
"DAI",
|
| 485 |
+
"TAA",
|
| 486 |
+
"PDS",
|
| 487 |
+
"RDL",
|
| 488 |
+
"VTR",
|
| 489 |
+
"DAG",
|
| 490 |
+
"AEG",
|
| 491 |
+
"SLI",
|
| 492 |
+
"FKW",
|
| 493 |
+
"VAG",
|
| 494 |
+
"VAV",
|
| 495 |
+
"RFV",
|
| 496 |
+
"GAG",
|
| 497 |
+
"GLT",
|
| 498 |
+
"VKR",
|
| 499 |
+
"RQQ",
|
| 500 |
+
"RVE",
|
| 501 |
+
"KGE",
|
| 502 |
+
"TRF"
|
| 503 |
+
],
|
| 504 |
+
"class_names": [
|
| 505 |
+
"aminocoumarin antibiotic",
|
| 506 |
+
"aminoglycoside antibiotic",
|
| 507 |
+
"antibacterial free fatty acids",
|
| 508 |
+
"antibiotic without defined classification",
|
| 509 |
+
"bicyclomycin-like antibiotic",
|
| 510 |
+
"carbapenem",
|
| 511 |
+
"cephalosporin",
|
| 512 |
+
"diaminopyrimidine antibiotic",
|
| 513 |
+
"disinfecting agents and antiseptics",
|
| 514 |
+
"elfamycin antibiotic",
|
| 515 |
+
"fluoroquinolone antibiotic",
|
| 516 |
+
"fusidane antibiotic",
|
| 517 |
+
"glycopeptide antibiotic",
|
| 518 |
+
"glycylcycline",
|
| 519 |
+
"isoniazid-like antibiotic",
|
| 520 |
+
"lincosamide antibiotic",
|
| 521 |
+
"macrolide antibiotic",
|
| 522 |
+
"moenomycin antibiotic",
|
| 523 |
+
"monobactam",
|
| 524 |
+
"mupirocin-like antibiotic",
|
| 525 |
+
"nitrofuran antibiotic",
|
| 526 |
+
"nitroimidazole antibiotic",
|
| 527 |
+
"nucleoside antibiotic",
|
| 528 |
+
"orthosomycin antibiotic",
|
| 529 |
+
"oxazolidinone antibiotic",
|
| 530 |
+
"penicillin beta-lactam",
|
| 531 |
+
"peptide antibiotic",
|
| 532 |
+
"phenicol antibiotic",
|
| 533 |
+
"phosphonic acid antibiotic",
|
| 534 |
+
"pleuromutilin antibiotic",
|
| 535 |
+
"polyamine antibiotic",
|
| 536 |
+
"rifamycin antibiotic",
|
| 537 |
+
"streptogramin A antibiotic",
|
| 538 |
+
"streptogramin B antibiotic",
|
| 539 |
+
"streptogramin antibiotic",
|
| 540 |
+
"sulfonamide antibiotic",
|
| 541 |
+
"sulfone antibiotic",
|
| 542 |
+
"tetracycline antibiotic"
|
| 543 |
+
],
|
| 544 |
+
"task_type": "multilabel",
|
| 545 |
+
"target": "drug_class",
|
| 546 |
+
"k": 3,
|
| 547 |
+
"max_features": 500,
|
| 548 |
+
"n_samples": 6054,
|
| 549 |
+
"n_features": 500,
|
| 550 |
+
"n_classes": 38
|
| 551 |
+
}
|
data_processed/card/card_drug_class_y_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c9bcbd653442796426386f60035cffc438caedc24f07e4465b0cd180dac83ab7
|
| 3 |
+
size 368272
|
data_processed/card/card_drug_class_y_train.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c97d499ebb542381bac702a8d55e11bf3f4eb8aef16a495023397b0027292a6b
|
| 3 |
+
size 1288176
|
data_processed/card/card_drug_class_y_val.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b7237e52cd44e3252b7ab582a21374313aad6d5b3231d8a50b2b9cd06964764b
|
| 3 |
+
size 184352
|
data_processed/card/card_gene_family_X_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:37754e2ffccec906083a21935e46f862471a3945ffa65b46fea2c87ae191bb5d
|
| 3 |
+
size 4844128
|
data_processed/card/card_gene_family_X_train.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7053e2e0734de9ae8c46483fad2fb11c8d181f371c26d250cbfe5eea2abe3355
|
| 3 |
+
size 16948128
|
data_processed/card/card_gene_family_X_val.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:562364532d2014e72213a80cd2191590b9963132f29c09843abf6b6a1fc6df1b
|
| 3 |
+
size 2424128
|
data_processed/card/card_gene_family_metadata.json
ADDED
|
@@ -0,0 +1,911 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"feature_names": [
|
| 3 |
+
"AAA",
|
| 4 |
+
"IGL",
|
| 5 |
+
"ALA",
|
| 6 |
+
"KTG",
|
| 7 |
+
"LLL",
|
| 8 |
+
"AAL",
|
| 9 |
+
"LAA",
|
| 10 |
+
"ISL",
|
| 11 |
+
"RLD",
|
| 12 |
+
"GLE",
|
| 13 |
+
"TGS",
|
| 14 |
+
"LEQ",
|
| 15 |
+
"ALG",
|
| 16 |
+
"AST",
|
| 17 |
+
"GPL",
|
| 18 |
+
"DLA",
|
| 19 |
+
"ALQ",
|
| 20 |
+
"SIG",
|
| 21 |
+
"AAV",
|
| 22 |
+
"RDT",
|
| 23 |
+
"LDA",
|
| 24 |
+
"PAS",
|
| 25 |
+
"LAT",
|
| 26 |
+
"LGL",
|
| 27 |
+
"ALL",
|
| 28 |
+
"STF",
|
| 29 |
+
"LAR",
|
| 30 |
+
"LDR",
|
| 31 |
+
"SAA",
|
| 32 |
+
"VDA",
|
| 33 |
+
"IPG",
|
| 34 |
+
"TFK",
|
| 35 |
+
"LPA",
|
| 36 |
+
"GDA",
|
| 37 |
+
"PLK",
|
| 38 |
+
"AVL",
|
| 39 |
+
"LAQ",
|
| 40 |
+
"LAV",
|
| 41 |
+
"GYG",
|
| 42 |
+
"VAL",
|
| 43 |
+
"ATL",
|
| 44 |
+
"LFG",
|
| 45 |
+
"ANL",
|
| 46 |
+
"ALI",
|
| 47 |
+
"LAS",
|
| 48 |
+
"QAL",
|
| 49 |
+
"GLG",
|
| 50 |
+
"TLL",
|
| 51 |
+
"GLP",
|
| 52 |
+
"LPL",
|
| 53 |
+
"VAF",
|
| 54 |
+
"GFG",
|
| 55 |
+
"SLL",
|
| 56 |
+
"RVG",
|
| 57 |
+
"AVA",
|
| 58 |
+
"LSA",
|
| 59 |
+
"VLA",
|
| 60 |
+
"LAN",
|
| 61 |
+
"SDN",
|
| 62 |
+
"LTA",
|
| 63 |
+
"PAL",
|
| 64 |
+
"SLK",
|
| 65 |
+
"TTG",
|
| 66 |
+
"AGG",
|
| 67 |
+
"LDL",
|
| 68 |
+
"YGN",
|
| 69 |
+
"DDR",
|
| 70 |
+
"TLA",
|
| 71 |
+
"LLA",
|
| 72 |
+
"AIP",
|
| 73 |
+
"GLA",
|
| 74 |
+
"QRL",
|
| 75 |
+
"GWE",
|
| 76 |
+
"GVK",
|
| 77 |
+
"ATY",
|
| 78 |
+
"LLD",
|
| 79 |
+
"GLF",
|
| 80 |
+
"LLR",
|
| 81 |
+
"IAA",
|
| 82 |
+
"LGW",
|
| 83 |
+
"GSV",
|
| 84 |
+
"RIG",
|
| 85 |
+
"ERL",
|
| 86 |
+
"SKE",
|
| 87 |
+
"YGV",
|
| 88 |
+
"TAG",
|
| 89 |
+
"EQQ",
|
| 90 |
+
"NLL",
|
| 91 |
+
"ELG",
|
| 92 |
+
"RFP",
|
| 93 |
+
"LEK",
|
| 94 |
+
"LLT",
|
| 95 |
+
"LKR",
|
| 96 |
+
"GST",
|
| 97 |
+
"AGL",
|
| 98 |
+
"GGL",
|
| 99 |
+
"SVS",
|
| 100 |
+
"LVD",
|
| 101 |
+
"FGA",
|
| 102 |
+
"LSG",
|
| 103 |
+
"KAL",
|
| 104 |
+
"DER",
|
| 105 |
+
"KRL",
|
| 106 |
+
"TLG",
|
| 107 |
+
"LAG",
|
| 108 |
+
"VGD",
|
| 109 |
+
"TPA",
|
| 110 |
+
"NAL",
|
| 111 |
+
"TYT",
|
| 112 |
+
"SAI",
|
| 113 |
+
"TLF",
|
| 114 |
+
"AIA",
|
| 115 |
+
"GGP",
|
| 116 |
+
"LAL",
|
| 117 |
+
"LGD",
|
| 118 |
+
"LGS",
|
| 119 |
+
"ALV",
|
| 120 |
+
"ERF",
|
| 121 |
+
"LTG",
|
| 122 |
+
"PVT",
|
| 123 |
+
"TAT",
|
| 124 |
+
"AER",
|
| 125 |
+
"LVI",
|
| 126 |
+
"LGI",
|
| 127 |
+
"LKG",
|
| 128 |
+
"DTT",
|
| 129 |
+
"AAN",
|
| 130 |
+
"LTL",
|
| 131 |
+
"GVL",
|
| 132 |
+
"EIG",
|
| 133 |
+
"ARS",
|
| 134 |
+
"NTA",
|
| 135 |
+
"LFE",
|
| 136 |
+
"TTP",
|
| 137 |
+
"LAE",
|
| 138 |
+
"VPA",
|
| 139 |
+
"QTL",
|
| 140 |
+
"VGP",
|
| 141 |
+
"VSK",
|
| 142 |
+
"AGN",
|
| 143 |
+
"ASA",
|
| 144 |
+
"EAA",
|
| 145 |
+
"ASK",
|
| 146 |
+
"RAS",
|
| 147 |
+
"RLY",
|
| 148 |
+
"VTP",
|
| 149 |
+
"QLG",
|
| 150 |
+
"GIA",
|
| 151 |
+
"GLV",
|
| 152 |
+
"VTA",
|
| 153 |
+
"ADI",
|
| 154 |
+
"KSL",
|
| 155 |
+
"AAR",
|
| 156 |
+
"SQR",
|
| 157 |
+
"EQL",
|
| 158 |
+
"MKA",
|
| 159 |
+
"APA",
|
| 160 |
+
"ASL",
|
| 161 |
+
"TTT",
|
| 162 |
+
"TSA",
|
| 163 |
+
"YRQ",
|
| 164 |
+
"QGL",
|
| 165 |
+
"SYG",
|
| 166 |
+
"TAF",
|
| 167 |
+
"ATT",
|
| 168 |
+
"LLS",
|
| 169 |
+
"ADL",
|
| 170 |
+
"PLQ",
|
| 171 |
+
"AAS",
|
| 172 |
+
"TLP",
|
| 173 |
+
"SKT",
|
| 174 |
+
"GKA",
|
| 175 |
+
"IGD",
|
| 176 |
+
"PLL",
|
| 177 |
+
"KAS",
|
| 178 |
+
"ALE",
|
| 179 |
+
"LLF",
|
| 180 |
+
"KTF",
|
| 181 |
+
"RRI",
|
| 182 |
+
"PAP",
|
| 183 |
+
"AQA",
|
| 184 |
+
"VLV",
|
| 185 |
+
"ALP",
|
| 186 |
+
"PAD",
|
| 187 |
+
"AVI",
|
| 188 |
+
"AIS",
|
| 189 |
+
"AYA",
|
| 190 |
+
"LGG",
|
| 191 |
+
"AIL",
|
| 192 |
+
"LIG",
|
| 193 |
+
"DAE",
|
| 194 |
+
"YVA",
|
| 195 |
+
"LVG",
|
| 196 |
+
"GAA",
|
| 197 |
+
"SLG",
|
| 198 |
+
"LNA",
|
| 199 |
+
"SAL",
|
| 200 |
+
"STL",
|
| 201 |
+
"DRP",
|
| 202 |
+
"IAR",
|
| 203 |
+
"PAG",
|
| 204 |
+
"ATA",
|
| 205 |
+
"DMT",
|
| 206 |
+
"MKK",
|
| 207 |
+
"IVA",
|
| 208 |
+
"DEV",
|
| 209 |
+
"DLL",
|
| 210 |
+
"QPQ",
|
| 211 |
+
"QLA",
|
| 212 |
+
"PGD",
|
| 213 |
+
"DGK",
|
| 214 |
+
"LLN",
|
| 215 |
+
"NDI",
|
| 216 |
+
"QDK",
|
| 217 |
+
"IAD",
|
| 218 |
+
"NKT",
|
| 219 |
+
"DKT",
|
| 220 |
+
"KLA",
|
| 221 |
+
"TGA",
|
| 222 |
+
"MLN",
|
| 223 |
+
"EAY",
|
| 224 |
+
"PGM",
|
| 225 |
+
"YSN",
|
| 226 |
+
"ARL",
|
| 227 |
+
"WQP",
|
| 228 |
+
"YTA",
|
| 229 |
+
"LCG",
|
| 230 |
+
"FTA",
|
| 231 |
+
"GAV",
|
| 232 |
+
"ANK",
|
| 233 |
+
"LEG",
|
| 234 |
+
"FPD",
|
| 235 |
+
"YPN",
|
| 236 |
+
"VQP",
|
| 237 |
+
"VGW",
|
| 238 |
+
"AVQ",
|
| 239 |
+
"KTL",
|
| 240 |
+
"LKI",
|
| 241 |
+
"LKA",
|
| 242 |
+
"DLV",
|
| 243 |
+
"ILS",
|
| 244 |
+
"ISA",
|
| 245 |
+
"GNT",
|
| 246 |
+
"FSY",
|
| 247 |
+
"ALD",
|
| 248 |
+
"YGL",
|
| 249 |
+
"SNP",
|
| 250 |
+
"QAG",
|
| 251 |
+
"PSI",
|
| 252 |
+
"QYS",
|
| 253 |
+
"GNA",
|
| 254 |
+
"LGV",
|
| 255 |
+
"IGS",
|
| 256 |
+
"GDK",
|
| 257 |
+
"KIS",
|
| 258 |
+
"KAE",
|
| 259 |
+
"SVQ",
|
| 260 |
+
"FWL",
|
| 261 |
+
"PGP",
|
| 262 |
+
"LLG",
|
| 263 |
+
"AEL",
|
| 264 |
+
"AYG",
|
| 265 |
+
"ETL",
|
| 266 |
+
"PLA",
|
| 267 |
+
"QQG",
|
| 268 |
+
"KSG",
|
| 269 |
+
"ARR",
|
| 270 |
+
"LVT",
|
| 271 |
+
"MTL",
|
| 272 |
+
"DAA",
|
| 273 |
+
"VLL",
|
| 274 |
+
"GYA",
|
| 275 |
+
"MAV",
|
| 276 |
+
"RLL",
|
| 277 |
+
"GKP",
|
| 278 |
+
"GAL",
|
| 279 |
+
"KLL",
|
| 280 |
+
"VKT",
|
| 281 |
+
"APL",
|
| 282 |
+
"FGY",
|
| 283 |
+
"NEA",
|
| 284 |
+
"TLR",
|
| 285 |
+
"LQF",
|
| 286 |
+
"ITP",
|
| 287 |
+
"NPS",
|
| 288 |
+
"IKK",
|
| 289 |
+
"VAI",
|
| 290 |
+
"YAK",
|
| 291 |
+
"LNK",
|
| 292 |
+
"GMT",
|
| 293 |
+
"EIK",
|
| 294 |
+
"QWQ",
|
| 295 |
+
"ALS",
|
| 296 |
+
"GWV",
|
| 297 |
+
"DTP",
|
| 298 |
+
"SEK",
|
| 299 |
+
"VPG",
|
| 300 |
+
"LLI",
|
| 301 |
+
"LDD",
|
| 302 |
+
"KKS",
|
| 303 |
+
"FPA",
|
| 304 |
+
"LRF",
|
| 305 |
+
"KEL",
|
| 306 |
+
"LLK",
|
| 307 |
+
"ILA",
|
| 308 |
+
"HKT",
|
| 309 |
+
"AGE",
|
| 310 |
+
"GPG",
|
| 311 |
+
"YGK",
|
| 312 |
+
"IAG",
|
| 313 |
+
"VMK",
|
| 314 |
+
"GIV",
|
| 315 |
+
"APQ",
|
| 316 |
+
"GSR",
|
| 317 |
+
"VPL",
|
| 318 |
+
"GIS",
|
| 319 |
+
"ARA",
|
| 320 |
+
"FAA",
|
| 321 |
+
"GDE",
|
| 322 |
+
"VIY",
|
| 323 |
+
"IAL",
|
| 324 |
+
"ADK",
|
| 325 |
+
"AND",
|
| 326 |
+
"DRA",
|
| 327 |
+
"QQV",
|
| 328 |
+
"STN",
|
| 329 |
+
"NAE",
|
| 330 |
+
"GVA",
|
| 331 |
+
"PLD",
|
| 332 |
+
"LPF",
|
| 333 |
+
"QFP",
|
| 334 |
+
"DIA",
|
| 335 |
+
"VPE",
|
| 336 |
+
"KDQ",
|
| 337 |
+
"TYA",
|
| 338 |
+
"TFT",
|
| 339 |
+
"DVP",
|
| 340 |
+
"STS",
|
| 341 |
+
"ADE",
|
| 342 |
+
"VNP",
|
| 343 |
+
"PVY",
|
| 344 |
+
"KDD",
|
| 345 |
+
"VYQ",
|
| 346 |
+
"ELA",
|
| 347 |
+
"GEA",
|
| 348 |
+
"LRK",
|
| 349 |
+
"TGW",
|
| 350 |
+
"NDL",
|
| 351 |
+
"ARV",
|
| 352 |
+
"GPA",
|
| 353 |
+
"EKH",
|
| 354 |
+
"QIA",
|
| 355 |
+
"AIT",
|
| 356 |
+
"AMA",
|
| 357 |
+
"AAD",
|
| 358 |
+
"IYA",
|
| 359 |
+
"TRL",
|
| 360 |
+
"YAQ",
|
| 361 |
+
"EQT",
|
| 362 |
+
"LEL",
|
| 363 |
+
"TNG",
|
| 364 |
+
"GMA",
|
| 365 |
+
"LIA",
|
| 366 |
+
"TGV",
|
| 367 |
+
"GKV",
|
| 368 |
+
"DLG",
|
| 369 |
+
"AQT",
|
| 370 |
+
"KLS",
|
| 371 |
+
"FVP",
|
| 372 |
+
"AQG",
|
| 373 |
+
"LNE",
|
| 374 |
+
"PIS",
|
| 375 |
+
"IVM",
|
| 376 |
+
"GAY",
|
| 377 |
+
"PET",
|
| 378 |
+
"QDL",
|
| 379 |
+
"AYV",
|
| 380 |
+
"ATV",
|
| 381 |
+
"LVL",
|
| 382 |
+
"VML",
|
| 383 |
+
"NGF",
|
| 384 |
+
"LLQ",
|
| 385 |
+
"VFK",
|
| 386 |
+
"ERI",
|
| 387 |
+
"GDM",
|
| 388 |
+
"EKN",
|
| 389 |
+
"AFV",
|
| 390 |
+
"QVL",
|
| 391 |
+
"FVD",
|
| 392 |
+
"VQD",
|
| 393 |
+
"MTV",
|
| 394 |
+
"FEL",
|
| 395 |
+
"EVK",
|
| 396 |
+
"AKS",
|
| 397 |
+
"GSQ",
|
| 398 |
+
"LFA",
|
| 399 |
+
"IKA",
|
| 400 |
+
"AAT",
|
| 401 |
+
"ELN",
|
| 402 |
+
"LGA",
|
| 403 |
+
"VAE",
|
| 404 |
+
"VKA",
|
| 405 |
+
"KIA",
|
| 406 |
+
"PEL",
|
| 407 |
+
"DNT",
|
| 408 |
+
"KAA",
|
| 409 |
+
"LYA",
|
| 410 |
+
"RKL",
|
| 411 |
+
"ADR",
|
| 412 |
+
"LPV",
|
| 413 |
+
"AEA",
|
| 414 |
+
"EER",
|
| 415 |
+
"KVA",
|
| 416 |
+
"KAN",
|
| 417 |
+
"ASI",
|
| 418 |
+
"LQA",
|
| 419 |
+
"WLV",
|
| 420 |
+
"VIL",
|
| 421 |
+
"PLR",
|
| 422 |
+
"AFS",
|
| 423 |
+
"PHY",
|
| 424 |
+
"HRI",
|
| 425 |
+
"VTE",
|
| 426 |
+
"DRL",
|
| 427 |
+
"AVK",
|
| 428 |
+
"FIP",
|
| 429 |
+
"VVA",
|
| 430 |
+
"RIA",
|
| 431 |
+
"AAI",
|
| 432 |
+
"YQG",
|
| 433 |
+
"LTN",
|
| 434 |
+
"YLA",
|
| 435 |
+
"AFL",
|
| 436 |
+
"ERV",
|
| 437 |
+
"LQP",
|
| 438 |
+
"QVG",
|
| 439 |
+
"GQP",
|
| 440 |
+
"GRR",
|
| 441 |
+
"HPE",
|
| 442 |
+
"LQG",
|
| 443 |
+
"LNL",
|
| 444 |
+
"SGG",
|
| 445 |
+
"TPQ",
|
| 446 |
+
"SYV",
|
| 447 |
+
"WVV",
|
| 448 |
+
"QAQ",
|
| 449 |
+
"DSV",
|
| 450 |
+
"ARG",
|
| 451 |
+
"NST",
|
| 452 |
+
"TPE",
|
| 453 |
+
"RPL",
|
| 454 |
+
"HYF",
|
| 455 |
+
"EQI",
|
| 456 |
+
"LVA",
|
| 457 |
+
"RSL",
|
| 458 |
+
"QQL",
|
| 459 |
+
"YFT",
|
| 460 |
+
"APG",
|
| 461 |
+
"GEL",
|
| 462 |
+
"FDG",
|
| 463 |
+
"SGL",
|
| 464 |
+
"SGA",
|
| 465 |
+
"AQI",
|
| 466 |
+
"QPV",
|
| 467 |
+
"FSL",
|
| 468 |
+
"GYL",
|
| 469 |
+
"PNA",
|
| 470 |
+
"RAP",
|
| 471 |
+
"QRA",
|
| 472 |
+
"LFI",
|
| 473 |
+
"ANR",
|
| 474 |
+
"GNL",
|
| 475 |
+
"VLG",
|
| 476 |
+
"LFP",
|
| 477 |
+
"QKD",
|
| 478 |
+
"GIL",
|
| 479 |
+
"EKI",
|
| 480 |
+
"SPV",
|
| 481 |
+
"DQA",
|
| 482 |
+
"LER",
|
| 483 |
+
"RLG",
|
| 484 |
+
"DAI",
|
| 485 |
+
"TAA",
|
| 486 |
+
"PDS",
|
| 487 |
+
"RDL",
|
| 488 |
+
"VTR",
|
| 489 |
+
"DAG",
|
| 490 |
+
"AEG",
|
| 491 |
+
"SLI",
|
| 492 |
+
"FKW",
|
| 493 |
+
"VAG",
|
| 494 |
+
"VAV",
|
| 495 |
+
"RFV",
|
| 496 |
+
"GAG",
|
| 497 |
+
"GLT",
|
| 498 |
+
"VKR",
|
| 499 |
+
"RQQ",
|
| 500 |
+
"RVE",
|
| 501 |
+
"KGE",
|
| 502 |
+
"TRF"
|
| 503 |
+
],
|
| 504 |
+
"class_names": [
|
| 505 |
+
"16S rRNA methyltransferase (A1408)",
|
| 506 |
+
"16S rRNA methyltransferase (G1405)",
|
| 507 |
+
"AAC(2')",
|
| 508 |
+
"AAC(3)",
|
| 509 |
+
"AAC(6')",
|
| 510 |
+
"AAC(6');AAC(6')-Ib-cr",
|
| 511 |
+
"AAK beta-lactamase",
|
| 512 |
+
"ACC beta-lactamase",
|
| 513 |
+
"ACI beta-lactamase",
|
| 514 |
+
"ACT beta-lactamase",
|
| 515 |
+
"ACT beta-lactamase;CMY beta-lactamase;CTX-M beta-lactamase;IMP beta-lactamase;KPC beta-lactamase;MOX beta-lactamase;OXA beta-lactamase;OXA-1-like beta-lactamase;SHV beta-lactamase;TEM beta-lactamase;class A Mycobacterium abscessus beta-lactamase",
|
| 516 |
+
"ADC beta-lactamase with carbapenemase activity",
|
| 517 |
+
"ADC beta-lactamase without carbapenemase activity",
|
| 518 |
+
"ADC beta-lactamases pending classification for carbapenemase activity",
|
| 519 |
+
"AER beta-lactamase",
|
| 520 |
+
"AFM beta-lactamase",
|
| 521 |
+
"AIM beta-lactamase",
|
| 522 |
+
"ALG11 beta-lactamase",
|
| 523 |
+
"ALG6 beta-lactamases",
|
| 524 |
+
"ALI beta-lactamase",
|
| 525 |
+
"AMZ beta-lactamase",
|
| 526 |
+
"ANA beta-lactamase",
|
| 527 |
+
"ANT(2'')",
|
| 528 |
+
"ANT(3'')",
|
| 529 |
+
"ANT(4')",
|
| 530 |
+
"ANT(6)",
|
| 531 |
+
"ANT(9)",
|
| 532 |
+
"APH(2'')",
|
| 533 |
+
"APH(3'')",
|
| 534 |
+
"APH(3')",
|
| 535 |
+
"APH(4)",
|
| 536 |
+
"APH(6)",
|
| 537 |
+
"APH(7'')",
|
| 538 |
+
"APH(9)",
|
| 539 |
+
"AQU beta-lactamase",
|
| 540 |
+
"ARL Beta-lactamase",
|
| 541 |
+
"AST Beta-lactamase",
|
| 542 |
+
"ASU1 beta-lactamase",
|
| 543 |
+
"ATP-binding cassette (ABC) antibiotic efflux pump",
|
| 544 |
+
"ATP-binding cassette (ABC) antibiotic efflux pump;major facilitator superfamily (MFS) antibiotic efflux pump",
|
| 545 |
+
"ATP-binding cassette (ABC) antibiotic efflux pump;major facilitator superfamily (MFS) antibiotic efflux pump;resistance-nodulation-cell division (RND) antibiotic efflux pump",
|
| 546 |
+
"AXC beta-lactamase",
|
| 547 |
+
"B3SU1 beta-lactamase",
|
| 548 |
+
"B3SU2 beta-lactamase",
|
| 549 |
+
"BAT Beta-lactamase",
|
| 550 |
+
"BCL Beta-lactamase",
|
| 551 |
+
"BEL beta-lactamase",
|
| 552 |
+
"BES Beta-lactamase",
|
| 553 |
+
"BIC Beta-lactamase",
|
| 554 |
+
"BIL Beta-lactamase",
|
| 555 |
+
"BIM beta-lactamase",
|
| 556 |
+
"BJP beta-lactamase",
|
| 557 |
+
"BKC Beta-lactamase",
|
| 558 |
+
"BMHC beta-lactamase",
|
| 559 |
+
"BOR beta-lactamase",
|
| 560 |
+
"BPU Beta-lactamase",
|
| 561 |
+
"BRO Beta-lactamase",
|
| 562 |
+
"BSU beta-lactamase",
|
| 563 |
+
"BUT beta-lactamase",
|
| 564 |
+
"Bah amidohydrolase",
|
| 565 |
+
"BlaA beta-lactamase",
|
| 566 |
+
"BlaB beta-lactamase",
|
| 567 |
+
"BlaZ beta-lactamase",
|
| 568 |
+
"Bleomycin resistant protein",
|
| 569 |
+
"CAE beta-lactamase",
|
| 570 |
+
"CAM beta-lactamase",
|
| 571 |
+
"CAR beta-lactamase",
|
| 572 |
+
"CARB beta-lactamase",
|
| 573 |
+
"CAU beta-lactamase",
|
| 574 |
+
"CBP beta-lactamase",
|
| 575 |
+
"CDA beta-lactamase;CTX-M beta-lactamase;SHV beta-lactamase;TEM beta-lactamase",
|
| 576 |
+
"CDD beta-lactamase",
|
| 577 |
+
"CGA beta-lactamase",
|
| 578 |
+
"CGB beta-lactamase",
|
| 579 |
+
"CHM beta-lactamase",
|
| 580 |
+
"CIA beta-lactamase",
|
| 581 |
+
"CIM beta-lactamase",
|
| 582 |
+
"CKO beta-lactamase",
|
| 583 |
+
"CMA beta-lactamase",
|
| 584 |
+
"CME beta-lactamase",
|
| 585 |
+
"CMH beta-lactamase",
|
| 586 |
+
"CMY beta-lactamase",
|
| 587 |
+
"CMY beta-lactamase;CTX-M beta-lactamase;IMP beta-lactamase;KPC beta-lactamase;NDM beta-lactamase;OXA beta-lactamase;OXA-1-like beta-lactamase;OXA-48-like beta-lactamase;SHV beta-lactamase;VIM beta-lactamase",
|
| 588 |
+
"CPS beta-lactamase",
|
| 589 |
+
"CRD3 beta-lactamase",
|
| 590 |
+
"CRH beta-lactamase",
|
| 591 |
+
"CRP beta-lactamase",
|
| 592 |
+
"CSA beta-lactamase",
|
| 593 |
+
"CSP beta-lactamase",
|
| 594 |
+
"CTX-M beta-lactamase",
|
| 595 |
+
"CVI beta-lactamase",
|
| 596 |
+
"CblA beta-lactamase",
|
| 597 |
+
"CepA beta-lactamase",
|
| 598 |
+
"CepS beta-lactamase",
|
| 599 |
+
"CfiA beta-lactamase",
|
| 600 |
+
"Cfr 23S ribosomal RNA methyltransferase",
|
| 601 |
+
"CfxA beta-lactamase",
|
| 602 |
+
"CphA beta-lactamase",
|
| 603 |
+
"DES beta-lactamase",
|
| 604 |
+
"DHA beta-lactamase",
|
| 605 |
+
"DHT2 beta-lactamase",
|
| 606 |
+
"DIM beta-lactamase",
|
| 607 |
+
"DYB beta-lactamase",
|
| 608 |
+
"EAM beta-lactamase",
|
| 609 |
+
"EBR beta-lactamase",
|
| 610 |
+
"EC beta-lactamase",
|
| 611 |
+
"ECF transporter S component",
|
| 612 |
+
"ECM beta-lactamase",
|
| 613 |
+
"ECV beta-lactamase",
|
| 614 |
+
"EFM beta-lactamase",
|
| 615 |
+
"ELM beta-lactamase",
|
| 616 |
+
"ERP beta-lactamase",
|
| 617 |
+
"ESP beta-lactamase",
|
| 618 |
+
"EVM beta-lactamase",
|
| 619 |
+
"EXO beta-lactamase",
|
| 620 |
+
"Edeine acetyltransferase",
|
| 621 |
+
"Erm 23S ribosomal RNA methyltransferase",
|
| 622 |
+
"FAR beta-lactamase",
|
| 623 |
+
"FEZ beta-lactamase",
|
| 624 |
+
"FIA beta-lactamase",
|
| 625 |
+
"FIM beta-lactamase",
|
| 626 |
+
"FONA beta-lactamase",
|
| 627 |
+
"FOX beta-lactamase",
|
| 628 |
+
"FPH beta-lactamase",
|
| 629 |
+
"FRI beta-lactamase",
|
| 630 |
+
"FTU beta-lactamase",
|
| 631 |
+
"Fom phosphotransferase family",
|
| 632 |
+
"GES beta-lactamase",
|
| 633 |
+
"GIL beta-lactamase",
|
| 634 |
+
"GIM beta-lactamase",
|
| 635 |
+
"GMA beta-lactamase",
|
| 636 |
+
"GMB beta-lactamase",
|
| 637 |
+
"GOB beta-lactamase",
|
| 638 |
+
"GPC beta-lactamase",
|
| 639 |
+
"GRD23 beta-lactamase",
|
| 640 |
+
"GRD33 beta-lactamase",
|
| 641 |
+
"General Bacterial Porin with reduced permeability to beta-lactams",
|
| 642 |
+
"General Bacterial Porin with reduced permeability to beta-lactams;resistance-nodulation-cell division (RND) antibiotic efflux pump",
|
| 643 |
+
"General Bacterial Porin with reduced permeability to peptide antibiotics",
|
| 644 |
+
"HBL beta-lactamase",
|
| 645 |
+
"HER beta-lactamase",
|
| 646 |
+
"HMB beta-lactamase",
|
| 647 |
+
"IDC beta-lactamase",
|
| 648 |
+
"IMI beta-lactamase",
|
| 649 |
+
"IMP beta-lactamase",
|
| 650 |
+
"IND beta-lactamase",
|
| 651 |
+
"Intrinsic peptide antibiotic resistant Lps",
|
| 652 |
+
"JOHN beta-lactamase",
|
| 653 |
+
"KBL beta-lactamase",
|
| 654 |
+
"KHM beta-lactamase",
|
| 655 |
+
"KLUC beta-lactamase",
|
| 656 |
+
"KPC beta-lactamase",
|
| 657 |
+
"L1 family beta-lactamase",
|
| 658 |
+
"LAP beta-lactamase",
|
| 659 |
+
"LAQ beta lactamase",
|
| 660 |
+
"LCR beta-lactamase",
|
| 661 |
+
"LEN beta-lactamase",
|
| 662 |
+
"LHK beta-lactamase",
|
| 663 |
+
"LMB beta-lactamase",
|
| 664 |
+
"LRG beta-lactamase",
|
| 665 |
+
"LUS beta-lactamase",
|
| 666 |
+
"LUT beta-lactamase",
|
| 667 |
+
"Llm 23S ribosomal RNA methyltransferase",
|
| 668 |
+
"MAL beta-lactamase",
|
| 669 |
+
"MBL beta-lactamase",
|
| 670 |
+
"MCR phosphoethanolamine transferase",
|
| 671 |
+
"MIR beta-lactamase",
|
| 672 |
+
"MOC beta-lactamase",
|
| 673 |
+
"MOR beta-lactamase",
|
| 674 |
+
"MOX beta-lactamase",
|
| 675 |
+
"MSI beta-lactamase",
|
| 676 |
+
"MSI-OXA family beta-lactamase",
|
| 677 |
+
"MUN beta-lactamase",
|
| 678 |
+
"MUS beta-lactamase",
|
| 679 |
+
"MYO beta-lactamase",
|
| 680 |
+
"MYX beta-lactamase",
|
| 681 |
+
"Miscellaneous ABC-F subfamily ATP-binding cassette ribosomal protection proteins",
|
| 682 |
+
"NDM beta-lactamase",
|
| 683 |
+
"NPS beta-lactamase",
|
| 684 |
+
"NWM beta-lactamase",
|
| 685 |
+
"OCH beta-lactamase",
|
| 686 |
+
"OHIO beta-lactamase",
|
| 687 |
+
"OKP beta-lactamase",
|
| 688 |
+
"ORN beta-lactamase",
|
| 689 |
+
"ORR beta-lactamase",
|
| 690 |
+
"OXA beta-lactamase",
|
| 691 |
+
"OXA beta-lactamase;OXA-1-like beta-lactamase",
|
| 692 |
+
"OXA beta-lactamase;OXA-10-like beta-lactamase",
|
| 693 |
+
"OXA beta-lactamase;OXA-114-like beta-lactamase",
|
| 694 |
+
"OXA beta-lactamase;OXA-12-like beta-lactamase",
|
| 695 |
+
"OXA beta-lactamase;OXA-134-like beta-lactamase",
|
| 696 |
+
"OXA beta-lactamase;OXA-143-like beta-lactamase",
|
| 697 |
+
"OXA beta-lactamase;OXA-184-like beta-lactamase",
|
| 698 |
+
"OXA beta-lactamase;OXA-198-like beta-lactamase",
|
| 699 |
+
"OXA beta-lactamase;OXA-2-like beta-lactamase",
|
| 700 |
+
"OXA beta-lactamase;OXA-211-like beta-lactamase",
|
| 701 |
+
"OXA beta-lactamase;OXA-213-like beta-lactamase",
|
| 702 |
+
"OXA beta-lactamase;OXA-214-like beta-lactamase",
|
| 703 |
+
"OXA beta-lactamase;OXA-22-like beta-lactamase",
|
| 704 |
+
"OXA beta-lactamase;OXA-229-like beta-lactamase",
|
| 705 |
+
"OXA beta-lactamase;OXA-23-like beta-lactamase",
|
| 706 |
+
"OXA beta-lactamase;OXA-24-like beta-lactamase",
|
| 707 |
+
"OXA beta-lactamase;OXA-266-like beta-lactamase",
|
| 708 |
+
"OXA beta-lactamase;OXA-274-like beta-lactamase",
|
| 709 |
+
"OXA beta-lactamase;OXA-286-like beta-lactamase",
|
| 710 |
+
"OXA beta-lactamase;OXA-294-like beta-lactamase",
|
| 711 |
+
"OXA beta-lactamase;OXA-364-like beta-lactamase",
|
| 712 |
+
"OXA beta-lactamase;OXA-372-like beta-lactamase",
|
| 713 |
+
"OXA beta-lactamase;OXA-42-like beta-lactamase",
|
| 714 |
+
"OXA beta-lactamase;OXA-427-like beta-lactamase",
|
| 715 |
+
"OXA beta-lactamase;OXA-46-like beta-lactamase",
|
| 716 |
+
"OXA beta-lactamase;OXA-48-like beta-lactamase",
|
| 717 |
+
"OXA beta-lactamase;OXA-493-like beta-lactamase",
|
| 718 |
+
"OXA beta-lactamase;OXA-5-like beta-lactamase",
|
| 719 |
+
"OXA beta-lactamase;OXA-50-like beta-lactamase",
|
| 720 |
+
"OXA beta-lactamase;OXA-51-like beta-lactamase",
|
| 721 |
+
"OXA beta-lactamase;OXA-548-like beta-lactamase",
|
| 722 |
+
"OXA beta-lactamase;OXA-55-like beta-lactamase",
|
| 723 |
+
"OXA beta-lactamase;OXA-58-like beta-lactamase",
|
| 724 |
+
"OXA beta-lactamase;OXA-60-like beta-lactamase",
|
| 725 |
+
"OXA beta-lactamase;OXA-61-like beta-lactamase",
|
| 726 |
+
"OXA beta-lactamase;OXA-62-like beta-lactamase",
|
| 727 |
+
"OXA beta-lactamase;OXA-63-like beta-lactamase",
|
| 728 |
+
"OXA beta-lactamase;OXA-679-like beta-lactamase",
|
| 729 |
+
"OXA beta-lactamase;OXA-727-like beta-lactamase",
|
| 730 |
+
"OXA beta-lactamase;OXA-9-like beta-lactamase",
|
| 731 |
+
"OXY beta-lactamase",
|
| 732 |
+
"Outer Membrane Porin (Opr)",
|
| 733 |
+
"Outer Membrane Porin (Opr);resistance-nodulation-cell division (RND) antibiotic efflux pump",
|
| 734 |
+
"PAC beta-lactamase",
|
| 735 |
+
"PAD beta-lactamase",
|
| 736 |
+
"PAM beta-lactamase",
|
| 737 |
+
"PAU beta-lactamase",
|
| 738 |
+
"PDC beta-lactamase",
|
| 739 |
+
"PEN-A beta-lactamase",
|
| 740 |
+
"PEN-B beta-lactamase",
|
| 741 |
+
"PER beta-lactamase",
|
| 742 |
+
"PFM beta-lactamase",
|
| 743 |
+
"PJM beta-lactamase",
|
| 744 |
+
"PLA beta-lactamase",
|
| 745 |
+
"PLN beta-lactamase",
|
| 746 |
+
"PME beta-lactamase",
|
| 747 |
+
"PNC beta-lactamase",
|
| 748 |
+
"PNGM beta-lactamase",
|
| 749 |
+
"POM beta-lactamase",
|
| 750 |
+
"PRC beta-lactamase",
|
| 751 |
+
"PST beta-lactamase",
|
| 752 |
+
"PSV beta-lactamase",
|
| 753 |
+
"PSZ beta-lactamase",
|
| 754 |
+
"R39 beta-lactamase",
|
| 755 |
+
"RAA beta-lactamase",
|
| 756 |
+
"RAD beta-lactamase",
|
| 757 |
+
"RAHN beta-lactamase",
|
| 758 |
+
"RASA beta-lactamase",
|
| 759 |
+
"RATA beta-lactamase",
|
| 760 |
+
"RCP beta-lactamase",
|
| 761 |
+
"ROB beta-lactamase",
|
| 762 |
+
"RSA beta-lactamase",
|
| 763 |
+
"RSA2 beta-lactamase",
|
| 764 |
+
"RSC1 beta-lactamase",
|
| 765 |
+
"RSD1",
|
| 766 |
+
"RSD2 beta-lactamase",
|
| 767 |
+
"RUB beta-lactamase",
|
| 768 |
+
"RbpA bacterial RNA polymerase-binding protein",
|
| 769 |
+
"Rm3 family beta-lactamase",
|
| 770 |
+
"SCO beta-lactamase",
|
| 771 |
+
"SED beta-lactamase",
|
| 772 |
+
"SFC beta-lactamase",
|
| 773 |
+
"SFDC beta-lactamase",
|
| 774 |
+
"SFH beta-lactamase",
|
| 775 |
+
"SFO beta-lactamase",
|
| 776 |
+
"SGM beta-lactamase",
|
| 777 |
+
"SHD beta-lactamase",
|
| 778 |
+
"SHN beta-lactamase",
|
| 779 |
+
"SHV beta-lactamase",
|
| 780 |
+
"SHW beta-lactamase",
|
| 781 |
+
"SIE beta-lactamase",
|
| 782 |
+
"SIM beta-lactamase",
|
| 783 |
+
"SMB beta-lactamase",
|
| 784 |
+
"SME beta-lactamase",
|
| 785 |
+
"SPG beta-lactamase",
|
| 786 |
+
"SPM beta-lactamase",
|
| 787 |
+
"SPN79 beta-lactamase",
|
| 788 |
+
"SPR beta-lactamase",
|
| 789 |
+
"SPS beta-lactamase",
|
| 790 |
+
"SPU beta-lactamase",
|
| 791 |
+
"SRT beta-lactamase",
|
| 792 |
+
"SSA beta-lactamase",
|
| 793 |
+
"SST beta-lactamase",
|
| 794 |
+
"STA beta-lactamase",
|
| 795 |
+
"Serine/threonine kinases",
|
| 796 |
+
"Subclass B1 Vibrio cholerae varG beta-lactamase",
|
| 797 |
+
"TEM beta-lactamase",
|
| 798 |
+
"TER beta-lactamase",
|
| 799 |
+
"THIN-B beta-lactamase",
|
| 800 |
+
"TLA beta-lactamase",
|
| 801 |
+
"TMB beta-lactamase",
|
| 802 |
+
"TRU beta-lactamase",
|
| 803 |
+
"TTU beta-lactamase",
|
| 804 |
+
"TUS beta-lactamase",
|
| 805 |
+
"Target protecting FusB-type protein conferring resistance to Fusidic acid",
|
| 806 |
+
"VAM beta-lactamase",
|
| 807 |
+
"VCC beta-lactamase",
|
| 808 |
+
"VEB beta-lactamase",
|
| 809 |
+
"VHH beta-lactamase",
|
| 810 |
+
"VHW beta-lactamase",
|
| 811 |
+
"VIM beta-lactamase",
|
| 812 |
+
"VMB beta-lactamase",
|
| 813 |
+
"Van ligase;glycopeptide resistance gene cluster",
|
| 814 |
+
"WUS beta-lactamase",
|
| 815 |
+
"YEM beta-lactamase",
|
| 816 |
+
"YOC beta-lactamase",
|
| 817 |
+
"YRC beta-lactamase",
|
| 818 |
+
"ZOG beta-lactamase",
|
| 819 |
+
"alm glycyl carrier protein;polymyxin resistance operon",
|
| 820 |
+
"alm glycyltransferase;polymyxin resistance operon",
|
| 821 |
+
"aminoglycoside bifunctional resistance protein",
|
| 822 |
+
"ampC-type beta-lactamase",
|
| 823 |
+
"antibiotic-resistant isoleucyl-tRNA synthetase (ileS)",
|
| 824 |
+
"antibiotic-resistant murA transferase",
|
| 825 |
+
"blaF family beta-lactamase",
|
| 826 |
+
"blaS",
|
| 827 |
+
"capreomycin phosphotransferase",
|
| 828 |
+
"chloramphenicol acetyltransferase (CAT)",
|
| 829 |
+
"chloramphenicol phosphotransferase",
|
| 830 |
+
"class A Bacillus anthracis Bla beta-lactamase",
|
| 831 |
+
"class A Bacillus cereus Bc beta-lactamase",
|
| 832 |
+
"class A LRA beta-lactamase",
|
| 833 |
+
"class A Mycobacterium abscessus beta-lactamase",
|
| 834 |
+
"class A Mycobacterium tuberculosis bla beta-lactamase",
|
| 835 |
+
"class C LRA beta-lactamase",
|
| 836 |
+
"class C LRA beta-lactamase;class D LRA beta-lactamase",
|
| 837 |
+
"cpa acetyltransferase",
|
| 838 |
+
"defensin resistant mprF",
|
| 839 |
+
"fosC phosphotransferase family",
|
| 840 |
+
"fosfomycin thiol transferase",
|
| 841 |
+
"fusidic acid inactivation enzyme",
|
| 842 |
+
"gimA family macrolide glycosyltransferase",
|
| 843 |
+
"glycopeptide resistance gene cluster;vanH",
|
| 844 |
+
"glycopeptide resistance gene cluster;vanK",
|
| 845 |
+
"glycopeptide resistance gene cluster;vanR",
|
| 846 |
+
"glycopeptide resistance gene cluster;vanS",
|
| 847 |
+
"glycopeptide resistance gene cluster;vanT",
|
| 848 |
+
"glycopeptide resistance gene cluster;vanU",
|
| 849 |
+
"glycopeptide resistance gene cluster;vanV",
|
| 850 |
+
"glycopeptide resistance gene cluster;vanW",
|
| 851 |
+
"glycopeptide resistance gene cluster;vanX",
|
| 852 |
+
"glycopeptide resistance gene cluster;vanXY",
|
| 853 |
+
"glycopeptide resistance gene cluster;vanY",
|
| 854 |
+
"glycopeptide resistance gene cluster;vanZ",
|
| 855 |
+
"helicase-like RNA polymerase protection protein",
|
| 856 |
+
"intrinsic colistin resistant phosphoethanolamine transferase",
|
| 857 |
+
"kdpDE",
|
| 858 |
+
"lincosamide nucleotidyltransferase (LNU)",
|
| 859 |
+
"lipid A acyltransferase;polymyxin resistance operon",
|
| 860 |
+
"lipid A phosphatase",
|
| 861 |
+
"lsa-type ABC-F protein",
|
| 862 |
+
"macrolide esterase",
|
| 863 |
+
"macrolide phosphotransferase (MPH)",
|
| 864 |
+
"major facilitator superfamily (MFS) antibiotic efflux pump",
|
| 865 |
+
"major facilitator superfamily (MFS) antibiotic efflux pump;resistance-nodulation-cell division (RND) antibiotic efflux pump",
|
| 866 |
+
"metal transporters with antibiotic efflux",
|
| 867 |
+
"methicillin resistant PBP2",
|
| 868 |
+
"mgt macrolide glycotransferase",
|
| 869 |
+
"msr-type ABC-F protein",
|
| 870 |
+
"multidrug and toxic compound extrusion (MATE) transporter",
|
| 871 |
+
"nitroimidazole reductase",
|
| 872 |
+
"non-erm 23S ribosomal RNA methyltransferase (A1067)",
|
| 873 |
+
"non-erm 23S ribosomal RNA methyltransferase (G748)",
|
| 874 |
+
"ole glycosyltransferase",
|
| 875 |
+
"pmr phosphoethanolamine transferase",
|
| 876 |
+
"quinolone resistance protein (qnr)",
|
| 877 |
+
"resistance-nodulation-cell division (RND) antibiotic efflux pump",
|
| 878 |
+
"rifampin ADP-ribosyltransferase (Arr)",
|
| 879 |
+
"rifampin glycosyltransferase",
|
| 880 |
+
"rifampin monooxygenase",
|
| 881 |
+
"rifampin phosphotransferase",
|
| 882 |
+
"rifamycin-resistant beta-subunit of RNA polymerase (rpoB)",
|
| 883 |
+
"sal-type ABC-F protein",
|
| 884 |
+
"small multidrug resistance (SMR) antibiotic efflux pump",
|
| 885 |
+
"streptogramin vat acetyltransferase",
|
| 886 |
+
"streptogramin vgb lyase",
|
| 887 |
+
"streptothricin acetyltransferase (SAT)",
|
| 888 |
+
"subclass B1 Bacillus anthracis Bla beta-lactamase",
|
| 889 |
+
"subclass B1 Bacillus cereus Bc beta-lactamase",
|
| 890 |
+
"subclass B1 Bacteroides xylanisolvens crx beta-lactamase",
|
| 891 |
+
"subclass B1 PEDO beta-lactamase",
|
| 892 |
+
"subclass B3 LRA beta-lactamase",
|
| 893 |
+
"subclass B3 PEDO beta-lactamase",
|
| 894 |
+
"sulfonamide resistant sul",
|
| 895 |
+
"tetracycline inactivation enzyme",
|
| 896 |
+
"tetracycline-resistant ribosomal protection protein",
|
| 897 |
+
"trimethoprim resistant dihydrofolate reductase dfr",
|
| 898 |
+
"tunicamycin resistance protein",
|
| 899 |
+
"undecaprenyl pyrophosphate related proteins",
|
| 900 |
+
"vanJ membrane protein",
|
| 901 |
+
"vga-type ABC-F protein",
|
| 902 |
+
"viomycin phosphotransferase"
|
| 903 |
+
],
|
| 904 |
+
"task_type": "multiclass",
|
| 905 |
+
"target": "gene_family",
|
| 906 |
+
"k": 3,
|
| 907 |
+
"max_features": 500,
|
| 908 |
+
"n_samples": 6054,
|
| 909 |
+
"n_features": 500,
|
| 910 |
+
"n_classes": 398
|
| 911 |
+
}
|
data_processed/card/card_gene_family_y_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:81f1c621f72580ad146f53f64427bcdd7a1341a413a9e18524d1a3c429e2e58a
|
| 3 |
+
size 9816
|
data_processed/card/card_gene_family_y_train.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c9e2e28392fef8a81d8cee68b38bc2abbf7d6ab8794a8eb60a0479b7e71b849c
|
| 3 |
+
size 34024
|
data_processed/card/card_gene_family_y_val.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6ce18e6526a29f25156ee6e8de04ed9dde10ca095006621e48074c37724addab
|
| 3 |
+
size 4976
|
data_processed/card/card_mechanism_X_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:37754e2ffccec906083a21935e46f862471a3945ffa65b46fea2c87ae191bb5d
|
| 3 |
+
size 4844128
|
data_processed/card/card_mechanism_X_train.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7053e2e0734de9ae8c46483fad2fb11c8d181f371c26d250cbfe5eea2abe3355
|
| 3 |
+
size 16948128
|
data_processed/card/card_mechanism_X_val.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:562364532d2014e72213a80cd2191590b9963132f29c09843abf6b6a1fc6df1b
|
| 3 |
+
size 2424128
|
data_processed/card/card_mechanism_metadata.json
ADDED
|
@@ -0,0 +1,523 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"feature_names": [
|
| 3 |
+
"AAA",
|
| 4 |
+
"IGL",
|
| 5 |
+
"ALA",
|
| 6 |
+
"KTG",
|
| 7 |
+
"LLL",
|
| 8 |
+
"AAL",
|
| 9 |
+
"LAA",
|
| 10 |
+
"ISL",
|
| 11 |
+
"RLD",
|
| 12 |
+
"GLE",
|
| 13 |
+
"TGS",
|
| 14 |
+
"LEQ",
|
| 15 |
+
"ALG",
|
| 16 |
+
"AST",
|
| 17 |
+
"GPL",
|
| 18 |
+
"DLA",
|
| 19 |
+
"ALQ",
|
| 20 |
+
"SIG",
|
| 21 |
+
"AAV",
|
| 22 |
+
"RDT",
|
| 23 |
+
"LDA",
|
| 24 |
+
"PAS",
|
| 25 |
+
"LAT",
|
| 26 |
+
"LGL",
|
| 27 |
+
"ALL",
|
| 28 |
+
"STF",
|
| 29 |
+
"LAR",
|
| 30 |
+
"LDR",
|
| 31 |
+
"SAA",
|
| 32 |
+
"VDA",
|
| 33 |
+
"IPG",
|
| 34 |
+
"TFK",
|
| 35 |
+
"LPA",
|
| 36 |
+
"GDA",
|
| 37 |
+
"PLK",
|
| 38 |
+
"AVL",
|
| 39 |
+
"LAQ",
|
| 40 |
+
"LAV",
|
| 41 |
+
"GYG",
|
| 42 |
+
"VAL",
|
| 43 |
+
"ATL",
|
| 44 |
+
"LFG",
|
| 45 |
+
"ANL",
|
| 46 |
+
"ALI",
|
| 47 |
+
"LAS",
|
| 48 |
+
"QAL",
|
| 49 |
+
"GLG",
|
| 50 |
+
"TLL",
|
| 51 |
+
"GLP",
|
| 52 |
+
"LPL",
|
| 53 |
+
"VAF",
|
| 54 |
+
"GFG",
|
| 55 |
+
"SLL",
|
| 56 |
+
"RVG",
|
| 57 |
+
"AVA",
|
| 58 |
+
"LSA",
|
| 59 |
+
"VLA",
|
| 60 |
+
"LAN",
|
| 61 |
+
"SDN",
|
| 62 |
+
"LTA",
|
| 63 |
+
"PAL",
|
| 64 |
+
"SLK",
|
| 65 |
+
"TTG",
|
| 66 |
+
"AGG",
|
| 67 |
+
"LDL",
|
| 68 |
+
"YGN",
|
| 69 |
+
"DDR",
|
| 70 |
+
"TLA",
|
| 71 |
+
"LLA",
|
| 72 |
+
"AIP",
|
| 73 |
+
"GLA",
|
| 74 |
+
"QRL",
|
| 75 |
+
"GWE",
|
| 76 |
+
"GVK",
|
| 77 |
+
"ATY",
|
| 78 |
+
"LLD",
|
| 79 |
+
"GLF",
|
| 80 |
+
"LLR",
|
| 81 |
+
"IAA",
|
| 82 |
+
"LGW",
|
| 83 |
+
"GSV",
|
| 84 |
+
"RIG",
|
| 85 |
+
"ERL",
|
| 86 |
+
"SKE",
|
| 87 |
+
"YGV",
|
| 88 |
+
"TAG",
|
| 89 |
+
"EQQ",
|
| 90 |
+
"NLL",
|
| 91 |
+
"ELG",
|
| 92 |
+
"RFP",
|
| 93 |
+
"LEK",
|
| 94 |
+
"LLT",
|
| 95 |
+
"LKR",
|
| 96 |
+
"GST",
|
| 97 |
+
"AGL",
|
| 98 |
+
"GGL",
|
| 99 |
+
"SVS",
|
| 100 |
+
"LVD",
|
| 101 |
+
"FGA",
|
| 102 |
+
"LSG",
|
| 103 |
+
"KAL",
|
| 104 |
+
"DER",
|
| 105 |
+
"KRL",
|
| 106 |
+
"TLG",
|
| 107 |
+
"LAG",
|
| 108 |
+
"VGD",
|
| 109 |
+
"TPA",
|
| 110 |
+
"NAL",
|
| 111 |
+
"TYT",
|
| 112 |
+
"SAI",
|
| 113 |
+
"TLF",
|
| 114 |
+
"AIA",
|
| 115 |
+
"GGP",
|
| 116 |
+
"LAL",
|
| 117 |
+
"LGD",
|
| 118 |
+
"LGS",
|
| 119 |
+
"ALV",
|
| 120 |
+
"ERF",
|
| 121 |
+
"LTG",
|
| 122 |
+
"PVT",
|
| 123 |
+
"TAT",
|
| 124 |
+
"AER",
|
| 125 |
+
"LVI",
|
| 126 |
+
"LGI",
|
| 127 |
+
"LKG",
|
| 128 |
+
"DTT",
|
| 129 |
+
"AAN",
|
| 130 |
+
"LTL",
|
| 131 |
+
"GVL",
|
| 132 |
+
"EIG",
|
| 133 |
+
"ARS",
|
| 134 |
+
"NTA",
|
| 135 |
+
"LFE",
|
| 136 |
+
"TTP",
|
| 137 |
+
"LAE",
|
| 138 |
+
"VPA",
|
| 139 |
+
"QTL",
|
| 140 |
+
"VGP",
|
| 141 |
+
"VSK",
|
| 142 |
+
"AGN",
|
| 143 |
+
"ASA",
|
| 144 |
+
"EAA",
|
| 145 |
+
"ASK",
|
| 146 |
+
"RAS",
|
| 147 |
+
"RLY",
|
| 148 |
+
"VTP",
|
| 149 |
+
"QLG",
|
| 150 |
+
"GIA",
|
| 151 |
+
"GLV",
|
| 152 |
+
"VTA",
|
| 153 |
+
"ADI",
|
| 154 |
+
"KSL",
|
| 155 |
+
"AAR",
|
| 156 |
+
"SQR",
|
| 157 |
+
"EQL",
|
| 158 |
+
"MKA",
|
| 159 |
+
"APA",
|
| 160 |
+
"ASL",
|
| 161 |
+
"TTT",
|
| 162 |
+
"TSA",
|
| 163 |
+
"YRQ",
|
| 164 |
+
"QGL",
|
| 165 |
+
"SYG",
|
| 166 |
+
"TAF",
|
| 167 |
+
"ATT",
|
| 168 |
+
"LLS",
|
| 169 |
+
"ADL",
|
| 170 |
+
"PLQ",
|
| 171 |
+
"AAS",
|
| 172 |
+
"TLP",
|
| 173 |
+
"SKT",
|
| 174 |
+
"GKA",
|
| 175 |
+
"IGD",
|
| 176 |
+
"PLL",
|
| 177 |
+
"KAS",
|
| 178 |
+
"ALE",
|
| 179 |
+
"LLF",
|
| 180 |
+
"KTF",
|
| 181 |
+
"RRI",
|
| 182 |
+
"PAP",
|
| 183 |
+
"AQA",
|
| 184 |
+
"VLV",
|
| 185 |
+
"ALP",
|
| 186 |
+
"PAD",
|
| 187 |
+
"AVI",
|
| 188 |
+
"AIS",
|
| 189 |
+
"AYA",
|
| 190 |
+
"LGG",
|
| 191 |
+
"AIL",
|
| 192 |
+
"LIG",
|
| 193 |
+
"DAE",
|
| 194 |
+
"YVA",
|
| 195 |
+
"LVG",
|
| 196 |
+
"GAA",
|
| 197 |
+
"SLG",
|
| 198 |
+
"LNA",
|
| 199 |
+
"SAL",
|
| 200 |
+
"STL",
|
| 201 |
+
"DRP",
|
| 202 |
+
"IAR",
|
| 203 |
+
"PAG",
|
| 204 |
+
"ATA",
|
| 205 |
+
"DMT",
|
| 206 |
+
"MKK",
|
| 207 |
+
"IVA",
|
| 208 |
+
"DEV",
|
| 209 |
+
"DLL",
|
| 210 |
+
"QPQ",
|
| 211 |
+
"QLA",
|
| 212 |
+
"PGD",
|
| 213 |
+
"DGK",
|
| 214 |
+
"LLN",
|
| 215 |
+
"NDI",
|
| 216 |
+
"QDK",
|
| 217 |
+
"IAD",
|
| 218 |
+
"NKT",
|
| 219 |
+
"DKT",
|
| 220 |
+
"KLA",
|
| 221 |
+
"TGA",
|
| 222 |
+
"MLN",
|
| 223 |
+
"EAY",
|
| 224 |
+
"PGM",
|
| 225 |
+
"YSN",
|
| 226 |
+
"ARL",
|
| 227 |
+
"WQP",
|
| 228 |
+
"YTA",
|
| 229 |
+
"LCG",
|
| 230 |
+
"FTA",
|
| 231 |
+
"GAV",
|
| 232 |
+
"ANK",
|
| 233 |
+
"LEG",
|
| 234 |
+
"FPD",
|
| 235 |
+
"YPN",
|
| 236 |
+
"VQP",
|
| 237 |
+
"VGW",
|
| 238 |
+
"AVQ",
|
| 239 |
+
"KTL",
|
| 240 |
+
"LKI",
|
| 241 |
+
"LKA",
|
| 242 |
+
"DLV",
|
| 243 |
+
"ILS",
|
| 244 |
+
"ISA",
|
| 245 |
+
"GNT",
|
| 246 |
+
"FSY",
|
| 247 |
+
"ALD",
|
| 248 |
+
"YGL",
|
| 249 |
+
"SNP",
|
| 250 |
+
"QAG",
|
| 251 |
+
"PSI",
|
| 252 |
+
"QYS",
|
| 253 |
+
"GNA",
|
| 254 |
+
"LGV",
|
| 255 |
+
"IGS",
|
| 256 |
+
"GDK",
|
| 257 |
+
"KIS",
|
| 258 |
+
"KAE",
|
| 259 |
+
"SVQ",
|
| 260 |
+
"FWL",
|
| 261 |
+
"PGP",
|
| 262 |
+
"LLG",
|
| 263 |
+
"AEL",
|
| 264 |
+
"AYG",
|
| 265 |
+
"ETL",
|
| 266 |
+
"PLA",
|
| 267 |
+
"QQG",
|
| 268 |
+
"KSG",
|
| 269 |
+
"ARR",
|
| 270 |
+
"LVT",
|
| 271 |
+
"MTL",
|
| 272 |
+
"DAA",
|
| 273 |
+
"VLL",
|
| 274 |
+
"GYA",
|
| 275 |
+
"MAV",
|
| 276 |
+
"RLL",
|
| 277 |
+
"GKP",
|
| 278 |
+
"GAL",
|
| 279 |
+
"KLL",
|
| 280 |
+
"VKT",
|
| 281 |
+
"APL",
|
| 282 |
+
"FGY",
|
| 283 |
+
"NEA",
|
| 284 |
+
"TLR",
|
| 285 |
+
"LQF",
|
| 286 |
+
"ITP",
|
| 287 |
+
"NPS",
|
| 288 |
+
"IKK",
|
| 289 |
+
"VAI",
|
| 290 |
+
"YAK",
|
| 291 |
+
"LNK",
|
| 292 |
+
"GMT",
|
| 293 |
+
"EIK",
|
| 294 |
+
"QWQ",
|
| 295 |
+
"ALS",
|
| 296 |
+
"GWV",
|
| 297 |
+
"DTP",
|
| 298 |
+
"SEK",
|
| 299 |
+
"VPG",
|
| 300 |
+
"LLI",
|
| 301 |
+
"LDD",
|
| 302 |
+
"KKS",
|
| 303 |
+
"FPA",
|
| 304 |
+
"LRF",
|
| 305 |
+
"KEL",
|
| 306 |
+
"LLK",
|
| 307 |
+
"ILA",
|
| 308 |
+
"HKT",
|
| 309 |
+
"AGE",
|
| 310 |
+
"GPG",
|
| 311 |
+
"YGK",
|
| 312 |
+
"IAG",
|
| 313 |
+
"VMK",
|
| 314 |
+
"GIV",
|
| 315 |
+
"APQ",
|
| 316 |
+
"GSR",
|
| 317 |
+
"VPL",
|
| 318 |
+
"GIS",
|
| 319 |
+
"ARA",
|
| 320 |
+
"FAA",
|
| 321 |
+
"GDE",
|
| 322 |
+
"VIY",
|
| 323 |
+
"IAL",
|
| 324 |
+
"ADK",
|
| 325 |
+
"AND",
|
| 326 |
+
"DRA",
|
| 327 |
+
"QQV",
|
| 328 |
+
"STN",
|
| 329 |
+
"NAE",
|
| 330 |
+
"GVA",
|
| 331 |
+
"PLD",
|
| 332 |
+
"LPF",
|
| 333 |
+
"QFP",
|
| 334 |
+
"DIA",
|
| 335 |
+
"VPE",
|
| 336 |
+
"KDQ",
|
| 337 |
+
"TYA",
|
| 338 |
+
"TFT",
|
| 339 |
+
"DVP",
|
| 340 |
+
"STS",
|
| 341 |
+
"ADE",
|
| 342 |
+
"VNP",
|
| 343 |
+
"PVY",
|
| 344 |
+
"KDD",
|
| 345 |
+
"VYQ",
|
| 346 |
+
"ELA",
|
| 347 |
+
"GEA",
|
| 348 |
+
"LRK",
|
| 349 |
+
"TGW",
|
| 350 |
+
"NDL",
|
| 351 |
+
"ARV",
|
| 352 |
+
"GPA",
|
| 353 |
+
"EKH",
|
| 354 |
+
"QIA",
|
| 355 |
+
"AIT",
|
| 356 |
+
"AMA",
|
| 357 |
+
"AAD",
|
| 358 |
+
"IYA",
|
| 359 |
+
"TRL",
|
| 360 |
+
"YAQ",
|
| 361 |
+
"EQT",
|
| 362 |
+
"LEL",
|
| 363 |
+
"TNG",
|
| 364 |
+
"GMA",
|
| 365 |
+
"LIA",
|
| 366 |
+
"TGV",
|
| 367 |
+
"GKV",
|
| 368 |
+
"DLG",
|
| 369 |
+
"AQT",
|
| 370 |
+
"KLS",
|
| 371 |
+
"FVP",
|
| 372 |
+
"AQG",
|
| 373 |
+
"LNE",
|
| 374 |
+
"PIS",
|
| 375 |
+
"IVM",
|
| 376 |
+
"GAY",
|
| 377 |
+
"PET",
|
| 378 |
+
"QDL",
|
| 379 |
+
"AYV",
|
| 380 |
+
"ATV",
|
| 381 |
+
"LVL",
|
| 382 |
+
"VML",
|
| 383 |
+
"NGF",
|
| 384 |
+
"LLQ",
|
| 385 |
+
"VFK",
|
| 386 |
+
"ERI",
|
| 387 |
+
"GDM",
|
| 388 |
+
"EKN",
|
| 389 |
+
"AFV",
|
| 390 |
+
"QVL",
|
| 391 |
+
"FVD",
|
| 392 |
+
"VQD",
|
| 393 |
+
"MTV",
|
| 394 |
+
"FEL",
|
| 395 |
+
"EVK",
|
| 396 |
+
"AKS",
|
| 397 |
+
"GSQ",
|
| 398 |
+
"LFA",
|
| 399 |
+
"IKA",
|
| 400 |
+
"AAT",
|
| 401 |
+
"ELN",
|
| 402 |
+
"LGA",
|
| 403 |
+
"VAE",
|
| 404 |
+
"VKA",
|
| 405 |
+
"KIA",
|
| 406 |
+
"PEL",
|
| 407 |
+
"DNT",
|
| 408 |
+
"KAA",
|
| 409 |
+
"LYA",
|
| 410 |
+
"RKL",
|
| 411 |
+
"ADR",
|
| 412 |
+
"LPV",
|
| 413 |
+
"AEA",
|
| 414 |
+
"EER",
|
| 415 |
+
"KVA",
|
| 416 |
+
"KAN",
|
| 417 |
+
"ASI",
|
| 418 |
+
"LQA",
|
| 419 |
+
"WLV",
|
| 420 |
+
"VIL",
|
| 421 |
+
"PLR",
|
| 422 |
+
"AFS",
|
| 423 |
+
"PHY",
|
| 424 |
+
"HRI",
|
| 425 |
+
"VTE",
|
| 426 |
+
"DRL",
|
| 427 |
+
"AVK",
|
| 428 |
+
"FIP",
|
| 429 |
+
"VVA",
|
| 430 |
+
"RIA",
|
| 431 |
+
"AAI",
|
| 432 |
+
"YQG",
|
| 433 |
+
"LTN",
|
| 434 |
+
"YLA",
|
| 435 |
+
"AFL",
|
| 436 |
+
"ERV",
|
| 437 |
+
"LQP",
|
| 438 |
+
"QVG",
|
| 439 |
+
"GQP",
|
| 440 |
+
"GRR",
|
| 441 |
+
"HPE",
|
| 442 |
+
"LQG",
|
| 443 |
+
"LNL",
|
| 444 |
+
"SGG",
|
| 445 |
+
"TPQ",
|
| 446 |
+
"SYV",
|
| 447 |
+
"WVV",
|
| 448 |
+
"QAQ",
|
| 449 |
+
"DSV",
|
| 450 |
+
"ARG",
|
| 451 |
+
"NST",
|
| 452 |
+
"TPE",
|
| 453 |
+
"RPL",
|
| 454 |
+
"HYF",
|
| 455 |
+
"EQI",
|
| 456 |
+
"LVA",
|
| 457 |
+
"RSL",
|
| 458 |
+
"QQL",
|
| 459 |
+
"YFT",
|
| 460 |
+
"APG",
|
| 461 |
+
"GEL",
|
| 462 |
+
"FDG",
|
| 463 |
+
"SGL",
|
| 464 |
+
"SGA",
|
| 465 |
+
"AQI",
|
| 466 |
+
"QPV",
|
| 467 |
+
"FSL",
|
| 468 |
+
"GYL",
|
| 469 |
+
"PNA",
|
| 470 |
+
"RAP",
|
| 471 |
+
"QRA",
|
| 472 |
+
"LFI",
|
| 473 |
+
"ANR",
|
| 474 |
+
"GNL",
|
| 475 |
+
"VLG",
|
| 476 |
+
"LFP",
|
| 477 |
+
"QKD",
|
| 478 |
+
"GIL",
|
| 479 |
+
"EKI",
|
| 480 |
+
"SPV",
|
| 481 |
+
"DQA",
|
| 482 |
+
"LER",
|
| 483 |
+
"RLG",
|
| 484 |
+
"DAI",
|
| 485 |
+
"TAA",
|
| 486 |
+
"PDS",
|
| 487 |
+
"RDL",
|
| 488 |
+
"VTR",
|
| 489 |
+
"DAG",
|
| 490 |
+
"AEG",
|
| 491 |
+
"SLI",
|
| 492 |
+
"FKW",
|
| 493 |
+
"VAG",
|
| 494 |
+
"VAV",
|
| 495 |
+
"RFV",
|
| 496 |
+
"GAG",
|
| 497 |
+
"GLT",
|
| 498 |
+
"VKR",
|
| 499 |
+
"RQQ",
|
| 500 |
+
"RVE",
|
| 501 |
+
"KGE",
|
| 502 |
+
"TRF"
|
| 503 |
+
],
|
| 504 |
+
"class_names": [
|
| 505 |
+
"antibiotic efflux",
|
| 506 |
+
"antibiotic efflux;antibiotic target alteration",
|
| 507 |
+
"antibiotic efflux;reduced permeability to antibiotic",
|
| 508 |
+
"antibiotic inactivation",
|
| 509 |
+
"antibiotic target alteration",
|
| 510 |
+
"antibiotic target alteration;antibiotic target replacement",
|
| 511 |
+
"antibiotic target protection",
|
| 512 |
+
"antibiotic target replacement",
|
| 513 |
+
"reduced permeability to antibiotic",
|
| 514 |
+
"resistance by host-dependent nutrient acquisition"
|
| 515 |
+
],
|
| 516 |
+
"task_type": "multiclass",
|
| 517 |
+
"target": "mechanism",
|
| 518 |
+
"k": 3,
|
| 519 |
+
"max_features": 500,
|
| 520 |
+
"n_samples": 6054,
|
| 521 |
+
"n_features": 500,
|
| 522 |
+
"n_classes": 10
|
| 523 |
+
}
|
data_processed/card/card_mechanism_y_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1b9d681610c6a31460738f88eb79edb53a953e31db719fc2a9eae01a25a511c6
|
| 3 |
+
size 9816
|
data_processed/card/card_mechanism_y_train.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ccb1d3c6f9de05eab30cc90f7a1cac15a717f3757b177d632b1f4fc543278c01
|
| 3 |
+
size 34024
|
data_processed/card/card_mechanism_y_val.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a8bf2328cff6f19f393b6faab709ba383972f3125d0626107a6599e90248fc07
|
| 3 |
+
size 4976
|
data_processed/ncbi/ncbi_amr_X_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8b27b9af79f60246b39456a5c338cd16f9279cdf57952f701c18729e94891191
|
| 3 |
+
size 692128
|
data_processed/ncbi/ncbi_amr_X_train.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5cb410aa23501e07ab67ecfe4dffd2004ce8db235ee4595ff22efba0f3664193
|
| 3 |
+
size 2408128
|
data_processed/ncbi/ncbi_amr_X_val.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:524db56ddde448beb99d4da060ad6ac0aaee3828565b9900f5202655e9f64514
|
| 3 |
+
size 348128
|
data_processed/ncbi/ncbi_amr_metadata.json
ADDED
|
@@ -0,0 +1,537 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"feature_names": [
|
| 3 |
+
"AAAAAA",
|
| 4 |
+
"TTTTTT",
|
| 5 |
+
"AAAAAT",
|
| 6 |
+
"ATTTTT",
|
| 7 |
+
"TAAAAA",
|
| 8 |
+
"TTTTTA",
|
| 9 |
+
"TTTTTC",
|
| 10 |
+
"TTTAAA",
|
| 11 |
+
"GAAAAA",
|
| 12 |
+
"AAAATA",
|
| 13 |
+
"TATTTT",
|
| 14 |
+
"TTAAAA",
|
| 15 |
+
"TTTTAA",
|
| 16 |
+
"AATAAA",
|
| 17 |
+
"CTTTTT",
|
| 18 |
+
"AAAATT",
|
| 19 |
+
"AAAAAG",
|
| 20 |
+
"TTTATT",
|
| 21 |
+
"AATTTT",
|
| 22 |
+
"AAATAA",
|
| 23 |
+
"ATAAAA",
|
| 24 |
+
"TTTTAT",
|
| 25 |
+
"TTATTT",
|
| 26 |
+
"CAAAAA",
|
| 27 |
+
"GCCAGC",
|
| 28 |
+
"CGCCAG",
|
| 29 |
+
"TTTTTG",
|
| 30 |
+
"TTTTCA",
|
| 31 |
+
"CGCCGC",
|
| 32 |
+
"GCTGGC",
|
| 33 |
+
"CAGCAG",
|
| 34 |
+
"CTGCTG",
|
| 35 |
+
"TGAAAA",
|
| 36 |
+
"GCGGCG",
|
| 37 |
+
"CTGGCG",
|
| 38 |
+
"AAATTT",
|
| 39 |
+
"TTCTTT",
|
| 40 |
+
"AAAAAC",
|
| 41 |
+
"AAAGAA",
|
| 42 |
+
"TTTCTT",
|
| 43 |
+
"GTTTTT",
|
| 44 |
+
"AAGAAA",
|
| 45 |
+
"ATTAAA",
|
| 46 |
+
"GCCGCC",
|
| 47 |
+
"TTTAAT",
|
| 48 |
+
"TTTTCT",
|
| 49 |
+
"CAGCGC",
|
| 50 |
+
"TAAAAT",
|
| 51 |
+
"AGAAAA",
|
| 52 |
+
"AATAAT",
|
| 53 |
+
"ATTTTA",
|
| 54 |
+
"TCTTTT",
|
| 55 |
+
"GGCGGC",
|
| 56 |
+
"CCAGCG",
|
| 57 |
+
"ATTATT",
|
| 58 |
+
"GCGCTG",
|
| 59 |
+
"CCAGCA",
|
| 60 |
+
"AATATT",
|
| 61 |
+
"AAAAGA",
|
| 62 |
+
"GCGCCG",
|
| 63 |
+
"CATTTT",
|
| 64 |
+
"CGGCGC",
|
| 65 |
+
"AAAATC",
|
| 66 |
+
"AAATCA",
|
| 67 |
+
"GCAAAA",
|
| 68 |
+
"AAAATG",
|
| 69 |
+
"TGCTGG",
|
| 70 |
+
"CGCTGG",
|
| 71 |
+
"TCAAAA",
|
| 72 |
+
"TTAAAT",
|
| 73 |
+
"ATATTT",
|
| 74 |
+
"AAATAT",
|
| 75 |
+
"AAATTA",
|
| 76 |
+
"ATTTAA",
|
| 77 |
+
"TTTTGA",
|
| 78 |
+
"GATTTT",
|
| 79 |
+
"TTTTGC",
|
| 80 |
+
"TGATTT",
|
| 81 |
+
"AAAACA",
|
| 82 |
+
"TAATTT",
|
| 83 |
+
"TTAATT",
|
| 84 |
+
"TGCTGC",
|
| 85 |
+
"CCGCCG",
|
| 86 |
+
"GCAGCA",
|
| 87 |
+
"CAGCAA",
|
| 88 |
+
"AATTAA",
|
| 89 |
+
"ATTTTC",
|
| 90 |
+
"TCGCCG",
|
| 91 |
+
"TGTTTT",
|
| 92 |
+
"TTCTTC",
|
| 93 |
+
"AAAAGC",
|
| 94 |
+
"CAAAAT",
|
| 95 |
+
"CGGCGA",
|
| 96 |
+
"TTGCTG",
|
| 97 |
+
"CGGCGG",
|
| 98 |
+
"GCTTTT",
|
| 99 |
+
"GAAAAT",
|
| 100 |
+
"ATTTTG",
|
| 101 |
+
"TTTCAA",
|
| 102 |
+
"CGCCGG",
|
| 103 |
+
"GAAGAA",
|
| 104 |
+
"ATAAAT",
|
| 105 |
+
"CCGGCG",
|
| 106 |
+
"GCGCCA",
|
| 107 |
+
"TTTCAT",
|
| 108 |
+
"ATCAAA",
|
| 109 |
+
"TAAATA",
|
| 110 |
+
"ATTTAT",
|
| 111 |
+
"ATCGCC",
|
| 112 |
+
"TATTTA",
|
| 113 |
+
"TTTGAT",
|
| 114 |
+
"ATGAAA",
|
| 115 |
+
"TTGAAA",
|
| 116 |
+
"ATCAAT",
|
| 117 |
+
"TTGTTT",
|
| 118 |
+
"TTTATC",
|
| 119 |
+
"AAACAA",
|
| 120 |
+
"TCATCA",
|
| 121 |
+
"GATAAA",
|
| 122 |
+
"CAATTT",
|
| 123 |
+
"AACAAA",
|
| 124 |
+
"TCAGCA",
|
| 125 |
+
"CGCCGA",
|
| 126 |
+
"CGCCTG",
|
| 127 |
+
"TGGCGC",
|
| 128 |
+
"TTCAAT",
|
| 129 |
+
"ATTAAT",
|
| 130 |
+
"GGCGAT",
|
| 131 |
+
"ACAAAA",
|
| 132 |
+
"AGCAGC",
|
| 133 |
+
"TCGGCG",
|
| 134 |
+
"TTTGTT",
|
| 135 |
+
"ACCAGC",
|
| 136 |
+
"CCGCCA",
|
| 137 |
+
"TTCAAA",
|
| 138 |
+
"AAATTG",
|
| 139 |
+
"TTCATC",
|
| 140 |
+
"GCTGCT",
|
| 141 |
+
"ATTGAT",
|
| 142 |
+
"TGCTGA",
|
| 143 |
+
"ATTGAA",
|
| 144 |
+
"CGCGCC",
|
| 145 |
+
"CATCAA",
|
| 146 |
+
"CAGGCG",
|
| 147 |
+
"TTTGAA",
|
| 148 |
+
"TAAATT",
|
| 149 |
+
"GCTGAA",
|
| 150 |
+
"GCGCGC",
|
| 151 |
+
"TGATGA",
|
| 152 |
+
"TTCAGC",
|
| 153 |
+
"GCTGGT",
|
| 154 |
+
"TTTTGT",
|
| 155 |
+
"AAACCA",
|
| 156 |
+
"AAAGCA",
|
| 157 |
+
"TCATTT",
|
| 158 |
+
"AATCAA",
|
| 159 |
+
"AAATGA",
|
| 160 |
+
"GATGAA",
|
| 161 |
+
"TGCTTT",
|
| 162 |
+
"TAATAA",
|
| 163 |
+
"TAAAAC",
|
| 164 |
+
"AATTTC",
|
| 165 |
+
"TTCATT",
|
| 166 |
+
"TCCAGC",
|
| 167 |
+
"GGCGCG",
|
| 168 |
+
"TGGTTT",
|
| 169 |
+
"GCCTGC",
|
| 170 |
+
"TTATTA",
|
| 171 |
+
"AGCGCC",
|
| 172 |
+
"GCCGGC",
|
| 173 |
+
"ATTTCA",
|
| 174 |
+
"TCGCCA",
|
| 175 |
+
"TGGCGG",
|
| 176 |
+
"CTTCTT",
|
| 177 |
+
"CTGCGC",
|
| 178 |
+
"AATTTA",
|
| 179 |
+
"TTGATT",
|
| 180 |
+
"AATGAA",
|
| 181 |
+
"GTTTTA",
|
| 182 |
+
"GAAATT",
|
| 183 |
+
"TGAAAT",
|
| 184 |
+
"TTGATG",
|
| 185 |
+
"AGCAAA",
|
| 186 |
+
"GCTGGA",
|
| 187 |
+
"GCAGCG",
|
| 188 |
+
"CGCTGC",
|
| 189 |
+
"GCAGGC",
|
| 190 |
+
"TTTTAC",
|
| 191 |
+
"CAATAA",
|
| 192 |
+
"GTAAAA",
|
| 193 |
+
"ATCAGC",
|
| 194 |
+
"TGGCGA",
|
| 195 |
+
"CACCAG",
|
| 196 |
+
"GGCGCT",
|
| 197 |
+
"CTTTAA",
|
| 198 |
+
"GCGCAG",
|
| 199 |
+
"TCTTCA",
|
| 200 |
+
"AAAACT",
|
| 201 |
+
"AAGAAG",
|
| 202 |
+
"TTTGCT",
|
| 203 |
+
"TTAAAG",
|
| 204 |
+
"AAAACC",
|
| 205 |
+
"GCTGAT",
|
| 206 |
+
"TCTTTA",
|
| 207 |
+
"CAACAA",
|
| 208 |
+
"TTTTCC",
|
| 209 |
+
"ACTTTT",
|
| 210 |
+
"TAAAGA",
|
| 211 |
+
"TGCCGC",
|
| 212 |
+
"CCTGCT",
|
| 213 |
+
"CCGCGC",
|
| 214 |
+
"CTTTAT",
|
| 215 |
+
"AGTTTT",
|
| 216 |
+
"GCCGCG",
|
| 217 |
+
"TTATTG",
|
| 218 |
+
"GGAAAA",
|
| 219 |
+
"GCGCGG",
|
| 220 |
+
"TGAAGA",
|
| 221 |
+
"TCAATT",
|
| 222 |
+
"ATAAAG",
|
| 223 |
+
"CTGAAA",
|
| 224 |
+
"GGTTTT",
|
| 225 |
+
"CTGGTG",
|
| 226 |
+
"CGCGGC",
|
| 227 |
+
"AACAGC",
|
| 228 |
+
"AAAAGT",
|
| 229 |
+
"ATCTTT",
|
| 230 |
+
"GCGGCA",
|
| 231 |
+
"AAAGAT",
|
| 232 |
+
"TTGTTG",
|
| 233 |
+
"ATTTCT",
|
| 234 |
+
"AATTGA",
|
| 235 |
+
"CAGCGG",
|
| 236 |
+
"AGAAAT",
|
| 237 |
+
"CGGCAA",
|
| 238 |
+
"TTTCAG",
|
| 239 |
+
"TAAAGC",
|
| 240 |
+
"CCGCTG",
|
| 241 |
+
"CTTCAA",
|
| 242 |
+
"CCACCA",
|
| 243 |
+
"GCTTTA",
|
| 244 |
+
"CTAAAA",
|
| 245 |
+
"GCTGTT",
|
| 246 |
+
"AAGCAA",
|
| 247 |
+
"AATTGC",
|
| 248 |
+
"AACAAT",
|
| 249 |
+
"TTCGCC",
|
| 250 |
+
"TTGAAG",
|
| 251 |
+
"GCAATT",
|
| 252 |
+
"TTGCTT",
|
| 253 |
+
"CTTTTA",
|
| 254 |
+
"TAAAAG",
|
| 255 |
+
"AGCAGG",
|
| 256 |
+
"AGCAAT",
|
| 257 |
+
"TTGCCG",
|
| 258 |
+
"TTTTAG",
|
| 259 |
+
"ATTGCT",
|
| 260 |
+
"CAGCCA",
|
| 261 |
+
"GATATT",
|
| 262 |
+
"CTGCCG",
|
| 263 |
+
"AATATC",
|
| 264 |
+
"CGGCAG",
|
| 265 |
+
"CAATAT",
|
| 266 |
+
"AACTTT",
|
| 267 |
+
"ATAATT",
|
| 268 |
+
"CCATTT",
|
| 269 |
+
"CAGCTT",
|
| 270 |
+
"ATCATC",
|
| 271 |
+
"TTGCCA",
|
| 272 |
+
"CGCCAT",
|
| 273 |
+
"AATTAT",
|
| 274 |
+
"GGCGAA",
|
| 275 |
+
"TCAATA",
|
| 276 |
+
"AAGCTG",
|
| 277 |
+
"TTTCTG",
|
| 278 |
+
"CTGTTT",
|
| 279 |
+
"CCAGGC",
|
| 280 |
+
"TGCGCC",
|
| 281 |
+
"CAAATT",
|
| 282 |
+
"TGGCAA",
|
| 283 |
+
"ATATTG",
|
| 284 |
+
"TTTGCC",
|
| 285 |
+
"AAAACG",
|
| 286 |
+
"AATCAT",
|
| 287 |
+
"GGCAAA",
|
| 288 |
+
"ATCATT",
|
| 289 |
+
"CATTAA",
|
| 290 |
+
"GCCTGG",
|
| 291 |
+
"GGTAAA",
|
| 292 |
+
"CAGAAA",
|
| 293 |
+
"AAACAG",
|
| 294 |
+
"CGTTTT",
|
| 295 |
+
"TGGCTG",
|
| 296 |
+
"CCAGCC",
|
| 297 |
+
"ATTGTT",
|
| 298 |
+
"AATTTG",
|
| 299 |
+
"AAATGG",
|
| 300 |
+
"CATCAT",
|
| 301 |
+
"TTTACC",
|
| 302 |
+
"TATAAA",
|
| 303 |
+
"TATCAA",
|
| 304 |
+
"TTTATA",
|
| 305 |
+
"TATTGA",
|
| 306 |
+
"ACCGCC",
|
| 307 |
+
"AAATTC",
|
| 308 |
+
"TCACCA",
|
| 309 |
+
"TTTCCA",
|
| 310 |
+
"AAAGTT",
|
| 311 |
+
"CTGGAA",
|
| 312 |
+
"GCCAGG",
|
| 313 |
+
"CCGGCA",
|
| 314 |
+
"TTCCAG",
|
| 315 |
+
"GGCGCA",
|
| 316 |
+
"ATGGCG",
|
| 317 |
+
"TTGATA",
|
| 318 |
+
"CCTGGC",
|
| 319 |
+
"TGGTGG",
|
| 320 |
+
"ATGATT",
|
| 321 |
+
"TTAATG",
|
| 322 |
+
"CAGCAT",
|
| 323 |
+
"CAGCAC",
|
| 324 |
+
"ATGATG",
|
| 325 |
+
"GGCTGG",
|
| 326 |
+
"CGCAGC",
|
| 327 |
+
"CTTTTG",
|
| 328 |
+
"AATGAT",
|
| 329 |
+
"ATGCTG",
|
| 330 |
+
"GAATTT",
|
| 331 |
+
"TGGAAA",
|
| 332 |
+
"TTAATA",
|
| 333 |
+
"TTATCA",
|
| 334 |
+
"GATGAT",
|
| 335 |
+
"ATCACC",
|
| 336 |
+
"TTTAAC",
|
| 337 |
+
"CAAAAC",
|
| 338 |
+
"TATTAA",
|
| 339 |
+
"TGCCGG",
|
| 340 |
+
"ACCAAA",
|
| 341 |
+
"TGGTGA",
|
| 342 |
+
"GCATCA",
|
| 343 |
+
"GCTGCG",
|
| 344 |
+
"CAAAAG",
|
| 345 |
+
"TGAATA",
|
| 346 |
+
"GCCGCT",
|
| 347 |
+
"GTTAAA",
|
| 348 |
+
"AAACTT",
|
| 349 |
+
"TGATAA",
|
| 350 |
+
"CAATCA",
|
| 351 |
+
"CTGGCC",
|
| 352 |
+
"AGCGGC",
|
| 353 |
+
"TGCAAA",
|
| 354 |
+
"GGCCAG",
|
| 355 |
+
"GCCATC",
|
| 356 |
+
"GCATTT",
|
| 357 |
+
"TATTCA",
|
| 358 |
+
"TTCACC",
|
| 359 |
+
"TAAATC",
|
| 360 |
+
"AGCTTT",
|
| 361 |
+
"AAATGC",
|
| 362 |
+
"AAAGCT",
|
| 363 |
+
"GGTGAT",
|
| 364 |
+
"GGCGGT",
|
| 365 |
+
"GCTTCA",
|
| 366 |
+
"TGATGC",
|
| 367 |
+
"TCAGCG",
|
| 368 |
+
"CTTCAT",
|
| 369 |
+
"GTTTTG",
|
| 370 |
+
"GTGCTG",
|
| 371 |
+
"CCAGTT",
|
| 372 |
+
"AACTGG",
|
| 373 |
+
"ATCGGC",
|
| 374 |
+
"GCCATT",
|
| 375 |
+
"TGCAGC",
|
| 376 |
+
"CGCTGA",
|
| 377 |
+
"CAATTG",
|
| 378 |
+
"GGTGAA",
|
| 379 |
+
"TTTGCA",
|
| 380 |
+
"GCTGCC",
|
| 381 |
+
"CTGCAA",
|
| 382 |
+
"GCCGAT",
|
| 383 |
+
"CCATCA",
|
| 384 |
+
"GCTGCA",
|
| 385 |
+
"GGCAGC",
|
| 386 |
+
"TCTTCT",
|
| 387 |
+
"TTGCAG",
|
| 388 |
+
"AATGGC",
|
| 389 |
+
"CCAAAA",
|
| 390 |
+
"ACGCCG",
|
| 391 |
+
"CGGCGT",
|
| 392 |
+
"ATTCAA",
|
| 393 |
+
"AACCAA",
|
| 394 |
+
"TTTTGG",
|
| 395 |
+
"TCAACA",
|
| 396 |
+
"TTTGGT",
|
| 397 |
+
"CAGGCC",
|
| 398 |
+
"TTGAAT",
|
| 399 |
+
"ATATTC",
|
| 400 |
+
"ACCAAT",
|
| 401 |
+
"CATAAA",
|
| 402 |
+
"TTCAAC",
|
| 403 |
+
"TGATTG",
|
| 404 |
+
"GATTTA",
|
| 405 |
+
"TTGTTC",
|
| 406 |
+
"TGAAGC",
|
| 407 |
+
"ATCAAC",
|
| 408 |
+
"TTCTGC",
|
| 409 |
+
"GGCCTG",
|
| 410 |
+
"TTGGTT",
|
| 411 |
+
"CATCGC",
|
| 412 |
+
"GATGGC",
|
| 413 |
+
"ACCACC",
|
| 414 |
+
"ATGAAG",
|
| 415 |
+
"AAAGCC",
|
| 416 |
+
"GAACAA",
|
| 417 |
+
"AAATAC",
|
| 418 |
+
"GAATAT",
|
| 419 |
+
"TCGGCA",
|
| 420 |
+
"ATTGGT",
|
| 421 |
+
"GGCTTT",
|
| 422 |
+
"GCAAAT",
|
| 423 |
+
"AAGTTT",
|
| 424 |
+
"ACCTGC",
|
| 425 |
+
"ATTTGC",
|
| 426 |
+
"CACCGC",
|
| 427 |
+
"GTTGAA",
|
| 428 |
+
"TGTTGA",
|
| 429 |
+
"GTTGAT",
|
| 430 |
+
"ATTCAT",
|
| 431 |
+
"ACAGCA",
|
| 432 |
+
"GCACCA",
|
| 433 |
+
"CGCTTT",
|
| 434 |
+
"TGATCA",
|
| 435 |
+
"AATTCA",
|
| 436 |
+
"ATCTTC",
|
| 437 |
+
"CCTGCA",
|
| 438 |
+
"AACAAC",
|
| 439 |
+
"GCAGAA",
|
| 440 |
+
"AACGCC",
|
| 441 |
+
"TGCCGA",
|
| 442 |
+
"ATTCTT",
|
| 443 |
+
"CGCCAC",
|
| 444 |
+
"GTATTT",
|
| 445 |
+
"GGCAAT",
|
| 446 |
+
"TCAAAT",
|
| 447 |
+
"ATTGCC",
|
| 448 |
+
"TGCTGT",
|
| 449 |
+
"AAACTG",
|
| 450 |
+
"AGAAGA",
|
| 451 |
+
"TTTATG",
|
| 452 |
+
"CGCCAA",
|
| 453 |
+
"CTGCTT",
|
| 454 |
+
"GCAGGT",
|
| 455 |
+
"TGATGG",
|
| 456 |
+
"TGTTCA",
|
| 457 |
+
"GCCAAA",
|
| 458 |
+
"GCAACA",
|
| 459 |
+
"GGCGCC",
|
| 460 |
+
"TGGTGC",
|
| 461 |
+
"ATGCCG",
|
| 462 |
+
"CAGTTT",
|
| 463 |
+
"TGTTGC",
|
| 464 |
+
"ATGAAT",
|
| 465 |
+
"GCGATG",
|
| 466 |
+
"CACCAC",
|
| 467 |
+
"TGAATT",
|
| 468 |
+
"AAAGCG",
|
| 469 |
+
"TGAACA",
|
| 470 |
+
"CATCAG",
|
| 471 |
+
"GGTGGT",
|
| 472 |
+
"GAAAAC",
|
| 473 |
+
"TTAAAC",
|
| 474 |
+
"GCGCCT",
|
| 475 |
+
"GTTTTC",
|
| 476 |
+
"CGGCAT",
|
| 477 |
+
"GAAGAT",
|
| 478 |
+
"AGCAAC",
|
| 479 |
+
"TTTCAC",
|
| 480 |
+
"TGCAGG",
|
| 481 |
+
"ATAAAC",
|
| 482 |
+
"CTGGCA",
|
| 483 |
+
"CGCCGT",
|
| 484 |
+
"GCGGTG",
|
| 485 |
+
"CCAGCT",
|
| 486 |
+
"TTTGGC",
|
| 487 |
+
"TAATGA",
|
| 488 |
+
"CAGCGA",
|
| 489 |
+
"TCATTA",
|
| 490 |
+
"CTGATT",
|
| 491 |
+
"ATTTGA",
|
| 492 |
+
"AAGAAT",
|
| 493 |
+
"ACCATT",
|
| 494 |
+
"ACGGCG",
|
| 495 |
+
"GTTGTT",
|
| 496 |
+
"CGGCCA",
|
| 497 |
+
"AGCTGG",
|
| 498 |
+
"AAATCT",
|
| 499 |
+
"AGGCGC",
|
| 500 |
+
"CTGTTC",
|
| 501 |
+
"GCCACC",
|
| 502 |
+
"AAGCAG"
|
| 503 |
+
],
|
| 504 |
+
"class_names": [
|
| 505 |
+
"aminoglycoside",
|
| 506 |
+
"beta-lactam",
|
| 507 |
+
"fosfomycin",
|
| 508 |
+
"glycopeptide",
|
| 509 |
+
"macrolide",
|
| 510 |
+
"phenicol",
|
| 511 |
+
"quinolone",
|
| 512 |
+
"rifampicin",
|
| 513 |
+
"sulfonamide",
|
| 514 |
+
"tetracycline",
|
| 515 |
+
"trimethoprim"
|
| 516 |
+
],
|
| 517 |
+
"task_type": "multilabel",
|
| 518 |
+
"target": "amr_drug_class",
|
| 519 |
+
"k": 6,
|
| 520 |
+
"max_features": 500,
|
| 521 |
+
"n_samples": 862,
|
| 522 |
+
"n_features": 500,
|
| 523 |
+
"n_classes": 11,
|
| 524 |
+
"drug_classes": [
|
| 525 |
+
"aminoglycoside",
|
| 526 |
+
"beta-lactam",
|
| 527 |
+
"fosfomycin",
|
| 528 |
+
"glycopeptide",
|
| 529 |
+
"macrolide",
|
| 530 |
+
"phenicol",
|
| 531 |
+
"quinolone",
|
| 532 |
+
"rifampicin",
|
| 533 |
+
"sulfonamide",
|
| 534 |
+
"tetracycline",
|
| 535 |
+
"trimethoprim"
|
| 536 |
+
]
|
| 537 |
+
}
|
data_processed/ncbi/ncbi_amr_y_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0e9cfd43cdd5225068480adc4532ae92590f8562cae1b3d7e6dd6b7b2ec41f97
|
| 3 |
+
size 15352
|
data_processed/ncbi/ncbi_amr_y_train.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:14bf49fa3126d571873acfc454ca684d26aeef410c58f64df82132c326b905d1
|
| 3 |
+
size 53104
|
data_processed/ncbi/ncbi_amr_y_val.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3c6344c6b50da9f524d488188950a1441175619e2edaee2e953295bbe02c1bc0
|
| 3 |
+
size 7784
|
data_processed/ncbi/ncbi_organism_X_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4df7f95f1e87a45419e5da13dd13d80f9423f5ed1c2a72bf9ef5b2f2e6eb493c
|
| 3 |
+
size 692128
|
data_processed/ncbi/ncbi_organism_X_train.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:03a1d3b21f2a2bda4a4008d3fd7c00dedaa661ecb8513a2b1c67c17fb338b5c5
|
| 3 |
+
size 2408128
|
data_processed/ncbi/ncbi_organism_X_val.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5a312735852dca85167dd7ac425542f9dc70f91aa7266df0a196a5ad2594e775
|
| 3 |
+
size 348128
|
data_processed/ncbi/ncbi_organism_metadata.json
ADDED
|
@@ -0,0 +1,521 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"feature_names": [
|
| 3 |
+
"AAAAAA",
|
| 4 |
+
"TTTTTT",
|
| 5 |
+
"ATTTTT",
|
| 6 |
+
"AAAAAT",
|
| 7 |
+
"TAAAAA",
|
| 8 |
+
"TTTTTA",
|
| 9 |
+
"TTTTTC",
|
| 10 |
+
"TTTAAA",
|
| 11 |
+
"GAAAAA",
|
| 12 |
+
"AAAATA",
|
| 13 |
+
"TTAAAA",
|
| 14 |
+
"TATTTT",
|
| 15 |
+
"TTTTAA",
|
| 16 |
+
"AATAAA",
|
| 17 |
+
"CTTTTT",
|
| 18 |
+
"AATTTT",
|
| 19 |
+
"AAAAAG",
|
| 20 |
+
"TTTATT",
|
| 21 |
+
"AAAATT",
|
| 22 |
+
"AAATAA",
|
| 23 |
+
"ATAAAA",
|
| 24 |
+
"TTTTAT",
|
| 25 |
+
"TTATTT",
|
| 26 |
+
"CAAAAA",
|
| 27 |
+
"GCCAGC",
|
| 28 |
+
"CGCCAG",
|
| 29 |
+
"TTTTCA",
|
| 30 |
+
"TTTTTG",
|
| 31 |
+
"CGCCGC",
|
| 32 |
+
"GCTGGC",
|
| 33 |
+
"TGAAAA",
|
| 34 |
+
"CAGCAG",
|
| 35 |
+
"GCGGCG",
|
| 36 |
+
"CTGGCG",
|
| 37 |
+
"CTGCTG",
|
| 38 |
+
"TTCTTT",
|
| 39 |
+
"AAATTT",
|
| 40 |
+
"AAAAAC",
|
| 41 |
+
"TTTCTT",
|
| 42 |
+
"AAAGAA",
|
| 43 |
+
"ATTAAA",
|
| 44 |
+
"AAGAAA",
|
| 45 |
+
"GTTTTT",
|
| 46 |
+
"GCCGCC",
|
| 47 |
+
"TTTTCT",
|
| 48 |
+
"TTTAAT",
|
| 49 |
+
"CAGCGC",
|
| 50 |
+
"TAAAAT",
|
| 51 |
+
"AATAAT",
|
| 52 |
+
"AGAAAA",
|
| 53 |
+
"GGCGGC",
|
| 54 |
+
"CCAGCG",
|
| 55 |
+
"TCTTTT",
|
| 56 |
+
"ATTTTA",
|
| 57 |
+
"CCAGCA",
|
| 58 |
+
"ATTATT",
|
| 59 |
+
"CGGCGC",
|
| 60 |
+
"GCGCTG",
|
| 61 |
+
"AAAAGA",
|
| 62 |
+
"GCAAAA",
|
| 63 |
+
"CATTTT",
|
| 64 |
+
"AATATT",
|
| 65 |
+
"GCGCCG",
|
| 66 |
+
"AAAATG",
|
| 67 |
+
"CGCTGG",
|
| 68 |
+
"AAAATC",
|
| 69 |
+
"TTAAAT",
|
| 70 |
+
"AAATCA",
|
| 71 |
+
"TGCTGG",
|
| 72 |
+
"ATATTT",
|
| 73 |
+
"TCAAAA",
|
| 74 |
+
"AAATAT",
|
| 75 |
+
"ATTTAA",
|
| 76 |
+
"TTTTGC",
|
| 77 |
+
"GATTTT",
|
| 78 |
+
"TTTTGA",
|
| 79 |
+
"AAATTA",
|
| 80 |
+
"TGATTT",
|
| 81 |
+
"AAAACA",
|
| 82 |
+
"TAATTT",
|
| 83 |
+
"AATTAA",
|
| 84 |
+
"TTAATT",
|
| 85 |
+
"CAGCAA",
|
| 86 |
+
"CCGCCG",
|
| 87 |
+
"ATTTTC",
|
| 88 |
+
"GCAGCA",
|
| 89 |
+
"TCGCCG",
|
| 90 |
+
"TGCTGC",
|
| 91 |
+
"TGTTTT",
|
| 92 |
+
"CAAAAT",
|
| 93 |
+
"AAAAGC",
|
| 94 |
+
"TTCTTC",
|
| 95 |
+
"CGGCGA",
|
| 96 |
+
"GCTTTT",
|
| 97 |
+
"TTGCTG",
|
| 98 |
+
"CGGCGG",
|
| 99 |
+
"ATTTTG",
|
| 100 |
+
"GAAAAT",
|
| 101 |
+
"CGCCGG",
|
| 102 |
+
"ATAAAT",
|
| 103 |
+
"TTTCAA",
|
| 104 |
+
"GAAGAA",
|
| 105 |
+
"CCGGCG",
|
| 106 |
+
"GCGCCA",
|
| 107 |
+
"TTTCAT",
|
| 108 |
+
"ATCAAA",
|
| 109 |
+
"ATTTAT",
|
| 110 |
+
"TAAATA",
|
| 111 |
+
"ATGAAA",
|
| 112 |
+
"ATCGCC",
|
| 113 |
+
"TTGTTT",
|
| 114 |
+
"TTTGAT",
|
| 115 |
+
"TTGAAA",
|
| 116 |
+
"AAACAA",
|
| 117 |
+
"TATTTA",
|
| 118 |
+
"ATCAAT",
|
| 119 |
+
"AACAAA",
|
| 120 |
+
"TTTATC",
|
| 121 |
+
"CGCCTG",
|
| 122 |
+
"TCAGCA",
|
| 123 |
+
"TCATCA",
|
| 124 |
+
"GATAAA",
|
| 125 |
+
"TGGCGC",
|
| 126 |
+
"CGCCGA",
|
| 127 |
+
"CCGCCA",
|
| 128 |
+
"TTCAAA",
|
| 129 |
+
"AAATTG",
|
| 130 |
+
"TTTGTT",
|
| 131 |
+
"TTCATC",
|
| 132 |
+
"ACAAAA",
|
| 133 |
+
"ATTGAT",
|
| 134 |
+
"AGCAGC",
|
| 135 |
+
"TTCAAT",
|
| 136 |
+
"ACCAGC",
|
| 137 |
+
"TCGGCG",
|
| 138 |
+
"TGCTGA",
|
| 139 |
+
"CAATTT",
|
| 140 |
+
"ATTAAT",
|
| 141 |
+
"GCTGCT",
|
| 142 |
+
"TTTGAA",
|
| 143 |
+
"GCGCGC",
|
| 144 |
+
"CATCAA",
|
| 145 |
+
"CGCGCC",
|
| 146 |
+
"GGCGAT",
|
| 147 |
+
"TTTTGT",
|
| 148 |
+
"TTCAGC",
|
| 149 |
+
"AAAGCA",
|
| 150 |
+
"ATTGAA",
|
| 151 |
+
"AAACCA",
|
| 152 |
+
"TAAATT",
|
| 153 |
+
"CAGGCG",
|
| 154 |
+
"TGCTTT",
|
| 155 |
+
"TAAAAC",
|
| 156 |
+
"TGATGA",
|
| 157 |
+
"GCTGGT",
|
| 158 |
+
"TCATTT",
|
| 159 |
+
"AAATGA",
|
| 160 |
+
"GCTGAA",
|
| 161 |
+
"TTCATT",
|
| 162 |
+
"GATGAA",
|
| 163 |
+
"CTTCTT",
|
| 164 |
+
"TCGCCA",
|
| 165 |
+
"AATCAA",
|
| 166 |
+
"TAATAA",
|
| 167 |
+
"TCCAGC",
|
| 168 |
+
"AGCGCC",
|
| 169 |
+
"TGGTTT",
|
| 170 |
+
"ATTTCA",
|
| 171 |
+
"GGCGCG",
|
| 172 |
+
"TTGATG",
|
| 173 |
+
"GTTTTA",
|
| 174 |
+
"TTATTA",
|
| 175 |
+
"TTGATT",
|
| 176 |
+
"AATGAA",
|
| 177 |
+
"TGGCGG",
|
| 178 |
+
"GCCTGC",
|
| 179 |
+
"AATTTC",
|
| 180 |
+
"TGAAAT",
|
| 181 |
+
"AATTTA",
|
| 182 |
+
"GCCGGC",
|
| 183 |
+
"GAAATT",
|
| 184 |
+
"GCAGCG",
|
| 185 |
+
"CACCAG",
|
| 186 |
+
"CTGCGC",
|
| 187 |
+
"GCTGGA",
|
| 188 |
+
"CAATAA",
|
| 189 |
+
"CGCTGC",
|
| 190 |
+
"GCGCAG",
|
| 191 |
+
"TTTTAC",
|
| 192 |
+
"AGCAAA",
|
| 193 |
+
"ATCAGC",
|
| 194 |
+
"GTAAAA",
|
| 195 |
+
"TCTTCA",
|
| 196 |
+
"AAAACT",
|
| 197 |
+
"CTTTAA",
|
| 198 |
+
"GGCGCT",
|
| 199 |
+
"AAGAAG",
|
| 200 |
+
"TGGCGA",
|
| 201 |
+
"TTAAAG",
|
| 202 |
+
"GCAGGC",
|
| 203 |
+
"GCTGAT",
|
| 204 |
+
"TTTGCT",
|
| 205 |
+
"TTTTCC",
|
| 206 |
+
"CAACAA",
|
| 207 |
+
"TTATTG",
|
| 208 |
+
"AAAACC",
|
| 209 |
+
"CTTTAT",
|
| 210 |
+
"TAAAGA",
|
| 211 |
+
"ACTTTT",
|
| 212 |
+
"TCTTTA",
|
| 213 |
+
"CCTGCT",
|
| 214 |
+
"CCGCGC",
|
| 215 |
+
"ATAAAG",
|
| 216 |
+
"TGCCGC",
|
| 217 |
+
"GGAAAA",
|
| 218 |
+
"AGTTTT",
|
| 219 |
+
"CGGCAA",
|
| 220 |
+
"TCAATT",
|
| 221 |
+
"GCGCGG",
|
| 222 |
+
"GCGGCA",
|
| 223 |
+
"ATCTTT",
|
| 224 |
+
"ATTTCT",
|
| 225 |
+
"GCCGCG",
|
| 226 |
+
"CTGGTG",
|
| 227 |
+
"GGTTTT",
|
| 228 |
+
"CTTCAA",
|
| 229 |
+
"TGAAGA",
|
| 230 |
+
"TTTCAG",
|
| 231 |
+
"AACAGC",
|
| 232 |
+
"AAAGAT",
|
| 233 |
+
"CGCGGC",
|
| 234 |
+
"AATTGA",
|
| 235 |
+
"AAAAGT",
|
| 236 |
+
"AGAAAT",
|
| 237 |
+
"CTGAAA",
|
| 238 |
+
"TTGTTG",
|
| 239 |
+
"GCTGTT",
|
| 240 |
+
"CAGCGG",
|
| 241 |
+
"CCACCA",
|
| 242 |
+
"TAAAGC",
|
| 243 |
+
"CTAAAA",
|
| 244 |
+
"AAGCAA",
|
| 245 |
+
"CCGCTG",
|
| 246 |
+
"TTCGCC",
|
| 247 |
+
"CTTTTA",
|
| 248 |
+
"TTGCCG",
|
| 249 |
+
"AACAAT",
|
| 250 |
+
"AATTGC",
|
| 251 |
+
"TTGCTT",
|
| 252 |
+
"CTGCCG",
|
| 253 |
+
"GCTTTA",
|
| 254 |
+
"TTGAAG",
|
| 255 |
+
"GCAATT",
|
| 256 |
+
"GATATT",
|
| 257 |
+
"AGCAAT",
|
| 258 |
+
"AATATC",
|
| 259 |
+
"TTTTAG",
|
| 260 |
+
"ATTGCT",
|
| 261 |
+
"CCATTT",
|
| 262 |
+
"AGCAGG",
|
| 263 |
+
"CGCCAT",
|
| 264 |
+
"TTGCCA",
|
| 265 |
+
"TAAAAG",
|
| 266 |
+
"CAGCCA",
|
| 267 |
+
"CGGCAG",
|
| 268 |
+
"AACTTT",
|
| 269 |
+
"GGCAAA",
|
| 270 |
+
"CAGCTT",
|
| 271 |
+
"CTGTTT",
|
| 272 |
+
"ATAATT",
|
| 273 |
+
"AAGCTG",
|
| 274 |
+
"GGCGAA",
|
| 275 |
+
"ATATTG",
|
| 276 |
+
"CAATAT",
|
| 277 |
+
"CAAATT",
|
| 278 |
+
"TGCGCC",
|
| 279 |
+
"TGGCAA",
|
| 280 |
+
"ATCATC",
|
| 281 |
+
"AAACAG",
|
| 282 |
+
"TGGCTG",
|
| 283 |
+
"AATTAT",
|
| 284 |
+
"TCAATA",
|
| 285 |
+
"CATTAA",
|
| 286 |
+
"TTTCTG",
|
| 287 |
+
"CGTTTT",
|
| 288 |
+
"ATTGTT",
|
| 289 |
+
"CCAGGC",
|
| 290 |
+
"CAGAAA",
|
| 291 |
+
"TTTGCC",
|
| 292 |
+
"GCCTGG",
|
| 293 |
+
"ATCATT",
|
| 294 |
+
"TCACCA",
|
| 295 |
+
"AATTTG",
|
| 296 |
+
"TATAAA",
|
| 297 |
+
"TATCAA",
|
| 298 |
+
"AATCAT",
|
| 299 |
+
"GGTAAA",
|
| 300 |
+
"AAAACG",
|
| 301 |
+
"GCCAGG",
|
| 302 |
+
"TTTATA",
|
| 303 |
+
"CCAGCC",
|
| 304 |
+
"CATCAT",
|
| 305 |
+
"AAATGG",
|
| 306 |
+
"TTGATA",
|
| 307 |
+
"TTTACC",
|
| 308 |
+
"ACCGCC",
|
| 309 |
+
"AAATTC",
|
| 310 |
+
"CCGGCA",
|
| 311 |
+
"CTGGAA",
|
| 312 |
+
"AAAGTT",
|
| 313 |
+
"TTTCCA",
|
| 314 |
+
"GGCGCA",
|
| 315 |
+
"TTCCAG",
|
| 316 |
+
"AATGAT",
|
| 317 |
+
"TATTGA",
|
| 318 |
+
"CAGCAC",
|
| 319 |
+
"ATGATT",
|
| 320 |
+
"TTAATG",
|
| 321 |
+
"ATGGCG",
|
| 322 |
+
"CTTTTG",
|
| 323 |
+
"TGGTGG",
|
| 324 |
+
"CCTGGC",
|
| 325 |
+
"CAGCAT",
|
| 326 |
+
"ATCACC",
|
| 327 |
+
"GAATTT",
|
| 328 |
+
"TGGAAA",
|
| 329 |
+
"GCATCA",
|
| 330 |
+
"TGCCGG",
|
| 331 |
+
"GATGAT",
|
| 332 |
+
"ACCAAA",
|
| 333 |
+
"TGGTGA",
|
| 334 |
+
"TTAATA",
|
| 335 |
+
"GCTGCG",
|
| 336 |
+
"GGCTGG",
|
| 337 |
+
"ATGATG",
|
| 338 |
+
"TTTAAC",
|
| 339 |
+
"GCCGCT",
|
| 340 |
+
"CGCAGC",
|
| 341 |
+
"TTCACC",
|
| 342 |
+
"TATTAA",
|
| 343 |
+
"TTATCA",
|
| 344 |
+
"ATGCTG",
|
| 345 |
+
"GTTAAA",
|
| 346 |
+
"CAAAAC",
|
| 347 |
+
"AGCGGC",
|
| 348 |
+
"AGCTTT",
|
| 349 |
+
"CAATCA",
|
| 350 |
+
"CAAAAG",
|
| 351 |
+
"TGAATA",
|
| 352 |
+
"TGCAAA",
|
| 353 |
+
"GGCCAG",
|
| 354 |
+
"AAAGCT",
|
| 355 |
+
"CTGGCC",
|
| 356 |
+
"TATTCA",
|
| 357 |
+
"TGATAA",
|
| 358 |
+
"GTGCTG",
|
| 359 |
+
"AAATGC",
|
| 360 |
+
"GGTGAT",
|
| 361 |
+
"AAACTT",
|
| 362 |
+
"AACTGG",
|
| 363 |
+
"GCATTT",
|
| 364 |
+
"GCTTCA",
|
| 365 |
+
"TGCAGC",
|
| 366 |
+
"TGATGC",
|
| 367 |
+
"GTTTTG",
|
| 368 |
+
"ACGCCG",
|
| 369 |
+
"TAAATC",
|
| 370 |
+
"GCCATC",
|
| 371 |
+
"AACCAA",
|
| 372 |
+
"ATTCAA",
|
| 373 |
+
"GCTGCC",
|
| 374 |
+
"CGCTGA",
|
| 375 |
+
"GCCATT",
|
| 376 |
+
"GGCGGT",
|
| 377 |
+
"CCAAAA",
|
| 378 |
+
"CAGGCC",
|
| 379 |
+
"CCATCA",
|
| 380 |
+
"CCAGTT",
|
| 381 |
+
"CTTCAT",
|
| 382 |
+
"ATCGGC",
|
| 383 |
+
"TTTTGG",
|
| 384 |
+
"CTGCAA",
|
| 385 |
+
"TCTTCT",
|
| 386 |
+
"TCAGCG",
|
| 387 |
+
"GGCAGC",
|
| 388 |
+
"CAATTG",
|
| 389 |
+
"TCAACA",
|
| 390 |
+
"CATAAA",
|
| 391 |
+
"ATATTC",
|
| 392 |
+
"TTGAAT",
|
| 393 |
+
"ACCACC",
|
| 394 |
+
"GGTGAA",
|
| 395 |
+
"TTCAAC",
|
| 396 |
+
"AATGGC",
|
| 397 |
+
"TTTGCA",
|
| 398 |
+
"ACCAAT",
|
| 399 |
+
"GCTGCA",
|
| 400 |
+
"CATCGC",
|
| 401 |
+
"TGATTG",
|
| 402 |
+
"GATTTA",
|
| 403 |
+
"GCCGAT",
|
| 404 |
+
"TTGCAG",
|
| 405 |
+
"TTTGGT",
|
| 406 |
+
"ATGAAG",
|
| 407 |
+
"CGGCGT",
|
| 408 |
+
"GAACAA",
|
| 409 |
+
"TCGGCA",
|
| 410 |
+
"TTCTGC",
|
| 411 |
+
"GATGGC",
|
| 412 |
+
"CGCTTT",
|
| 413 |
+
"TGAAGC",
|
| 414 |
+
"ATCAAC",
|
| 415 |
+
"AAGTTT",
|
| 416 |
+
"TTGGTT",
|
| 417 |
+
"TTGTTC",
|
| 418 |
+
"AAAGCC",
|
| 419 |
+
"GGCCTG",
|
| 420 |
+
"AAATAC",
|
| 421 |
+
"ATTGGT",
|
| 422 |
+
"GAATAT",
|
| 423 |
+
"ACCTGC",
|
| 424 |
+
"GTTGAT",
|
| 425 |
+
"ATCTTC",
|
| 426 |
+
"GGCTTT",
|
| 427 |
+
"GTTGAA",
|
| 428 |
+
"GCAAAT",
|
| 429 |
+
"GGCAAT",
|
| 430 |
+
"TGTTGA",
|
| 431 |
+
"ATTCAT",
|
| 432 |
+
"CCTGCA",
|
| 433 |
+
"AACGCC",
|
| 434 |
+
"ACAGCA",
|
| 435 |
+
"GCAGAA",
|
| 436 |
+
"GCACCA",
|
| 437 |
+
"AATTCA",
|
| 438 |
+
"TGCCGA",
|
| 439 |
+
"TGATCA",
|
| 440 |
+
"CTGCTT",
|
| 441 |
+
"CACCGC",
|
| 442 |
+
"GCCAAA",
|
| 443 |
+
"ATTTGC",
|
| 444 |
+
"TGCTGT",
|
| 445 |
+
"ATTGCC",
|
| 446 |
+
"TTTATG",
|
| 447 |
+
"GTATTT",
|
| 448 |
+
"AAACTG",
|
| 449 |
+
"GCAGGT",
|
| 450 |
+
"TGTTCA",
|
| 451 |
+
"GCGATG",
|
| 452 |
+
"AACAAC",
|
| 453 |
+
"CGCCAC",
|
| 454 |
+
"CGCCAA",
|
| 455 |
+
"TGATGG",
|
| 456 |
+
"ATTCTT",
|
| 457 |
+
"TCAAAT",
|
| 458 |
+
"CAGTTT",
|
| 459 |
+
"TGGTGC",
|
| 460 |
+
"TGAATT",
|
| 461 |
+
"GCAACA",
|
| 462 |
+
"TGAACA",
|
| 463 |
+
"TTTCAC",
|
| 464 |
+
"ATGAAT",
|
| 465 |
+
"AGAAGA",
|
| 466 |
+
"GTTTTC",
|
| 467 |
+
"GAAAAC",
|
| 468 |
+
"CGGCAT",
|
| 469 |
+
"ATGCCG",
|
| 470 |
+
"GAAGAT",
|
| 471 |
+
"TGCAGG",
|
| 472 |
+
"GCGCCT",
|
| 473 |
+
"AAAGCG",
|
| 474 |
+
"CATCAG",
|
| 475 |
+
"GGTGGT",
|
| 476 |
+
"CACCAC",
|
| 477 |
+
"GGCGCC",
|
| 478 |
+
"TGTTGC",
|
| 479 |
+
"TTTGGC",
|
| 480 |
+
"AGCAAC",
|
| 481 |
+
"CCAGCT",
|
| 482 |
+
"ATTTGA",
|
| 483 |
+
"TGCCAG",
|
| 484 |
+
"TTAAAC",
|
| 485 |
+
"ACCATT",
|
| 486 |
+
"TCATTA",
|
| 487 |
+
"ATAAAC",
|
| 488 |
+
"CGCCGT",
|
| 489 |
+
"AGCTGG",
|
| 490 |
+
"CTGGCA",
|
| 491 |
+
"TAATGA",
|
| 492 |
+
"CCTGAA",
|
| 493 |
+
"CTGATG",
|
| 494 |
+
"TAATTG",
|
| 495 |
+
"CAGCGA",
|
| 496 |
+
"GTTGTT",
|
| 497 |
+
"AAATCT",
|
| 498 |
+
"GCGGTG",
|
| 499 |
+
"GCCACC",
|
| 500 |
+
"GTTTAA",
|
| 501 |
+
"CTTTCA",
|
| 502 |
+
"ACTGGC"
|
| 503 |
+
],
|
| 504 |
+
"class_names": [
|
| 505 |
+
"Acinetobacter baumannii",
|
| 506 |
+
"Enterococcus faecalis",
|
| 507 |
+
"Enterococcus faecium",
|
| 508 |
+
"Escherichia coli",
|
| 509 |
+
"Klebsiella pneumoniae",
|
| 510 |
+
"Pseudomonas aeruginosa",
|
| 511 |
+
"Salmonella enterica",
|
| 512 |
+
"Staphylococcus aureus"
|
| 513 |
+
],
|
| 514 |
+
"task_type": "multiclass",
|
| 515 |
+
"target": "organism",
|
| 516 |
+
"k": 6,
|
| 517 |
+
"max_features": 500,
|
| 518 |
+
"n_samples": 862,
|
| 519 |
+
"n_features": 500,
|
| 520 |
+
"n_classes": 8
|
| 521 |
+
}
|
data_processed/ncbi/ncbi_organism_y_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6b15343b098e58dcf7910efddf82428330af16e7ce2034c49432cc319a7c0905
|
| 3 |
+
size 1512
|
data_processed/ncbi/ncbi_organism_y_train.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f148b0e42811eb9625f603512a27a83a5a8d45d295f53653557e131c887353e5
|
| 3 |
+
size 4944
|
data_processed/ncbi/ncbi_organism_y_val.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:dc91c81d6df415c92791afd9b70aa6187bd8d6d1fab3cc706b21392519acf053
|
| 3 |
+
size 824
|
data_processed/patric/patric_cefoxitin_X_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8454ba894ae919d8e0e2fab310826aeac8925abbec1e19ed2f6a428373747968
|
| 3 |
+
size 144128
|
data_processed/patric/patric_cefoxitin_X_train.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d22d6ec277e2c4361861b74f8c8117da02f4d1de194c471d33582ee3223b04bd
|
| 3 |
+
size 492128
|
data_processed/patric/patric_cefoxitin_X_val.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6d30e5536b9b743982ba94c0a88938b086fef20931403583581411507645c45b
|
| 3 |
+
size 72128
|
data_processed/patric/patric_cefoxitin_metadata.json
ADDED
|
@@ -0,0 +1,515 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"feature_names": [
|
| 3 |
+
"ATTTTT",
|
| 4 |
+
"AAAAAT",
|
| 5 |
+
"TTTAAA",
|
| 6 |
+
"TTTTTA",
|
| 7 |
+
"TAAAAA",
|
| 8 |
+
"TATTTT",
|
| 9 |
+
"TTAAAA",
|
| 10 |
+
"TTTTAA",
|
| 11 |
+
"TTTATT",
|
| 12 |
+
"TTTAAT",
|
| 13 |
+
"AAAATA",
|
| 14 |
+
"ATTAAA",
|
| 15 |
+
"AATAAA",
|
| 16 |
+
"AAAAAA",
|
| 17 |
+
"TTATTT",
|
| 18 |
+
"TTTTTT",
|
| 19 |
+
"TTTTAT",
|
| 20 |
+
"ATTATT",
|
| 21 |
+
"AAATAA",
|
| 22 |
+
"ATAAAA",
|
| 23 |
+
"ATTAAT",
|
| 24 |
+
"AATAAT",
|
| 25 |
+
"ATATTT",
|
| 26 |
+
"TAAAAT",
|
| 27 |
+
"ATTTTA",
|
| 28 |
+
"ATTTAA",
|
| 29 |
+
"AAAATT",
|
| 30 |
+
"AAATAT",
|
| 31 |
+
"TTAAAT",
|
| 32 |
+
"TTAATT",
|
| 33 |
+
"AATATT",
|
| 34 |
+
"AATTTT",
|
| 35 |
+
"AATTAA",
|
| 36 |
+
"ATTTAT",
|
| 37 |
+
"GAAAAA",
|
| 38 |
+
"TGAAAA",
|
| 39 |
+
"TTTTCA",
|
| 40 |
+
"TTTTTC",
|
| 41 |
+
"ATAAAT",
|
| 42 |
+
"TTATTA",
|
| 43 |
+
"AAATTA",
|
| 44 |
+
"TAATTT",
|
| 45 |
+
"TAATAA",
|
| 46 |
+
"CATTTT",
|
| 47 |
+
"AAAATG",
|
| 48 |
+
"AAAAAG",
|
| 49 |
+
"CTTTTT",
|
| 50 |
+
"AAATTT",
|
| 51 |
+
"TATTTA",
|
| 52 |
+
"TATTAA",
|
| 53 |
+
"TTAATA",
|
| 54 |
+
"TGATTT",
|
| 55 |
+
"TAAATA",
|
| 56 |
+
"AAATCA",
|
| 57 |
+
"AATTTA",
|
| 58 |
+
"TAAATT",
|
| 59 |
+
"TTCTTT",
|
| 60 |
+
"TTTATC",
|
| 61 |
+
"AAAGAA",
|
| 62 |
+
"AAATGA",
|
| 63 |
+
"GATAAA",
|
| 64 |
+
"AATGAT",
|
| 65 |
+
"TCATTT",
|
| 66 |
+
"ATCATT",
|
| 67 |
+
"GAAAAT",
|
| 68 |
+
"AATTAT",
|
| 69 |
+
"TTTCTT",
|
| 70 |
+
"ATTGAT",
|
| 71 |
+
"TTTCAA",
|
| 72 |
+
"TTGAAA",
|
| 73 |
+
"TTTCAT",
|
| 74 |
+
"ATTTTC",
|
| 75 |
+
"ATCAAT",
|
| 76 |
+
"AAGAAA",
|
| 77 |
+
"ATGAAA",
|
| 78 |
+
"ATAATT",
|
| 79 |
+
"TTCATT",
|
| 80 |
+
"TTCAAT",
|
| 81 |
+
"TTTATA",
|
| 82 |
+
"ATTGAA",
|
| 83 |
+
"TATAAA",
|
| 84 |
+
"TTTGAT",
|
| 85 |
+
"TATTAT",
|
| 86 |
+
"AATCAT",
|
| 87 |
+
"TGAAAT",
|
| 88 |
+
"ATAATA",
|
| 89 |
+
"TTTGTT",
|
| 90 |
+
"ATGATT",
|
| 91 |
+
"AAAATC",
|
| 92 |
+
"TCATCA",
|
| 93 |
+
"TAATAT",
|
| 94 |
+
"TGATGA",
|
| 95 |
+
"TGATAA",
|
| 96 |
+
"TTGATT",
|
| 97 |
+
"CAAAAA",
|
| 98 |
+
"TTTTTG",
|
| 99 |
+
"CATTAA",
|
| 100 |
+
"TTAATG",
|
| 101 |
+
"AATGAA",
|
| 102 |
+
"TTATCA",
|
| 103 |
+
"TCTTTT",
|
| 104 |
+
"ATTTCA",
|
| 105 |
+
"GATTTT",
|
| 106 |
+
"ATATTA",
|
| 107 |
+
"AGAAAA",
|
| 108 |
+
"TTTTCT",
|
| 109 |
+
"TGTTTT",
|
| 110 |
+
"TTTGAA",
|
| 111 |
+
"ATCAAA",
|
| 112 |
+
"GTTTTT",
|
| 113 |
+
"TTCAAA",
|
| 114 |
+
"TCTTTA",
|
| 115 |
+
"TTGTTT",
|
| 116 |
+
"CGCCAG",
|
| 117 |
+
"AAATTG",
|
| 118 |
+
"AAAAGA",
|
| 119 |
+
"AATATA",
|
| 120 |
+
"TATATT",
|
| 121 |
+
"AAAAAC",
|
| 122 |
+
"CTGGCG",
|
| 123 |
+
"CAAAAT",
|
| 124 |
+
"TAAAGA",
|
| 125 |
+
"ATTTTG",
|
| 126 |
+
"ATTGTT",
|
| 127 |
+
"AAAACA",
|
| 128 |
+
"AACAAA",
|
| 129 |
+
"AATCAA",
|
| 130 |
+
"CTTTAT",
|
| 131 |
+
"CAATTT",
|
| 132 |
+
"TTTTGA",
|
| 133 |
+
"ATCTTT",
|
| 134 |
+
"CTTTAA",
|
| 135 |
+
"TCATTA",
|
| 136 |
+
"AATATC",
|
| 137 |
+
"AACAAT",
|
| 138 |
+
"AAAGAT",
|
| 139 |
+
"TTGATA",
|
| 140 |
+
"ATAAAG",
|
| 141 |
+
"ATATAA",
|
| 142 |
+
"GATATT",
|
| 143 |
+
"TCAAAA",
|
| 144 |
+
"TTATAT",
|
| 145 |
+
"AAACAA",
|
| 146 |
+
"TAATGA",
|
| 147 |
+
"TATCAA",
|
| 148 |
+
"CATCAT",
|
| 149 |
+
"GCCAGC",
|
| 150 |
+
"GATGAA",
|
| 151 |
+
"TTAAAG",
|
| 152 |
+
"CAATAT",
|
| 153 |
+
"TTGATG",
|
| 154 |
+
"TTGTTG",
|
| 155 |
+
"TTCATC",
|
| 156 |
+
"GCTGGC",
|
| 157 |
+
"CATCAA",
|
| 158 |
+
"AATTGA",
|
| 159 |
+
"ATGATG",
|
| 160 |
+
"TTTTGT",
|
| 161 |
+
"GATTTA",
|
| 162 |
+
"TAAATC",
|
| 163 |
+
"TTTAAC",
|
| 164 |
+
"GAAATT",
|
| 165 |
+
"TCAATT",
|
| 166 |
+
"ATATTG",
|
| 167 |
+
"ACAAAA",
|
| 168 |
+
"AATTTC",
|
| 169 |
+
"TCAATA",
|
| 170 |
+
"ATCATC",
|
| 171 |
+
"GTTAAA",
|
| 172 |
+
"TAATTA",
|
| 173 |
+
"TTCTTC",
|
| 174 |
+
"ATTTCT",
|
| 175 |
+
"ATTATC",
|
| 176 |
+
"CATTAT",
|
| 177 |
+
"ATAATG",
|
| 178 |
+
"GATGAT",
|
| 179 |
+
"AATGTT",
|
| 180 |
+
"TATTGA",
|
| 181 |
+
"CCAGCA",
|
| 182 |
+
"CAACAA",
|
| 183 |
+
"CAGCAG",
|
| 184 |
+
"AACATT",
|
| 185 |
+
"TTATTG",
|
| 186 |
+
"TTATAA",
|
| 187 |
+
"CAATAA",
|
| 188 |
+
"CTGCTG",
|
| 189 |
+
"GATAAT",
|
| 190 |
+
"GCAAAA",
|
| 191 |
+
"TGATAT",
|
| 192 |
+
"TTTTAC",
|
| 193 |
+
"GAAGAA",
|
| 194 |
+
"CCAGCG",
|
| 195 |
+
"GTAAAA",
|
| 196 |
+
"CATTTA",
|
| 197 |
+
"TTTTGC",
|
| 198 |
+
"AGAAAT",
|
| 199 |
+
"ATTTGA",
|
| 200 |
+
"AATTTG",
|
| 201 |
+
"TAAAAC",
|
| 202 |
+
"ATTCAT",
|
| 203 |
+
"ATATCA",
|
| 204 |
+
"TAAATG",
|
| 205 |
+
"ATGAAT",
|
| 206 |
+
"CGCTGG",
|
| 207 |
+
"ATTTGT",
|
| 208 |
+
"TTGCTG",
|
| 209 |
+
"TTGAAT",
|
| 210 |
+
"TGAATT",
|
| 211 |
+
"TGCTGG",
|
| 212 |
+
"CAGCAA",
|
| 213 |
+
"CAAATT",
|
| 214 |
+
"GTTTTA",
|
| 215 |
+
"AATTCA",
|
| 216 |
+
"TAATTG",
|
| 217 |
+
"ATTCAA",
|
| 218 |
+
"TGATTA",
|
| 219 |
+
"CTGAAA",
|
| 220 |
+
"ACATTT",
|
| 221 |
+
"TCAAAT",
|
| 222 |
+
"TTAACA",
|
| 223 |
+
"CAATTA",
|
| 224 |
+
"ATGTTT",
|
| 225 |
+
"ATATAT",
|
| 226 |
+
"TGTTAA",
|
| 227 |
+
"AAATGT",
|
| 228 |
+
"TTTCAG",
|
| 229 |
+
"TATCAT",
|
| 230 |
+
"ATGATA",
|
| 231 |
+
"TATTCA",
|
| 232 |
+
"TTATCT",
|
| 233 |
+
"AATTGT",
|
| 234 |
+
"TAATCA",
|
| 235 |
+
"GCTTTT",
|
| 236 |
+
"AATATG",
|
| 237 |
+
"AAACAT",
|
| 238 |
+
"TCTTCA",
|
| 239 |
+
"CATATT",
|
| 240 |
+
"TGTTGA",
|
| 241 |
+
"TGAAGA",
|
| 242 |
+
"GTTGAT",
|
| 243 |
+
"CAGCGC",
|
| 244 |
+
"ACTTTT",
|
| 245 |
+
"ATCAAC",
|
| 246 |
+
"TGAATA",
|
| 247 |
+
"AAAAGT",
|
| 248 |
+
"TCAGCA",
|
| 249 |
+
"TGCTTT",
|
| 250 |
+
"GTTAAT",
|
| 251 |
+
"AAAAGC",
|
| 252 |
+
"AACTTT",
|
| 253 |
+
"ACAAAT",
|
| 254 |
+
"ACAATT",
|
| 255 |
+
"TATAAT",
|
| 256 |
+
"TCAACA",
|
| 257 |
+
"GCATTT",
|
| 258 |
+
"CATAAA",
|
| 259 |
+
"CCATTT",
|
| 260 |
+
"ATATTC",
|
| 261 |
+
"GAATTT",
|
| 262 |
+
"TTTACC",
|
| 263 |
+
"GGTAAA",
|
| 264 |
+
"ATAATC",
|
| 265 |
+
"TGCTGA",
|
| 266 |
+
"GTATTT",
|
| 267 |
+
"AAAGTT",
|
| 268 |
+
"GTTATT",
|
| 269 |
+
"GATTAT",
|
| 270 |
+
"AGATAA",
|
| 271 |
+
"ATTAAC",
|
| 272 |
+
"TTCAGC",
|
| 273 |
+
"GAATAT",
|
| 274 |
+
"AAATTC",
|
| 275 |
+
"GCTGAA",
|
| 276 |
+
"GCGCTG",
|
| 277 |
+
"ATAAAC",
|
| 278 |
+
"CTTCTT",
|
| 279 |
+
"TATCTT",
|
| 280 |
+
"AATAAC",
|
| 281 |
+
"GTTTAT",
|
| 282 |
+
"AAGAAG",
|
| 283 |
+
"AAATAC",
|
| 284 |
+
"TTTATG",
|
| 285 |
+
"AATTGC",
|
| 286 |
+
"AAAGCA",
|
| 287 |
+
"GCAGCA",
|
| 288 |
+
"CTTTTA",
|
| 289 |
+
"TGCTGC",
|
| 290 |
+
"ATTGCT",
|
| 291 |
+
"CGTTTT",
|
| 292 |
+
"ATTATA",
|
| 293 |
+
"TATTTC",
|
| 294 |
+
"GCAATT",
|
| 295 |
+
"CTAAAA",
|
| 296 |
+
"TGGTTT",
|
| 297 |
+
"TATTTG",
|
| 298 |
+
"AAATGC",
|
| 299 |
+
"TTAAAC",
|
| 300 |
+
"CTTCAA",
|
| 301 |
+
"TTGTTA",
|
| 302 |
+
"TTTTAG",
|
| 303 |
+
"GATTAA",
|
| 304 |
+
"CGCCGC",
|
| 305 |
+
"TAAAAG",
|
| 306 |
+
"AAACCA",
|
| 307 |
+
"AAGATA",
|
| 308 |
+
"TCGCCA",
|
| 309 |
+
"TGATTG",
|
| 310 |
+
"TTGAAG",
|
| 311 |
+
"GTTGAA",
|
| 312 |
+
"AAATGG",
|
| 313 |
+
"GTTTAA",
|
| 314 |
+
"TTAATC",
|
| 315 |
+
"CAATCA",
|
| 316 |
+
"ATCAGC",
|
| 317 |
+
"GCGGCG",
|
| 318 |
+
"ATCGCC",
|
| 319 |
+
"GTAAAT",
|
| 320 |
+
"GAAATA",
|
| 321 |
+
"TGTTTA",
|
| 322 |
+
"GCTGAT",
|
| 323 |
+
"AAAACG",
|
| 324 |
+
"ATTTAC",
|
| 325 |
+
"CTTCAT",
|
| 326 |
+
"TAACAA",
|
| 327 |
+
"GCATTA",
|
| 328 |
+
"GCATCA",
|
| 329 |
+
"TACTTT",
|
| 330 |
+
"GTTGTT",
|
| 331 |
+
"TTGTAA",
|
| 332 |
+
"TTCAAC",
|
| 333 |
+
"GCTTTA",
|
| 334 |
+
"TAATGC",
|
| 335 |
+
"AGCAAT",
|
| 336 |
+
"TGGCGA",
|
| 337 |
+
"ACCATT",
|
| 338 |
+
"TAAACA",
|
| 339 |
+
"CAACAT",
|
| 340 |
+
"TGATGC",
|
| 341 |
+
"ACCAGC",
|
| 342 |
+
"TAATGT",
|
| 343 |
+
"TGATGT",
|
| 344 |
+
"GGCGAT",
|
| 345 |
+
"CATCTT",
|
| 346 |
+
"TTCATA",
|
| 347 |
+
"CACCAG",
|
| 348 |
+
"ACATCA",
|
| 349 |
+
"CAAATA",
|
| 350 |
+
"ACTTTA",
|
| 351 |
+
"ATCTTC",
|
| 352 |
+
"CATTTG",
|
| 353 |
+
"TCATAT",
|
| 354 |
+
"TTTGTA",
|
| 355 |
+
"CTGTTT",
|
| 356 |
+
"AAGATG",
|
| 357 |
+
"CCGCCA",
|
| 358 |
+
"AACAAC",
|
| 359 |
+
"CAATTG",
|
| 360 |
+
"GCTGGT",
|
| 361 |
+
"TTTCAC",
|
| 362 |
+
"GCGCCA",
|
| 363 |
+
"TTACAA",
|
| 364 |
+
"ATGAAG",
|
| 365 |
+
"TCCAGC",
|
| 366 |
+
"TACAAA",
|
| 367 |
+
"AAAACT",
|
| 368 |
+
"TGTTGT",
|
| 369 |
+
"TAATTC",
|
| 370 |
+
"TAAAGC",
|
| 371 |
+
"GAAGAT",
|
| 372 |
+
"ATGTTG",
|
| 373 |
+
"ATATGA",
|
| 374 |
+
"TATGAA",
|
| 375 |
+
"ACATTA",
|
| 376 |
+
"TGGCGG",
|
| 377 |
+
"AATGGT",
|
| 378 |
+
"AAATCT",
|
| 379 |
+
"AAAGTA",
|
| 380 |
+
"TTTGCT",
|
| 381 |
+
"ATTATG",
|
| 382 |
+
"CTGGTG",
|
| 383 |
+
"TGGCGC",
|
| 384 |
+
"AGATTT",
|
| 385 |
+
"AGTTTT",
|
| 386 |
+
"TGTAAT",
|
| 387 |
+
"GAATTA",
|
| 388 |
+
"AGCAAA",
|
| 389 |
+
"TTATCG",
|
| 390 |
+
"CGATAA",
|
| 391 |
+
"CAGAAA",
|
| 392 |
+
"TTATTC",
|
| 393 |
+
"TGGTGA",
|
| 394 |
+
"TTGCTT",
|
| 395 |
+
"TTTCTG",
|
| 396 |
+
"CATAAT",
|
| 397 |
+
"GTGAAA",
|
| 398 |
+
"CAGCAT",
|
| 399 |
+
"GCCATT",
|
| 400 |
+
"TAAAGT",
|
| 401 |
+
"AAACAG",
|
| 402 |
+
"TCACCA",
|
| 403 |
+
"GCTGGA",
|
| 404 |
+
"TGCAAT",
|
| 405 |
+
"CAAATG",
|
| 406 |
+
"ATTTGC",
|
| 407 |
+
"TTTTCC",
|
| 408 |
+
"ATCACC",
|
| 409 |
+
"GGAAAA",
|
| 410 |
+
"TCATAA",
|
| 411 |
+
"ATCATA",
|
| 412 |
+
"ATTACA",
|
| 413 |
+
"ACAACA",
|
| 414 |
+
"ATGCTG",
|
| 415 |
+
"CCATCA",
|
| 416 |
+
"AATACA",
|
| 417 |
+
"TATTGC",
|
| 418 |
+
"TTACTT",
|
| 419 |
+
"CATCAG",
|
| 420 |
+
"TGCCAG",
|
| 421 |
+
"TTTGCC",
|
| 422 |
+
"GCAAAT",
|
| 423 |
+
"TAATAC",
|
| 424 |
+
"CTGGCA",
|
| 425 |
+
"GCAATA",
|
| 426 |
+
"GTTTCA",
|
| 427 |
+
"GGCAAA",
|
| 428 |
+
"TTGTTC",
|
| 429 |
+
"AACATC",
|
| 430 |
+
"CGCTTT",
|
| 431 |
+
"TGTATT",
|
| 432 |
+
"AATGGC",
|
| 433 |
+
"TGTTGC",
|
| 434 |
+
"AATTAC",
|
| 435 |
+
"GATGTT",
|
| 436 |
+
"GAATAA",
|
| 437 |
+
"TATTGT",
|
| 438 |
+
"ACAATA",
|
| 439 |
+
"AATACC",
|
| 440 |
+
"GCTGTT",
|
| 441 |
+
"TATGAT",
|
| 442 |
+
"TTATGA",
|
| 443 |
+
"CTGATG",
|
| 444 |
+
"TGGCAA",
|
| 445 |
+
"TGCATT",
|
| 446 |
+
"CTTTCA",
|
| 447 |
+
"TTTCCA",
|
| 448 |
+
"CGCCAT",
|
| 449 |
+
"ATTGCA",
|
| 450 |
+
"TTGCCA",
|
| 451 |
+
"TGATGG",
|
| 452 |
+
"AATCTT",
|
| 453 |
+
"GGTATT",
|
| 454 |
+
"AAAGCG",
|
| 455 |
+
"AAAACC",
|
| 456 |
+
"ATTCTT",
|
| 457 |
+
"GTAATT",
|
| 458 |
+
"TTAACT",
|
| 459 |
+
"GTAATA",
|
| 460 |
+
"TGTAAA",
|
| 461 |
+
"GGTGAA",
|
| 462 |
+
"GGTGAT",
|
| 463 |
+
"GTATTA",
|
| 464 |
+
"TTTACT",
|
| 465 |
+
"ATGTTA",
|
| 466 |
+
"TGAAAG",
|
| 467 |
+
"ATGGCG",
|
| 468 |
+
"TTTACA",
|
| 469 |
+
"TATTAC",
|
| 470 |
+
"AATGCA",
|
| 471 |
+
"AGCATT",
|
| 472 |
+
"CATTGA",
|
| 473 |
+
"TAACTT",
|
| 474 |
+
"AGTTAA",
|
| 475 |
+
"TGTTCA",
|
| 476 |
+
"AAGTAA",
|
| 477 |
+
"CATTTC",
|
| 478 |
+
"GCAACA",
|
| 479 |
+
"TCATCT",
|
| 480 |
+
"AGATGA",
|
| 481 |
+
"CTGATT",
|
| 482 |
+
"TTCACC",
|
| 483 |
+
"TTCCAG",
|
| 484 |
+
"TTGCCG",
|
| 485 |
+
"GGTTTT",
|
| 486 |
+
"GGCGGC",
|
| 487 |
+
"GCCGCC",
|
| 488 |
+
"AATCAG",
|
| 489 |
+
"ATTACT",
|
| 490 |
+
"AAGATT",
|
| 491 |
+
"AGATAT",
|
| 492 |
+
"AAACTT",
|
| 493 |
+
"TGTTAT",
|
| 494 |
+
"AAACTG",
|
| 495 |
+
"AACAGC",
|
| 496 |
+
"AAGTTT",
|
| 497 |
+
"TAACAT",
|
| 498 |
+
"AAGCAA",
|
| 499 |
+
"CACCAT",
|
| 500 |
+
"TCTTCT",
|
| 501 |
+
"CCACCA",
|
| 502 |
+
"CTGGAA"
|
| 503 |
+
],
|
| 504 |
+
"class_names": [
|
| 505 |
+
"Resistant",
|
| 506 |
+
"Susceptible"
|
| 507 |
+
],
|
| 508 |
+
"task_type": "binary",
|
| 509 |
+
"antibiotic": "cefoxitin",
|
| 510 |
+
"k": 6,
|
| 511 |
+
"max_features": 500,
|
| 512 |
+
"n_samples": 177,
|
| 513 |
+
"n_features": 500,
|
| 514 |
+
"n_classes": 2
|
| 515 |
+
}
|
data_processed/patric/patric_cefoxitin_y_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e71a2e93bfd80dd8cf7121ac173ef50b39cad4880636996a71ed2e88365cba44
|
| 3 |
+
size 416
|
data_processed/patric/patric_cefoxitin_y_train.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:67f5fb06d9c46cc9d6bcb85b2cf0c4f996ead893676c840af84703b3ee5b1f10
|
| 3 |
+
size 1112
|
data_processed/patric/patric_cefoxitin_y_val.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:483ffcdec578185306133f00629561f277b95ac930e534f47de7d3489851d221
|
| 3 |
+
size 272
|
data_processed/patric/patric_ciprofloxacin_X_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1c5ffb232d90b9318edac49958a520054b5a2071183a6229a1fc2338675faa80
|
| 3 |
+
size 204128
|
data_processed/patric/patric_ciprofloxacin_X_train.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:34673e4d5674a53ef5e5e789ab581bdfda6f2d67880ff77fd99cbb2cf4f03b7b
|
| 3 |
+
size 700128
|
data_processed/patric/patric_ciprofloxacin_X_val.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:47299679a94433999a9b166c9547fdd76cc2b670293e909a92c769e2db6a4b6d
|
| 3 |
+
size 104128
|
data_processed/patric/patric_ciprofloxacin_metadata.json
ADDED
|
@@ -0,0 +1,515 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"feature_names": [
|
| 3 |
+
"AAAAAT",
|
| 4 |
+
"ATTTTT",
|
| 5 |
+
"TTTAAA",
|
| 6 |
+
"TTTTTA",
|
| 7 |
+
"TAAAAA",
|
| 8 |
+
"TTAAAA",
|
| 9 |
+
"TTTTAA",
|
| 10 |
+
"TATTTT",
|
| 11 |
+
"AAAAAA",
|
| 12 |
+
"TTTATT",
|
| 13 |
+
"AAAATA",
|
| 14 |
+
"TTTAAT",
|
| 15 |
+
"ATTAAA",
|
| 16 |
+
"TTTTTT",
|
| 17 |
+
"AATAAA",
|
| 18 |
+
"TTATTT",
|
| 19 |
+
"AAATAA",
|
| 20 |
+
"TTTTAT",
|
| 21 |
+
"ATAAAA",
|
| 22 |
+
"AAAATT",
|
| 23 |
+
"ATTTAA",
|
| 24 |
+
"TTAAAT",
|
| 25 |
+
"TAAAAT",
|
| 26 |
+
"ATTATT",
|
| 27 |
+
"ATTTTA",
|
| 28 |
+
"AATTTT",
|
| 29 |
+
"ATATTT",
|
| 30 |
+
"AATAAT",
|
| 31 |
+
"AATATT",
|
| 32 |
+
"AAATAT",
|
| 33 |
+
"TTAATT",
|
| 34 |
+
"ATTAAT",
|
| 35 |
+
"AATTAA",
|
| 36 |
+
"ATTTAT",
|
| 37 |
+
"GAAAAA",
|
| 38 |
+
"TGAAAA",
|
| 39 |
+
"AAATTA",
|
| 40 |
+
"TTTTCA",
|
| 41 |
+
"TTTTTC",
|
| 42 |
+
"AAATTT",
|
| 43 |
+
"ATAAAT",
|
| 44 |
+
"TAATTT",
|
| 45 |
+
"AAAAAG",
|
| 46 |
+
"CTTTTT",
|
| 47 |
+
"TTATTA",
|
| 48 |
+
"CATTTT",
|
| 49 |
+
"TATTTA",
|
| 50 |
+
"AAAATG",
|
| 51 |
+
"TAATAA",
|
| 52 |
+
"TAAATA",
|
| 53 |
+
"TGATTT",
|
| 54 |
+
"TATTAA",
|
| 55 |
+
"AAATCA",
|
| 56 |
+
"TTAATA",
|
| 57 |
+
"AATTTA",
|
| 58 |
+
"TAAATT",
|
| 59 |
+
"TTCTTT",
|
| 60 |
+
"AAAGAA",
|
| 61 |
+
"AAATGA",
|
| 62 |
+
"TCATTT",
|
| 63 |
+
"TTTCTT",
|
| 64 |
+
"TTGAAA",
|
| 65 |
+
"TTTCAA",
|
| 66 |
+
"AAGAAA",
|
| 67 |
+
"CAAAAA",
|
| 68 |
+
"TTTATC",
|
| 69 |
+
"TTTTTG",
|
| 70 |
+
"ATTGAT",
|
| 71 |
+
"GAAAAT",
|
| 72 |
+
"GATAAA",
|
| 73 |
+
"AATTAT",
|
| 74 |
+
"ATCAAT",
|
| 75 |
+
"TTTATA",
|
| 76 |
+
"AATGAT",
|
| 77 |
+
"TTCAAT",
|
| 78 |
+
"TTTCAT",
|
| 79 |
+
"ATTTTC",
|
| 80 |
+
"TATAAA",
|
| 81 |
+
"ATGAAA",
|
| 82 |
+
"ATTGAA",
|
| 83 |
+
"TTTGAT",
|
| 84 |
+
"ATCATT",
|
| 85 |
+
"ATAATT",
|
| 86 |
+
"TTCATT",
|
| 87 |
+
"TGAAAT",
|
| 88 |
+
"GTTTTT",
|
| 89 |
+
"AAAATC",
|
| 90 |
+
"AAAAAC",
|
| 91 |
+
"AAATTG",
|
| 92 |
+
"TCTTTT",
|
| 93 |
+
"TTTGAA",
|
| 94 |
+
"TTTGTT",
|
| 95 |
+
"TTTTGA",
|
| 96 |
+
"TGTTTT",
|
| 97 |
+
"CAAAAT",
|
| 98 |
+
"ATTTTG",
|
| 99 |
+
"AGAAAA",
|
| 100 |
+
"TTCAAA",
|
| 101 |
+
"AAAAGA",
|
| 102 |
+
"GATTTT",
|
| 103 |
+
"TTTTCT",
|
| 104 |
+
"CTTTAA",
|
| 105 |
+
"TTGATT",
|
| 106 |
+
"ATTTCA",
|
| 107 |
+
"TCAAAA",
|
| 108 |
+
"ATCAAA",
|
| 109 |
+
"CAATTT",
|
| 110 |
+
"AATGAA",
|
| 111 |
+
"CATTAA",
|
| 112 |
+
"TTAATG",
|
| 113 |
+
"TCTTTA",
|
| 114 |
+
"AATCAT",
|
| 115 |
+
"AAAACA",
|
| 116 |
+
"ATGATT",
|
| 117 |
+
"TGATGA",
|
| 118 |
+
"TATTAT",
|
| 119 |
+
"TTGTTT",
|
| 120 |
+
"TCATCA",
|
| 121 |
+
"TAAAGA",
|
| 122 |
+
"TTAAAG",
|
| 123 |
+
"ATAATA",
|
| 124 |
+
"TAATAT",
|
| 125 |
+
"TGATAA",
|
| 126 |
+
"ATATTA",
|
| 127 |
+
"AACAAA",
|
| 128 |
+
"CTTTAT",
|
| 129 |
+
"AATCAA",
|
| 130 |
+
"TTATCA",
|
| 131 |
+
"AAACAA",
|
| 132 |
+
"AATATA",
|
| 133 |
+
"ATAAAG",
|
| 134 |
+
"ATCTTT",
|
| 135 |
+
"TATATT",
|
| 136 |
+
"ATTGTT",
|
| 137 |
+
"AAAGAT",
|
| 138 |
+
"AATTGA",
|
| 139 |
+
"CAATAT",
|
| 140 |
+
"AACAAT",
|
| 141 |
+
"AATATC",
|
| 142 |
+
"TTGATA",
|
| 143 |
+
"GATATT",
|
| 144 |
+
"GAAATT",
|
| 145 |
+
"TTGATG",
|
| 146 |
+
"TCAATT",
|
| 147 |
+
"ATATTG",
|
| 148 |
+
"CATCAA",
|
| 149 |
+
"TCATTA",
|
| 150 |
+
"GATTTA",
|
| 151 |
+
"TTTTGT",
|
| 152 |
+
"ATATAA",
|
| 153 |
+
"AATTTC",
|
| 154 |
+
"TTATAT",
|
| 155 |
+
"TATCAA",
|
| 156 |
+
"TTGTTG",
|
| 157 |
+
"TAATGA",
|
| 158 |
+
"TAAAAC",
|
| 159 |
+
"TTTAAC",
|
| 160 |
+
"ACAAAA",
|
| 161 |
+
"TAAATC",
|
| 162 |
+
"GCAAAA",
|
| 163 |
+
"GATGAA",
|
| 164 |
+
"TTTTGC",
|
| 165 |
+
"TCAATA",
|
| 166 |
+
"GTTTTA",
|
| 167 |
+
"TTCATC",
|
| 168 |
+
"GTTAAA",
|
| 169 |
+
"TTATTG",
|
| 170 |
+
"CAATAA",
|
| 171 |
+
"CATCAT",
|
| 172 |
+
"ATGATG",
|
| 173 |
+
"TATTGA",
|
| 174 |
+
"GTAAAA",
|
| 175 |
+
"TTTTAC",
|
| 176 |
+
"AATTTG",
|
| 177 |
+
"ATTTCT",
|
| 178 |
+
"CAACAA",
|
| 179 |
+
"GCTTTT",
|
| 180 |
+
"CATTTA",
|
| 181 |
+
"TAAATG",
|
| 182 |
+
"CAAATT",
|
| 183 |
+
"ATTTGA",
|
| 184 |
+
"TTGCTG",
|
| 185 |
+
"AAAAGC",
|
| 186 |
+
"AGAAAT",
|
| 187 |
+
"TTATAA",
|
| 188 |
+
"TGCTTT",
|
| 189 |
+
"TAATTA",
|
| 190 |
+
"CAGCAA",
|
| 191 |
+
"TTCTTC",
|
| 192 |
+
"TTGAAT",
|
| 193 |
+
"ATTCAA",
|
| 194 |
+
"ATAATG",
|
| 195 |
+
"AATGTT",
|
| 196 |
+
"GAAGAA",
|
| 197 |
+
"CGCCAG",
|
| 198 |
+
"AACTTT",
|
| 199 |
+
"CATTAT",
|
| 200 |
+
"AAAGCA",
|
| 201 |
+
"ATTCAT",
|
| 202 |
+
"TCAAAT",
|
| 203 |
+
"TGAATT",
|
| 204 |
+
"ATGAAT",
|
| 205 |
+
"AACATT",
|
| 206 |
+
"TAATTG",
|
| 207 |
+
"AAAGTT",
|
| 208 |
+
"ATCATC",
|
| 209 |
+
"CTGGCG",
|
| 210 |
+
"ACTTTT",
|
| 211 |
+
"AAAAGT",
|
| 212 |
+
"CAATTA",
|
| 213 |
+
"AATTCA",
|
| 214 |
+
"TATTCA",
|
| 215 |
+
"ATTTGT",
|
| 216 |
+
"ATTATC",
|
| 217 |
+
"GATGAT",
|
| 218 |
+
"CTAAAA",
|
| 219 |
+
"CTTTTA",
|
| 220 |
+
"TGATAT",
|
| 221 |
+
"AATTGC",
|
| 222 |
+
"TTTTAG",
|
| 223 |
+
"TGAATA",
|
| 224 |
+
"CAGCAG",
|
| 225 |
+
"CTGAAA",
|
| 226 |
+
"TGGTTT",
|
| 227 |
+
"GCTTTA",
|
| 228 |
+
"TGAAGA",
|
| 229 |
+
"TAAAAG",
|
| 230 |
+
"CTGCTG",
|
| 231 |
+
"ATATCA",
|
| 232 |
+
"GATAAT",
|
| 233 |
+
"TCTTCA",
|
| 234 |
+
"GCAATT",
|
| 235 |
+
"GCCAGC",
|
| 236 |
+
"AAACCA",
|
| 237 |
+
"ATGTTT",
|
| 238 |
+
"GTATTT",
|
| 239 |
+
"GCTGGC",
|
| 240 |
+
"AATATG",
|
| 241 |
+
"GCATTT",
|
| 242 |
+
"CATATT",
|
| 243 |
+
"CCATTT",
|
| 244 |
+
"TTTCAG",
|
| 245 |
+
"ACATTT",
|
| 246 |
+
"TGTTGA",
|
| 247 |
+
"TGATTA",
|
| 248 |
+
"ATTGCT",
|
| 249 |
+
"GGTAAA",
|
| 250 |
+
"TAAAGC",
|
| 251 |
+
"AAATGT",
|
| 252 |
+
"TTTACC",
|
| 253 |
+
"CATAAA",
|
| 254 |
+
"AAATAC",
|
| 255 |
+
"AAACAT",
|
| 256 |
+
"CCAGCA",
|
| 257 |
+
"AAAACT",
|
| 258 |
+
"TTAAAC",
|
| 259 |
+
"TCAGCA",
|
| 260 |
+
"AATTGT",
|
| 261 |
+
"TTATCT",
|
| 262 |
+
"TTGAAG",
|
| 263 |
+
"ATATTC",
|
| 264 |
+
"GAATTT",
|
| 265 |
+
"CTTCAA",
|
| 266 |
+
"GTTTAA",
|
| 267 |
+
"AGTTTT",
|
| 268 |
+
"TGCTGA",
|
| 269 |
+
"TCAACA",
|
| 270 |
+
"AAATTC",
|
| 271 |
+
"TTTATG",
|
| 272 |
+
"GAATAT",
|
| 273 |
+
"ACAAAT",
|
| 274 |
+
"TTAACA",
|
| 275 |
+
"ATATAT",
|
| 276 |
+
"AAATGC",
|
| 277 |
+
"AAGAAG",
|
| 278 |
+
"ACAATT",
|
| 279 |
+
"AAATGG",
|
| 280 |
+
"TAATCA",
|
| 281 |
+
"CTTCTT",
|
| 282 |
+
"AGCAAT",
|
| 283 |
+
"TGTTAA",
|
| 284 |
+
"TATTTG",
|
| 285 |
+
"GTTGAT",
|
| 286 |
+
"TGCTGG",
|
| 287 |
+
"GTTATT",
|
| 288 |
+
"AGATAA",
|
| 289 |
+
"GTTTAT",
|
| 290 |
+
"GTAAAT",
|
| 291 |
+
"GCTGAA",
|
| 292 |
+
"TTCAGC",
|
| 293 |
+
"ATAAAC",
|
| 294 |
+
"TTTGCT",
|
| 295 |
+
"ATCAAC",
|
| 296 |
+
"TTGCTT",
|
| 297 |
+
"ACTTTA",
|
| 298 |
+
"AATAAC",
|
| 299 |
+
"ATTTAC",
|
| 300 |
+
"CAATTG",
|
| 301 |
+
"GTTGAA",
|
| 302 |
+
"TGCTGC",
|
| 303 |
+
"AGCAAA",
|
| 304 |
+
"GTTAAT",
|
| 305 |
+
"GCAGCA",
|
| 306 |
+
"TACTTT",
|
| 307 |
+
"TATTTC",
|
| 308 |
+
"ATGATA",
|
| 309 |
+
"TATCTT",
|
| 310 |
+
"TATCAT",
|
| 311 |
+
"TGATTG",
|
| 312 |
+
"CAAATA",
|
| 313 |
+
"TTCAAC",
|
| 314 |
+
"GATTAT",
|
| 315 |
+
"ATAATC",
|
| 316 |
+
"TGTTTA",
|
| 317 |
+
"GAAATA",
|
| 318 |
+
"CAATCA",
|
| 319 |
+
"ATTAAC",
|
| 320 |
+
"CGTTTT",
|
| 321 |
+
"AAGATA",
|
| 322 |
+
"ACCATT",
|
| 323 |
+
"TAAAGT",
|
| 324 |
+
"CATTTG",
|
| 325 |
+
"AAATCT",
|
| 326 |
+
"GCATTA",
|
| 327 |
+
"TGCAAT",
|
| 328 |
+
"AAAGTA",
|
| 329 |
+
"AGATTT",
|
| 330 |
+
"TTGTAA",
|
| 331 |
+
"AAGTTT",
|
| 332 |
+
"AAACTT",
|
| 333 |
+
"GATTAA",
|
| 334 |
+
"CTTCAT",
|
| 335 |
+
"TAATGC",
|
| 336 |
+
"GTTGTT",
|
| 337 |
+
"AAAACG",
|
| 338 |
+
"TAAACA",
|
| 339 |
+
"CCAGCG",
|
| 340 |
+
"AATGGT",
|
| 341 |
+
"TATAAT",
|
| 342 |
+
"TTAATC",
|
| 343 |
+
"AAGCAA",
|
| 344 |
+
"ATTGCA",
|
| 345 |
+
"AAGATG",
|
| 346 |
+
"TTTGTA",
|
| 347 |
+
"GCATCA",
|
| 348 |
+
"TTGTTA",
|
| 349 |
+
"CATCTT",
|
| 350 |
+
"TACAAA",
|
| 351 |
+
"AAAACC",
|
| 352 |
+
"TTACAA",
|
| 353 |
+
"CGCTGG",
|
| 354 |
+
"ATTTGC",
|
| 355 |
+
"GCAAAT",
|
| 356 |
+
"CAAATG",
|
| 357 |
+
"TGCAAA",
|
| 358 |
+
"CTGTTT",
|
| 359 |
+
"TGATGC",
|
| 360 |
+
"ATGAAG",
|
| 361 |
+
"TTACTT",
|
| 362 |
+
"CAACAT",
|
| 363 |
+
"TTCATA",
|
| 364 |
+
"AACAAC",
|
| 365 |
+
"GGTTTT",
|
| 366 |
+
"TATTGC",
|
| 367 |
+
"TCTAAA",
|
| 368 |
+
"TATGAA",
|
| 369 |
+
"ACCAAT",
|
| 370 |
+
"TAACAA",
|
| 371 |
+
"TTATTC",
|
| 372 |
+
"GCAATA",
|
| 373 |
+
"ATTATA",
|
| 374 |
+
"TTTGCA",
|
| 375 |
+
"TGATGT",
|
| 376 |
+
"ATTGGT",
|
| 377 |
+
"ATGTTG",
|
| 378 |
+
"TTTAGA",
|
| 379 |
+
"TAATTC",
|
| 380 |
+
"TGTTGT",
|
| 381 |
+
"TTTCAC",
|
| 382 |
+
"AATTAC",
|
| 383 |
+
"TGGCAA",
|
| 384 |
+
"AAGTAA",
|
| 385 |
+
"TGGTGA",
|
| 386 |
+
"TTTGGT",
|
| 387 |
+
"GCTGAT",
|
| 388 |
+
"TGCATT",
|
| 389 |
+
"ATCTTC",
|
| 390 |
+
"GAATTA",
|
| 391 |
+
"ATCAGC",
|
| 392 |
+
"TGTAAT",
|
| 393 |
+
"AAACAG",
|
| 394 |
+
"GAAGAT",
|
| 395 |
+
"TGTAAA",
|
| 396 |
+
"TTGCCA",
|
| 397 |
+
"TTGTTC",
|
| 398 |
+
"TAATGT",
|
| 399 |
+
"ACATCA",
|
| 400 |
+
"TTTACT",
|
| 401 |
+
"GAATAA",
|
| 402 |
+
"CAGCGC",
|
| 403 |
+
"TTTACA",
|
| 404 |
+
"TCATAT",
|
| 405 |
+
"ATTATG",
|
| 406 |
+
"TGTATT",
|
| 407 |
+
"GTGAAA",
|
| 408 |
+
"AATACA",
|
| 409 |
+
"CAGAAA",
|
| 410 |
+
"ATATGA",
|
| 411 |
+
"GTTTTG",
|
| 412 |
+
"AATACC",
|
| 413 |
+
"AATCTT",
|
| 414 |
+
"AATGCA",
|
| 415 |
+
"TTTCTG",
|
| 416 |
+
"ATTCTT",
|
| 417 |
+
"TTAACT",
|
| 418 |
+
"GTAATT",
|
| 419 |
+
"TCATAA",
|
| 420 |
+
"TCACCA",
|
| 421 |
+
"AGTTAA",
|
| 422 |
+
"ATTTAG",
|
| 423 |
+
"ACATTA",
|
| 424 |
+
"CAGCAT",
|
| 425 |
+
"GGTATT",
|
| 426 |
+
"TGTTGC",
|
| 427 |
+
"AGCATT",
|
| 428 |
+
"GCCATT",
|
| 429 |
+
"ATTACA",
|
| 430 |
+
"ACCAGC",
|
| 431 |
+
"TATTGT",
|
| 432 |
+
"GCTAAA",
|
| 433 |
+
"AGTAAA",
|
| 434 |
+
"TTTGCC",
|
| 435 |
+
"ACCAAA",
|
| 436 |
+
"TTTAGC",
|
| 437 |
+
"AATGGC",
|
| 438 |
+
"ATGCTG",
|
| 439 |
+
"AAGATT",
|
| 440 |
+
"GCTGGT",
|
| 441 |
+
"GGCAAA",
|
| 442 |
+
"TTATGA",
|
| 443 |
+
"ACAATA",
|
| 444 |
+
"ACAACA",
|
| 445 |
+
"CTTTTG",
|
| 446 |
+
"CATAAT",
|
| 447 |
+
"TAACTT",
|
| 448 |
+
"CTAAAT",
|
| 449 |
+
"CAAAAC",
|
| 450 |
+
"AAGAAT",
|
| 451 |
+
"TCGCCA",
|
| 452 |
+
"GAACAA",
|
| 453 |
+
"TGTTCA",
|
| 454 |
+
"GGAAAA",
|
| 455 |
+
"GTTTCA",
|
| 456 |
+
"AATGCT",
|
| 457 |
+
"GCGCTG",
|
| 458 |
+
"CACCAG",
|
| 459 |
+
"TTTTCC",
|
| 460 |
+
"CAAAAG",
|
| 461 |
+
"TAATAC",
|
| 462 |
+
"AAACTG",
|
| 463 |
+
"TTTCCA",
|
| 464 |
+
"TTGCAA",
|
| 465 |
+
"GCTGTT",
|
| 466 |
+
"GCAACA",
|
| 467 |
+
"AAGTTA",
|
| 468 |
+
"TGGTAA",
|
| 469 |
+
"TTCTAA",
|
| 470 |
+
"TTAGAA",
|
| 471 |
+
"GGTGAA",
|
| 472 |
+
"CAGTTT",
|
| 473 |
+
"CTGGTG",
|
| 474 |
+
"AGATGA",
|
| 475 |
+
"ATTACT",
|
| 476 |
+
"AGCTTT",
|
| 477 |
+
"GTAATA",
|
| 478 |
+
"TGGCGA",
|
| 479 |
+
"CTATTT",
|
| 480 |
+
"TGAACA",
|
| 481 |
+
"TATTAC",
|
| 482 |
+
"GTATTA",
|
| 483 |
+
"ATCGCC",
|
| 484 |
+
"TCATCT",
|
| 485 |
+
"CTTTCA",
|
| 486 |
+
"CTGCAA",
|
| 487 |
+
"TTTAGT",
|
| 488 |
+
"CCATCA",
|
| 489 |
+
"ATCATA",
|
| 490 |
+
"TGAAAG",
|
| 491 |
+
"TTATCG",
|
| 492 |
+
"TTCACC",
|
| 493 |
+
"GATGTT",
|
| 494 |
+
"TGGAAA",
|
| 495 |
+
"AAAGCT",
|
| 496 |
+
"CGCTTT",
|
| 497 |
+
"AACATC",
|
| 498 |
+
"CGATAA",
|
| 499 |
+
"TTGCAG",
|
| 500 |
+
"TGATGG",
|
| 501 |
+
"TATGAT",
|
| 502 |
+
"CTGATT"
|
| 503 |
+
],
|
| 504 |
+
"class_names": [
|
| 505 |
+
"Resistant",
|
| 506 |
+
"Susceptible"
|
| 507 |
+
],
|
| 508 |
+
"task_type": "binary",
|
| 509 |
+
"antibiotic": "ciprofloxacin",
|
| 510 |
+
"k": 6,
|
| 511 |
+
"max_features": 500,
|
| 512 |
+
"n_samples": 252,
|
| 513 |
+
"n_features": 500,
|
| 514 |
+
"n_classes": 2
|
| 515 |
+
}
|
data_processed/patric/patric_ciprofloxacin_y_test.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9ef7a315e5aa72527355c6c990864def6a8f3414622deeae52a570161339410e
|
| 3 |
+
size 536
|