hossainlab Claude Opus 4.6 commited on
Commit
3255634
·
1 Parent(s): a3107bb

Deploy DeepAMR API backend

Browse files

FastAPI backend with deep learning AMR prediction:
- 11 drug classes, 84.3% Micro F1, 98.6% AUC
- FASTA/FASTQ file upload and prediction
- Bangladesh clinical guidelines (DGHS/IEDCR)
- PDF clinical report generation
- User auth, prediction history, admin dashboard
- Rate limiting and security hardening

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. .gitignore +7 -0
  2. Dockerfile +30 -0
  3. README.md +15 -5
  4. data_processed/card/card_drug_class_X_test.npy +3 -0
  5. data_processed/card/card_drug_class_X_train.npy +3 -0
  6. data_processed/card/card_drug_class_X_val.npy +3 -0
  7. data_processed/card/card_drug_class_metadata.json +551 -0
  8. data_processed/card/card_drug_class_y_test.npy +3 -0
  9. data_processed/card/card_drug_class_y_train.npy +3 -0
  10. data_processed/card/card_drug_class_y_val.npy +3 -0
  11. data_processed/card/card_gene_family_X_test.npy +3 -0
  12. data_processed/card/card_gene_family_X_train.npy +3 -0
  13. data_processed/card/card_gene_family_X_val.npy +3 -0
  14. data_processed/card/card_gene_family_metadata.json +911 -0
  15. data_processed/card/card_gene_family_y_test.npy +3 -0
  16. data_processed/card/card_gene_family_y_train.npy +3 -0
  17. data_processed/card/card_gene_family_y_val.npy +3 -0
  18. data_processed/card/card_mechanism_X_test.npy +3 -0
  19. data_processed/card/card_mechanism_X_train.npy +3 -0
  20. data_processed/card/card_mechanism_X_val.npy +3 -0
  21. data_processed/card/card_mechanism_metadata.json +523 -0
  22. data_processed/card/card_mechanism_y_test.npy +3 -0
  23. data_processed/card/card_mechanism_y_train.npy +3 -0
  24. data_processed/card/card_mechanism_y_val.npy +3 -0
  25. data_processed/ncbi/ncbi_amr_X_test.npy +3 -0
  26. data_processed/ncbi/ncbi_amr_X_train.npy +3 -0
  27. data_processed/ncbi/ncbi_amr_X_val.npy +3 -0
  28. data_processed/ncbi/ncbi_amr_metadata.json +537 -0
  29. data_processed/ncbi/ncbi_amr_y_test.npy +3 -0
  30. data_processed/ncbi/ncbi_amr_y_train.npy +3 -0
  31. data_processed/ncbi/ncbi_amr_y_val.npy +3 -0
  32. data_processed/ncbi/ncbi_organism_X_test.npy +3 -0
  33. data_processed/ncbi/ncbi_organism_X_train.npy +3 -0
  34. data_processed/ncbi/ncbi_organism_X_val.npy +3 -0
  35. data_processed/ncbi/ncbi_organism_metadata.json +521 -0
  36. data_processed/ncbi/ncbi_organism_y_test.npy +3 -0
  37. data_processed/ncbi/ncbi_organism_y_train.npy +3 -0
  38. data_processed/ncbi/ncbi_organism_y_val.npy +3 -0
  39. data_processed/patric/patric_cefoxitin_X_test.npy +3 -0
  40. data_processed/patric/patric_cefoxitin_X_train.npy +3 -0
  41. data_processed/patric/patric_cefoxitin_X_val.npy +3 -0
  42. data_processed/patric/patric_cefoxitin_metadata.json +515 -0
  43. data_processed/patric/patric_cefoxitin_y_test.npy +3 -0
  44. data_processed/patric/patric_cefoxitin_y_train.npy +3 -0
  45. data_processed/patric/patric_cefoxitin_y_val.npy +3 -0
  46. data_processed/patric/patric_ciprofloxacin_X_test.npy +3 -0
  47. data_processed/patric/patric_ciprofloxacin_X_train.npy +3 -0
  48. data_processed/patric/patric_ciprofloxacin_X_val.npy +3 -0
  49. data_processed/patric/patric_ciprofloxacin_metadata.json +515 -0
  50. data_processed/patric/patric_ciprofloxacin_y_test.npy +3 -0
.gitignore ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ __pycache__/
2
+ *.py[codz]
3
+ .DS_Store
4
+ deepamr.db
5
+ deepamr.db-wal
6
+ deepamr.db-shm
7
+ .env
Dockerfile ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ FROM python:3.11-slim
2
+
3
+ # HF Spaces runs as uid 1000
4
+ RUN useradd -m -u 1000 user
5
+
6
+ WORKDIR /app
7
+
8
+ # Install system dependencies
9
+ RUN apt-get update && apt-get install -y --no-install-recommends \
10
+ gcc g++ && \
11
+ rm -rf /var/lib/apt/lists/*
12
+
13
+ # Install Python dependencies
14
+ COPY requirements.txt .
15
+ RUN pip install --no-cache-dir -r requirements.txt
16
+
17
+ # Copy application code
18
+ COPY src/ src/
19
+ COPY models/ models/
20
+ COPY data_processed/ data/processed/
21
+ COPY demo/ demo/
22
+
23
+ # Make everything writable for the app user (needed for SQLite DB)
24
+ RUN chown -R user:user /app
25
+
26
+ USER user
27
+
28
+ EXPOSE 7860
29
+
30
+ CMD ["uvicorn", "src.api.main:app", "--host", "0.0.0.0", "--port", "7860"]
README.md CHANGED
@@ -1,12 +1,22 @@
1
  ---
2
- title: Deepamr Api
3
- emoji: 📊
4
  colorFrom: blue
5
- colorTo: pink
6
  sdk: docker
 
7
  pinned: false
8
  license: mit
9
- short_description: 'DeepAMR: Deep Learning for Antimicrobial Resistance Predicti'
10
  ---
11
 
12
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: DeepAMR API
3
+ emoji: 🧬
4
  colorFrom: blue
5
+ colorTo: green
6
  sdk: docker
7
+ app_port: 7860
8
  pinned: false
9
  license: mit
10
+ short_description: 'Deep Learning for AMR Prediction'
11
  ---
12
 
13
+ # DeepAMR - Antimicrobial Resistance Prediction API
14
+
15
+ Deep Learning API for predicting antibiotic resistance from bacterial genomic sequences.
16
+
17
+ - **11 drug classes** supported
18
+ - **84.3% Micro F1**, **98.6% AUC**
19
+ - Bangladesh-specific clinical guidelines
20
+ - PDF report generation
21
+
22
+ API docs available at `/docs`
data_processed/card/card_drug_class_X_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:37754e2ffccec906083a21935e46f862471a3945ffa65b46fea2c87ae191bb5d
3
+ size 4844128
data_processed/card/card_drug_class_X_train.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7053e2e0734de9ae8c46483fad2fb11c8d181f371c26d250cbfe5eea2abe3355
3
+ size 16948128
data_processed/card/card_drug_class_X_val.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:562364532d2014e72213a80cd2191590b9963132f29c09843abf6b6a1fc6df1b
3
+ size 2424128
data_processed/card/card_drug_class_metadata.json ADDED
@@ -0,0 +1,551 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "feature_names": [
3
+ "AAA",
4
+ "IGL",
5
+ "ALA",
6
+ "KTG",
7
+ "LLL",
8
+ "AAL",
9
+ "LAA",
10
+ "ISL",
11
+ "RLD",
12
+ "GLE",
13
+ "TGS",
14
+ "LEQ",
15
+ "ALG",
16
+ "AST",
17
+ "GPL",
18
+ "DLA",
19
+ "ALQ",
20
+ "SIG",
21
+ "AAV",
22
+ "RDT",
23
+ "LDA",
24
+ "PAS",
25
+ "LAT",
26
+ "LGL",
27
+ "ALL",
28
+ "STF",
29
+ "LAR",
30
+ "LDR",
31
+ "SAA",
32
+ "VDA",
33
+ "IPG",
34
+ "TFK",
35
+ "LPA",
36
+ "GDA",
37
+ "PLK",
38
+ "AVL",
39
+ "LAQ",
40
+ "LAV",
41
+ "GYG",
42
+ "VAL",
43
+ "ATL",
44
+ "LFG",
45
+ "ANL",
46
+ "ALI",
47
+ "LAS",
48
+ "QAL",
49
+ "GLG",
50
+ "TLL",
51
+ "GLP",
52
+ "LPL",
53
+ "VAF",
54
+ "GFG",
55
+ "SLL",
56
+ "RVG",
57
+ "AVA",
58
+ "LSA",
59
+ "VLA",
60
+ "LAN",
61
+ "SDN",
62
+ "LTA",
63
+ "PAL",
64
+ "SLK",
65
+ "TTG",
66
+ "AGG",
67
+ "LDL",
68
+ "YGN",
69
+ "DDR",
70
+ "TLA",
71
+ "LLA",
72
+ "AIP",
73
+ "GLA",
74
+ "QRL",
75
+ "GWE",
76
+ "GVK",
77
+ "ATY",
78
+ "LLD",
79
+ "GLF",
80
+ "LLR",
81
+ "IAA",
82
+ "LGW",
83
+ "GSV",
84
+ "RIG",
85
+ "ERL",
86
+ "SKE",
87
+ "YGV",
88
+ "TAG",
89
+ "EQQ",
90
+ "NLL",
91
+ "ELG",
92
+ "RFP",
93
+ "LEK",
94
+ "LLT",
95
+ "LKR",
96
+ "GST",
97
+ "AGL",
98
+ "GGL",
99
+ "SVS",
100
+ "LVD",
101
+ "FGA",
102
+ "LSG",
103
+ "KAL",
104
+ "DER",
105
+ "KRL",
106
+ "TLG",
107
+ "LAG",
108
+ "VGD",
109
+ "TPA",
110
+ "NAL",
111
+ "TYT",
112
+ "SAI",
113
+ "TLF",
114
+ "AIA",
115
+ "GGP",
116
+ "LAL",
117
+ "LGD",
118
+ "LGS",
119
+ "ALV",
120
+ "ERF",
121
+ "LTG",
122
+ "PVT",
123
+ "TAT",
124
+ "AER",
125
+ "LVI",
126
+ "LGI",
127
+ "LKG",
128
+ "DTT",
129
+ "AAN",
130
+ "LTL",
131
+ "GVL",
132
+ "EIG",
133
+ "ARS",
134
+ "NTA",
135
+ "LFE",
136
+ "TTP",
137
+ "LAE",
138
+ "VPA",
139
+ "QTL",
140
+ "VGP",
141
+ "VSK",
142
+ "AGN",
143
+ "ASA",
144
+ "EAA",
145
+ "ASK",
146
+ "RAS",
147
+ "RLY",
148
+ "VTP",
149
+ "QLG",
150
+ "GIA",
151
+ "GLV",
152
+ "VTA",
153
+ "ADI",
154
+ "KSL",
155
+ "AAR",
156
+ "SQR",
157
+ "EQL",
158
+ "MKA",
159
+ "APA",
160
+ "ASL",
161
+ "TTT",
162
+ "TSA",
163
+ "YRQ",
164
+ "QGL",
165
+ "SYG",
166
+ "TAF",
167
+ "ATT",
168
+ "LLS",
169
+ "ADL",
170
+ "PLQ",
171
+ "AAS",
172
+ "TLP",
173
+ "SKT",
174
+ "GKA",
175
+ "IGD",
176
+ "PLL",
177
+ "KAS",
178
+ "ALE",
179
+ "LLF",
180
+ "KTF",
181
+ "RRI",
182
+ "PAP",
183
+ "AQA",
184
+ "VLV",
185
+ "ALP",
186
+ "PAD",
187
+ "AVI",
188
+ "AIS",
189
+ "AYA",
190
+ "LGG",
191
+ "AIL",
192
+ "LIG",
193
+ "DAE",
194
+ "YVA",
195
+ "LVG",
196
+ "GAA",
197
+ "SLG",
198
+ "LNA",
199
+ "SAL",
200
+ "STL",
201
+ "DRP",
202
+ "IAR",
203
+ "PAG",
204
+ "ATA",
205
+ "DMT",
206
+ "MKK",
207
+ "IVA",
208
+ "DEV",
209
+ "DLL",
210
+ "QPQ",
211
+ "QLA",
212
+ "PGD",
213
+ "DGK",
214
+ "LLN",
215
+ "NDI",
216
+ "QDK",
217
+ "IAD",
218
+ "NKT",
219
+ "DKT",
220
+ "KLA",
221
+ "TGA",
222
+ "MLN",
223
+ "EAY",
224
+ "PGM",
225
+ "YSN",
226
+ "ARL",
227
+ "WQP",
228
+ "YTA",
229
+ "LCG",
230
+ "FTA",
231
+ "GAV",
232
+ "ANK",
233
+ "LEG",
234
+ "FPD",
235
+ "YPN",
236
+ "VQP",
237
+ "VGW",
238
+ "AVQ",
239
+ "KTL",
240
+ "LKI",
241
+ "LKA",
242
+ "DLV",
243
+ "ILS",
244
+ "ISA",
245
+ "GNT",
246
+ "FSY",
247
+ "ALD",
248
+ "YGL",
249
+ "SNP",
250
+ "QAG",
251
+ "PSI",
252
+ "QYS",
253
+ "GNA",
254
+ "LGV",
255
+ "IGS",
256
+ "GDK",
257
+ "KIS",
258
+ "KAE",
259
+ "SVQ",
260
+ "FWL",
261
+ "PGP",
262
+ "LLG",
263
+ "AEL",
264
+ "AYG",
265
+ "ETL",
266
+ "PLA",
267
+ "QQG",
268
+ "KSG",
269
+ "ARR",
270
+ "LVT",
271
+ "MTL",
272
+ "DAA",
273
+ "VLL",
274
+ "GYA",
275
+ "MAV",
276
+ "RLL",
277
+ "GKP",
278
+ "GAL",
279
+ "KLL",
280
+ "VKT",
281
+ "APL",
282
+ "FGY",
283
+ "NEA",
284
+ "TLR",
285
+ "LQF",
286
+ "ITP",
287
+ "NPS",
288
+ "IKK",
289
+ "VAI",
290
+ "YAK",
291
+ "LNK",
292
+ "GMT",
293
+ "EIK",
294
+ "QWQ",
295
+ "ALS",
296
+ "GWV",
297
+ "DTP",
298
+ "SEK",
299
+ "VPG",
300
+ "LLI",
301
+ "LDD",
302
+ "KKS",
303
+ "FPA",
304
+ "LRF",
305
+ "KEL",
306
+ "LLK",
307
+ "ILA",
308
+ "HKT",
309
+ "AGE",
310
+ "GPG",
311
+ "YGK",
312
+ "IAG",
313
+ "VMK",
314
+ "GIV",
315
+ "APQ",
316
+ "GSR",
317
+ "VPL",
318
+ "GIS",
319
+ "ARA",
320
+ "FAA",
321
+ "GDE",
322
+ "VIY",
323
+ "IAL",
324
+ "ADK",
325
+ "AND",
326
+ "DRA",
327
+ "QQV",
328
+ "STN",
329
+ "NAE",
330
+ "GVA",
331
+ "PLD",
332
+ "LPF",
333
+ "QFP",
334
+ "DIA",
335
+ "VPE",
336
+ "KDQ",
337
+ "TYA",
338
+ "TFT",
339
+ "DVP",
340
+ "STS",
341
+ "ADE",
342
+ "VNP",
343
+ "PVY",
344
+ "KDD",
345
+ "VYQ",
346
+ "ELA",
347
+ "GEA",
348
+ "LRK",
349
+ "TGW",
350
+ "NDL",
351
+ "ARV",
352
+ "GPA",
353
+ "EKH",
354
+ "QIA",
355
+ "AIT",
356
+ "AMA",
357
+ "AAD",
358
+ "IYA",
359
+ "TRL",
360
+ "YAQ",
361
+ "EQT",
362
+ "LEL",
363
+ "TNG",
364
+ "GMA",
365
+ "LIA",
366
+ "TGV",
367
+ "GKV",
368
+ "DLG",
369
+ "AQT",
370
+ "KLS",
371
+ "FVP",
372
+ "AQG",
373
+ "LNE",
374
+ "PIS",
375
+ "IVM",
376
+ "GAY",
377
+ "PET",
378
+ "QDL",
379
+ "AYV",
380
+ "ATV",
381
+ "LVL",
382
+ "VML",
383
+ "NGF",
384
+ "LLQ",
385
+ "VFK",
386
+ "ERI",
387
+ "GDM",
388
+ "EKN",
389
+ "AFV",
390
+ "QVL",
391
+ "FVD",
392
+ "VQD",
393
+ "MTV",
394
+ "FEL",
395
+ "EVK",
396
+ "AKS",
397
+ "GSQ",
398
+ "LFA",
399
+ "IKA",
400
+ "AAT",
401
+ "ELN",
402
+ "LGA",
403
+ "VAE",
404
+ "VKA",
405
+ "KIA",
406
+ "PEL",
407
+ "DNT",
408
+ "KAA",
409
+ "LYA",
410
+ "RKL",
411
+ "ADR",
412
+ "LPV",
413
+ "AEA",
414
+ "EER",
415
+ "KVA",
416
+ "KAN",
417
+ "ASI",
418
+ "LQA",
419
+ "WLV",
420
+ "VIL",
421
+ "PLR",
422
+ "AFS",
423
+ "PHY",
424
+ "HRI",
425
+ "VTE",
426
+ "DRL",
427
+ "AVK",
428
+ "FIP",
429
+ "VVA",
430
+ "RIA",
431
+ "AAI",
432
+ "YQG",
433
+ "LTN",
434
+ "YLA",
435
+ "AFL",
436
+ "ERV",
437
+ "LQP",
438
+ "QVG",
439
+ "GQP",
440
+ "GRR",
441
+ "HPE",
442
+ "LQG",
443
+ "LNL",
444
+ "SGG",
445
+ "TPQ",
446
+ "SYV",
447
+ "WVV",
448
+ "QAQ",
449
+ "DSV",
450
+ "ARG",
451
+ "NST",
452
+ "TPE",
453
+ "RPL",
454
+ "HYF",
455
+ "EQI",
456
+ "LVA",
457
+ "RSL",
458
+ "QQL",
459
+ "YFT",
460
+ "APG",
461
+ "GEL",
462
+ "FDG",
463
+ "SGL",
464
+ "SGA",
465
+ "AQI",
466
+ "QPV",
467
+ "FSL",
468
+ "GYL",
469
+ "PNA",
470
+ "RAP",
471
+ "QRA",
472
+ "LFI",
473
+ "ANR",
474
+ "GNL",
475
+ "VLG",
476
+ "LFP",
477
+ "QKD",
478
+ "GIL",
479
+ "EKI",
480
+ "SPV",
481
+ "DQA",
482
+ "LER",
483
+ "RLG",
484
+ "DAI",
485
+ "TAA",
486
+ "PDS",
487
+ "RDL",
488
+ "VTR",
489
+ "DAG",
490
+ "AEG",
491
+ "SLI",
492
+ "FKW",
493
+ "VAG",
494
+ "VAV",
495
+ "RFV",
496
+ "GAG",
497
+ "GLT",
498
+ "VKR",
499
+ "RQQ",
500
+ "RVE",
501
+ "KGE",
502
+ "TRF"
503
+ ],
504
+ "class_names": [
505
+ "aminocoumarin antibiotic",
506
+ "aminoglycoside antibiotic",
507
+ "antibacterial free fatty acids",
508
+ "antibiotic without defined classification",
509
+ "bicyclomycin-like antibiotic",
510
+ "carbapenem",
511
+ "cephalosporin",
512
+ "diaminopyrimidine antibiotic",
513
+ "disinfecting agents and antiseptics",
514
+ "elfamycin antibiotic",
515
+ "fluoroquinolone antibiotic",
516
+ "fusidane antibiotic",
517
+ "glycopeptide antibiotic",
518
+ "glycylcycline",
519
+ "isoniazid-like antibiotic",
520
+ "lincosamide antibiotic",
521
+ "macrolide antibiotic",
522
+ "moenomycin antibiotic",
523
+ "monobactam",
524
+ "mupirocin-like antibiotic",
525
+ "nitrofuran antibiotic",
526
+ "nitroimidazole antibiotic",
527
+ "nucleoside antibiotic",
528
+ "orthosomycin antibiotic",
529
+ "oxazolidinone antibiotic",
530
+ "penicillin beta-lactam",
531
+ "peptide antibiotic",
532
+ "phenicol antibiotic",
533
+ "phosphonic acid antibiotic",
534
+ "pleuromutilin antibiotic",
535
+ "polyamine antibiotic",
536
+ "rifamycin antibiotic",
537
+ "streptogramin A antibiotic",
538
+ "streptogramin B antibiotic",
539
+ "streptogramin antibiotic",
540
+ "sulfonamide antibiotic",
541
+ "sulfone antibiotic",
542
+ "tetracycline antibiotic"
543
+ ],
544
+ "task_type": "multilabel",
545
+ "target": "drug_class",
546
+ "k": 3,
547
+ "max_features": 500,
548
+ "n_samples": 6054,
549
+ "n_features": 500,
550
+ "n_classes": 38
551
+ }
data_processed/card/card_drug_class_y_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c9bcbd653442796426386f60035cffc438caedc24f07e4465b0cd180dac83ab7
3
+ size 368272
data_processed/card/card_drug_class_y_train.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c97d499ebb542381bac702a8d55e11bf3f4eb8aef16a495023397b0027292a6b
3
+ size 1288176
data_processed/card/card_drug_class_y_val.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b7237e52cd44e3252b7ab582a21374313aad6d5b3231d8a50b2b9cd06964764b
3
+ size 184352
data_processed/card/card_gene_family_X_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:37754e2ffccec906083a21935e46f862471a3945ffa65b46fea2c87ae191bb5d
3
+ size 4844128
data_processed/card/card_gene_family_X_train.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7053e2e0734de9ae8c46483fad2fb11c8d181f371c26d250cbfe5eea2abe3355
3
+ size 16948128
data_processed/card/card_gene_family_X_val.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:562364532d2014e72213a80cd2191590b9963132f29c09843abf6b6a1fc6df1b
3
+ size 2424128
data_processed/card/card_gene_family_metadata.json ADDED
@@ -0,0 +1,911 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "feature_names": [
3
+ "AAA",
4
+ "IGL",
5
+ "ALA",
6
+ "KTG",
7
+ "LLL",
8
+ "AAL",
9
+ "LAA",
10
+ "ISL",
11
+ "RLD",
12
+ "GLE",
13
+ "TGS",
14
+ "LEQ",
15
+ "ALG",
16
+ "AST",
17
+ "GPL",
18
+ "DLA",
19
+ "ALQ",
20
+ "SIG",
21
+ "AAV",
22
+ "RDT",
23
+ "LDA",
24
+ "PAS",
25
+ "LAT",
26
+ "LGL",
27
+ "ALL",
28
+ "STF",
29
+ "LAR",
30
+ "LDR",
31
+ "SAA",
32
+ "VDA",
33
+ "IPG",
34
+ "TFK",
35
+ "LPA",
36
+ "GDA",
37
+ "PLK",
38
+ "AVL",
39
+ "LAQ",
40
+ "LAV",
41
+ "GYG",
42
+ "VAL",
43
+ "ATL",
44
+ "LFG",
45
+ "ANL",
46
+ "ALI",
47
+ "LAS",
48
+ "QAL",
49
+ "GLG",
50
+ "TLL",
51
+ "GLP",
52
+ "LPL",
53
+ "VAF",
54
+ "GFG",
55
+ "SLL",
56
+ "RVG",
57
+ "AVA",
58
+ "LSA",
59
+ "VLA",
60
+ "LAN",
61
+ "SDN",
62
+ "LTA",
63
+ "PAL",
64
+ "SLK",
65
+ "TTG",
66
+ "AGG",
67
+ "LDL",
68
+ "YGN",
69
+ "DDR",
70
+ "TLA",
71
+ "LLA",
72
+ "AIP",
73
+ "GLA",
74
+ "QRL",
75
+ "GWE",
76
+ "GVK",
77
+ "ATY",
78
+ "LLD",
79
+ "GLF",
80
+ "LLR",
81
+ "IAA",
82
+ "LGW",
83
+ "GSV",
84
+ "RIG",
85
+ "ERL",
86
+ "SKE",
87
+ "YGV",
88
+ "TAG",
89
+ "EQQ",
90
+ "NLL",
91
+ "ELG",
92
+ "RFP",
93
+ "LEK",
94
+ "LLT",
95
+ "LKR",
96
+ "GST",
97
+ "AGL",
98
+ "GGL",
99
+ "SVS",
100
+ "LVD",
101
+ "FGA",
102
+ "LSG",
103
+ "KAL",
104
+ "DER",
105
+ "KRL",
106
+ "TLG",
107
+ "LAG",
108
+ "VGD",
109
+ "TPA",
110
+ "NAL",
111
+ "TYT",
112
+ "SAI",
113
+ "TLF",
114
+ "AIA",
115
+ "GGP",
116
+ "LAL",
117
+ "LGD",
118
+ "LGS",
119
+ "ALV",
120
+ "ERF",
121
+ "LTG",
122
+ "PVT",
123
+ "TAT",
124
+ "AER",
125
+ "LVI",
126
+ "LGI",
127
+ "LKG",
128
+ "DTT",
129
+ "AAN",
130
+ "LTL",
131
+ "GVL",
132
+ "EIG",
133
+ "ARS",
134
+ "NTA",
135
+ "LFE",
136
+ "TTP",
137
+ "LAE",
138
+ "VPA",
139
+ "QTL",
140
+ "VGP",
141
+ "VSK",
142
+ "AGN",
143
+ "ASA",
144
+ "EAA",
145
+ "ASK",
146
+ "RAS",
147
+ "RLY",
148
+ "VTP",
149
+ "QLG",
150
+ "GIA",
151
+ "GLV",
152
+ "VTA",
153
+ "ADI",
154
+ "KSL",
155
+ "AAR",
156
+ "SQR",
157
+ "EQL",
158
+ "MKA",
159
+ "APA",
160
+ "ASL",
161
+ "TTT",
162
+ "TSA",
163
+ "YRQ",
164
+ "QGL",
165
+ "SYG",
166
+ "TAF",
167
+ "ATT",
168
+ "LLS",
169
+ "ADL",
170
+ "PLQ",
171
+ "AAS",
172
+ "TLP",
173
+ "SKT",
174
+ "GKA",
175
+ "IGD",
176
+ "PLL",
177
+ "KAS",
178
+ "ALE",
179
+ "LLF",
180
+ "KTF",
181
+ "RRI",
182
+ "PAP",
183
+ "AQA",
184
+ "VLV",
185
+ "ALP",
186
+ "PAD",
187
+ "AVI",
188
+ "AIS",
189
+ "AYA",
190
+ "LGG",
191
+ "AIL",
192
+ "LIG",
193
+ "DAE",
194
+ "YVA",
195
+ "LVG",
196
+ "GAA",
197
+ "SLG",
198
+ "LNA",
199
+ "SAL",
200
+ "STL",
201
+ "DRP",
202
+ "IAR",
203
+ "PAG",
204
+ "ATA",
205
+ "DMT",
206
+ "MKK",
207
+ "IVA",
208
+ "DEV",
209
+ "DLL",
210
+ "QPQ",
211
+ "QLA",
212
+ "PGD",
213
+ "DGK",
214
+ "LLN",
215
+ "NDI",
216
+ "QDK",
217
+ "IAD",
218
+ "NKT",
219
+ "DKT",
220
+ "KLA",
221
+ "TGA",
222
+ "MLN",
223
+ "EAY",
224
+ "PGM",
225
+ "YSN",
226
+ "ARL",
227
+ "WQP",
228
+ "YTA",
229
+ "LCG",
230
+ "FTA",
231
+ "GAV",
232
+ "ANK",
233
+ "LEG",
234
+ "FPD",
235
+ "YPN",
236
+ "VQP",
237
+ "VGW",
238
+ "AVQ",
239
+ "KTL",
240
+ "LKI",
241
+ "LKA",
242
+ "DLV",
243
+ "ILS",
244
+ "ISA",
245
+ "GNT",
246
+ "FSY",
247
+ "ALD",
248
+ "YGL",
249
+ "SNP",
250
+ "QAG",
251
+ "PSI",
252
+ "QYS",
253
+ "GNA",
254
+ "LGV",
255
+ "IGS",
256
+ "GDK",
257
+ "KIS",
258
+ "KAE",
259
+ "SVQ",
260
+ "FWL",
261
+ "PGP",
262
+ "LLG",
263
+ "AEL",
264
+ "AYG",
265
+ "ETL",
266
+ "PLA",
267
+ "QQG",
268
+ "KSG",
269
+ "ARR",
270
+ "LVT",
271
+ "MTL",
272
+ "DAA",
273
+ "VLL",
274
+ "GYA",
275
+ "MAV",
276
+ "RLL",
277
+ "GKP",
278
+ "GAL",
279
+ "KLL",
280
+ "VKT",
281
+ "APL",
282
+ "FGY",
283
+ "NEA",
284
+ "TLR",
285
+ "LQF",
286
+ "ITP",
287
+ "NPS",
288
+ "IKK",
289
+ "VAI",
290
+ "YAK",
291
+ "LNK",
292
+ "GMT",
293
+ "EIK",
294
+ "QWQ",
295
+ "ALS",
296
+ "GWV",
297
+ "DTP",
298
+ "SEK",
299
+ "VPG",
300
+ "LLI",
301
+ "LDD",
302
+ "KKS",
303
+ "FPA",
304
+ "LRF",
305
+ "KEL",
306
+ "LLK",
307
+ "ILA",
308
+ "HKT",
309
+ "AGE",
310
+ "GPG",
311
+ "YGK",
312
+ "IAG",
313
+ "VMK",
314
+ "GIV",
315
+ "APQ",
316
+ "GSR",
317
+ "VPL",
318
+ "GIS",
319
+ "ARA",
320
+ "FAA",
321
+ "GDE",
322
+ "VIY",
323
+ "IAL",
324
+ "ADK",
325
+ "AND",
326
+ "DRA",
327
+ "QQV",
328
+ "STN",
329
+ "NAE",
330
+ "GVA",
331
+ "PLD",
332
+ "LPF",
333
+ "QFP",
334
+ "DIA",
335
+ "VPE",
336
+ "KDQ",
337
+ "TYA",
338
+ "TFT",
339
+ "DVP",
340
+ "STS",
341
+ "ADE",
342
+ "VNP",
343
+ "PVY",
344
+ "KDD",
345
+ "VYQ",
346
+ "ELA",
347
+ "GEA",
348
+ "LRK",
349
+ "TGW",
350
+ "NDL",
351
+ "ARV",
352
+ "GPA",
353
+ "EKH",
354
+ "QIA",
355
+ "AIT",
356
+ "AMA",
357
+ "AAD",
358
+ "IYA",
359
+ "TRL",
360
+ "YAQ",
361
+ "EQT",
362
+ "LEL",
363
+ "TNG",
364
+ "GMA",
365
+ "LIA",
366
+ "TGV",
367
+ "GKV",
368
+ "DLG",
369
+ "AQT",
370
+ "KLS",
371
+ "FVP",
372
+ "AQG",
373
+ "LNE",
374
+ "PIS",
375
+ "IVM",
376
+ "GAY",
377
+ "PET",
378
+ "QDL",
379
+ "AYV",
380
+ "ATV",
381
+ "LVL",
382
+ "VML",
383
+ "NGF",
384
+ "LLQ",
385
+ "VFK",
386
+ "ERI",
387
+ "GDM",
388
+ "EKN",
389
+ "AFV",
390
+ "QVL",
391
+ "FVD",
392
+ "VQD",
393
+ "MTV",
394
+ "FEL",
395
+ "EVK",
396
+ "AKS",
397
+ "GSQ",
398
+ "LFA",
399
+ "IKA",
400
+ "AAT",
401
+ "ELN",
402
+ "LGA",
403
+ "VAE",
404
+ "VKA",
405
+ "KIA",
406
+ "PEL",
407
+ "DNT",
408
+ "KAA",
409
+ "LYA",
410
+ "RKL",
411
+ "ADR",
412
+ "LPV",
413
+ "AEA",
414
+ "EER",
415
+ "KVA",
416
+ "KAN",
417
+ "ASI",
418
+ "LQA",
419
+ "WLV",
420
+ "VIL",
421
+ "PLR",
422
+ "AFS",
423
+ "PHY",
424
+ "HRI",
425
+ "VTE",
426
+ "DRL",
427
+ "AVK",
428
+ "FIP",
429
+ "VVA",
430
+ "RIA",
431
+ "AAI",
432
+ "YQG",
433
+ "LTN",
434
+ "YLA",
435
+ "AFL",
436
+ "ERV",
437
+ "LQP",
438
+ "QVG",
439
+ "GQP",
440
+ "GRR",
441
+ "HPE",
442
+ "LQG",
443
+ "LNL",
444
+ "SGG",
445
+ "TPQ",
446
+ "SYV",
447
+ "WVV",
448
+ "QAQ",
449
+ "DSV",
450
+ "ARG",
451
+ "NST",
452
+ "TPE",
453
+ "RPL",
454
+ "HYF",
455
+ "EQI",
456
+ "LVA",
457
+ "RSL",
458
+ "QQL",
459
+ "YFT",
460
+ "APG",
461
+ "GEL",
462
+ "FDG",
463
+ "SGL",
464
+ "SGA",
465
+ "AQI",
466
+ "QPV",
467
+ "FSL",
468
+ "GYL",
469
+ "PNA",
470
+ "RAP",
471
+ "QRA",
472
+ "LFI",
473
+ "ANR",
474
+ "GNL",
475
+ "VLG",
476
+ "LFP",
477
+ "QKD",
478
+ "GIL",
479
+ "EKI",
480
+ "SPV",
481
+ "DQA",
482
+ "LER",
483
+ "RLG",
484
+ "DAI",
485
+ "TAA",
486
+ "PDS",
487
+ "RDL",
488
+ "VTR",
489
+ "DAG",
490
+ "AEG",
491
+ "SLI",
492
+ "FKW",
493
+ "VAG",
494
+ "VAV",
495
+ "RFV",
496
+ "GAG",
497
+ "GLT",
498
+ "VKR",
499
+ "RQQ",
500
+ "RVE",
501
+ "KGE",
502
+ "TRF"
503
+ ],
504
+ "class_names": [
505
+ "16S rRNA methyltransferase (A1408)",
506
+ "16S rRNA methyltransferase (G1405)",
507
+ "AAC(2')",
508
+ "AAC(3)",
509
+ "AAC(6')",
510
+ "AAC(6');AAC(6')-Ib-cr",
511
+ "AAK beta-lactamase",
512
+ "ACC beta-lactamase",
513
+ "ACI beta-lactamase",
514
+ "ACT beta-lactamase",
515
+ "ACT beta-lactamase;CMY beta-lactamase;CTX-M beta-lactamase;IMP beta-lactamase;KPC beta-lactamase;MOX beta-lactamase;OXA beta-lactamase;OXA-1-like beta-lactamase;SHV beta-lactamase;TEM beta-lactamase;class A Mycobacterium abscessus beta-lactamase",
516
+ "ADC beta-lactamase with carbapenemase activity",
517
+ "ADC beta-lactamase without carbapenemase activity",
518
+ "ADC beta-lactamases pending classification for carbapenemase activity",
519
+ "AER beta-lactamase",
520
+ "AFM beta-lactamase",
521
+ "AIM beta-lactamase",
522
+ "ALG11 beta-lactamase",
523
+ "ALG6 beta-lactamases",
524
+ "ALI beta-lactamase",
525
+ "AMZ beta-lactamase",
526
+ "ANA beta-lactamase",
527
+ "ANT(2'')",
528
+ "ANT(3'')",
529
+ "ANT(4')",
530
+ "ANT(6)",
531
+ "ANT(9)",
532
+ "APH(2'')",
533
+ "APH(3'')",
534
+ "APH(3')",
535
+ "APH(4)",
536
+ "APH(6)",
537
+ "APH(7'')",
538
+ "APH(9)",
539
+ "AQU beta-lactamase",
540
+ "ARL Beta-lactamase",
541
+ "AST Beta-lactamase",
542
+ "ASU1 beta-lactamase",
543
+ "ATP-binding cassette (ABC) antibiotic efflux pump",
544
+ "ATP-binding cassette (ABC) antibiotic efflux pump;major facilitator superfamily (MFS) antibiotic efflux pump",
545
+ "ATP-binding cassette (ABC) antibiotic efflux pump;major facilitator superfamily (MFS) antibiotic efflux pump;resistance-nodulation-cell division (RND) antibiotic efflux pump",
546
+ "AXC beta-lactamase",
547
+ "B3SU1 beta-lactamase",
548
+ "B3SU2 beta-lactamase",
549
+ "BAT Beta-lactamase",
550
+ "BCL Beta-lactamase",
551
+ "BEL beta-lactamase",
552
+ "BES Beta-lactamase",
553
+ "BIC Beta-lactamase",
554
+ "BIL Beta-lactamase",
555
+ "BIM beta-lactamase",
556
+ "BJP beta-lactamase",
557
+ "BKC Beta-lactamase",
558
+ "BMHC beta-lactamase",
559
+ "BOR beta-lactamase",
560
+ "BPU Beta-lactamase",
561
+ "BRO Beta-lactamase",
562
+ "BSU beta-lactamase",
563
+ "BUT beta-lactamase",
564
+ "Bah amidohydrolase",
565
+ "BlaA beta-lactamase",
566
+ "BlaB beta-lactamase",
567
+ "BlaZ beta-lactamase",
568
+ "Bleomycin resistant protein",
569
+ "CAE beta-lactamase",
570
+ "CAM beta-lactamase",
571
+ "CAR beta-lactamase",
572
+ "CARB beta-lactamase",
573
+ "CAU beta-lactamase",
574
+ "CBP beta-lactamase",
575
+ "CDA beta-lactamase;CTX-M beta-lactamase;SHV beta-lactamase;TEM beta-lactamase",
576
+ "CDD beta-lactamase",
577
+ "CGA beta-lactamase",
578
+ "CGB beta-lactamase",
579
+ "CHM beta-lactamase",
580
+ "CIA beta-lactamase",
581
+ "CIM beta-lactamase",
582
+ "CKO beta-lactamase",
583
+ "CMA beta-lactamase",
584
+ "CME beta-lactamase",
585
+ "CMH beta-lactamase",
586
+ "CMY beta-lactamase",
587
+ "CMY beta-lactamase;CTX-M beta-lactamase;IMP beta-lactamase;KPC beta-lactamase;NDM beta-lactamase;OXA beta-lactamase;OXA-1-like beta-lactamase;OXA-48-like beta-lactamase;SHV beta-lactamase;VIM beta-lactamase",
588
+ "CPS beta-lactamase",
589
+ "CRD3 beta-lactamase",
590
+ "CRH beta-lactamase",
591
+ "CRP beta-lactamase",
592
+ "CSA beta-lactamase",
593
+ "CSP beta-lactamase",
594
+ "CTX-M beta-lactamase",
595
+ "CVI beta-lactamase",
596
+ "CblA beta-lactamase",
597
+ "CepA beta-lactamase",
598
+ "CepS beta-lactamase",
599
+ "CfiA beta-lactamase",
600
+ "Cfr 23S ribosomal RNA methyltransferase",
601
+ "CfxA beta-lactamase",
602
+ "CphA beta-lactamase",
603
+ "DES beta-lactamase",
604
+ "DHA beta-lactamase",
605
+ "DHT2 beta-lactamase",
606
+ "DIM beta-lactamase",
607
+ "DYB beta-lactamase",
608
+ "EAM beta-lactamase",
609
+ "EBR beta-lactamase",
610
+ "EC beta-lactamase",
611
+ "ECF transporter S component",
612
+ "ECM beta-lactamase",
613
+ "ECV beta-lactamase",
614
+ "EFM beta-lactamase",
615
+ "ELM beta-lactamase",
616
+ "ERP beta-lactamase",
617
+ "ESP beta-lactamase",
618
+ "EVM beta-lactamase",
619
+ "EXO beta-lactamase",
620
+ "Edeine acetyltransferase",
621
+ "Erm 23S ribosomal RNA methyltransferase",
622
+ "FAR beta-lactamase",
623
+ "FEZ beta-lactamase",
624
+ "FIA beta-lactamase",
625
+ "FIM beta-lactamase",
626
+ "FONA beta-lactamase",
627
+ "FOX beta-lactamase",
628
+ "FPH beta-lactamase",
629
+ "FRI beta-lactamase",
630
+ "FTU beta-lactamase",
631
+ "Fom phosphotransferase family",
632
+ "GES beta-lactamase",
633
+ "GIL beta-lactamase",
634
+ "GIM beta-lactamase",
635
+ "GMA beta-lactamase",
636
+ "GMB beta-lactamase",
637
+ "GOB beta-lactamase",
638
+ "GPC beta-lactamase",
639
+ "GRD23 beta-lactamase",
640
+ "GRD33 beta-lactamase",
641
+ "General Bacterial Porin with reduced permeability to beta-lactams",
642
+ "General Bacterial Porin with reduced permeability to beta-lactams;resistance-nodulation-cell division (RND) antibiotic efflux pump",
643
+ "General Bacterial Porin with reduced permeability to peptide antibiotics",
644
+ "HBL beta-lactamase",
645
+ "HER beta-lactamase",
646
+ "HMB beta-lactamase",
647
+ "IDC beta-lactamase",
648
+ "IMI beta-lactamase",
649
+ "IMP beta-lactamase",
650
+ "IND beta-lactamase",
651
+ "Intrinsic peptide antibiotic resistant Lps",
652
+ "JOHN beta-lactamase",
653
+ "KBL beta-lactamase",
654
+ "KHM beta-lactamase",
655
+ "KLUC beta-lactamase",
656
+ "KPC beta-lactamase",
657
+ "L1 family beta-lactamase",
658
+ "LAP beta-lactamase",
659
+ "LAQ beta lactamase",
660
+ "LCR beta-lactamase",
661
+ "LEN beta-lactamase",
662
+ "LHK beta-lactamase",
663
+ "LMB beta-lactamase",
664
+ "LRG beta-lactamase",
665
+ "LUS beta-lactamase",
666
+ "LUT beta-lactamase",
667
+ "Llm 23S ribosomal RNA methyltransferase",
668
+ "MAL beta-lactamase",
669
+ "MBL beta-lactamase",
670
+ "MCR phosphoethanolamine transferase",
671
+ "MIR beta-lactamase",
672
+ "MOC beta-lactamase",
673
+ "MOR beta-lactamase",
674
+ "MOX beta-lactamase",
675
+ "MSI beta-lactamase",
676
+ "MSI-OXA family beta-lactamase",
677
+ "MUN beta-lactamase",
678
+ "MUS beta-lactamase",
679
+ "MYO beta-lactamase",
680
+ "MYX beta-lactamase",
681
+ "Miscellaneous ABC-F subfamily ATP-binding cassette ribosomal protection proteins",
682
+ "NDM beta-lactamase",
683
+ "NPS beta-lactamase",
684
+ "NWM beta-lactamase",
685
+ "OCH beta-lactamase",
686
+ "OHIO beta-lactamase",
687
+ "OKP beta-lactamase",
688
+ "ORN beta-lactamase",
689
+ "ORR beta-lactamase",
690
+ "OXA beta-lactamase",
691
+ "OXA beta-lactamase;OXA-1-like beta-lactamase",
692
+ "OXA beta-lactamase;OXA-10-like beta-lactamase",
693
+ "OXA beta-lactamase;OXA-114-like beta-lactamase",
694
+ "OXA beta-lactamase;OXA-12-like beta-lactamase",
695
+ "OXA beta-lactamase;OXA-134-like beta-lactamase",
696
+ "OXA beta-lactamase;OXA-143-like beta-lactamase",
697
+ "OXA beta-lactamase;OXA-184-like beta-lactamase",
698
+ "OXA beta-lactamase;OXA-198-like beta-lactamase",
699
+ "OXA beta-lactamase;OXA-2-like beta-lactamase",
700
+ "OXA beta-lactamase;OXA-211-like beta-lactamase",
701
+ "OXA beta-lactamase;OXA-213-like beta-lactamase",
702
+ "OXA beta-lactamase;OXA-214-like beta-lactamase",
703
+ "OXA beta-lactamase;OXA-22-like beta-lactamase",
704
+ "OXA beta-lactamase;OXA-229-like beta-lactamase",
705
+ "OXA beta-lactamase;OXA-23-like beta-lactamase",
706
+ "OXA beta-lactamase;OXA-24-like beta-lactamase",
707
+ "OXA beta-lactamase;OXA-266-like beta-lactamase",
708
+ "OXA beta-lactamase;OXA-274-like beta-lactamase",
709
+ "OXA beta-lactamase;OXA-286-like beta-lactamase",
710
+ "OXA beta-lactamase;OXA-294-like beta-lactamase",
711
+ "OXA beta-lactamase;OXA-364-like beta-lactamase",
712
+ "OXA beta-lactamase;OXA-372-like beta-lactamase",
713
+ "OXA beta-lactamase;OXA-42-like beta-lactamase",
714
+ "OXA beta-lactamase;OXA-427-like beta-lactamase",
715
+ "OXA beta-lactamase;OXA-46-like beta-lactamase",
716
+ "OXA beta-lactamase;OXA-48-like beta-lactamase",
717
+ "OXA beta-lactamase;OXA-493-like beta-lactamase",
718
+ "OXA beta-lactamase;OXA-5-like beta-lactamase",
719
+ "OXA beta-lactamase;OXA-50-like beta-lactamase",
720
+ "OXA beta-lactamase;OXA-51-like beta-lactamase",
721
+ "OXA beta-lactamase;OXA-548-like beta-lactamase",
722
+ "OXA beta-lactamase;OXA-55-like beta-lactamase",
723
+ "OXA beta-lactamase;OXA-58-like beta-lactamase",
724
+ "OXA beta-lactamase;OXA-60-like beta-lactamase",
725
+ "OXA beta-lactamase;OXA-61-like beta-lactamase",
726
+ "OXA beta-lactamase;OXA-62-like beta-lactamase",
727
+ "OXA beta-lactamase;OXA-63-like beta-lactamase",
728
+ "OXA beta-lactamase;OXA-679-like beta-lactamase",
729
+ "OXA beta-lactamase;OXA-727-like beta-lactamase",
730
+ "OXA beta-lactamase;OXA-9-like beta-lactamase",
731
+ "OXY beta-lactamase",
732
+ "Outer Membrane Porin (Opr)",
733
+ "Outer Membrane Porin (Opr);resistance-nodulation-cell division (RND) antibiotic efflux pump",
734
+ "PAC beta-lactamase",
735
+ "PAD beta-lactamase",
736
+ "PAM beta-lactamase",
737
+ "PAU beta-lactamase",
738
+ "PDC beta-lactamase",
739
+ "PEN-A beta-lactamase",
740
+ "PEN-B beta-lactamase",
741
+ "PER beta-lactamase",
742
+ "PFM beta-lactamase",
743
+ "PJM beta-lactamase",
744
+ "PLA beta-lactamase",
745
+ "PLN beta-lactamase",
746
+ "PME beta-lactamase",
747
+ "PNC beta-lactamase",
748
+ "PNGM beta-lactamase",
749
+ "POM beta-lactamase",
750
+ "PRC beta-lactamase",
751
+ "PST beta-lactamase",
752
+ "PSV beta-lactamase",
753
+ "PSZ beta-lactamase",
754
+ "R39 beta-lactamase",
755
+ "RAA beta-lactamase",
756
+ "RAD beta-lactamase",
757
+ "RAHN beta-lactamase",
758
+ "RASA beta-lactamase",
759
+ "RATA beta-lactamase",
760
+ "RCP beta-lactamase",
761
+ "ROB beta-lactamase",
762
+ "RSA beta-lactamase",
763
+ "RSA2 beta-lactamase",
764
+ "RSC1 beta-lactamase",
765
+ "RSD1",
766
+ "RSD2 beta-lactamase",
767
+ "RUB beta-lactamase",
768
+ "RbpA bacterial RNA polymerase-binding protein",
769
+ "Rm3 family beta-lactamase",
770
+ "SCO beta-lactamase",
771
+ "SED beta-lactamase",
772
+ "SFC beta-lactamase",
773
+ "SFDC beta-lactamase",
774
+ "SFH beta-lactamase",
775
+ "SFO beta-lactamase",
776
+ "SGM beta-lactamase",
777
+ "SHD beta-lactamase",
778
+ "SHN beta-lactamase",
779
+ "SHV beta-lactamase",
780
+ "SHW beta-lactamase",
781
+ "SIE beta-lactamase",
782
+ "SIM beta-lactamase",
783
+ "SMB beta-lactamase",
784
+ "SME beta-lactamase",
785
+ "SPG beta-lactamase",
786
+ "SPM beta-lactamase",
787
+ "SPN79 beta-lactamase",
788
+ "SPR beta-lactamase",
789
+ "SPS beta-lactamase",
790
+ "SPU beta-lactamase",
791
+ "SRT beta-lactamase",
792
+ "SSA beta-lactamase",
793
+ "SST beta-lactamase",
794
+ "STA beta-lactamase",
795
+ "Serine/threonine kinases",
796
+ "Subclass B1 Vibrio cholerae varG beta-lactamase",
797
+ "TEM beta-lactamase",
798
+ "TER beta-lactamase",
799
+ "THIN-B beta-lactamase",
800
+ "TLA beta-lactamase",
801
+ "TMB beta-lactamase",
802
+ "TRU beta-lactamase",
803
+ "TTU beta-lactamase",
804
+ "TUS beta-lactamase",
805
+ "Target protecting FusB-type protein conferring resistance to Fusidic acid",
806
+ "VAM beta-lactamase",
807
+ "VCC beta-lactamase",
808
+ "VEB beta-lactamase",
809
+ "VHH beta-lactamase",
810
+ "VHW beta-lactamase",
811
+ "VIM beta-lactamase",
812
+ "VMB beta-lactamase",
813
+ "Van ligase;glycopeptide resistance gene cluster",
814
+ "WUS beta-lactamase",
815
+ "YEM beta-lactamase",
816
+ "YOC beta-lactamase",
817
+ "YRC beta-lactamase",
818
+ "ZOG beta-lactamase",
819
+ "alm glycyl carrier protein;polymyxin resistance operon",
820
+ "alm glycyltransferase;polymyxin resistance operon",
821
+ "aminoglycoside bifunctional resistance protein",
822
+ "ampC-type beta-lactamase",
823
+ "antibiotic-resistant isoleucyl-tRNA synthetase (ileS)",
824
+ "antibiotic-resistant murA transferase",
825
+ "blaF family beta-lactamase",
826
+ "blaS",
827
+ "capreomycin phosphotransferase",
828
+ "chloramphenicol acetyltransferase (CAT)",
829
+ "chloramphenicol phosphotransferase",
830
+ "class A Bacillus anthracis Bla beta-lactamase",
831
+ "class A Bacillus cereus Bc beta-lactamase",
832
+ "class A LRA beta-lactamase",
833
+ "class A Mycobacterium abscessus beta-lactamase",
834
+ "class A Mycobacterium tuberculosis bla beta-lactamase",
835
+ "class C LRA beta-lactamase",
836
+ "class C LRA beta-lactamase;class D LRA beta-lactamase",
837
+ "cpa acetyltransferase",
838
+ "defensin resistant mprF",
839
+ "fosC phosphotransferase family",
840
+ "fosfomycin thiol transferase",
841
+ "fusidic acid inactivation enzyme",
842
+ "gimA family macrolide glycosyltransferase",
843
+ "glycopeptide resistance gene cluster;vanH",
844
+ "glycopeptide resistance gene cluster;vanK",
845
+ "glycopeptide resistance gene cluster;vanR",
846
+ "glycopeptide resistance gene cluster;vanS",
847
+ "glycopeptide resistance gene cluster;vanT",
848
+ "glycopeptide resistance gene cluster;vanU",
849
+ "glycopeptide resistance gene cluster;vanV",
850
+ "glycopeptide resistance gene cluster;vanW",
851
+ "glycopeptide resistance gene cluster;vanX",
852
+ "glycopeptide resistance gene cluster;vanXY",
853
+ "glycopeptide resistance gene cluster;vanY",
854
+ "glycopeptide resistance gene cluster;vanZ",
855
+ "helicase-like RNA polymerase protection protein",
856
+ "intrinsic colistin resistant phosphoethanolamine transferase",
857
+ "kdpDE",
858
+ "lincosamide nucleotidyltransferase (LNU)",
859
+ "lipid A acyltransferase;polymyxin resistance operon",
860
+ "lipid A phosphatase",
861
+ "lsa-type ABC-F protein",
862
+ "macrolide esterase",
863
+ "macrolide phosphotransferase (MPH)",
864
+ "major facilitator superfamily (MFS) antibiotic efflux pump",
865
+ "major facilitator superfamily (MFS) antibiotic efflux pump;resistance-nodulation-cell division (RND) antibiotic efflux pump",
866
+ "metal transporters with antibiotic efflux",
867
+ "methicillin resistant PBP2",
868
+ "mgt macrolide glycotransferase",
869
+ "msr-type ABC-F protein",
870
+ "multidrug and toxic compound extrusion (MATE) transporter",
871
+ "nitroimidazole reductase",
872
+ "non-erm 23S ribosomal RNA methyltransferase (A1067)",
873
+ "non-erm 23S ribosomal RNA methyltransferase (G748)",
874
+ "ole glycosyltransferase",
875
+ "pmr phosphoethanolamine transferase",
876
+ "quinolone resistance protein (qnr)",
877
+ "resistance-nodulation-cell division (RND) antibiotic efflux pump",
878
+ "rifampin ADP-ribosyltransferase (Arr)",
879
+ "rifampin glycosyltransferase",
880
+ "rifampin monooxygenase",
881
+ "rifampin phosphotransferase",
882
+ "rifamycin-resistant beta-subunit of RNA polymerase (rpoB)",
883
+ "sal-type ABC-F protein",
884
+ "small multidrug resistance (SMR) antibiotic efflux pump",
885
+ "streptogramin vat acetyltransferase",
886
+ "streptogramin vgb lyase",
887
+ "streptothricin acetyltransferase (SAT)",
888
+ "subclass B1 Bacillus anthracis Bla beta-lactamase",
889
+ "subclass B1 Bacillus cereus Bc beta-lactamase",
890
+ "subclass B1 Bacteroides xylanisolvens crx beta-lactamase",
891
+ "subclass B1 PEDO beta-lactamase",
892
+ "subclass B3 LRA beta-lactamase",
893
+ "subclass B3 PEDO beta-lactamase",
894
+ "sulfonamide resistant sul",
895
+ "tetracycline inactivation enzyme",
896
+ "tetracycline-resistant ribosomal protection protein",
897
+ "trimethoprim resistant dihydrofolate reductase dfr",
898
+ "tunicamycin resistance protein",
899
+ "undecaprenyl pyrophosphate related proteins",
900
+ "vanJ membrane protein",
901
+ "vga-type ABC-F protein",
902
+ "viomycin phosphotransferase"
903
+ ],
904
+ "task_type": "multiclass",
905
+ "target": "gene_family",
906
+ "k": 3,
907
+ "max_features": 500,
908
+ "n_samples": 6054,
909
+ "n_features": 500,
910
+ "n_classes": 398
911
+ }
data_processed/card/card_gene_family_y_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:81f1c621f72580ad146f53f64427bcdd7a1341a413a9e18524d1a3c429e2e58a
3
+ size 9816
data_processed/card/card_gene_family_y_train.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c9e2e28392fef8a81d8cee68b38bc2abbf7d6ab8794a8eb60a0479b7e71b849c
3
+ size 34024
data_processed/card/card_gene_family_y_val.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6ce18e6526a29f25156ee6e8de04ed9dde10ca095006621e48074c37724addab
3
+ size 4976
data_processed/card/card_mechanism_X_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:37754e2ffccec906083a21935e46f862471a3945ffa65b46fea2c87ae191bb5d
3
+ size 4844128
data_processed/card/card_mechanism_X_train.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7053e2e0734de9ae8c46483fad2fb11c8d181f371c26d250cbfe5eea2abe3355
3
+ size 16948128
data_processed/card/card_mechanism_X_val.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:562364532d2014e72213a80cd2191590b9963132f29c09843abf6b6a1fc6df1b
3
+ size 2424128
data_processed/card/card_mechanism_metadata.json ADDED
@@ -0,0 +1,523 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "feature_names": [
3
+ "AAA",
4
+ "IGL",
5
+ "ALA",
6
+ "KTG",
7
+ "LLL",
8
+ "AAL",
9
+ "LAA",
10
+ "ISL",
11
+ "RLD",
12
+ "GLE",
13
+ "TGS",
14
+ "LEQ",
15
+ "ALG",
16
+ "AST",
17
+ "GPL",
18
+ "DLA",
19
+ "ALQ",
20
+ "SIG",
21
+ "AAV",
22
+ "RDT",
23
+ "LDA",
24
+ "PAS",
25
+ "LAT",
26
+ "LGL",
27
+ "ALL",
28
+ "STF",
29
+ "LAR",
30
+ "LDR",
31
+ "SAA",
32
+ "VDA",
33
+ "IPG",
34
+ "TFK",
35
+ "LPA",
36
+ "GDA",
37
+ "PLK",
38
+ "AVL",
39
+ "LAQ",
40
+ "LAV",
41
+ "GYG",
42
+ "VAL",
43
+ "ATL",
44
+ "LFG",
45
+ "ANL",
46
+ "ALI",
47
+ "LAS",
48
+ "QAL",
49
+ "GLG",
50
+ "TLL",
51
+ "GLP",
52
+ "LPL",
53
+ "VAF",
54
+ "GFG",
55
+ "SLL",
56
+ "RVG",
57
+ "AVA",
58
+ "LSA",
59
+ "VLA",
60
+ "LAN",
61
+ "SDN",
62
+ "LTA",
63
+ "PAL",
64
+ "SLK",
65
+ "TTG",
66
+ "AGG",
67
+ "LDL",
68
+ "YGN",
69
+ "DDR",
70
+ "TLA",
71
+ "LLA",
72
+ "AIP",
73
+ "GLA",
74
+ "QRL",
75
+ "GWE",
76
+ "GVK",
77
+ "ATY",
78
+ "LLD",
79
+ "GLF",
80
+ "LLR",
81
+ "IAA",
82
+ "LGW",
83
+ "GSV",
84
+ "RIG",
85
+ "ERL",
86
+ "SKE",
87
+ "YGV",
88
+ "TAG",
89
+ "EQQ",
90
+ "NLL",
91
+ "ELG",
92
+ "RFP",
93
+ "LEK",
94
+ "LLT",
95
+ "LKR",
96
+ "GST",
97
+ "AGL",
98
+ "GGL",
99
+ "SVS",
100
+ "LVD",
101
+ "FGA",
102
+ "LSG",
103
+ "KAL",
104
+ "DER",
105
+ "KRL",
106
+ "TLG",
107
+ "LAG",
108
+ "VGD",
109
+ "TPA",
110
+ "NAL",
111
+ "TYT",
112
+ "SAI",
113
+ "TLF",
114
+ "AIA",
115
+ "GGP",
116
+ "LAL",
117
+ "LGD",
118
+ "LGS",
119
+ "ALV",
120
+ "ERF",
121
+ "LTG",
122
+ "PVT",
123
+ "TAT",
124
+ "AER",
125
+ "LVI",
126
+ "LGI",
127
+ "LKG",
128
+ "DTT",
129
+ "AAN",
130
+ "LTL",
131
+ "GVL",
132
+ "EIG",
133
+ "ARS",
134
+ "NTA",
135
+ "LFE",
136
+ "TTP",
137
+ "LAE",
138
+ "VPA",
139
+ "QTL",
140
+ "VGP",
141
+ "VSK",
142
+ "AGN",
143
+ "ASA",
144
+ "EAA",
145
+ "ASK",
146
+ "RAS",
147
+ "RLY",
148
+ "VTP",
149
+ "QLG",
150
+ "GIA",
151
+ "GLV",
152
+ "VTA",
153
+ "ADI",
154
+ "KSL",
155
+ "AAR",
156
+ "SQR",
157
+ "EQL",
158
+ "MKA",
159
+ "APA",
160
+ "ASL",
161
+ "TTT",
162
+ "TSA",
163
+ "YRQ",
164
+ "QGL",
165
+ "SYG",
166
+ "TAF",
167
+ "ATT",
168
+ "LLS",
169
+ "ADL",
170
+ "PLQ",
171
+ "AAS",
172
+ "TLP",
173
+ "SKT",
174
+ "GKA",
175
+ "IGD",
176
+ "PLL",
177
+ "KAS",
178
+ "ALE",
179
+ "LLF",
180
+ "KTF",
181
+ "RRI",
182
+ "PAP",
183
+ "AQA",
184
+ "VLV",
185
+ "ALP",
186
+ "PAD",
187
+ "AVI",
188
+ "AIS",
189
+ "AYA",
190
+ "LGG",
191
+ "AIL",
192
+ "LIG",
193
+ "DAE",
194
+ "YVA",
195
+ "LVG",
196
+ "GAA",
197
+ "SLG",
198
+ "LNA",
199
+ "SAL",
200
+ "STL",
201
+ "DRP",
202
+ "IAR",
203
+ "PAG",
204
+ "ATA",
205
+ "DMT",
206
+ "MKK",
207
+ "IVA",
208
+ "DEV",
209
+ "DLL",
210
+ "QPQ",
211
+ "QLA",
212
+ "PGD",
213
+ "DGK",
214
+ "LLN",
215
+ "NDI",
216
+ "QDK",
217
+ "IAD",
218
+ "NKT",
219
+ "DKT",
220
+ "KLA",
221
+ "TGA",
222
+ "MLN",
223
+ "EAY",
224
+ "PGM",
225
+ "YSN",
226
+ "ARL",
227
+ "WQP",
228
+ "YTA",
229
+ "LCG",
230
+ "FTA",
231
+ "GAV",
232
+ "ANK",
233
+ "LEG",
234
+ "FPD",
235
+ "YPN",
236
+ "VQP",
237
+ "VGW",
238
+ "AVQ",
239
+ "KTL",
240
+ "LKI",
241
+ "LKA",
242
+ "DLV",
243
+ "ILS",
244
+ "ISA",
245
+ "GNT",
246
+ "FSY",
247
+ "ALD",
248
+ "YGL",
249
+ "SNP",
250
+ "QAG",
251
+ "PSI",
252
+ "QYS",
253
+ "GNA",
254
+ "LGV",
255
+ "IGS",
256
+ "GDK",
257
+ "KIS",
258
+ "KAE",
259
+ "SVQ",
260
+ "FWL",
261
+ "PGP",
262
+ "LLG",
263
+ "AEL",
264
+ "AYG",
265
+ "ETL",
266
+ "PLA",
267
+ "QQG",
268
+ "KSG",
269
+ "ARR",
270
+ "LVT",
271
+ "MTL",
272
+ "DAA",
273
+ "VLL",
274
+ "GYA",
275
+ "MAV",
276
+ "RLL",
277
+ "GKP",
278
+ "GAL",
279
+ "KLL",
280
+ "VKT",
281
+ "APL",
282
+ "FGY",
283
+ "NEA",
284
+ "TLR",
285
+ "LQF",
286
+ "ITP",
287
+ "NPS",
288
+ "IKK",
289
+ "VAI",
290
+ "YAK",
291
+ "LNK",
292
+ "GMT",
293
+ "EIK",
294
+ "QWQ",
295
+ "ALS",
296
+ "GWV",
297
+ "DTP",
298
+ "SEK",
299
+ "VPG",
300
+ "LLI",
301
+ "LDD",
302
+ "KKS",
303
+ "FPA",
304
+ "LRF",
305
+ "KEL",
306
+ "LLK",
307
+ "ILA",
308
+ "HKT",
309
+ "AGE",
310
+ "GPG",
311
+ "YGK",
312
+ "IAG",
313
+ "VMK",
314
+ "GIV",
315
+ "APQ",
316
+ "GSR",
317
+ "VPL",
318
+ "GIS",
319
+ "ARA",
320
+ "FAA",
321
+ "GDE",
322
+ "VIY",
323
+ "IAL",
324
+ "ADK",
325
+ "AND",
326
+ "DRA",
327
+ "QQV",
328
+ "STN",
329
+ "NAE",
330
+ "GVA",
331
+ "PLD",
332
+ "LPF",
333
+ "QFP",
334
+ "DIA",
335
+ "VPE",
336
+ "KDQ",
337
+ "TYA",
338
+ "TFT",
339
+ "DVP",
340
+ "STS",
341
+ "ADE",
342
+ "VNP",
343
+ "PVY",
344
+ "KDD",
345
+ "VYQ",
346
+ "ELA",
347
+ "GEA",
348
+ "LRK",
349
+ "TGW",
350
+ "NDL",
351
+ "ARV",
352
+ "GPA",
353
+ "EKH",
354
+ "QIA",
355
+ "AIT",
356
+ "AMA",
357
+ "AAD",
358
+ "IYA",
359
+ "TRL",
360
+ "YAQ",
361
+ "EQT",
362
+ "LEL",
363
+ "TNG",
364
+ "GMA",
365
+ "LIA",
366
+ "TGV",
367
+ "GKV",
368
+ "DLG",
369
+ "AQT",
370
+ "KLS",
371
+ "FVP",
372
+ "AQG",
373
+ "LNE",
374
+ "PIS",
375
+ "IVM",
376
+ "GAY",
377
+ "PET",
378
+ "QDL",
379
+ "AYV",
380
+ "ATV",
381
+ "LVL",
382
+ "VML",
383
+ "NGF",
384
+ "LLQ",
385
+ "VFK",
386
+ "ERI",
387
+ "GDM",
388
+ "EKN",
389
+ "AFV",
390
+ "QVL",
391
+ "FVD",
392
+ "VQD",
393
+ "MTV",
394
+ "FEL",
395
+ "EVK",
396
+ "AKS",
397
+ "GSQ",
398
+ "LFA",
399
+ "IKA",
400
+ "AAT",
401
+ "ELN",
402
+ "LGA",
403
+ "VAE",
404
+ "VKA",
405
+ "KIA",
406
+ "PEL",
407
+ "DNT",
408
+ "KAA",
409
+ "LYA",
410
+ "RKL",
411
+ "ADR",
412
+ "LPV",
413
+ "AEA",
414
+ "EER",
415
+ "KVA",
416
+ "KAN",
417
+ "ASI",
418
+ "LQA",
419
+ "WLV",
420
+ "VIL",
421
+ "PLR",
422
+ "AFS",
423
+ "PHY",
424
+ "HRI",
425
+ "VTE",
426
+ "DRL",
427
+ "AVK",
428
+ "FIP",
429
+ "VVA",
430
+ "RIA",
431
+ "AAI",
432
+ "YQG",
433
+ "LTN",
434
+ "YLA",
435
+ "AFL",
436
+ "ERV",
437
+ "LQP",
438
+ "QVG",
439
+ "GQP",
440
+ "GRR",
441
+ "HPE",
442
+ "LQG",
443
+ "LNL",
444
+ "SGG",
445
+ "TPQ",
446
+ "SYV",
447
+ "WVV",
448
+ "QAQ",
449
+ "DSV",
450
+ "ARG",
451
+ "NST",
452
+ "TPE",
453
+ "RPL",
454
+ "HYF",
455
+ "EQI",
456
+ "LVA",
457
+ "RSL",
458
+ "QQL",
459
+ "YFT",
460
+ "APG",
461
+ "GEL",
462
+ "FDG",
463
+ "SGL",
464
+ "SGA",
465
+ "AQI",
466
+ "QPV",
467
+ "FSL",
468
+ "GYL",
469
+ "PNA",
470
+ "RAP",
471
+ "QRA",
472
+ "LFI",
473
+ "ANR",
474
+ "GNL",
475
+ "VLG",
476
+ "LFP",
477
+ "QKD",
478
+ "GIL",
479
+ "EKI",
480
+ "SPV",
481
+ "DQA",
482
+ "LER",
483
+ "RLG",
484
+ "DAI",
485
+ "TAA",
486
+ "PDS",
487
+ "RDL",
488
+ "VTR",
489
+ "DAG",
490
+ "AEG",
491
+ "SLI",
492
+ "FKW",
493
+ "VAG",
494
+ "VAV",
495
+ "RFV",
496
+ "GAG",
497
+ "GLT",
498
+ "VKR",
499
+ "RQQ",
500
+ "RVE",
501
+ "KGE",
502
+ "TRF"
503
+ ],
504
+ "class_names": [
505
+ "antibiotic efflux",
506
+ "antibiotic efflux;antibiotic target alteration",
507
+ "antibiotic efflux;reduced permeability to antibiotic",
508
+ "antibiotic inactivation",
509
+ "antibiotic target alteration",
510
+ "antibiotic target alteration;antibiotic target replacement",
511
+ "antibiotic target protection",
512
+ "antibiotic target replacement",
513
+ "reduced permeability to antibiotic",
514
+ "resistance by host-dependent nutrient acquisition"
515
+ ],
516
+ "task_type": "multiclass",
517
+ "target": "mechanism",
518
+ "k": 3,
519
+ "max_features": 500,
520
+ "n_samples": 6054,
521
+ "n_features": 500,
522
+ "n_classes": 10
523
+ }
data_processed/card/card_mechanism_y_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1b9d681610c6a31460738f88eb79edb53a953e31db719fc2a9eae01a25a511c6
3
+ size 9816
data_processed/card/card_mechanism_y_train.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ccb1d3c6f9de05eab30cc90f7a1cac15a717f3757b177d632b1f4fc543278c01
3
+ size 34024
data_processed/card/card_mechanism_y_val.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a8bf2328cff6f19f393b6faab709ba383972f3125d0626107a6599e90248fc07
3
+ size 4976
data_processed/ncbi/ncbi_amr_X_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8b27b9af79f60246b39456a5c338cd16f9279cdf57952f701c18729e94891191
3
+ size 692128
data_processed/ncbi/ncbi_amr_X_train.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5cb410aa23501e07ab67ecfe4dffd2004ce8db235ee4595ff22efba0f3664193
3
+ size 2408128
data_processed/ncbi/ncbi_amr_X_val.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:524db56ddde448beb99d4da060ad6ac0aaee3828565b9900f5202655e9f64514
3
+ size 348128
data_processed/ncbi/ncbi_amr_metadata.json ADDED
@@ -0,0 +1,537 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "feature_names": [
3
+ "AAAAAA",
4
+ "TTTTTT",
5
+ "AAAAAT",
6
+ "ATTTTT",
7
+ "TAAAAA",
8
+ "TTTTTA",
9
+ "TTTTTC",
10
+ "TTTAAA",
11
+ "GAAAAA",
12
+ "AAAATA",
13
+ "TATTTT",
14
+ "TTAAAA",
15
+ "TTTTAA",
16
+ "AATAAA",
17
+ "CTTTTT",
18
+ "AAAATT",
19
+ "AAAAAG",
20
+ "TTTATT",
21
+ "AATTTT",
22
+ "AAATAA",
23
+ "ATAAAA",
24
+ "TTTTAT",
25
+ "TTATTT",
26
+ "CAAAAA",
27
+ "GCCAGC",
28
+ "CGCCAG",
29
+ "TTTTTG",
30
+ "TTTTCA",
31
+ "CGCCGC",
32
+ "GCTGGC",
33
+ "CAGCAG",
34
+ "CTGCTG",
35
+ "TGAAAA",
36
+ "GCGGCG",
37
+ "CTGGCG",
38
+ "AAATTT",
39
+ "TTCTTT",
40
+ "AAAAAC",
41
+ "AAAGAA",
42
+ "TTTCTT",
43
+ "GTTTTT",
44
+ "AAGAAA",
45
+ "ATTAAA",
46
+ "GCCGCC",
47
+ "TTTAAT",
48
+ "TTTTCT",
49
+ "CAGCGC",
50
+ "TAAAAT",
51
+ "AGAAAA",
52
+ "AATAAT",
53
+ "ATTTTA",
54
+ "TCTTTT",
55
+ "GGCGGC",
56
+ "CCAGCG",
57
+ "ATTATT",
58
+ "GCGCTG",
59
+ "CCAGCA",
60
+ "AATATT",
61
+ "AAAAGA",
62
+ "GCGCCG",
63
+ "CATTTT",
64
+ "CGGCGC",
65
+ "AAAATC",
66
+ "AAATCA",
67
+ "GCAAAA",
68
+ "AAAATG",
69
+ "TGCTGG",
70
+ "CGCTGG",
71
+ "TCAAAA",
72
+ "TTAAAT",
73
+ "ATATTT",
74
+ "AAATAT",
75
+ "AAATTA",
76
+ "ATTTAA",
77
+ "TTTTGA",
78
+ "GATTTT",
79
+ "TTTTGC",
80
+ "TGATTT",
81
+ "AAAACA",
82
+ "TAATTT",
83
+ "TTAATT",
84
+ "TGCTGC",
85
+ "CCGCCG",
86
+ "GCAGCA",
87
+ "CAGCAA",
88
+ "AATTAA",
89
+ "ATTTTC",
90
+ "TCGCCG",
91
+ "TGTTTT",
92
+ "TTCTTC",
93
+ "AAAAGC",
94
+ "CAAAAT",
95
+ "CGGCGA",
96
+ "TTGCTG",
97
+ "CGGCGG",
98
+ "GCTTTT",
99
+ "GAAAAT",
100
+ "ATTTTG",
101
+ "TTTCAA",
102
+ "CGCCGG",
103
+ "GAAGAA",
104
+ "ATAAAT",
105
+ "CCGGCG",
106
+ "GCGCCA",
107
+ "TTTCAT",
108
+ "ATCAAA",
109
+ "TAAATA",
110
+ "ATTTAT",
111
+ "ATCGCC",
112
+ "TATTTA",
113
+ "TTTGAT",
114
+ "ATGAAA",
115
+ "TTGAAA",
116
+ "ATCAAT",
117
+ "TTGTTT",
118
+ "TTTATC",
119
+ "AAACAA",
120
+ "TCATCA",
121
+ "GATAAA",
122
+ "CAATTT",
123
+ "AACAAA",
124
+ "TCAGCA",
125
+ "CGCCGA",
126
+ "CGCCTG",
127
+ "TGGCGC",
128
+ "TTCAAT",
129
+ "ATTAAT",
130
+ "GGCGAT",
131
+ "ACAAAA",
132
+ "AGCAGC",
133
+ "TCGGCG",
134
+ "TTTGTT",
135
+ "ACCAGC",
136
+ "CCGCCA",
137
+ "TTCAAA",
138
+ "AAATTG",
139
+ "TTCATC",
140
+ "GCTGCT",
141
+ "ATTGAT",
142
+ "TGCTGA",
143
+ "ATTGAA",
144
+ "CGCGCC",
145
+ "CATCAA",
146
+ "CAGGCG",
147
+ "TTTGAA",
148
+ "TAAATT",
149
+ "GCTGAA",
150
+ "GCGCGC",
151
+ "TGATGA",
152
+ "TTCAGC",
153
+ "GCTGGT",
154
+ "TTTTGT",
155
+ "AAACCA",
156
+ "AAAGCA",
157
+ "TCATTT",
158
+ "AATCAA",
159
+ "AAATGA",
160
+ "GATGAA",
161
+ "TGCTTT",
162
+ "TAATAA",
163
+ "TAAAAC",
164
+ "AATTTC",
165
+ "TTCATT",
166
+ "TCCAGC",
167
+ "GGCGCG",
168
+ "TGGTTT",
169
+ "GCCTGC",
170
+ "TTATTA",
171
+ "AGCGCC",
172
+ "GCCGGC",
173
+ "ATTTCA",
174
+ "TCGCCA",
175
+ "TGGCGG",
176
+ "CTTCTT",
177
+ "CTGCGC",
178
+ "AATTTA",
179
+ "TTGATT",
180
+ "AATGAA",
181
+ "GTTTTA",
182
+ "GAAATT",
183
+ "TGAAAT",
184
+ "TTGATG",
185
+ "AGCAAA",
186
+ "GCTGGA",
187
+ "GCAGCG",
188
+ "CGCTGC",
189
+ "GCAGGC",
190
+ "TTTTAC",
191
+ "CAATAA",
192
+ "GTAAAA",
193
+ "ATCAGC",
194
+ "TGGCGA",
195
+ "CACCAG",
196
+ "GGCGCT",
197
+ "CTTTAA",
198
+ "GCGCAG",
199
+ "TCTTCA",
200
+ "AAAACT",
201
+ "AAGAAG",
202
+ "TTTGCT",
203
+ "TTAAAG",
204
+ "AAAACC",
205
+ "GCTGAT",
206
+ "TCTTTA",
207
+ "CAACAA",
208
+ "TTTTCC",
209
+ "ACTTTT",
210
+ "TAAAGA",
211
+ "TGCCGC",
212
+ "CCTGCT",
213
+ "CCGCGC",
214
+ "CTTTAT",
215
+ "AGTTTT",
216
+ "GCCGCG",
217
+ "TTATTG",
218
+ "GGAAAA",
219
+ "GCGCGG",
220
+ "TGAAGA",
221
+ "TCAATT",
222
+ "ATAAAG",
223
+ "CTGAAA",
224
+ "GGTTTT",
225
+ "CTGGTG",
226
+ "CGCGGC",
227
+ "AACAGC",
228
+ "AAAAGT",
229
+ "ATCTTT",
230
+ "GCGGCA",
231
+ "AAAGAT",
232
+ "TTGTTG",
233
+ "ATTTCT",
234
+ "AATTGA",
235
+ "CAGCGG",
236
+ "AGAAAT",
237
+ "CGGCAA",
238
+ "TTTCAG",
239
+ "TAAAGC",
240
+ "CCGCTG",
241
+ "CTTCAA",
242
+ "CCACCA",
243
+ "GCTTTA",
244
+ "CTAAAA",
245
+ "GCTGTT",
246
+ "AAGCAA",
247
+ "AATTGC",
248
+ "AACAAT",
249
+ "TTCGCC",
250
+ "TTGAAG",
251
+ "GCAATT",
252
+ "TTGCTT",
253
+ "CTTTTA",
254
+ "TAAAAG",
255
+ "AGCAGG",
256
+ "AGCAAT",
257
+ "TTGCCG",
258
+ "TTTTAG",
259
+ "ATTGCT",
260
+ "CAGCCA",
261
+ "GATATT",
262
+ "CTGCCG",
263
+ "AATATC",
264
+ "CGGCAG",
265
+ "CAATAT",
266
+ "AACTTT",
267
+ "ATAATT",
268
+ "CCATTT",
269
+ "CAGCTT",
270
+ "ATCATC",
271
+ "TTGCCA",
272
+ "CGCCAT",
273
+ "AATTAT",
274
+ "GGCGAA",
275
+ "TCAATA",
276
+ "AAGCTG",
277
+ "TTTCTG",
278
+ "CTGTTT",
279
+ "CCAGGC",
280
+ "TGCGCC",
281
+ "CAAATT",
282
+ "TGGCAA",
283
+ "ATATTG",
284
+ "TTTGCC",
285
+ "AAAACG",
286
+ "AATCAT",
287
+ "GGCAAA",
288
+ "ATCATT",
289
+ "CATTAA",
290
+ "GCCTGG",
291
+ "GGTAAA",
292
+ "CAGAAA",
293
+ "AAACAG",
294
+ "CGTTTT",
295
+ "TGGCTG",
296
+ "CCAGCC",
297
+ "ATTGTT",
298
+ "AATTTG",
299
+ "AAATGG",
300
+ "CATCAT",
301
+ "TTTACC",
302
+ "TATAAA",
303
+ "TATCAA",
304
+ "TTTATA",
305
+ "TATTGA",
306
+ "ACCGCC",
307
+ "AAATTC",
308
+ "TCACCA",
309
+ "TTTCCA",
310
+ "AAAGTT",
311
+ "CTGGAA",
312
+ "GCCAGG",
313
+ "CCGGCA",
314
+ "TTCCAG",
315
+ "GGCGCA",
316
+ "ATGGCG",
317
+ "TTGATA",
318
+ "CCTGGC",
319
+ "TGGTGG",
320
+ "ATGATT",
321
+ "TTAATG",
322
+ "CAGCAT",
323
+ "CAGCAC",
324
+ "ATGATG",
325
+ "GGCTGG",
326
+ "CGCAGC",
327
+ "CTTTTG",
328
+ "AATGAT",
329
+ "ATGCTG",
330
+ "GAATTT",
331
+ "TGGAAA",
332
+ "TTAATA",
333
+ "TTATCA",
334
+ "GATGAT",
335
+ "ATCACC",
336
+ "TTTAAC",
337
+ "CAAAAC",
338
+ "TATTAA",
339
+ "TGCCGG",
340
+ "ACCAAA",
341
+ "TGGTGA",
342
+ "GCATCA",
343
+ "GCTGCG",
344
+ "CAAAAG",
345
+ "TGAATA",
346
+ "GCCGCT",
347
+ "GTTAAA",
348
+ "AAACTT",
349
+ "TGATAA",
350
+ "CAATCA",
351
+ "CTGGCC",
352
+ "AGCGGC",
353
+ "TGCAAA",
354
+ "GGCCAG",
355
+ "GCCATC",
356
+ "GCATTT",
357
+ "TATTCA",
358
+ "TTCACC",
359
+ "TAAATC",
360
+ "AGCTTT",
361
+ "AAATGC",
362
+ "AAAGCT",
363
+ "GGTGAT",
364
+ "GGCGGT",
365
+ "GCTTCA",
366
+ "TGATGC",
367
+ "TCAGCG",
368
+ "CTTCAT",
369
+ "GTTTTG",
370
+ "GTGCTG",
371
+ "CCAGTT",
372
+ "AACTGG",
373
+ "ATCGGC",
374
+ "GCCATT",
375
+ "TGCAGC",
376
+ "CGCTGA",
377
+ "CAATTG",
378
+ "GGTGAA",
379
+ "TTTGCA",
380
+ "GCTGCC",
381
+ "CTGCAA",
382
+ "GCCGAT",
383
+ "CCATCA",
384
+ "GCTGCA",
385
+ "GGCAGC",
386
+ "TCTTCT",
387
+ "TTGCAG",
388
+ "AATGGC",
389
+ "CCAAAA",
390
+ "ACGCCG",
391
+ "CGGCGT",
392
+ "ATTCAA",
393
+ "AACCAA",
394
+ "TTTTGG",
395
+ "TCAACA",
396
+ "TTTGGT",
397
+ "CAGGCC",
398
+ "TTGAAT",
399
+ "ATATTC",
400
+ "ACCAAT",
401
+ "CATAAA",
402
+ "TTCAAC",
403
+ "TGATTG",
404
+ "GATTTA",
405
+ "TTGTTC",
406
+ "TGAAGC",
407
+ "ATCAAC",
408
+ "TTCTGC",
409
+ "GGCCTG",
410
+ "TTGGTT",
411
+ "CATCGC",
412
+ "GATGGC",
413
+ "ACCACC",
414
+ "ATGAAG",
415
+ "AAAGCC",
416
+ "GAACAA",
417
+ "AAATAC",
418
+ "GAATAT",
419
+ "TCGGCA",
420
+ "ATTGGT",
421
+ "GGCTTT",
422
+ "GCAAAT",
423
+ "AAGTTT",
424
+ "ACCTGC",
425
+ "ATTTGC",
426
+ "CACCGC",
427
+ "GTTGAA",
428
+ "TGTTGA",
429
+ "GTTGAT",
430
+ "ATTCAT",
431
+ "ACAGCA",
432
+ "GCACCA",
433
+ "CGCTTT",
434
+ "TGATCA",
435
+ "AATTCA",
436
+ "ATCTTC",
437
+ "CCTGCA",
438
+ "AACAAC",
439
+ "GCAGAA",
440
+ "AACGCC",
441
+ "TGCCGA",
442
+ "ATTCTT",
443
+ "CGCCAC",
444
+ "GTATTT",
445
+ "GGCAAT",
446
+ "TCAAAT",
447
+ "ATTGCC",
448
+ "TGCTGT",
449
+ "AAACTG",
450
+ "AGAAGA",
451
+ "TTTATG",
452
+ "CGCCAA",
453
+ "CTGCTT",
454
+ "GCAGGT",
455
+ "TGATGG",
456
+ "TGTTCA",
457
+ "GCCAAA",
458
+ "GCAACA",
459
+ "GGCGCC",
460
+ "TGGTGC",
461
+ "ATGCCG",
462
+ "CAGTTT",
463
+ "TGTTGC",
464
+ "ATGAAT",
465
+ "GCGATG",
466
+ "CACCAC",
467
+ "TGAATT",
468
+ "AAAGCG",
469
+ "TGAACA",
470
+ "CATCAG",
471
+ "GGTGGT",
472
+ "GAAAAC",
473
+ "TTAAAC",
474
+ "GCGCCT",
475
+ "GTTTTC",
476
+ "CGGCAT",
477
+ "GAAGAT",
478
+ "AGCAAC",
479
+ "TTTCAC",
480
+ "TGCAGG",
481
+ "ATAAAC",
482
+ "CTGGCA",
483
+ "CGCCGT",
484
+ "GCGGTG",
485
+ "CCAGCT",
486
+ "TTTGGC",
487
+ "TAATGA",
488
+ "CAGCGA",
489
+ "TCATTA",
490
+ "CTGATT",
491
+ "ATTTGA",
492
+ "AAGAAT",
493
+ "ACCATT",
494
+ "ACGGCG",
495
+ "GTTGTT",
496
+ "CGGCCA",
497
+ "AGCTGG",
498
+ "AAATCT",
499
+ "AGGCGC",
500
+ "CTGTTC",
501
+ "GCCACC",
502
+ "AAGCAG"
503
+ ],
504
+ "class_names": [
505
+ "aminoglycoside",
506
+ "beta-lactam",
507
+ "fosfomycin",
508
+ "glycopeptide",
509
+ "macrolide",
510
+ "phenicol",
511
+ "quinolone",
512
+ "rifampicin",
513
+ "sulfonamide",
514
+ "tetracycline",
515
+ "trimethoprim"
516
+ ],
517
+ "task_type": "multilabel",
518
+ "target": "amr_drug_class",
519
+ "k": 6,
520
+ "max_features": 500,
521
+ "n_samples": 862,
522
+ "n_features": 500,
523
+ "n_classes": 11,
524
+ "drug_classes": [
525
+ "aminoglycoside",
526
+ "beta-lactam",
527
+ "fosfomycin",
528
+ "glycopeptide",
529
+ "macrolide",
530
+ "phenicol",
531
+ "quinolone",
532
+ "rifampicin",
533
+ "sulfonamide",
534
+ "tetracycline",
535
+ "trimethoprim"
536
+ ]
537
+ }
data_processed/ncbi/ncbi_amr_y_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0e9cfd43cdd5225068480adc4532ae92590f8562cae1b3d7e6dd6b7b2ec41f97
3
+ size 15352
data_processed/ncbi/ncbi_amr_y_train.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:14bf49fa3126d571873acfc454ca684d26aeef410c58f64df82132c326b905d1
3
+ size 53104
data_processed/ncbi/ncbi_amr_y_val.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3c6344c6b50da9f524d488188950a1441175619e2edaee2e953295bbe02c1bc0
3
+ size 7784
data_processed/ncbi/ncbi_organism_X_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4df7f95f1e87a45419e5da13dd13d80f9423f5ed1c2a72bf9ef5b2f2e6eb493c
3
+ size 692128
data_processed/ncbi/ncbi_organism_X_train.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:03a1d3b21f2a2bda4a4008d3fd7c00dedaa661ecb8513a2b1c67c17fb338b5c5
3
+ size 2408128
data_processed/ncbi/ncbi_organism_X_val.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5a312735852dca85167dd7ac425542f9dc70f91aa7266df0a196a5ad2594e775
3
+ size 348128
data_processed/ncbi/ncbi_organism_metadata.json ADDED
@@ -0,0 +1,521 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "feature_names": [
3
+ "AAAAAA",
4
+ "TTTTTT",
5
+ "ATTTTT",
6
+ "AAAAAT",
7
+ "TAAAAA",
8
+ "TTTTTA",
9
+ "TTTTTC",
10
+ "TTTAAA",
11
+ "GAAAAA",
12
+ "AAAATA",
13
+ "TTAAAA",
14
+ "TATTTT",
15
+ "TTTTAA",
16
+ "AATAAA",
17
+ "CTTTTT",
18
+ "AATTTT",
19
+ "AAAAAG",
20
+ "TTTATT",
21
+ "AAAATT",
22
+ "AAATAA",
23
+ "ATAAAA",
24
+ "TTTTAT",
25
+ "TTATTT",
26
+ "CAAAAA",
27
+ "GCCAGC",
28
+ "CGCCAG",
29
+ "TTTTCA",
30
+ "TTTTTG",
31
+ "CGCCGC",
32
+ "GCTGGC",
33
+ "TGAAAA",
34
+ "CAGCAG",
35
+ "GCGGCG",
36
+ "CTGGCG",
37
+ "CTGCTG",
38
+ "TTCTTT",
39
+ "AAATTT",
40
+ "AAAAAC",
41
+ "TTTCTT",
42
+ "AAAGAA",
43
+ "ATTAAA",
44
+ "AAGAAA",
45
+ "GTTTTT",
46
+ "GCCGCC",
47
+ "TTTTCT",
48
+ "TTTAAT",
49
+ "CAGCGC",
50
+ "TAAAAT",
51
+ "AATAAT",
52
+ "AGAAAA",
53
+ "GGCGGC",
54
+ "CCAGCG",
55
+ "TCTTTT",
56
+ "ATTTTA",
57
+ "CCAGCA",
58
+ "ATTATT",
59
+ "CGGCGC",
60
+ "GCGCTG",
61
+ "AAAAGA",
62
+ "GCAAAA",
63
+ "CATTTT",
64
+ "AATATT",
65
+ "GCGCCG",
66
+ "AAAATG",
67
+ "CGCTGG",
68
+ "AAAATC",
69
+ "TTAAAT",
70
+ "AAATCA",
71
+ "TGCTGG",
72
+ "ATATTT",
73
+ "TCAAAA",
74
+ "AAATAT",
75
+ "ATTTAA",
76
+ "TTTTGC",
77
+ "GATTTT",
78
+ "TTTTGA",
79
+ "AAATTA",
80
+ "TGATTT",
81
+ "AAAACA",
82
+ "TAATTT",
83
+ "AATTAA",
84
+ "TTAATT",
85
+ "CAGCAA",
86
+ "CCGCCG",
87
+ "ATTTTC",
88
+ "GCAGCA",
89
+ "TCGCCG",
90
+ "TGCTGC",
91
+ "TGTTTT",
92
+ "CAAAAT",
93
+ "AAAAGC",
94
+ "TTCTTC",
95
+ "CGGCGA",
96
+ "GCTTTT",
97
+ "TTGCTG",
98
+ "CGGCGG",
99
+ "ATTTTG",
100
+ "GAAAAT",
101
+ "CGCCGG",
102
+ "ATAAAT",
103
+ "TTTCAA",
104
+ "GAAGAA",
105
+ "CCGGCG",
106
+ "GCGCCA",
107
+ "TTTCAT",
108
+ "ATCAAA",
109
+ "ATTTAT",
110
+ "TAAATA",
111
+ "ATGAAA",
112
+ "ATCGCC",
113
+ "TTGTTT",
114
+ "TTTGAT",
115
+ "TTGAAA",
116
+ "AAACAA",
117
+ "TATTTA",
118
+ "ATCAAT",
119
+ "AACAAA",
120
+ "TTTATC",
121
+ "CGCCTG",
122
+ "TCAGCA",
123
+ "TCATCA",
124
+ "GATAAA",
125
+ "TGGCGC",
126
+ "CGCCGA",
127
+ "CCGCCA",
128
+ "TTCAAA",
129
+ "AAATTG",
130
+ "TTTGTT",
131
+ "TTCATC",
132
+ "ACAAAA",
133
+ "ATTGAT",
134
+ "AGCAGC",
135
+ "TTCAAT",
136
+ "ACCAGC",
137
+ "TCGGCG",
138
+ "TGCTGA",
139
+ "CAATTT",
140
+ "ATTAAT",
141
+ "GCTGCT",
142
+ "TTTGAA",
143
+ "GCGCGC",
144
+ "CATCAA",
145
+ "CGCGCC",
146
+ "GGCGAT",
147
+ "TTTTGT",
148
+ "TTCAGC",
149
+ "AAAGCA",
150
+ "ATTGAA",
151
+ "AAACCA",
152
+ "TAAATT",
153
+ "CAGGCG",
154
+ "TGCTTT",
155
+ "TAAAAC",
156
+ "TGATGA",
157
+ "GCTGGT",
158
+ "TCATTT",
159
+ "AAATGA",
160
+ "GCTGAA",
161
+ "TTCATT",
162
+ "GATGAA",
163
+ "CTTCTT",
164
+ "TCGCCA",
165
+ "AATCAA",
166
+ "TAATAA",
167
+ "TCCAGC",
168
+ "AGCGCC",
169
+ "TGGTTT",
170
+ "ATTTCA",
171
+ "GGCGCG",
172
+ "TTGATG",
173
+ "GTTTTA",
174
+ "TTATTA",
175
+ "TTGATT",
176
+ "AATGAA",
177
+ "TGGCGG",
178
+ "GCCTGC",
179
+ "AATTTC",
180
+ "TGAAAT",
181
+ "AATTTA",
182
+ "GCCGGC",
183
+ "GAAATT",
184
+ "GCAGCG",
185
+ "CACCAG",
186
+ "CTGCGC",
187
+ "GCTGGA",
188
+ "CAATAA",
189
+ "CGCTGC",
190
+ "GCGCAG",
191
+ "TTTTAC",
192
+ "AGCAAA",
193
+ "ATCAGC",
194
+ "GTAAAA",
195
+ "TCTTCA",
196
+ "AAAACT",
197
+ "CTTTAA",
198
+ "GGCGCT",
199
+ "AAGAAG",
200
+ "TGGCGA",
201
+ "TTAAAG",
202
+ "GCAGGC",
203
+ "GCTGAT",
204
+ "TTTGCT",
205
+ "TTTTCC",
206
+ "CAACAA",
207
+ "TTATTG",
208
+ "AAAACC",
209
+ "CTTTAT",
210
+ "TAAAGA",
211
+ "ACTTTT",
212
+ "TCTTTA",
213
+ "CCTGCT",
214
+ "CCGCGC",
215
+ "ATAAAG",
216
+ "TGCCGC",
217
+ "GGAAAA",
218
+ "AGTTTT",
219
+ "CGGCAA",
220
+ "TCAATT",
221
+ "GCGCGG",
222
+ "GCGGCA",
223
+ "ATCTTT",
224
+ "ATTTCT",
225
+ "GCCGCG",
226
+ "CTGGTG",
227
+ "GGTTTT",
228
+ "CTTCAA",
229
+ "TGAAGA",
230
+ "TTTCAG",
231
+ "AACAGC",
232
+ "AAAGAT",
233
+ "CGCGGC",
234
+ "AATTGA",
235
+ "AAAAGT",
236
+ "AGAAAT",
237
+ "CTGAAA",
238
+ "TTGTTG",
239
+ "GCTGTT",
240
+ "CAGCGG",
241
+ "CCACCA",
242
+ "TAAAGC",
243
+ "CTAAAA",
244
+ "AAGCAA",
245
+ "CCGCTG",
246
+ "TTCGCC",
247
+ "CTTTTA",
248
+ "TTGCCG",
249
+ "AACAAT",
250
+ "AATTGC",
251
+ "TTGCTT",
252
+ "CTGCCG",
253
+ "GCTTTA",
254
+ "TTGAAG",
255
+ "GCAATT",
256
+ "GATATT",
257
+ "AGCAAT",
258
+ "AATATC",
259
+ "TTTTAG",
260
+ "ATTGCT",
261
+ "CCATTT",
262
+ "AGCAGG",
263
+ "CGCCAT",
264
+ "TTGCCA",
265
+ "TAAAAG",
266
+ "CAGCCA",
267
+ "CGGCAG",
268
+ "AACTTT",
269
+ "GGCAAA",
270
+ "CAGCTT",
271
+ "CTGTTT",
272
+ "ATAATT",
273
+ "AAGCTG",
274
+ "GGCGAA",
275
+ "ATATTG",
276
+ "CAATAT",
277
+ "CAAATT",
278
+ "TGCGCC",
279
+ "TGGCAA",
280
+ "ATCATC",
281
+ "AAACAG",
282
+ "TGGCTG",
283
+ "AATTAT",
284
+ "TCAATA",
285
+ "CATTAA",
286
+ "TTTCTG",
287
+ "CGTTTT",
288
+ "ATTGTT",
289
+ "CCAGGC",
290
+ "CAGAAA",
291
+ "TTTGCC",
292
+ "GCCTGG",
293
+ "ATCATT",
294
+ "TCACCA",
295
+ "AATTTG",
296
+ "TATAAA",
297
+ "TATCAA",
298
+ "AATCAT",
299
+ "GGTAAA",
300
+ "AAAACG",
301
+ "GCCAGG",
302
+ "TTTATA",
303
+ "CCAGCC",
304
+ "CATCAT",
305
+ "AAATGG",
306
+ "TTGATA",
307
+ "TTTACC",
308
+ "ACCGCC",
309
+ "AAATTC",
310
+ "CCGGCA",
311
+ "CTGGAA",
312
+ "AAAGTT",
313
+ "TTTCCA",
314
+ "GGCGCA",
315
+ "TTCCAG",
316
+ "AATGAT",
317
+ "TATTGA",
318
+ "CAGCAC",
319
+ "ATGATT",
320
+ "TTAATG",
321
+ "ATGGCG",
322
+ "CTTTTG",
323
+ "TGGTGG",
324
+ "CCTGGC",
325
+ "CAGCAT",
326
+ "ATCACC",
327
+ "GAATTT",
328
+ "TGGAAA",
329
+ "GCATCA",
330
+ "TGCCGG",
331
+ "GATGAT",
332
+ "ACCAAA",
333
+ "TGGTGA",
334
+ "TTAATA",
335
+ "GCTGCG",
336
+ "GGCTGG",
337
+ "ATGATG",
338
+ "TTTAAC",
339
+ "GCCGCT",
340
+ "CGCAGC",
341
+ "TTCACC",
342
+ "TATTAA",
343
+ "TTATCA",
344
+ "ATGCTG",
345
+ "GTTAAA",
346
+ "CAAAAC",
347
+ "AGCGGC",
348
+ "AGCTTT",
349
+ "CAATCA",
350
+ "CAAAAG",
351
+ "TGAATA",
352
+ "TGCAAA",
353
+ "GGCCAG",
354
+ "AAAGCT",
355
+ "CTGGCC",
356
+ "TATTCA",
357
+ "TGATAA",
358
+ "GTGCTG",
359
+ "AAATGC",
360
+ "GGTGAT",
361
+ "AAACTT",
362
+ "AACTGG",
363
+ "GCATTT",
364
+ "GCTTCA",
365
+ "TGCAGC",
366
+ "TGATGC",
367
+ "GTTTTG",
368
+ "ACGCCG",
369
+ "TAAATC",
370
+ "GCCATC",
371
+ "AACCAA",
372
+ "ATTCAA",
373
+ "GCTGCC",
374
+ "CGCTGA",
375
+ "GCCATT",
376
+ "GGCGGT",
377
+ "CCAAAA",
378
+ "CAGGCC",
379
+ "CCATCA",
380
+ "CCAGTT",
381
+ "CTTCAT",
382
+ "ATCGGC",
383
+ "TTTTGG",
384
+ "CTGCAA",
385
+ "TCTTCT",
386
+ "TCAGCG",
387
+ "GGCAGC",
388
+ "CAATTG",
389
+ "TCAACA",
390
+ "CATAAA",
391
+ "ATATTC",
392
+ "TTGAAT",
393
+ "ACCACC",
394
+ "GGTGAA",
395
+ "TTCAAC",
396
+ "AATGGC",
397
+ "TTTGCA",
398
+ "ACCAAT",
399
+ "GCTGCA",
400
+ "CATCGC",
401
+ "TGATTG",
402
+ "GATTTA",
403
+ "GCCGAT",
404
+ "TTGCAG",
405
+ "TTTGGT",
406
+ "ATGAAG",
407
+ "CGGCGT",
408
+ "GAACAA",
409
+ "TCGGCA",
410
+ "TTCTGC",
411
+ "GATGGC",
412
+ "CGCTTT",
413
+ "TGAAGC",
414
+ "ATCAAC",
415
+ "AAGTTT",
416
+ "TTGGTT",
417
+ "TTGTTC",
418
+ "AAAGCC",
419
+ "GGCCTG",
420
+ "AAATAC",
421
+ "ATTGGT",
422
+ "GAATAT",
423
+ "ACCTGC",
424
+ "GTTGAT",
425
+ "ATCTTC",
426
+ "GGCTTT",
427
+ "GTTGAA",
428
+ "GCAAAT",
429
+ "GGCAAT",
430
+ "TGTTGA",
431
+ "ATTCAT",
432
+ "CCTGCA",
433
+ "AACGCC",
434
+ "ACAGCA",
435
+ "GCAGAA",
436
+ "GCACCA",
437
+ "AATTCA",
438
+ "TGCCGA",
439
+ "TGATCA",
440
+ "CTGCTT",
441
+ "CACCGC",
442
+ "GCCAAA",
443
+ "ATTTGC",
444
+ "TGCTGT",
445
+ "ATTGCC",
446
+ "TTTATG",
447
+ "GTATTT",
448
+ "AAACTG",
449
+ "GCAGGT",
450
+ "TGTTCA",
451
+ "GCGATG",
452
+ "AACAAC",
453
+ "CGCCAC",
454
+ "CGCCAA",
455
+ "TGATGG",
456
+ "ATTCTT",
457
+ "TCAAAT",
458
+ "CAGTTT",
459
+ "TGGTGC",
460
+ "TGAATT",
461
+ "GCAACA",
462
+ "TGAACA",
463
+ "TTTCAC",
464
+ "ATGAAT",
465
+ "AGAAGA",
466
+ "GTTTTC",
467
+ "GAAAAC",
468
+ "CGGCAT",
469
+ "ATGCCG",
470
+ "GAAGAT",
471
+ "TGCAGG",
472
+ "GCGCCT",
473
+ "AAAGCG",
474
+ "CATCAG",
475
+ "GGTGGT",
476
+ "CACCAC",
477
+ "GGCGCC",
478
+ "TGTTGC",
479
+ "TTTGGC",
480
+ "AGCAAC",
481
+ "CCAGCT",
482
+ "ATTTGA",
483
+ "TGCCAG",
484
+ "TTAAAC",
485
+ "ACCATT",
486
+ "TCATTA",
487
+ "ATAAAC",
488
+ "CGCCGT",
489
+ "AGCTGG",
490
+ "CTGGCA",
491
+ "TAATGA",
492
+ "CCTGAA",
493
+ "CTGATG",
494
+ "TAATTG",
495
+ "CAGCGA",
496
+ "GTTGTT",
497
+ "AAATCT",
498
+ "GCGGTG",
499
+ "GCCACC",
500
+ "GTTTAA",
501
+ "CTTTCA",
502
+ "ACTGGC"
503
+ ],
504
+ "class_names": [
505
+ "Acinetobacter baumannii",
506
+ "Enterococcus faecalis",
507
+ "Enterococcus faecium",
508
+ "Escherichia coli",
509
+ "Klebsiella pneumoniae",
510
+ "Pseudomonas aeruginosa",
511
+ "Salmonella enterica",
512
+ "Staphylococcus aureus"
513
+ ],
514
+ "task_type": "multiclass",
515
+ "target": "organism",
516
+ "k": 6,
517
+ "max_features": 500,
518
+ "n_samples": 862,
519
+ "n_features": 500,
520
+ "n_classes": 8
521
+ }
data_processed/ncbi/ncbi_organism_y_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b15343b098e58dcf7910efddf82428330af16e7ce2034c49432cc319a7c0905
3
+ size 1512
data_processed/ncbi/ncbi_organism_y_train.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f148b0e42811eb9625f603512a27a83a5a8d45d295f53653557e131c887353e5
3
+ size 4944
data_processed/ncbi/ncbi_organism_y_val.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dc91c81d6df415c92791afd9b70aa6187bd8d6d1fab3cc706b21392519acf053
3
+ size 824
data_processed/patric/patric_cefoxitin_X_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8454ba894ae919d8e0e2fab310826aeac8925abbec1e19ed2f6a428373747968
3
+ size 144128
data_processed/patric/patric_cefoxitin_X_train.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d22d6ec277e2c4361861b74f8c8117da02f4d1de194c471d33582ee3223b04bd
3
+ size 492128
data_processed/patric/patric_cefoxitin_X_val.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6d30e5536b9b743982ba94c0a88938b086fef20931403583581411507645c45b
3
+ size 72128
data_processed/patric/patric_cefoxitin_metadata.json ADDED
@@ -0,0 +1,515 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "feature_names": [
3
+ "ATTTTT",
4
+ "AAAAAT",
5
+ "TTTAAA",
6
+ "TTTTTA",
7
+ "TAAAAA",
8
+ "TATTTT",
9
+ "TTAAAA",
10
+ "TTTTAA",
11
+ "TTTATT",
12
+ "TTTAAT",
13
+ "AAAATA",
14
+ "ATTAAA",
15
+ "AATAAA",
16
+ "AAAAAA",
17
+ "TTATTT",
18
+ "TTTTTT",
19
+ "TTTTAT",
20
+ "ATTATT",
21
+ "AAATAA",
22
+ "ATAAAA",
23
+ "ATTAAT",
24
+ "AATAAT",
25
+ "ATATTT",
26
+ "TAAAAT",
27
+ "ATTTTA",
28
+ "ATTTAA",
29
+ "AAAATT",
30
+ "AAATAT",
31
+ "TTAAAT",
32
+ "TTAATT",
33
+ "AATATT",
34
+ "AATTTT",
35
+ "AATTAA",
36
+ "ATTTAT",
37
+ "GAAAAA",
38
+ "TGAAAA",
39
+ "TTTTCA",
40
+ "TTTTTC",
41
+ "ATAAAT",
42
+ "TTATTA",
43
+ "AAATTA",
44
+ "TAATTT",
45
+ "TAATAA",
46
+ "CATTTT",
47
+ "AAAATG",
48
+ "AAAAAG",
49
+ "CTTTTT",
50
+ "AAATTT",
51
+ "TATTTA",
52
+ "TATTAA",
53
+ "TTAATA",
54
+ "TGATTT",
55
+ "TAAATA",
56
+ "AAATCA",
57
+ "AATTTA",
58
+ "TAAATT",
59
+ "TTCTTT",
60
+ "TTTATC",
61
+ "AAAGAA",
62
+ "AAATGA",
63
+ "GATAAA",
64
+ "AATGAT",
65
+ "TCATTT",
66
+ "ATCATT",
67
+ "GAAAAT",
68
+ "AATTAT",
69
+ "TTTCTT",
70
+ "ATTGAT",
71
+ "TTTCAA",
72
+ "TTGAAA",
73
+ "TTTCAT",
74
+ "ATTTTC",
75
+ "ATCAAT",
76
+ "AAGAAA",
77
+ "ATGAAA",
78
+ "ATAATT",
79
+ "TTCATT",
80
+ "TTCAAT",
81
+ "TTTATA",
82
+ "ATTGAA",
83
+ "TATAAA",
84
+ "TTTGAT",
85
+ "TATTAT",
86
+ "AATCAT",
87
+ "TGAAAT",
88
+ "ATAATA",
89
+ "TTTGTT",
90
+ "ATGATT",
91
+ "AAAATC",
92
+ "TCATCA",
93
+ "TAATAT",
94
+ "TGATGA",
95
+ "TGATAA",
96
+ "TTGATT",
97
+ "CAAAAA",
98
+ "TTTTTG",
99
+ "CATTAA",
100
+ "TTAATG",
101
+ "AATGAA",
102
+ "TTATCA",
103
+ "TCTTTT",
104
+ "ATTTCA",
105
+ "GATTTT",
106
+ "ATATTA",
107
+ "AGAAAA",
108
+ "TTTTCT",
109
+ "TGTTTT",
110
+ "TTTGAA",
111
+ "ATCAAA",
112
+ "GTTTTT",
113
+ "TTCAAA",
114
+ "TCTTTA",
115
+ "TTGTTT",
116
+ "CGCCAG",
117
+ "AAATTG",
118
+ "AAAAGA",
119
+ "AATATA",
120
+ "TATATT",
121
+ "AAAAAC",
122
+ "CTGGCG",
123
+ "CAAAAT",
124
+ "TAAAGA",
125
+ "ATTTTG",
126
+ "ATTGTT",
127
+ "AAAACA",
128
+ "AACAAA",
129
+ "AATCAA",
130
+ "CTTTAT",
131
+ "CAATTT",
132
+ "TTTTGA",
133
+ "ATCTTT",
134
+ "CTTTAA",
135
+ "TCATTA",
136
+ "AATATC",
137
+ "AACAAT",
138
+ "AAAGAT",
139
+ "TTGATA",
140
+ "ATAAAG",
141
+ "ATATAA",
142
+ "GATATT",
143
+ "TCAAAA",
144
+ "TTATAT",
145
+ "AAACAA",
146
+ "TAATGA",
147
+ "TATCAA",
148
+ "CATCAT",
149
+ "GCCAGC",
150
+ "GATGAA",
151
+ "TTAAAG",
152
+ "CAATAT",
153
+ "TTGATG",
154
+ "TTGTTG",
155
+ "TTCATC",
156
+ "GCTGGC",
157
+ "CATCAA",
158
+ "AATTGA",
159
+ "ATGATG",
160
+ "TTTTGT",
161
+ "GATTTA",
162
+ "TAAATC",
163
+ "TTTAAC",
164
+ "GAAATT",
165
+ "TCAATT",
166
+ "ATATTG",
167
+ "ACAAAA",
168
+ "AATTTC",
169
+ "TCAATA",
170
+ "ATCATC",
171
+ "GTTAAA",
172
+ "TAATTA",
173
+ "TTCTTC",
174
+ "ATTTCT",
175
+ "ATTATC",
176
+ "CATTAT",
177
+ "ATAATG",
178
+ "GATGAT",
179
+ "AATGTT",
180
+ "TATTGA",
181
+ "CCAGCA",
182
+ "CAACAA",
183
+ "CAGCAG",
184
+ "AACATT",
185
+ "TTATTG",
186
+ "TTATAA",
187
+ "CAATAA",
188
+ "CTGCTG",
189
+ "GATAAT",
190
+ "GCAAAA",
191
+ "TGATAT",
192
+ "TTTTAC",
193
+ "GAAGAA",
194
+ "CCAGCG",
195
+ "GTAAAA",
196
+ "CATTTA",
197
+ "TTTTGC",
198
+ "AGAAAT",
199
+ "ATTTGA",
200
+ "AATTTG",
201
+ "TAAAAC",
202
+ "ATTCAT",
203
+ "ATATCA",
204
+ "TAAATG",
205
+ "ATGAAT",
206
+ "CGCTGG",
207
+ "ATTTGT",
208
+ "TTGCTG",
209
+ "TTGAAT",
210
+ "TGAATT",
211
+ "TGCTGG",
212
+ "CAGCAA",
213
+ "CAAATT",
214
+ "GTTTTA",
215
+ "AATTCA",
216
+ "TAATTG",
217
+ "ATTCAA",
218
+ "TGATTA",
219
+ "CTGAAA",
220
+ "ACATTT",
221
+ "TCAAAT",
222
+ "TTAACA",
223
+ "CAATTA",
224
+ "ATGTTT",
225
+ "ATATAT",
226
+ "TGTTAA",
227
+ "AAATGT",
228
+ "TTTCAG",
229
+ "TATCAT",
230
+ "ATGATA",
231
+ "TATTCA",
232
+ "TTATCT",
233
+ "AATTGT",
234
+ "TAATCA",
235
+ "GCTTTT",
236
+ "AATATG",
237
+ "AAACAT",
238
+ "TCTTCA",
239
+ "CATATT",
240
+ "TGTTGA",
241
+ "TGAAGA",
242
+ "GTTGAT",
243
+ "CAGCGC",
244
+ "ACTTTT",
245
+ "ATCAAC",
246
+ "TGAATA",
247
+ "AAAAGT",
248
+ "TCAGCA",
249
+ "TGCTTT",
250
+ "GTTAAT",
251
+ "AAAAGC",
252
+ "AACTTT",
253
+ "ACAAAT",
254
+ "ACAATT",
255
+ "TATAAT",
256
+ "TCAACA",
257
+ "GCATTT",
258
+ "CATAAA",
259
+ "CCATTT",
260
+ "ATATTC",
261
+ "GAATTT",
262
+ "TTTACC",
263
+ "GGTAAA",
264
+ "ATAATC",
265
+ "TGCTGA",
266
+ "GTATTT",
267
+ "AAAGTT",
268
+ "GTTATT",
269
+ "GATTAT",
270
+ "AGATAA",
271
+ "ATTAAC",
272
+ "TTCAGC",
273
+ "GAATAT",
274
+ "AAATTC",
275
+ "GCTGAA",
276
+ "GCGCTG",
277
+ "ATAAAC",
278
+ "CTTCTT",
279
+ "TATCTT",
280
+ "AATAAC",
281
+ "GTTTAT",
282
+ "AAGAAG",
283
+ "AAATAC",
284
+ "TTTATG",
285
+ "AATTGC",
286
+ "AAAGCA",
287
+ "GCAGCA",
288
+ "CTTTTA",
289
+ "TGCTGC",
290
+ "ATTGCT",
291
+ "CGTTTT",
292
+ "ATTATA",
293
+ "TATTTC",
294
+ "GCAATT",
295
+ "CTAAAA",
296
+ "TGGTTT",
297
+ "TATTTG",
298
+ "AAATGC",
299
+ "TTAAAC",
300
+ "CTTCAA",
301
+ "TTGTTA",
302
+ "TTTTAG",
303
+ "GATTAA",
304
+ "CGCCGC",
305
+ "TAAAAG",
306
+ "AAACCA",
307
+ "AAGATA",
308
+ "TCGCCA",
309
+ "TGATTG",
310
+ "TTGAAG",
311
+ "GTTGAA",
312
+ "AAATGG",
313
+ "GTTTAA",
314
+ "TTAATC",
315
+ "CAATCA",
316
+ "ATCAGC",
317
+ "GCGGCG",
318
+ "ATCGCC",
319
+ "GTAAAT",
320
+ "GAAATA",
321
+ "TGTTTA",
322
+ "GCTGAT",
323
+ "AAAACG",
324
+ "ATTTAC",
325
+ "CTTCAT",
326
+ "TAACAA",
327
+ "GCATTA",
328
+ "GCATCA",
329
+ "TACTTT",
330
+ "GTTGTT",
331
+ "TTGTAA",
332
+ "TTCAAC",
333
+ "GCTTTA",
334
+ "TAATGC",
335
+ "AGCAAT",
336
+ "TGGCGA",
337
+ "ACCATT",
338
+ "TAAACA",
339
+ "CAACAT",
340
+ "TGATGC",
341
+ "ACCAGC",
342
+ "TAATGT",
343
+ "TGATGT",
344
+ "GGCGAT",
345
+ "CATCTT",
346
+ "TTCATA",
347
+ "CACCAG",
348
+ "ACATCA",
349
+ "CAAATA",
350
+ "ACTTTA",
351
+ "ATCTTC",
352
+ "CATTTG",
353
+ "TCATAT",
354
+ "TTTGTA",
355
+ "CTGTTT",
356
+ "AAGATG",
357
+ "CCGCCA",
358
+ "AACAAC",
359
+ "CAATTG",
360
+ "GCTGGT",
361
+ "TTTCAC",
362
+ "GCGCCA",
363
+ "TTACAA",
364
+ "ATGAAG",
365
+ "TCCAGC",
366
+ "TACAAA",
367
+ "AAAACT",
368
+ "TGTTGT",
369
+ "TAATTC",
370
+ "TAAAGC",
371
+ "GAAGAT",
372
+ "ATGTTG",
373
+ "ATATGA",
374
+ "TATGAA",
375
+ "ACATTA",
376
+ "TGGCGG",
377
+ "AATGGT",
378
+ "AAATCT",
379
+ "AAAGTA",
380
+ "TTTGCT",
381
+ "ATTATG",
382
+ "CTGGTG",
383
+ "TGGCGC",
384
+ "AGATTT",
385
+ "AGTTTT",
386
+ "TGTAAT",
387
+ "GAATTA",
388
+ "AGCAAA",
389
+ "TTATCG",
390
+ "CGATAA",
391
+ "CAGAAA",
392
+ "TTATTC",
393
+ "TGGTGA",
394
+ "TTGCTT",
395
+ "TTTCTG",
396
+ "CATAAT",
397
+ "GTGAAA",
398
+ "CAGCAT",
399
+ "GCCATT",
400
+ "TAAAGT",
401
+ "AAACAG",
402
+ "TCACCA",
403
+ "GCTGGA",
404
+ "TGCAAT",
405
+ "CAAATG",
406
+ "ATTTGC",
407
+ "TTTTCC",
408
+ "ATCACC",
409
+ "GGAAAA",
410
+ "TCATAA",
411
+ "ATCATA",
412
+ "ATTACA",
413
+ "ACAACA",
414
+ "ATGCTG",
415
+ "CCATCA",
416
+ "AATACA",
417
+ "TATTGC",
418
+ "TTACTT",
419
+ "CATCAG",
420
+ "TGCCAG",
421
+ "TTTGCC",
422
+ "GCAAAT",
423
+ "TAATAC",
424
+ "CTGGCA",
425
+ "GCAATA",
426
+ "GTTTCA",
427
+ "GGCAAA",
428
+ "TTGTTC",
429
+ "AACATC",
430
+ "CGCTTT",
431
+ "TGTATT",
432
+ "AATGGC",
433
+ "TGTTGC",
434
+ "AATTAC",
435
+ "GATGTT",
436
+ "GAATAA",
437
+ "TATTGT",
438
+ "ACAATA",
439
+ "AATACC",
440
+ "GCTGTT",
441
+ "TATGAT",
442
+ "TTATGA",
443
+ "CTGATG",
444
+ "TGGCAA",
445
+ "TGCATT",
446
+ "CTTTCA",
447
+ "TTTCCA",
448
+ "CGCCAT",
449
+ "ATTGCA",
450
+ "TTGCCA",
451
+ "TGATGG",
452
+ "AATCTT",
453
+ "GGTATT",
454
+ "AAAGCG",
455
+ "AAAACC",
456
+ "ATTCTT",
457
+ "GTAATT",
458
+ "TTAACT",
459
+ "GTAATA",
460
+ "TGTAAA",
461
+ "GGTGAA",
462
+ "GGTGAT",
463
+ "GTATTA",
464
+ "TTTACT",
465
+ "ATGTTA",
466
+ "TGAAAG",
467
+ "ATGGCG",
468
+ "TTTACA",
469
+ "TATTAC",
470
+ "AATGCA",
471
+ "AGCATT",
472
+ "CATTGA",
473
+ "TAACTT",
474
+ "AGTTAA",
475
+ "TGTTCA",
476
+ "AAGTAA",
477
+ "CATTTC",
478
+ "GCAACA",
479
+ "TCATCT",
480
+ "AGATGA",
481
+ "CTGATT",
482
+ "TTCACC",
483
+ "TTCCAG",
484
+ "TTGCCG",
485
+ "GGTTTT",
486
+ "GGCGGC",
487
+ "GCCGCC",
488
+ "AATCAG",
489
+ "ATTACT",
490
+ "AAGATT",
491
+ "AGATAT",
492
+ "AAACTT",
493
+ "TGTTAT",
494
+ "AAACTG",
495
+ "AACAGC",
496
+ "AAGTTT",
497
+ "TAACAT",
498
+ "AAGCAA",
499
+ "CACCAT",
500
+ "TCTTCT",
501
+ "CCACCA",
502
+ "CTGGAA"
503
+ ],
504
+ "class_names": [
505
+ "Resistant",
506
+ "Susceptible"
507
+ ],
508
+ "task_type": "binary",
509
+ "antibiotic": "cefoxitin",
510
+ "k": 6,
511
+ "max_features": 500,
512
+ "n_samples": 177,
513
+ "n_features": 500,
514
+ "n_classes": 2
515
+ }
data_processed/patric/patric_cefoxitin_y_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e71a2e93bfd80dd8cf7121ac173ef50b39cad4880636996a71ed2e88365cba44
3
+ size 416
data_processed/patric/patric_cefoxitin_y_train.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:67f5fb06d9c46cc9d6bcb85b2cf0c4f996ead893676c840af84703b3ee5b1f10
3
+ size 1112
data_processed/patric/patric_cefoxitin_y_val.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:483ffcdec578185306133f00629561f277b95ac930e534f47de7d3489851d221
3
+ size 272
data_processed/patric/patric_ciprofloxacin_X_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1c5ffb232d90b9318edac49958a520054b5a2071183a6229a1fc2338675faa80
3
+ size 204128
data_processed/patric/patric_ciprofloxacin_X_train.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:34673e4d5674a53ef5e5e789ab581bdfda6f2d67880ff77fd99cbb2cf4f03b7b
3
+ size 700128
data_processed/patric/patric_ciprofloxacin_X_val.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:47299679a94433999a9b166c9547fdd76cc2b670293e909a92c769e2db6a4b6d
3
+ size 104128
data_processed/patric/patric_ciprofloxacin_metadata.json ADDED
@@ -0,0 +1,515 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "feature_names": [
3
+ "AAAAAT",
4
+ "ATTTTT",
5
+ "TTTAAA",
6
+ "TTTTTA",
7
+ "TAAAAA",
8
+ "TTAAAA",
9
+ "TTTTAA",
10
+ "TATTTT",
11
+ "AAAAAA",
12
+ "TTTATT",
13
+ "AAAATA",
14
+ "TTTAAT",
15
+ "ATTAAA",
16
+ "TTTTTT",
17
+ "AATAAA",
18
+ "TTATTT",
19
+ "AAATAA",
20
+ "TTTTAT",
21
+ "ATAAAA",
22
+ "AAAATT",
23
+ "ATTTAA",
24
+ "TTAAAT",
25
+ "TAAAAT",
26
+ "ATTATT",
27
+ "ATTTTA",
28
+ "AATTTT",
29
+ "ATATTT",
30
+ "AATAAT",
31
+ "AATATT",
32
+ "AAATAT",
33
+ "TTAATT",
34
+ "ATTAAT",
35
+ "AATTAA",
36
+ "ATTTAT",
37
+ "GAAAAA",
38
+ "TGAAAA",
39
+ "AAATTA",
40
+ "TTTTCA",
41
+ "TTTTTC",
42
+ "AAATTT",
43
+ "ATAAAT",
44
+ "TAATTT",
45
+ "AAAAAG",
46
+ "CTTTTT",
47
+ "TTATTA",
48
+ "CATTTT",
49
+ "TATTTA",
50
+ "AAAATG",
51
+ "TAATAA",
52
+ "TAAATA",
53
+ "TGATTT",
54
+ "TATTAA",
55
+ "AAATCA",
56
+ "TTAATA",
57
+ "AATTTA",
58
+ "TAAATT",
59
+ "TTCTTT",
60
+ "AAAGAA",
61
+ "AAATGA",
62
+ "TCATTT",
63
+ "TTTCTT",
64
+ "TTGAAA",
65
+ "TTTCAA",
66
+ "AAGAAA",
67
+ "CAAAAA",
68
+ "TTTATC",
69
+ "TTTTTG",
70
+ "ATTGAT",
71
+ "GAAAAT",
72
+ "GATAAA",
73
+ "AATTAT",
74
+ "ATCAAT",
75
+ "TTTATA",
76
+ "AATGAT",
77
+ "TTCAAT",
78
+ "TTTCAT",
79
+ "ATTTTC",
80
+ "TATAAA",
81
+ "ATGAAA",
82
+ "ATTGAA",
83
+ "TTTGAT",
84
+ "ATCATT",
85
+ "ATAATT",
86
+ "TTCATT",
87
+ "TGAAAT",
88
+ "GTTTTT",
89
+ "AAAATC",
90
+ "AAAAAC",
91
+ "AAATTG",
92
+ "TCTTTT",
93
+ "TTTGAA",
94
+ "TTTGTT",
95
+ "TTTTGA",
96
+ "TGTTTT",
97
+ "CAAAAT",
98
+ "ATTTTG",
99
+ "AGAAAA",
100
+ "TTCAAA",
101
+ "AAAAGA",
102
+ "GATTTT",
103
+ "TTTTCT",
104
+ "CTTTAA",
105
+ "TTGATT",
106
+ "ATTTCA",
107
+ "TCAAAA",
108
+ "ATCAAA",
109
+ "CAATTT",
110
+ "AATGAA",
111
+ "CATTAA",
112
+ "TTAATG",
113
+ "TCTTTA",
114
+ "AATCAT",
115
+ "AAAACA",
116
+ "ATGATT",
117
+ "TGATGA",
118
+ "TATTAT",
119
+ "TTGTTT",
120
+ "TCATCA",
121
+ "TAAAGA",
122
+ "TTAAAG",
123
+ "ATAATA",
124
+ "TAATAT",
125
+ "TGATAA",
126
+ "ATATTA",
127
+ "AACAAA",
128
+ "CTTTAT",
129
+ "AATCAA",
130
+ "TTATCA",
131
+ "AAACAA",
132
+ "AATATA",
133
+ "ATAAAG",
134
+ "ATCTTT",
135
+ "TATATT",
136
+ "ATTGTT",
137
+ "AAAGAT",
138
+ "AATTGA",
139
+ "CAATAT",
140
+ "AACAAT",
141
+ "AATATC",
142
+ "TTGATA",
143
+ "GATATT",
144
+ "GAAATT",
145
+ "TTGATG",
146
+ "TCAATT",
147
+ "ATATTG",
148
+ "CATCAA",
149
+ "TCATTA",
150
+ "GATTTA",
151
+ "TTTTGT",
152
+ "ATATAA",
153
+ "AATTTC",
154
+ "TTATAT",
155
+ "TATCAA",
156
+ "TTGTTG",
157
+ "TAATGA",
158
+ "TAAAAC",
159
+ "TTTAAC",
160
+ "ACAAAA",
161
+ "TAAATC",
162
+ "GCAAAA",
163
+ "GATGAA",
164
+ "TTTTGC",
165
+ "TCAATA",
166
+ "GTTTTA",
167
+ "TTCATC",
168
+ "GTTAAA",
169
+ "TTATTG",
170
+ "CAATAA",
171
+ "CATCAT",
172
+ "ATGATG",
173
+ "TATTGA",
174
+ "GTAAAA",
175
+ "TTTTAC",
176
+ "AATTTG",
177
+ "ATTTCT",
178
+ "CAACAA",
179
+ "GCTTTT",
180
+ "CATTTA",
181
+ "TAAATG",
182
+ "CAAATT",
183
+ "ATTTGA",
184
+ "TTGCTG",
185
+ "AAAAGC",
186
+ "AGAAAT",
187
+ "TTATAA",
188
+ "TGCTTT",
189
+ "TAATTA",
190
+ "CAGCAA",
191
+ "TTCTTC",
192
+ "TTGAAT",
193
+ "ATTCAA",
194
+ "ATAATG",
195
+ "AATGTT",
196
+ "GAAGAA",
197
+ "CGCCAG",
198
+ "AACTTT",
199
+ "CATTAT",
200
+ "AAAGCA",
201
+ "ATTCAT",
202
+ "TCAAAT",
203
+ "TGAATT",
204
+ "ATGAAT",
205
+ "AACATT",
206
+ "TAATTG",
207
+ "AAAGTT",
208
+ "ATCATC",
209
+ "CTGGCG",
210
+ "ACTTTT",
211
+ "AAAAGT",
212
+ "CAATTA",
213
+ "AATTCA",
214
+ "TATTCA",
215
+ "ATTTGT",
216
+ "ATTATC",
217
+ "GATGAT",
218
+ "CTAAAA",
219
+ "CTTTTA",
220
+ "TGATAT",
221
+ "AATTGC",
222
+ "TTTTAG",
223
+ "TGAATA",
224
+ "CAGCAG",
225
+ "CTGAAA",
226
+ "TGGTTT",
227
+ "GCTTTA",
228
+ "TGAAGA",
229
+ "TAAAAG",
230
+ "CTGCTG",
231
+ "ATATCA",
232
+ "GATAAT",
233
+ "TCTTCA",
234
+ "GCAATT",
235
+ "GCCAGC",
236
+ "AAACCA",
237
+ "ATGTTT",
238
+ "GTATTT",
239
+ "GCTGGC",
240
+ "AATATG",
241
+ "GCATTT",
242
+ "CATATT",
243
+ "CCATTT",
244
+ "TTTCAG",
245
+ "ACATTT",
246
+ "TGTTGA",
247
+ "TGATTA",
248
+ "ATTGCT",
249
+ "GGTAAA",
250
+ "TAAAGC",
251
+ "AAATGT",
252
+ "TTTACC",
253
+ "CATAAA",
254
+ "AAATAC",
255
+ "AAACAT",
256
+ "CCAGCA",
257
+ "AAAACT",
258
+ "TTAAAC",
259
+ "TCAGCA",
260
+ "AATTGT",
261
+ "TTATCT",
262
+ "TTGAAG",
263
+ "ATATTC",
264
+ "GAATTT",
265
+ "CTTCAA",
266
+ "GTTTAA",
267
+ "AGTTTT",
268
+ "TGCTGA",
269
+ "TCAACA",
270
+ "AAATTC",
271
+ "TTTATG",
272
+ "GAATAT",
273
+ "ACAAAT",
274
+ "TTAACA",
275
+ "ATATAT",
276
+ "AAATGC",
277
+ "AAGAAG",
278
+ "ACAATT",
279
+ "AAATGG",
280
+ "TAATCA",
281
+ "CTTCTT",
282
+ "AGCAAT",
283
+ "TGTTAA",
284
+ "TATTTG",
285
+ "GTTGAT",
286
+ "TGCTGG",
287
+ "GTTATT",
288
+ "AGATAA",
289
+ "GTTTAT",
290
+ "GTAAAT",
291
+ "GCTGAA",
292
+ "TTCAGC",
293
+ "ATAAAC",
294
+ "TTTGCT",
295
+ "ATCAAC",
296
+ "TTGCTT",
297
+ "ACTTTA",
298
+ "AATAAC",
299
+ "ATTTAC",
300
+ "CAATTG",
301
+ "GTTGAA",
302
+ "TGCTGC",
303
+ "AGCAAA",
304
+ "GTTAAT",
305
+ "GCAGCA",
306
+ "TACTTT",
307
+ "TATTTC",
308
+ "ATGATA",
309
+ "TATCTT",
310
+ "TATCAT",
311
+ "TGATTG",
312
+ "CAAATA",
313
+ "TTCAAC",
314
+ "GATTAT",
315
+ "ATAATC",
316
+ "TGTTTA",
317
+ "GAAATA",
318
+ "CAATCA",
319
+ "ATTAAC",
320
+ "CGTTTT",
321
+ "AAGATA",
322
+ "ACCATT",
323
+ "TAAAGT",
324
+ "CATTTG",
325
+ "AAATCT",
326
+ "GCATTA",
327
+ "TGCAAT",
328
+ "AAAGTA",
329
+ "AGATTT",
330
+ "TTGTAA",
331
+ "AAGTTT",
332
+ "AAACTT",
333
+ "GATTAA",
334
+ "CTTCAT",
335
+ "TAATGC",
336
+ "GTTGTT",
337
+ "AAAACG",
338
+ "TAAACA",
339
+ "CCAGCG",
340
+ "AATGGT",
341
+ "TATAAT",
342
+ "TTAATC",
343
+ "AAGCAA",
344
+ "ATTGCA",
345
+ "AAGATG",
346
+ "TTTGTA",
347
+ "GCATCA",
348
+ "TTGTTA",
349
+ "CATCTT",
350
+ "TACAAA",
351
+ "AAAACC",
352
+ "TTACAA",
353
+ "CGCTGG",
354
+ "ATTTGC",
355
+ "GCAAAT",
356
+ "CAAATG",
357
+ "TGCAAA",
358
+ "CTGTTT",
359
+ "TGATGC",
360
+ "ATGAAG",
361
+ "TTACTT",
362
+ "CAACAT",
363
+ "TTCATA",
364
+ "AACAAC",
365
+ "GGTTTT",
366
+ "TATTGC",
367
+ "TCTAAA",
368
+ "TATGAA",
369
+ "ACCAAT",
370
+ "TAACAA",
371
+ "TTATTC",
372
+ "GCAATA",
373
+ "ATTATA",
374
+ "TTTGCA",
375
+ "TGATGT",
376
+ "ATTGGT",
377
+ "ATGTTG",
378
+ "TTTAGA",
379
+ "TAATTC",
380
+ "TGTTGT",
381
+ "TTTCAC",
382
+ "AATTAC",
383
+ "TGGCAA",
384
+ "AAGTAA",
385
+ "TGGTGA",
386
+ "TTTGGT",
387
+ "GCTGAT",
388
+ "TGCATT",
389
+ "ATCTTC",
390
+ "GAATTA",
391
+ "ATCAGC",
392
+ "TGTAAT",
393
+ "AAACAG",
394
+ "GAAGAT",
395
+ "TGTAAA",
396
+ "TTGCCA",
397
+ "TTGTTC",
398
+ "TAATGT",
399
+ "ACATCA",
400
+ "TTTACT",
401
+ "GAATAA",
402
+ "CAGCGC",
403
+ "TTTACA",
404
+ "TCATAT",
405
+ "ATTATG",
406
+ "TGTATT",
407
+ "GTGAAA",
408
+ "AATACA",
409
+ "CAGAAA",
410
+ "ATATGA",
411
+ "GTTTTG",
412
+ "AATACC",
413
+ "AATCTT",
414
+ "AATGCA",
415
+ "TTTCTG",
416
+ "ATTCTT",
417
+ "TTAACT",
418
+ "GTAATT",
419
+ "TCATAA",
420
+ "TCACCA",
421
+ "AGTTAA",
422
+ "ATTTAG",
423
+ "ACATTA",
424
+ "CAGCAT",
425
+ "GGTATT",
426
+ "TGTTGC",
427
+ "AGCATT",
428
+ "GCCATT",
429
+ "ATTACA",
430
+ "ACCAGC",
431
+ "TATTGT",
432
+ "GCTAAA",
433
+ "AGTAAA",
434
+ "TTTGCC",
435
+ "ACCAAA",
436
+ "TTTAGC",
437
+ "AATGGC",
438
+ "ATGCTG",
439
+ "AAGATT",
440
+ "GCTGGT",
441
+ "GGCAAA",
442
+ "TTATGA",
443
+ "ACAATA",
444
+ "ACAACA",
445
+ "CTTTTG",
446
+ "CATAAT",
447
+ "TAACTT",
448
+ "CTAAAT",
449
+ "CAAAAC",
450
+ "AAGAAT",
451
+ "TCGCCA",
452
+ "GAACAA",
453
+ "TGTTCA",
454
+ "GGAAAA",
455
+ "GTTTCA",
456
+ "AATGCT",
457
+ "GCGCTG",
458
+ "CACCAG",
459
+ "TTTTCC",
460
+ "CAAAAG",
461
+ "TAATAC",
462
+ "AAACTG",
463
+ "TTTCCA",
464
+ "TTGCAA",
465
+ "GCTGTT",
466
+ "GCAACA",
467
+ "AAGTTA",
468
+ "TGGTAA",
469
+ "TTCTAA",
470
+ "TTAGAA",
471
+ "GGTGAA",
472
+ "CAGTTT",
473
+ "CTGGTG",
474
+ "AGATGA",
475
+ "ATTACT",
476
+ "AGCTTT",
477
+ "GTAATA",
478
+ "TGGCGA",
479
+ "CTATTT",
480
+ "TGAACA",
481
+ "TATTAC",
482
+ "GTATTA",
483
+ "ATCGCC",
484
+ "TCATCT",
485
+ "CTTTCA",
486
+ "CTGCAA",
487
+ "TTTAGT",
488
+ "CCATCA",
489
+ "ATCATA",
490
+ "TGAAAG",
491
+ "TTATCG",
492
+ "TTCACC",
493
+ "GATGTT",
494
+ "TGGAAA",
495
+ "AAAGCT",
496
+ "CGCTTT",
497
+ "AACATC",
498
+ "CGATAA",
499
+ "TTGCAG",
500
+ "TGATGG",
501
+ "TATGAT",
502
+ "CTGATT"
503
+ ],
504
+ "class_names": [
505
+ "Resistant",
506
+ "Susceptible"
507
+ ],
508
+ "task_type": "binary",
509
+ "antibiotic": "ciprofloxacin",
510
+ "k": 6,
511
+ "max_features": 500,
512
+ "n_samples": 252,
513
+ "n_features": 500,
514
+ "n_classes": 2
515
+ }
data_processed/patric/patric_ciprofloxacin_y_test.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9ef7a315e5aa72527355c6c990864def6a8f3414622deeae52a570161339410e
3
+ size 536