Spaces:
Running
Running
File size: 1,796 Bytes
0ed74db | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 | # External Tool Benchmark Status
This file tracks the apples-to-apples benchmark setup for external tools
on the same held-out BacDive/MediaDive strains used by the dry-lab media
recommender benchmark.
## Held-Out Manifest
- Manifest: `artifacts/external_benchmark_manifest.parquet`
- Rows: 25,728
- Unique genome accessions: 16,154
- Media labels retained: 40
- Fold counts: {"0": 5146, "1": 5146, "2": 5146, "3": 5145, "4": 5145}
Label coverage:
| Target | Labeled rows |
|---|---:|
| Temperature | 25,727 |
| pH | 2,984 |
| Salt | 2,486 |
| Oxygen | 9,283 |
| Medium | 21,050 |
## Local Requirements
- FASTA directory: `data/external_benchmark_fastas`
- FASTAs present: 8 / 16,154 (0.05%)
- FASTA download smoke run: {"attempted": 0, "downloaded": 0, "failed": 0}
| Tool | Local command | Status |
|---|---|---|
| GenomeSPOT | `uv run python -m genome_spot.genome_spot` | available |
| CarveMe | `uv run --with carveme carve` | available |
| gapseq | `` | missing |
## Verdict
External baseline execution is not ready on this machine yet: the full held-out FASTA set and one or more external tool binaries/databases are missing.
## Next Commands
Use the manifest to run each external tool against the same rows and folds.
The medium-feasibility tools should be scored by whether at least one known
MediaDive medium is feasible or closest among the tool's predicted feasible
media/metabolite environments.
```bash
PYTHONPATH=src uv run --python 3.11 python scripts/42_prepare_external_benchmarks.py \
--download-fastas 10
```
For the full benchmark, download the complete FASTA set into the FASTA
directory above, install the external tools plus their databases, then run
tool-specific inference using the `bacdive_id`, `fold`, and
`genome_accession` columns from the manifest.
|