microbe-model / scripts

Commit History

Deploy app from main@a3254bf (no paper/ binaries)
0ed74db
Running

Miyu Horiuchi commited on

Add unified strain catalog (100K rows w/ provenance) + selective weak supervision for pH
4c18dfd

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on

Add MediaDive-derived features (medium pH, NaCl, n_media) — all 4 targets improve
5df9ef8

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on

Fix _derive_salt to pick optimum entries — salt MAE 2.52 → 2.17 (-13.7%)
56b0c4e

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on

Expand training corpus to 46K strains: species-name → NCBI genome + isolation features
f0f1d93

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on

UI prep: pre-train phenotype heads + pre-score uncultured media
d23315e

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on

v2 results: ESM-2 t6 (8M, 20-protein sample) loses to v1 hand-crafted features on all 4 phenotype targets
8800528

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on

Tier 1.5: uncertainty quantification via quantile regression in recommend.py
1ebf56f

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on

v2 scaffolding: ESM-2 embedding extraction + GPU runner doc
8c28a61

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on

Phase E #2: scripts/recommend.py — single-genome → ranked media + phenotype CLI
31110fe

Miyu Horiuchi commited on

Phase E modeling: per-medium classifiers + recommender training script
d3cbd87

Miyu Horiuchi commited on

Phase E scaffolding: MediaDive integration + strain↔medium links
3d34be9

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on

Fix GTDB column names + accession resolution for v226 metadata schema
0fbea89

Miyu Horiuchi commited on

Phase C scaffolding: GTDB candidate selection + uncultured prediction
30e65bc

Miyu Horiuchi commited on

Final cleanup: sync OVERNIGHT_SUMMARY.md + fix size display for small files
6b52ab8

Miyu Horiuchi commited on

Make OVERNIGHT_SUMMARY.md write atomic (avoid race with regen loop)
de9e822

Miyu Horiuchi commited on

Fix predictions parquet type mix + plumb feature_cols through eval
bbbea9d

Miyu Horiuchi commited on

Harden post-featurize chain: each phase runs even if previous fails
65e8e0f

Miyu Horiuchi commited on

Eval report enhancements: TL;DR + per-strain predictions + per-family error
4b79970

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on

Add eval report generator + training table persistence + group-col override
d082ced

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on

Streaming fetch+featurize pipeline + 6× pyrodigal speedup + GCA version resolution
383bb62

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on

Rewrite BacDive client for v2 public API (no auth required)
6c30d74

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on

Scaffold v0: BacDive + NCBI ingestion, genome feature extractor, XGBoost baseline
52cf5ab

Miyu Horiuchi Claude Opus 4.7 (1M context) commited on