stGPT-Xenium L3 CNN v0.1

stGPT-Xenium is a Xenium-first morpho-molecular representation model series for spatial pathology research. This repository contains the first public model artifact in the series: L3-CNN-v0.1, a lightweight CNN-based pilot checkpoint for contour-level XeniumSlide learning.

stGPT learns the morpho-molecular tissue representation; spatho turns it into auditable spatial pathology evidence.

Release Status

  • Public research artifact.
  • Not a diagnostic model and not for clinical decision support.
  • First checkpoint in the stGPT-Xenium series, intended for reproducibility, interface testing, and transparent model-development tracking.
  • This is not a universal foundation model yet; stronger multi-case and slide-holdout models will be released as later versions.

What This Checkpoint Is

  • Model family: stGPT-Xenium
  • Release: L3-CNN-v0.1
  • Backend: lightweight CNN contour H&E encoder
  • Training unit: XeniumSlide contour/region-level image-gene-spatial learning
  • Representative checkpoint case: Xenium_Prime_Human_Prostate_FFPE_outs
  • Selected from a 39-case L3 morphology-ready A100 pilot where all cases completed successfully

A100 Pilot Summary

  • Uploaded/trained L3 cases: 39/39
  • Completed metrics: 39/39
  • Failed cases: 0
  • Checkpoint records: 273
  • Error rows: 0
  • Mean final validation loss across 39 cases: ~0.180
  • Mean final validation gene loss: ~0.00162
  • Mean final validation neighbor loss: ~0.0317

Selected checkpoint metrics:

  • last_val_loss: 0.1554184090346098
  • last_val_gene_loss: 0.0013655137008754
  • last_val_neighbor_loss: 0.0067438320256769
  • last_val_image_gene_loss: 3.0473387241363525

Files

  • checkpoint_best.pt: representative best checkpoint.
  • config.yaml: stGPT config used for this case on A100.
  • metrics.json: per-step training metrics for this checkpoint.
  • stgpt_model_manifest.json: provenance and checksums.
  • training_telemetry_summary.json: pilot-level telemetry summary.
  • case_metric_summary_all_cases.csv: aggregate training metrics for all 39 pilot cases.
  • checkpoint_manifest_all_cases.csv: checkpoint provenance for all pilot cases.

Evidence and Data Policy

A companion private evidence/provenance repository tracks telemetry, evaluation tables, ablations, and model-development evidence. Raw 10x-derived assets are not redistributed here:

  • no xenium_slide.zarr
  • no H&E patch PNGs
  • no OME-TIF or raw image files
  • no raw expression matrices

Users should obtain public 10x Genomics Xenium datasets from the original 10x dataset pages and rebuild local XeniumSlide artifacts under the applicable source terms.

Intended Use

Use this checkpoint for:

  • testing the stGPT model/package interface
  • reproducing the v0.1 checkpoint loading path
  • comparing against future stGPT-Xenium releases
  • research-only spatial transcriptomics and computational pathology method development

Do not use this checkpoint for:

  • diagnosis
  • treatment decisions
  • clinical reporting
  • claims of universal tissue or organ generalization

Limitations

  • Single representative checkpoint, not a multi-case global model.
  • CNN image encoder baseline; Virchow-precomputed and ablation evidence are tracked separately and will inform later releases.
  • Validation is pilot-level and per-case; slide-holdout and cross-case generalization are future release targets.
  • Public 10x source data must be accessed from original sources, not from this model repository.

Version Series

Planned naming convention:

  • stGPT-Xenium-L3-CNN-v0.1: first public pilot checkpoint, this repo.
  • stGPT-Xenium-L3-Virchow-v0.2: planned frozen pathology-encoder checkpoint.
  • stGPT-Xenium-Foundation-v1.0: future multi-case/slide-holdout foundation release.
Downloads last month
8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support