Rename: this is the twin-stack app layer (was gemeo-world-model). World model now at Raras-AI/gemeo-world-model.
Browse files
README.md
CHANGED
|
@@ -43,42 +43,60 @@ extra_gated_fields:
|
|
| 43 |
I agree to non-clinical research use only: checkbox
|
| 44 |
---
|
| 45 |
|
| 46 |
-
# GEMEO β
|
| 47 |
|
| 48 |
-
>
|
| 49 |
-
>
|
| 50 |
-
>
|
| 51 |
-
>
|
|
|
|
| 52 |
|
| 53 |
**Authors:** Raras.ai team Β· **Contact:** dimas@raras.ai
|
| 54 |
**Source:** github companion repo (raras.org)
|
| 55 |
**Paper v1 (Zenodo):** [DOI 10.5281/zenodo.20092131](https://doi.org/10.5281/zenodo.20092131)
|
| 56 |
-
**
|
| 57 |
-
**
|
| 58 |
**License:** CC-BY-NC 4.0 + non-clinical-use rider (see [LICENSE](./LICENSE))
|
| 59 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 60 |
---
|
| 61 |
|
| 62 |
## What this is
|
| 63 |
|
| 64 |
-
|
| 65 |
-
|
| 66 |
-
|
| 67 |
-
|
| 68 |
-
|
|
|
|
|
|
|
| 69 |
|
| 70 |
```
|
| 71 |
-
GEMEO
|
| 72 |
βββ Patient embedding (gemeo/encoder.py) HGT scaffolded; bootstrap = weighted KG embedding
|
| 73 |
βββ Cohort retrieval (gemeo/cohort.py) kNN + Cypher overlap on PrimeKG
|
| 74 |
βββ Subgraph reasoning (gemeo/subgraph.py) 1-hop sparsification (learned variant in train/)
|
| 75 |
-
βββ Trajectory
|
|
|
|
|
|
|
| 76 |
βββ Risk / survival (gemeo/risk.py) NeuralSurv trained on DATASUS SIM (c-index 0.70)
|
| 77 |
βββ Drug repurposing (gemeo/repurpose.py) TxGNN slot + SUS auxiliary head
|
| 78 |
-
βββ Counterfactual (gemeo/whatif.py) do-calculus mutator
|
| 79 |
βββ Active learning (gemeo/ask.py) info-gain over KG annotations
|
| 80 |
βββ SUS grounding (gemeo/ground_sus.py) PCDT/CEAF/UF coverage check
|
| 81 |
βββ FastAPI (gemeo/api.py) production /api/gemeo/* endpoints
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 82 |
```
|
| 83 |
|
| 84 |
---
|
|
@@ -93,7 +111,7 @@ GEMEO World Model
|
|
| 93 |
| | timeout rate | 0.0% | |
|
| 94 |
| **Risk / survival** (NeuralSurv, DATASUS SIM) | val c-index | **0.70** (best @ ep 30) | 4,624 SIM mortality records, 37,494 censoring samples |
|
| 95 |
| | final c-index | 0.694 | after 100 epochs |
|
| 96 |
-
| **
|
| 97 |
| | calibration ICI | 0.0006 | well calibrated |
|
| 98 |
| | training | 5.8 min on 1ΓH100 | $0.48 |
|
| 99 |
|
|
@@ -118,7 +136,7 @@ twin.trajectory.horizons # 6/12/24-month event predictions
|
|
| 118 |
```
|
| 119 |
|
| 120 |
Backbone: bootstrap LLM-over-natural-history today, [Causal Diffusion
|
| 121 |
-
Forcing](https://huggingface.co/Raras-AI/gemeo-
|
| 122 |
checkpoint is mounted.
|
| 123 |
|
| 124 |
### 2 Β· Diagnosis
|
|
@@ -207,9 +225,11 @@ LICENSE
|
|
| 207 |
README.md β this file
|
| 208 |
```
|
| 209 |
|
| 210 |
-
The
|
| 211 |
-
|
| 212 |
-
(
|
|
|
|
|
|
|
| 213 |
|
| 214 |
---
|
| 215 |
|
|
@@ -314,8 +334,9 @@ world model, multimodal substrate.
|
|
| 314 |
|
| 315 |
## Changelog
|
| 316 |
|
| 317 |
-
- **2026-05 (this release)** β Initial public release of the
|
| 318 |
-
- **2026-05** β
|
|
|
|
| 319 |
- **2026-04** β GEMEO v1 paper published on Zenodo (DOI 10.5281/zenodo.20092131).
|
| 320 |
|
| 321 |
---
|
|
|
|
| 43 |
I agree to non-clinical research use only: checkbox
|
| 44 |
---
|
| 45 |
|
| 46 |
+
# GEMEO Twin Stack β Application Layer for the GEMEO Patient World Model
|
| 47 |
|
| 48 |
+
> The graph-native **digital-twin application stack** that runs on top of
|
| 49 |
+
> the [GEMEO World Model](https://huggingface.co/Raras-AI/gemeo-world-model).
|
| 50 |
+
> Six inference modes β trajectory, diagnosis, risk/survival,
|
| 51 |
+
> counterfactual, repurposing, cohort β wired together with auxiliary heads,
|
| 52 |
+
> KG embeddings, and a FastAPI surface. Research preview. **Not a medical device.**
|
| 53 |
|
| 54 |
**Authors:** Raras.ai team Β· **Contact:** dimas@raras.ai
|
| 55 |
**Source:** github companion repo (raras.org)
|
| 56 |
**Paper v1 (Zenodo):** [DOI 10.5281/zenodo.20092131](https://doi.org/10.5281/zenodo.20092131)
|
| 57 |
+
**π World Model (the dynamics core this stack runs on top of):** [Raras-AI/gemeo-world-model](https://huggingface.co/Raras-AI/gemeo-world-model)
|
| 58 |
+
**π± Mobile decision-support sibling:** [Raras-AI/araras-gemma4](https://huggingface.co/Raras-AI/araras-gemma4)
|
| 59 |
**License:** CC-BY-NC 4.0 + non-clinical-use rider (see [LICENSE](./LICENSE))
|
| 60 |
|
| 61 |
+
> **Note on naming.** Previously released as `Raras-AI/gemeo-world-model`
|
| 62 |
+
> (HF auto-redirects). Renamed to `gemeo-twin-stack` because the actual
|
| 63 |
+
> world model β the generative dynamics core β is the Causal Diffusion
|
| 64 |
+
> Forcing transformer now living at the new `gemeo-world-model` slug. This
|
| 65 |
+
> repo is the **application layer** (encoder, cohort, risk, whatif,
|
| 66 |
+
> repurpose, ask, ground_sus, api) that turns the world model into a
|
| 67 |
+
> usable digital twin with six inference modes.
|
| 68 |
+
|
| 69 |
---
|
| 70 |
|
| 71 |
## What this is
|
| 72 |
|
| 73 |
+
This is the **application layer** of GEMEO β a module (β22k LOC) that
|
| 74 |
+
takes the GEMEO World Model (a Causal Diffusion Forcing transformer,
|
| 75 |
+
published separately at [Raras-AI/gemeo-world-model](https://huggingface.co/Raras-AI/gemeo-world-model))
|
| 76 |
+
and wires it together with auxiliary heads, KG embeddings, and tooling
|
| 77 |
+
to produce a complete digital-twin product. Each inference mode has a
|
| 78 |
+
clean Python API, a bootstrap implementation that runs today, and an
|
| 79 |
+
optional learned slot that drops in when a checkpoint exists.
|
| 80 |
|
| 81 |
```
|
| 82 |
+
GEMEO Twin Stack (this repo)
|
| 83 |
βββ Patient embedding (gemeo/encoder.py) HGT scaffolded; bootstrap = weighted KG embedding
|
| 84 |
βββ Cohort retrieval (gemeo/cohort.py) kNN + Cypher overlap on PrimeKG
|
| 85 |
βββ Subgraph reasoning (gemeo/subgraph.py) 1-hop sparsification (learned variant in train/)
|
| 86 |
+
βββ Trajectory mode (gemeo/trajectory.py) β calls into the GEMEO World Model
|
| 87 |
+
βββ Diagnostic mode (deeprare_diagnosis.py + β multi-agent over PrimeKG paths
|
| 88 |
+
β fast_dx.py, ensemble)
|
| 89 |
βββ Risk / survival (gemeo/risk.py) NeuralSurv trained on DATASUS SIM (c-index 0.70)
|
| 90 |
βββ Drug repurposing (gemeo/repurpose.py) TxGNN slot + SUS auxiliary head
|
| 91 |
+
βββ Counterfactual (gemeo/whatif.py) do-calculus mutator; uses world-model rollout
|
| 92 |
βββ Active learning (gemeo/ask.py) info-gain over KG annotations
|
| 93 |
βββ SUS grounding (gemeo/ground_sus.py) PCDT/CEAF/UF coverage check
|
| 94 |
βββ FastAPI (gemeo/api.py) production /api/gemeo/* endpoints
|
| 95 |
+
|
| 96 |
+
GEMEO World Model (separate repo: Raras-AI/gemeo-world-model)
|
| 97 |
+
βββ Causal Diffusion Forcing transformer (19.86M params)
|
| 98 |
+
The generative dynamics core. Trajectory + counterfactual modes
|
| 99 |
+
above delegate to it.
|
| 100 |
```
|
| 101 |
|
| 102 |
---
|
|
|
|
| 111 |
| | timeout rate | 0.0% | |
|
| 112 |
| **Risk / survival** (NeuralSurv, DATASUS SIM) | val c-index | **0.70** (best @ ep 30) | 4,624 SIM mortality records, 37,494 censoring samples |
|
| 113 |
| | final c-index | 0.694 | after 100 epochs |
|
| 114 |
+
| **World Model backbone** ([gemeo-world-model](https://huggingface.co/Raras-AI/gemeo-world-model)) | val cross-entropy | 0.030 | per-token diffusion forcing |
|
| 115 |
| | calibration ICI | 0.0006 | well calibrated |
|
| 116 |
| | training | 5.8 min on 1ΓH100 | $0.48 |
|
| 117 |
|
|
|
|
| 136 |
```
|
| 137 |
|
| 138 |
Backbone: bootstrap LLM-over-natural-history today, [Causal Diffusion
|
| 139 |
+
Forcing](https://huggingface.co/Raras-AI/gemeo-world-model) when the CDF
|
| 140 |
checkpoint is mounted.
|
| 141 |
|
| 142 |
### 2 Β· Diagnosis
|
|
|
|
| 225 |
README.md β this file
|
| 226 |
```
|
| 227 |
|
| 228 |
+
The GEMEO World Model (the Causal Diffusion Forcing dynamics core) is
|
| 229 |
+
published separately at
|
| 230 |
+
[**Raras-AI/gemeo-world-model**](https://huggingface.co/Raras-AI/gemeo-world-model)
|
| 231 |
+
(~80 MB) β keep that repo for the heavy ckpt, this repo for the
|
| 232 |
+
application stack that runs on top of it.
|
| 233 |
|
| 234 |
---
|
| 235 |
|
|
|
|
| 334 |
|
| 335 |
## Changelog
|
| 336 |
|
| 337 |
+
- **2026-05 (this release)** β Initial public release of the twin-stack application layer + NeuralSurv ckpt (c-index 0.70) + DT-FM baseline + PrimeKG-derived embeddings + RareBench v49 results.
|
| 338 |
+
- **2026-05-19** β Renamed from `Raras-AI/gemeo-world-model` β `Raras-AI/gemeo-twin-stack` (HF auto-redirects). The slug `gemeo-world-model` now hosts the actual Causal Diffusion Forcing world model.
|
| 339 |
+
- **2026-05** β GEMEO World Model v2 (CDF backbone) published at [Raras-AI/gemeo-world-model](https://huggingface.co/Raras-AI/gemeo-world-model).
|
| 340 |
- **2026-04** β GEMEO v1 paper published on Zenodo (DOI 10.5281/zenodo.20092131).
|
| 341 |
|
| 342 |
---
|