timmers commited on
Commit
dde49c7
Β·
verified Β·
1 Parent(s): 089d665

Rename: this is the twin-stack app layer (was gemeo-world-model). World model now at Raras-AI/gemeo-world-model.

Browse files
Files changed (1) hide show
  1. README.md +43 -22
README.md CHANGED
@@ -43,42 +43,60 @@ extra_gated_fields:
43
  I agree to non-clinical research use only: checkbox
44
  ---
45
 
46
- # GEMEO β€” Rare-Disease Patient World Model
47
 
48
- > A graph-native, continuously-evolving **digital twin** for rare-disease
49
- > patients. Six inference modes β€” trajectory, diagnosis, risk/survival,
50
- > counterfactual, repurposing, cohort β€” over the same PrimeKG-grounded,
51
- > SUS-anchored backbone. Research preview. **Not a medical device.**
 
52
 
53
  **Authors:** Raras.ai team Β· **Contact:** dimas@raras.ai
54
  **Source:** github companion repo (raras.org)
55
  **Paper v1 (Zenodo):** [DOI 10.5281/zenodo.20092131](https://doi.org/10.5281/zenodo.20092131)
56
- **Companion model (CDF backbone):** [Raras-AI/gemeo-cdf-v2](https://huggingface.co/Raras-AI/gemeo-cdf-v2)
57
- **Companion model (mobile decision-support):** [Raras-AI/araras-gemma4](https://huggingface.co/spaces/Raras-AI/araras-gemma4)
58
  **License:** CC-BY-NC 4.0 + non-clinical-use rider (see [LICENSE](./LICENSE))
59
 
 
 
 
 
 
 
 
 
60
  ---
61
 
62
  ## What this is
63
 
64
- GEMEO is not a single model. It is a **module** (β‰ˆ22k LOC) that wires
65
- together a heterogeneous-graph backbone with six inference modes for
66
- rare-disease patient digital twins. Each mode has a clean Python API, a
67
- bootstrap implementation that runs today, and an optional learned
68
- component that drops in when a checkpoint exists.
 
 
69
 
70
  ```
71
- GEMEO World Model
72
  β”œβ”€β”€ Patient embedding (gemeo/encoder.py) HGT scaffolded; bootstrap = weighted KG embedding
73
  β”œβ”€β”€ Cohort retrieval (gemeo/cohort.py) kNN + Cypher overlap on PrimeKG
74
  β”œβ”€β”€ Subgraph reasoning (gemeo/subgraph.py) 1-hop sparsification (learned variant in train/)
75
- β”œβ”€β”€ Trajectory (gemeo/trajectory.py) bootstrap LLM-over-natural-history; CDF backbone via gemeo-cdf-v2
 
 
76
  β”œβ”€β”€ Risk / survival (gemeo/risk.py) NeuralSurv trained on DATASUS SIM (c-index 0.70)
77
  β”œβ”€β”€ Drug repurposing (gemeo/repurpose.py) TxGNN slot + SUS auxiliary head
78
- β”œβ”€β”€ Counterfactual (gemeo/whatif.py) do-calculus mutator
79
  β”œβ”€β”€ Active learning (gemeo/ask.py) info-gain over KG annotations
80
  β”œβ”€β”€ SUS grounding (gemeo/ground_sus.py) PCDT/CEAF/UF coverage check
81
  └── FastAPI (gemeo/api.py) production /api/gemeo/* endpoints
 
 
 
 
 
82
  ```
83
 
84
  ---
@@ -93,7 +111,7 @@ GEMEO World Model
93
  | | timeout rate | 0.0% | |
94
  | **Risk / survival** (NeuralSurv, DATASUS SIM) | val c-index | **0.70** (best @ ep 30) | 4,624 SIM mortality records, 37,494 censoring samples |
95
  | | final c-index | 0.694 | after 100 epochs |
96
- | **Trajectory backbone** ([gemeo-cdf-v2](https://huggingface.co/Raras-AI/gemeo-cdf-v2)) | val cross-entropy | 0.030 | per-token diffusion forcing |
97
  | | calibration ICI | 0.0006 | well calibrated |
98
  | | training | 5.8 min on 1Γ—H100 | $0.48 |
99
 
@@ -118,7 +136,7 @@ twin.trajectory.horizons # 6/12/24-month event predictions
118
  ```
119
 
120
  Backbone: bootstrap LLM-over-natural-history today, [Causal Diffusion
121
- Forcing](https://huggingface.co/Raras-AI/gemeo-cdf-v2) when the CDF
122
  checkpoint is mounted.
123
 
124
  ### 2 Β· Diagnosis
@@ -207,9 +225,11 @@ LICENSE
207
  README.md ← this file
208
  ```
209
 
210
- The CDF trajectory backbone is published separately at
211
- [**Raras-AI/gemeo-cdf-v2**](https://huggingface.co/Raras-AI/gemeo-cdf-v2)
212
- (~80 MB) β€” keep that repo for the heavy ckpt, this repo for the wiring.
 
 
213
 
214
  ---
215
 
@@ -314,8 +334,9 @@ world model, multimodal substrate.
314
 
315
  ## Changelog
316
 
317
- - **2026-05 (this release)** β€” Initial public release of the world-model module + NeuralSurv ckpt (c-index 0.70) + DT-FM baseline + PrimeKG-derived embeddings + RareBench v49 results.
318
- - **2026-05** β€” CDF v2 trajectory backbone published at [Raras-AI/gemeo-cdf-v2](https://huggingface.co/Raras-AI/gemeo-cdf-v2).
 
319
  - **2026-04** β€” GEMEO v1 paper published on Zenodo (DOI 10.5281/zenodo.20092131).
320
 
321
  ---
 
43
  I agree to non-clinical research use only: checkbox
44
  ---
45
 
46
+ # GEMEO Twin Stack β€” Application Layer for the GEMEO Patient World Model
47
 
48
+ > The graph-native **digital-twin application stack** that runs on top of
49
+ > the [GEMEO World Model](https://huggingface.co/Raras-AI/gemeo-world-model).
50
+ > Six inference modes β€” trajectory, diagnosis, risk/survival,
51
+ > counterfactual, repurposing, cohort β€” wired together with auxiliary heads,
52
+ > KG embeddings, and a FastAPI surface. Research preview. **Not a medical device.**
53
 
54
  **Authors:** Raras.ai team Β· **Contact:** dimas@raras.ai
55
  **Source:** github companion repo (raras.org)
56
  **Paper v1 (Zenodo):** [DOI 10.5281/zenodo.20092131](https://doi.org/10.5281/zenodo.20092131)
57
+ **🌍 World Model (the dynamics core this stack runs on top of):** [Raras-AI/gemeo-world-model](https://huggingface.co/Raras-AI/gemeo-world-model)
58
+ **πŸ“± Mobile decision-support sibling:** [Raras-AI/araras-gemma4](https://huggingface.co/Raras-AI/araras-gemma4)
59
  **License:** CC-BY-NC 4.0 + non-clinical-use rider (see [LICENSE](./LICENSE))
60
 
61
+ > **Note on naming.** Previously released as `Raras-AI/gemeo-world-model`
62
+ > (HF auto-redirects). Renamed to `gemeo-twin-stack` because the actual
63
+ > world model β€” the generative dynamics core β€” is the Causal Diffusion
64
+ > Forcing transformer now living at the new `gemeo-world-model` slug. This
65
+ > repo is the **application layer** (encoder, cohort, risk, whatif,
66
+ > repurpose, ask, ground_sus, api) that turns the world model into a
67
+ > usable digital twin with six inference modes.
68
+
69
  ---
70
 
71
  ## What this is
72
 
73
+ This is the **application layer** of GEMEO β€” a module (β‰ˆ22k LOC) that
74
+ takes the GEMEO World Model (a Causal Diffusion Forcing transformer,
75
+ published separately at [Raras-AI/gemeo-world-model](https://huggingface.co/Raras-AI/gemeo-world-model))
76
+ and wires it together with auxiliary heads, KG embeddings, and tooling
77
+ to produce a complete digital-twin product. Each inference mode has a
78
+ clean Python API, a bootstrap implementation that runs today, and an
79
+ optional learned slot that drops in when a checkpoint exists.
80
 
81
  ```
82
+ GEMEO Twin Stack (this repo)
83
  β”œβ”€β”€ Patient embedding (gemeo/encoder.py) HGT scaffolded; bootstrap = weighted KG embedding
84
  β”œβ”€β”€ Cohort retrieval (gemeo/cohort.py) kNN + Cypher overlap on PrimeKG
85
  β”œβ”€β”€ Subgraph reasoning (gemeo/subgraph.py) 1-hop sparsification (learned variant in train/)
86
+ β”œβ”€β”€ Trajectory mode (gemeo/trajectory.py) ← calls into the GEMEO World Model
87
+ β”œβ”€β”€ Diagnostic mode (deeprare_diagnosis.py + ← multi-agent over PrimeKG paths
88
+ β”‚ fast_dx.py, ensemble)
89
  β”œβ”€β”€ Risk / survival (gemeo/risk.py) NeuralSurv trained on DATASUS SIM (c-index 0.70)
90
  β”œβ”€β”€ Drug repurposing (gemeo/repurpose.py) TxGNN slot + SUS auxiliary head
91
+ β”œβ”€β”€ Counterfactual (gemeo/whatif.py) do-calculus mutator; uses world-model rollout
92
  β”œβ”€β”€ Active learning (gemeo/ask.py) info-gain over KG annotations
93
  β”œβ”€β”€ SUS grounding (gemeo/ground_sus.py) PCDT/CEAF/UF coverage check
94
  └── FastAPI (gemeo/api.py) production /api/gemeo/* endpoints
95
+
96
+ GEMEO World Model (separate repo: Raras-AI/gemeo-world-model)
97
+ └── Causal Diffusion Forcing transformer (19.86M params)
98
+ The generative dynamics core. Trajectory + counterfactual modes
99
+ above delegate to it.
100
  ```
101
 
102
  ---
 
111
  | | timeout rate | 0.0% | |
112
  | **Risk / survival** (NeuralSurv, DATASUS SIM) | val c-index | **0.70** (best @ ep 30) | 4,624 SIM mortality records, 37,494 censoring samples |
113
  | | final c-index | 0.694 | after 100 epochs |
114
+ | **World Model backbone** ([gemeo-world-model](https://huggingface.co/Raras-AI/gemeo-world-model)) | val cross-entropy | 0.030 | per-token diffusion forcing |
115
  | | calibration ICI | 0.0006 | well calibrated |
116
  | | training | 5.8 min on 1Γ—H100 | $0.48 |
117
 
 
136
  ```
137
 
138
  Backbone: bootstrap LLM-over-natural-history today, [Causal Diffusion
139
+ Forcing](https://huggingface.co/Raras-AI/gemeo-world-model) when the CDF
140
  checkpoint is mounted.
141
 
142
  ### 2 Β· Diagnosis
 
225
  README.md ← this file
226
  ```
227
 
228
+ The GEMEO World Model (the Causal Diffusion Forcing dynamics core) is
229
+ published separately at
230
+ [**Raras-AI/gemeo-world-model**](https://huggingface.co/Raras-AI/gemeo-world-model)
231
+ (~80 MB) β€” keep that repo for the heavy ckpt, this repo for the
232
+ application stack that runs on top of it.
233
 
234
  ---
235
 
 
334
 
335
  ## Changelog
336
 
337
+ - **2026-05 (this release)** β€” Initial public release of the twin-stack application layer + NeuralSurv ckpt (c-index 0.70) + DT-FM baseline + PrimeKG-derived embeddings + RareBench v49 results.
338
+ - **2026-05-19** β€” Renamed from `Raras-AI/gemeo-world-model` β†’ `Raras-AI/gemeo-twin-stack` (HF auto-redirects). The slug `gemeo-world-model` now hosts the actual Causal Diffusion Forcing world model.
339
+ - **2026-05** β€” GEMEO World Model v2 (CDF backbone) published at [Raras-AI/gemeo-world-model](https://huggingface.co/Raras-AI/gemeo-world-model).
340
  - **2026-04** β€” GEMEO v1 paper published on Zenodo (DOI 10.5281/zenodo.20092131).
341
 
342
  ---