ProcessDSL + FlowCell-10 Proposal

This proposal outlines a pilot initiative to integrate the "genome as program" concept and cellular process flowcharting into the Virtual Cell project. The goal is to formalize biological processes as executable, interpretable programs that can be learned, simulated, and manipulated by AI.

1. ProcessDSL Specification

ProcessDSL is a domain-specific language for representing cellular processes. It compiles human-readable flowcharts into machine-executable forms such as stochastic rule systems, Petri nets, or hybrid ODE/event simulators.

Key features:

2. FlowCell-10 Pilot Dataset

FlowCell-10 is a curated set of ten well-characterized yeast pathways, each represented as:

Example pathways:

  1. Glycolysis
  2. TOR nutrient sensing pathway
  3. Heat shock response
  4. Autophagy initiation
  5. Unfolded protein response (UPR)
  6. Cell cycle G1/S transition
  7. Mitochondrial respiration control
  8. Amino acid biosynthesis regulation
  9. Gluconeogenesis
  10. Alcoholic fermentation

3. Example ProcessDSL (Glycolysis)

process Glycolysis in Cytosol: state: [Glucose, G6P, F6P, F16BP, G3P, DHAP, PEP, Pyruvate, ATP, ADP, NAD+, NADH] rule Hexokinase: Glucose + ATP -> G6P + ADP [guard: ATP>θ1] rule PFK: F6P + ATP -> F16BP + ADP [guard: ATP<θ2 & AMP>θ3] rule Aldolase: F16BP -> G3P + DHAP rule TPI: DHAP <-> G3P rule PyruvateKinase: PEP + ADP -> Pyruvate + ATP [allosteric: F16BP activates] event GlucosePulse(t=0..T): inflow rate r_in

4. Expanded Glycolysis Flowchart

Below is an example from FlowCell-10 showing Glycolysis in Yeast with branch and loop structure:

Glycolysis Pathway in Yeast

5. Deliverables

6. Benefits to the Virtual Cell Project

This proposal demonstrates how computational biology and artificial intelligence can converge to create interpretable, executable models of biological systems.