Causal inference benchmark with adversarial self-play. An agent identifies hidden interventions in partially observable contamination graphs while an adversary adapts the difficulty.
reset() generates a unique procedural graph. No two episodes share the same topology or contamination pattern.
lot_relabel, mixing_event, record_deletion
inspect_node()trace_lot()cross_reference()request_lab_test()quarantine()finalize()reset() live: new graph, new hidden intervention sampled, unique topology every episode