Before And After
blind trust
trained skepticism
Without SENTINEL
task3 random 0.666
All public slots start near the same trust. The orchestrator delegates with weak evidence.
A high-confidence specialist can slip poisoned output into a high-stakes node.
Downstream subtasks inherit bad state, so the mission drifts before anyone notices.
Detection stays weak and the agent cannot explain which public slot became dangerous.
Judge takeaway: good-looking multi-agent systems still fail if trust is static or role-based.
With SENTINEL
task3 heuristic 0.784
Behavior updates the TrustLedger after every step, so public slots diverge quickly.
When stakes rise and trust is shaky, the orchestrator switches from delegate to verify.
Adversarial attempts are detected before they cascade through the task graph.
Profile swap forces re-learning from evidence, proving skill instead of memorized identity.
Judge takeaway: this environment teaches oversight, recovery, and calibrated delegation under uncertainty.