911 / docs /tasks.md
Abhinav31122006
feat: exploit analysis, architecture docs, observation depth, citation
0b2675d

Task Reference β€” 911 Dispatch Supervisor

Task 1: single_incident β€” Easy

Objective: Dispatch the correct unit to a single cardiac arrest and resolve it before the survival clock expires.

Initial State: 1 incident (cardiac arrest, P1), 3 units available (1 MEDIC, 1 ENGINE, 1 PATROL) Max Steps: 20 | Survival Clock: 600s

What a good agent does: Immediately dispatches MEDIC to the cardiac arrest. Does not dispatch ENGINE or PATROL (triage mismatch penalty).

What a bad agent does: Dispatches ENGINE (wrong unit type), wastes steps, patient survival clock expires β†’ Safety Gate β†’ score capped at 0.2.

Scoring: 50% resolution + 30% correct unit type + 20% response speed


Task 2: multi_incident β€” Medium

Objective: Triage 3 simultaneous incidents with competing priorities.

Initial State: 3 incidents (structure fire P2, cardiac arrest P1, shooting P1), 6 units available Max Steps: 40

What a good agent does: Immediately dispatches MEDIC to cardiac arrest and PATROL to shooting (both P1), then dispatches ENGINE to structure fire (P2).

What a bad agent does: Dispatches to the fire first (visible/dramatic but P2), leaving P1 incidents unattended β†’ Safety Gate.

Scoring: 50% P1 resolution + 30% overall resolution βˆ’ 20% escalation penalty


Task 3: mass_casualty β€” Hard

Objective: Manage a building collapse with surprise incident waves.

Initial State: 1 incident (building collapse P1, survival 480s), 7 units Max Steps: 60 Wave spawns: Step 5 β†’ structure fire; Step 12 β†’ 2Γ— cardiac arrests

What a good agent does: Responds to building collapse immediately, pre-stages units for anticipated waves, adapts when cardiac arrests spawn at step 12.

What a bad agent does: Commits all units to building collapse, has no available units when cardiac arrests spawn β†’ multiple P1 failures β†’ Safety Gate.

Scoring: 60% P1 survival + 30% mean step reward βˆ’ failure penalty


Task 4: shift_surge β€” Hard

Objective: Maintain coverage as units go out of service mid-shift.

Initial State: 5 units, 0 incidents (board starts empty) Max Steps: 60 | Wave spawn: Every 8 steps | Survival clock: 720s OOS events: 3 units go OUT_OF_SERVICE by step 5

What a good agent does: Anticipates resource scarcity, requests mutual aid early, stages remaining units strategically, prioritizes P1 incidents as board fills.

What a bad agent does: Dispatches all units freely in early steps, has no coverage when OOS events hit and new incidents spawn simultaneously.

Scoring: 35% resolution + 25% P1 survival + 15% coverage + 15% backlog + 10% step reward βˆ’ 25% escalation penalty