name: nexus-incident-investigation version: "1.0.0" tags: ["openenv", "multi-agent"] description: > NEXUS — Multi-Agent Incident Investigation Environment. Multiple AI agents (up to 10) collaborate to investigate real-world system incidents. Agents can take different roles: Investigator, Validator, Forensic Analyst, Network Engineer, System Admin, Security Architect, and Compliance Officer. Together they identify root causes across software, business-process, and cascade-system failure scenarios. tasks: - name: software-incident description: Single-service software bug causing user-facing errors difficulty: easy max_steps: 8 grader: scenarios/graders/easy_grader.py - name: business-process-failure description: Multi-team process breakdown with misleading red-herrings difficulty: medium max_steps: 8 grader: scenarios/graders/medium_grader.py - name: cascade-system-failure description: Multi-system cascade failure with misleading logs difficulty: hard max_steps: 8 grader: scenarios/graders/hard_grader.py action_space: type: text description: "Free-form natural language message with optional TOOL: calls" observation_space: type: structured fields: scenario_description: string scenario_context: string partner_message: string tool_results: list clues_found: list investigation_stage: string round: integer available_tools: list reward_range: [0.0, 1.0] reward_description: > Dynamically computed from semantic similarity of hypothesis to root-cause, tool quality, fix correctness, and investigation efficiency. inference_script: inference.py entry_point: backend/main.py docker_port: 7860 baseline_scores: software-incident: 0.88 business-process-failure: 0.72 cascade-system-failure: 0.48