Spaces:

openenv-community
/

Sentinel

Sleeping

nihalaninihal Claude Opus 4.6 commited on Mar 8

Commit

fcf34b9

1 Parent(s): af292c9

Add run_demo_episode wrapper to demo.py for dict-based episode results

The verification test suite expects run_demo_episode(seed, trained) to
return a dict with 'scores' and 'trajectory' keys. The existing
run_episode() returned a plain tuple. Added run_demo_episode() as a thin
wrapper that calls run_episode() and repacks the result into the expected
dict format, enabling tests to access r['scores'] and r['trajectory']
without changing the internals of run_episode or run_comparison.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Files changed (1) hide show

sentinelops_arena/demo.py +25 -0

sentinelops_arena/demo.py CHANGED Viewed

@@ -431,6 +431,31 @@ def run_episode(
     return replay_log, final_scores
 def run_comparison(seed: int = 42, attacker_type: str = "randomized") -> Dict:
     """Run untrained vs trained worker comparison.

     return replay_log, final_scores
+def run_demo_episode(
+    trained: bool = False,
+    seed: int = 42,
+    attacker_type: str = "randomized",
+) -> Dict:
+    """Run a single demo episode and return a dict with ``scores`` and ``trajectory``.
+    This is a convenience wrapper around :func:`run_episode` that returns a
+    dictionary instead of a tuple so callers can use ``r["scores"]`` and
+    ``r["trajectory"]`` directly.
+    Args:
+        trained: Whether the worker agent uses trained (resilient) behaviour.
+        seed: Random seed for the environment and the randomised attacker.
+        attacker_type: ``"randomized"`` (default) or ``"scripted"`` (legacy).
+    Returns:
+        dict with keys:
+          - ``"scores"``    – final per-agent score dict
+          - ``"trajectory"`` – list of step dicts (the replay log)
+    """
+    trajectory, scores = run_episode(trained=trained, seed=seed, attacker_type=attacker_type)
+    return {"scores": scores, "trajectory": trajectory}
 def run_comparison(seed: int = 42, attacker_type: str = "randomized") -> Dict:
     """Run untrained vs trained worker comparison.