AgentnessBench / tests /cli /test_cli.py

Commit History

refactor(scenario): delete predator_evade; template is the canonical scenario
93cd78f

irregular6612 Claude Opus 4.8 (1M context) commited on

refactor: restructure proteus into game/web subpackages
426093b

irregular6612 Claude Opus 4.8 (1M context) commited on

feat(cp6): CLI compare — aggregate human/LLM traces by (model, difficulty)
c8beea5

irregular6612 Claude Sonnet 4.6 commited on

fix(cp5): _cmd_play returns rc=2 on stdin EOF (no traceback, no partial-trace crash)
c817950

irregular6612 Claude Sonnet 4.6 commited on

feat(cp5): CLI replay --visual/--png/--fps (text stays default)
ff9d5a9

irregular6612 Claude Opus 4.8 (1M context) commited on

feat(cp5): CLI 'play' subcommand — human session via stdin
3f6f600

irregular6612 Claude Sonnet 4.6 commited on

feat(cp4): CLI (run / list-scenarios / replay) over fake + real providers
c318527

irregular6612 commited on