Defensive fix: strip agent.cash=0 when starting_cash>0 in tmp YAML b77e43d yxc20098 commited on May 23
Quality drive: schema fix, 5 new/revised packs, 4 engine tests, scenario audit 6d71d3b yxc20098 commited on May 23
Engine-feature integration: 4 commands + 9 scenario packs + 4 test suites 20960c1 yxc20098 commited on May 23
feat(scenario): scout-cycle-keep-info-fresh — re-scout to detect mid-episode reinforcement 0090040 yxc20098 commited on May 21
feat(bench): forbidden_tools + tool_violations_gte — strict-toolban / procedural-compliance primitive (BFCL V4 / τ²-bench / IFBench anchor) e3e91b7 yxc20098 commited on May 20
Default spawn_mcvs:false — stop engine auto-seeding phantom MCVs f1ea367 yxc20098 commited on May 19
playback: append a terminal 'episode end' frame (the resolved win/loss board) 7dbeb97 yxc20098 commited on May 19
Structured-fog text mode, premium routing, codex descriptions, minimap colour-by-difficulty 93ee9dd yxc20098 commited on May 19
playback: save the SAME _minimap_v2 the model receives (fog-accumulating) — viewer now shows exactly what the model saw, not the legacy matplotlib render d7ba62a yxc20098 commited on May 19
Training-parity minimap (real terrain + legend) + viewer (system/thinking/debrief) 39fba02 yxc20098 commited on May 18
Playback: capture model reasoning + per-turn goal tracker + viewer f77eea7 yxc20098 commited on May 18
S7 bench: surrender tool + loss outcome (tool schema 1:1, 15==15) 09ac234 yxc20098 commited on May 18
Bench: economy scenario pack + full-loop integ test + starting_cash constraint dc028b6 yxc20098 commited on May 17
Add Rust-backed eval stack: scenario packs, adapter, spine, integration tests 098c6e0 yxc20098 commited on May 17