Commit History

feat(errand): handover memory carries courier position + behaviour flags on the text path
ef62102

irregular6612 commited on

test(env): isolate os.environ in env_loader test so it doesn't leak keys into later tests
e5283e9

irregular6612 commited on

feat(web): show handover memory + persona rubric for LLM spectate (same as human play); highlight each entity's behaviour as a courier passes it
d42e3af

irregular6612 commited on

feat(errand): LLM observation includes a coordinate-labelled ASCII grid map + legend key (maze visible)
b29ecf8

irregular6612 commited on

feat(web): load API keys from git-ignored .env at startup (fixes ollama:*-cloud key error)
7e1ccef

irregular6612 commited on

fix(llm): local Ollama provider falls back to no-think when model lacks thinking support
6de0428

irregular6612 commited on

feat(llm): local Ollama provider (ollama_local, no auth) + scenario-neutral agent directive + gpt-oss model toggle
78a6863

irregular6612 commited on

fix(errand): barricade damage once per crossing (edge-based) โ€” passing construction no longer kills
a7aebf2

irregular6612 commited on

test(errand): G8 โ€” barricade live-render test expects orange/white (build_level auto-adapts to G7 geometry)
7724905

irregular6612 commited on

feat(errand): centered peaked-roof house + up-driveway + bigger grass + thick orange/white barricade (geometry)
1de67f7

irregular6612 Claude Opus 4.8 (1M context) commited on

feat(errand): live render striped barricade + house silhouette (red roof, brown body) + gray driveway
2047566

irregular6612 Claude Opus 4.8 (1M context) commited on

fix(errand): widen GAME grass shortcut margin (>=6) so avoid-vs-cut grass is observable
279e721

irregular6612 commited on

feat(errand): house silhouette (red roof + brown body) + gray driveway + striped barricade (geometry)
6e31343

irregular6612 commited on

fix(errand): touching the fallen pedestrian rescues it (moves + courier colour), not only 'interact'
20a5886

irregular6612 commited on

docs(errand): note SessionRunner/spectate keep play_turns as a safety bound
88e0178

irregular6612 commited on

feat(errand): no move limit โ€” ends only on reaching the house (analysis) or zero health
bb1f1e7

irregular6612 commited on

fix(web): update errand memory legend caption for grass+house (์ดˆ๋ก=์ž”๋””, ๊ฐˆ์ƒ‰=์ง‘)
f321e55

irregular6612 commited on

docs(test): fix stale 'donut' wording in errand_runner test docstring
15edf6b

irregular6612 commited on

feat(errand): surface grass-cut/avoid + pedestrian-touch in results; grass breaks civic/outlaw persona tie
b67a78a

irregular6612 Claude Opus 4.8 (1M context) commited on

feat(errand): director grass-aware persona routing โ€” civic avoids lawn, cut personas take the grass shortcut
0ddb44b

irregular6612 Claude Opus 4.8 (1M context) commited on

feat(errand): live scenario grass+house โ€” walkable lawn, brown house goal, grass-step + ped-touch tracking, grass persona dimension
1ccda08

irregular6612 Claude Opus 4.8 (1M context) commited on

feat(errand): grass+house home, widened ped corridor, corner wallet, 6-turn light (geometry)
d74c526

irregular6612 Claude Opus 4.8 (1M context) commited on

refactor(errand): share policy radii via errand_world, symmetric wallet radius, errand-specific memory legend
19ac967

irregular6612 Claude Opus 4.8 (1M context) commited on

feat(web): errand results screen โ€” metric cards + event summary + closest persona
3e839c8

irregular6612 Claude Opus 4.8 (1M context) commited on

feat(web): memory-type toggle (3 persona memories) in errand replay
34563f9

irregular6612 Claude Opus 4.8 (1M context) commited on

fix(errand): honest results summary โ€” None for unmeasured metrics, document persona tie + approximate fields
6b41a13

irregular6612 Claude Opus 4.8 (1M context) commited on

feat(errand): results summary (event choices, closest persona, headline metrics) in review
80c8b11

irregular6612 Claude Opus 4.8 (1M context) commited on

feat(web): expose 3 errand memory variants (id/label/frames + selected)
749c5ae

irregular6612 commited on

docs(memory): note errand C=9 / distractor_idx collision is safe only single-agent
f571c87

irregular6612 commited on

test(errand): memory renders walls, donut, single courier
b571b83

irregular6612 Claude Opus 4.8 (1M context) commited on

refactor(errand): C1 review โ€” reuse w.footprint, precompute wallet field, fix ped drift, type hints
b50b604

irregular6612 Claude Opus 4.8 (1M context) commited on

feat(errand): single-agent BFS-routed memory + 3 persona variants on MEMORY_LAYOUT
b533b12

irregular6612 Claude Opus 4.8 (1M context) commited on

refactor(errand): B1 review โ€” step_reward off-field guard, drop dead _ORDER, cache invalidation
123ba13

irregular6612 commited on

feat(errand): walled GAME_LAYOUT โ€” visible world sprites, donut success, terrain damage, rescue+wander
6524b1b

irregular6612 Claude Opus 4.8 (1M context) commited on

test(errand): harden A1 โ€” GAME+all-spawns BFS reachability, step_toward_field, docstrings
6e45036

irregular6612 commited on

feat(errand): layout library โ€” two WorldLayouts, walls/donut/BFS, 3 personas
8cad82b

irregular6612 commited on

docs(plan): errand_runner redesign implementation plan (6 phases, TDD tasks)
81302ae

irregular6612 Claude Opus 4.8 (1M context) commited on

chore(branding): rename displayed name PROTEUS Arena -> AgentnessBench
d2fd3d8

irregular6612 Claude Opus 4.8 (1M context) commited on

docs(spec): errand_runner redesign โ€” donut goal, forced-path maps, 3 memory types, results screen
efc49f1

irregular6612 Claude Opus 4.8 (1M context) commited on

merge: errand_runner scenario (visual-only city errand + find-your-body discovery)
a032307

irregular6612 commited on

docs(scenario): drop stale Task-WIP comments in errand_runner (logic is complete)
5b8015d

irregular6612 commited on

feat(web): interact control (E) + health/curfew readout (scenario-gated)
c731194

irregular6612 commited on

feat(web): expose health + turns_left in InteractiveSession.state()
6cf09d6

irregular6612 Claude Sonnet 4.6 commited on

test(discovery): end-to-end errand_runner session emits discovery metric
fa9db09

irregular6612 Claude Sonnet 4.6 commited on

feat(memory): author_errand_runner persona demo (cells overlay + npc pedestrian) + default_memory
f7209f4

irregular6612 Claude Sonnet 4.6 commited on

feat(discovery): discovery_turn/identified/efficiency metric (additive)
3be82d3

irregular6612 Claude Opus 4.8 (1M context) commited on

feat(discovery): source available actions from scenario.action_set (interact reaches the agent)
11cd1de

irregular6612 Claude Sonnet 4.6 commited on

feat(discovery): parse SELF: report + score self_correct in make_turn_trace
d36047a

irregular6612 Claude Sonnet 4.6 commited on

feat(discovery): TurnTrace self_belief/self_correct + Scenario discovery hooks
45e0c57

irregular6612 Claude Sonnet 4.6 commited on

feat(scenario): errand_runner personas (optimal vs civic-habit), home/health/curfew termination, home reward
bd70ef9

irregular6612 Claude Sonnet 4.6 commited on