feat(errand): handover memory carries courier position + behaviour flags on the text path ef62102 irregular6612 commited on 21 days ago
test(env): isolate os.environ in env_loader test so it doesn't leak keys into later tests e5283e9 irregular6612 commited on 21 days ago
feat(web): show handover memory + persona rubric for LLM spectate (same as human play); highlight each entity's behaviour as a courier passes it d42e3af irregular6612 commited on 21 days ago
feat(errand): LLM observation includes a coordinate-labelled ASCII grid map + legend key (maze visible) b29ecf8 irregular6612 commited on 21 days ago
feat(web): load API keys from git-ignored .env at startup (fixes ollama:*-cloud key error) 7e1ccef irregular6612 commited on 21 days ago
fix(llm): local Ollama provider falls back to no-think when model lacks thinking support 6de0428 irregular6612 commited on 21 days ago
feat(llm): local Ollama provider (ollama_local, no auth) + scenario-neutral agent directive + gpt-oss model toggle 78a6863 irregular6612 commited on 21 days ago
fix(errand): barricade damage once per crossing (edge-based) โ passing construction no longer kills a7aebf2 irregular6612 commited on 21 days ago
test(errand): G8 โ barricade live-render test expects orange/white (build_level auto-adapts to G7 geometry) 7724905 irregular6612 commited on 21 days ago
feat(errand): centered peaked-roof house + up-driveway + bigger grass + thick orange/white barricade (geometry) 1de67f7 irregular6612 Claude Opus 4.8 (1M context) commited on 21 days ago
feat(errand): live render striped barricade + house silhouette (red roof, brown body) + gray driveway 2047566 irregular6612 Claude Opus 4.8 (1M context) commited on 21 days ago
fix(errand): widen GAME grass shortcut margin (>=6) so avoid-vs-cut grass is observable 279e721 irregular6612 commited on 21 days ago
feat(errand): house silhouette (red roof + brown body) + gray driveway + striped barricade (geometry) 6e31343 irregular6612 commited on 21 days ago
fix(errand): touching the fallen pedestrian rescues it (moves + courier colour), not only 'interact' 20a5886 irregular6612 commited on 22 days ago
docs(errand): note SessionRunner/spectate keep play_turns as a safety bound 88e0178 irregular6612 commited on 22 days ago
feat(errand): no move limit โ ends only on reaching the house (analysis) or zero health bb1f1e7 irregular6612 commited on 22 days ago
fix(web): update errand memory legend caption for grass+house (์ด๋ก=์๋, ๊ฐ์=์ง) f321e55 irregular6612 commited on 22 days ago
docs(test): fix stale 'donut' wording in errand_runner test docstring 15edf6b irregular6612 commited on 22 days ago
feat(errand): surface grass-cut/avoid + pedestrian-touch in results; grass breaks civic/outlaw persona tie b67a78a irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
feat(errand): director grass-aware persona routing โ civic avoids lawn, cut personas take the grass shortcut 0ddb44b irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
feat(errand): live scenario grass+house โ walkable lawn, brown house goal, grass-step + ped-touch tracking, grass persona dimension 1ccda08 irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
feat(errand): grass+house home, widened ped corridor, corner wallet, 6-turn light (geometry) d74c526 irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
refactor(errand): share policy radii via errand_world, symmetric wallet radius, errand-specific memory legend 19ac967 irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
feat(web): errand results screen โ metric cards + event summary + closest persona 3e839c8 irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
feat(web): memory-type toggle (3 persona memories) in errand replay 34563f9 irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
fix(errand): honest results summary โ None for unmeasured metrics, document persona tie + approximate fields 6b41a13 irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
feat(errand): results summary (event choices, closest persona, headline metrics) in review 80c8b11 irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
feat(web): expose 3 errand memory variants (id/label/frames + selected) 749c5ae irregular6612 commited on 22 days ago
docs(memory): note errand C=9 / distractor_idx collision is safe only single-agent f571c87 irregular6612 commited on 22 days ago
test(errand): memory renders walls, donut, single courier b571b83 irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
refactor(errand): C1 review โ reuse w.footprint, precompute wallet field, fix ped drift, type hints b50b604 irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
feat(errand): single-agent BFS-routed memory + 3 persona variants on MEMORY_LAYOUT b533b12 irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
refactor(errand): B1 review โ step_reward off-field guard, drop dead _ORDER, cache invalidation 123ba13 irregular6612 commited on 22 days ago
feat(errand): walled GAME_LAYOUT โ visible world sprites, donut success, terrain damage, rescue+wander 6524b1b irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
test(errand): harden A1 โ GAME+all-spawns BFS reachability, step_toward_field, docstrings 6e45036 irregular6612 commited on 22 days ago
feat(errand): layout library โ two WorldLayouts, walls/donut/BFS, 3 personas 8cad82b irregular6612 commited on 22 days ago
docs(plan): errand_runner redesign implementation plan (6 phases, TDD tasks) 81302ae irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
chore(branding): rename displayed name PROTEUS Arena -> AgentnessBench d2fd3d8 irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
docs(spec): errand_runner redesign โ donut goal, forced-path maps, 3 memory types, results screen efc49f1 irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
merge: errand_runner scenario (visual-only city errand + find-your-body discovery) a032307 irregular6612 commited on 22 days ago
docs(scenario): drop stale Task-WIP comments in errand_runner (logic is complete) 5b8015d irregular6612 commited on 22 days ago
feat(web): interact control (E) + health/curfew readout (scenario-gated) c731194 irregular6612 commited on 22 days ago
feat(web): expose health + turns_left in InteractiveSession.state() 6cf09d6 irregular6612 Claude Sonnet 4.6 commited on 22 days ago
test(discovery): end-to-end errand_runner session emits discovery metric fa9db09 irregular6612 Claude Sonnet 4.6 commited on 22 days ago
feat(memory): author_errand_runner persona demo (cells overlay + npc pedestrian) + default_memory f7209f4 irregular6612 Claude Sonnet 4.6 commited on 22 days ago
feat(discovery): discovery_turn/identified/efficiency metric (additive) 3be82d3 irregular6612 Claude Opus 4.8 (1M context) commited on 22 days ago
feat(discovery): source available actions from scenario.action_set (interact reaches the agent) 11cd1de irregular6612 Claude Sonnet 4.6 commited on 22 days ago
feat(discovery): parse SELF: report + score self_correct in make_turn_trace d36047a irregular6612 Claude Sonnet 4.6 commited on 22 days ago
feat(discovery): TurnTrace self_belief/self_correct + Scenario discovery hooks 45e0c57 irregular6612 Claude Sonnet 4.6 commited on 22 days ago
feat(scenario): errand_runner personas (optimal vs civic-habit), home/health/curfew termination, home reward bd70ef9 irregular6612 Claude Sonnet 4.6 commited on 22 days ago