OpenRA-Bench / tests

Commit History

providers: implement BedrockProvider via Converse API
075d71c

Xiaochuang Yuan commited on

conftest: skip image-primary tests that call _render_state helper
9470076

Xiaochuang Yuan commited on

Add Playlist tab โ€” cold-start non-gamer baseline UX
5413afb

Xiaochuang Yuan commited on

Bucket E: align synthetic predicate-test contexts with current YAML
7fada42

Xiaochuang Yuan commited on

Bucket D: discover archived packs + load adversarial-siege from _archive
0f4480c

Xiaochuang Yuan commited on

Bucket C: classify the 11 unclassified active packs
ed78310

Xiaochuang Yuan commited on

Bucket A: drop tests referencing archived artofwar-decoy-sacrifice
d4148f9

Xiaochuang Yuan commited on

conftest: add run_handoff / open_study_session to engine-token list
060f765

Xiaochuang Yuan commited on

conftest: skip tests that import openra_train inside test bodies
df3ac76

Xiaochuang Yuan commited on

Merge origin/main into pr13-revised
d4d38ed

Xiaochuang Yuan commited on

Land PR #19 static site infrastructure (rebased onto pr13-revised)
1c57350

yxc20098 commited on

def-position-revealed-direction: tailored 112x40 lane arena
277431c

yxc20098 commited on

def-tower-line-vs-cluster: per-tier maps that match topology to threat geometry
d396087

yxc20098 commited on

combat-hold-chokepoint: rewrite as canonical chokepoint-arena generator use case
73e261f

yxc20098 commited on

def-walls-vs-towers: tier-differentiated walls vs towers doctrine
6129f0d

yxc20098 commited on

no-cheat redesign: build-defensive-tower-line โ€” wide-front LINE topology
b65a333

yxc20098 commited on

def-multi-direction: tailor per-tier arenas + scale lanes per tier
16b217d

yxc20098 commited on

def-position-expected-direction: tailor per-tier maps to the funnel idiom
eb84faa

yxc20098 commited on

combat-attack-from-behind-fog: per-tier maps reinforce fog-flank doctrine
d3b8d98

yxc20098 commited on

feat(scenario): econ-contested-expansion โ€” corridor map tailored to geographic contention
03dcb55

yxc20098 commited on

econ-second-base-race: redesign with custom map + spatial placement gate
4fb498d

yxc20098 commited on

mapgen: obstacles, bridges-arena, chokepoint-arena + per-scenario authoring
e3e79cb

yxc20098 commited on

Fix P0 regression: explicit agent.cash in 3 test packs
859aa77

yxc20098 commited on

Phase 1 engine audit: ENGINE_AUDIT.md + bench-side closures
c634971

yxc20098 commited on

Quality drive: schema fix, 5 new/revised packs, 4 engine tests, scenario audit
6d71d3b

yxc20098 commited on

Engine-feature integration: 4 commands + 9 scenario packs + 4 test suites
20960c1

yxc20098 commited on

Strip empty tool_calls from wire history (Together Qwen3.6-Plus fix)
fb20de3

yxc20098 commited on

Paper-experiment prep: human-study harness + pass^k + paper plan
07dfe2e

yxc20098 commited on

Add handoff ablation (recover-from-deficit / capitalize-on-advantage)
cb15568

yxc20098 commited on

Add perception ablation grid (observation channel ร— fog of war)
4a5b0dd

yxc20098 commited on

Skip live engine tests without openra_train
18c4140

Siqi commited on

Skip survive live tests without engine wheel
588bf8c

Siqi commited on

Skip live Rust rendezvous tests without engine wheel
edd5985

Siqi commited on

Improve Play minimap clarity
a412201

Siqi commited on

Persist human Play-tab runs in the standard Playback format
680e8db

yxc20098 commited on

test: HumanController drives a 1v1 side (Phase 2 x Phase 3)
401851b

yxc20098 commited on

Phase 2: Play tab โ€” selection boundary, move arrows, clear selection
d579975

yxc20098 commited on

Recalibrate 8 packs regressed by the count: spawn-spread engine fix
b5f8a60

yxc20098 commited on

Phase 2: Play tab โ€” fix hp, reorder panel, cleaner units table
d8da7e4

yxc20098 commited on

Phase 2: Play tab โ€” show objective, units panel, sharper minimap
4386310

yxc20098 commited on

Phase 2: 'Play' tab in app.py โ€” human-labeling UI
81fb667

yxc20098 commited on

Phase 2: InteractiveSession โ€” turn-steppable backend for GUI play
7d53522

yxc20098 commited on

Phase 3: 1v1 full-macro adversarial harness
aab84a0

yxc20098 commited on

Phase 2: human-labeling machine core
c044b07

yxc20098 commited on

Phase 1: unified Controller interface for the eval stack
c68e036

yxc20098 commited on

fix(scenario): combat-target-priority-highvalue โ€” recalibrate after engine movement fixes
248d766

yxc20098 commited on

fix(scenario): rob-multiple-simultaneous-pressures โ€” recalibrate after engine movement fixes
0eecda5

yxc20098 commited on

fix(scenario): combat-tank-vs-tank-engagement โ€” recalibrate after engine movement fixes
e05ae9b

yxc20098 commited on

fix(scenario): combat-bait-counter-attack โ€” recalibrate after engine movement fixes
7520dea

yxc20098 commited on

fix(scenario): build-repair-priority-under-fire โ€” add hard-tier spawn-witness
df4ce7b

yxc20098 commited on