Spaces:

qpluslab
/

OpenRA-Bench

Running

App Files Files Community

OpenRA-Bench / tests

Commit History

providers: implement BedrockProvider via Converse API

075d71c

Xiaochuang Yuan commited on May 23

conftest: skip image-primary tests that call _render_state helper

9470076

Xiaochuang Yuan commited on May 23

Add Playlist tab — cold-start non-gamer baseline UX

5413afb

Xiaochuang Yuan commited on May 23

Bucket E: align synthetic predicate-test contexts with current YAML

7fada42

Xiaochuang Yuan commited on May 23

Bucket D: discover archived packs + load adversarial-siege from _archive

0f4480c

Xiaochuang Yuan commited on May 23

Bucket C: classify the 11 unclassified active packs

ed78310

Xiaochuang Yuan commited on May 23

Bucket A: drop tests referencing archived artofwar-decoy-sacrifice

d4148f9

Xiaochuang Yuan commited on May 23

conftest: add run_handoff / open_study_session to engine-token list

060f765

Xiaochuang Yuan commited on May 23

conftest: skip tests that import openra_train inside test bodies

df3ac76

Xiaochuang Yuan commited on May 23

Merge origin/main into pr13-revised

d4d38ed

Xiaochuang Yuan commited on May 23

Land PR #19 static site infrastructure (rebased onto pr13-revised)

1c57350

yxc20098 commited on May 23

def-position-revealed-direction: tailored 112x40 lane arena

277431c

yxc20098 commited on May 23

def-tower-line-vs-cluster: per-tier maps that match topology to threat geometry

d396087

yxc20098 commited on May 23

combat-hold-chokepoint: rewrite as canonical chokepoint-arena generator use case

73e261f

yxc20098 commited on May 23

def-walls-vs-towers: tier-differentiated walls vs towers doctrine

6129f0d

yxc20098 commited on May 23

no-cheat redesign: build-defensive-tower-line — wide-front LINE topology

b65a333

yxc20098 commited on May 23

def-multi-direction: tailor per-tier arenas + scale lanes per tier

16b217d

yxc20098 commited on May 23

def-position-expected-direction: tailor per-tier maps to the funnel idiom

eb84faa

yxc20098 commited on May 23

combat-attack-from-behind-fog: per-tier maps reinforce fog-flank doctrine

d3b8d98

yxc20098 commited on May 23

feat(scenario): econ-contested-expansion — corridor map tailored to geographic contention

03dcb55

yxc20098 commited on May 23

econ-second-base-race: redesign with custom map + spatial placement gate

4fb498d

yxc20098 commited on May 23

mapgen: obstacles, bridges-arena, chokepoint-arena + per-scenario authoring

e3e79cb

yxc20098 commited on May 23

Fix P0 regression: explicit agent.cash in 3 test packs

859aa77

yxc20098 commited on May 23

Phase 1 engine audit: ENGINE_AUDIT.md + bench-side closures

c634971

yxc20098 commited on May 23

Quality drive: schema fix, 5 new/revised packs, 4 engine tests, scenario audit

6d71d3b

yxc20098 commited on May 23

Engine-feature integration: 4 commands + 9 scenario packs + 4 test suites

20960c1

yxc20098 commited on May 23

Strip empty tool_calls from wire history (Together Qwen3.6-Plus fix)

fb20de3

yxc20098 commited on May 23

Paper-experiment prep: human-study harness + pass^k + paper plan

07dfe2e

yxc20098 commited on May 23

Add handoff ablation (recover-from-deficit / capitalize-on-advantage)

cb15568

yxc20098 commited on May 22

Add perception ablation grid (observation channel × fog of war)

4a5b0dd

yxc20098 commited on May 22

Skip live engine tests without openra_train

18c4140

Siqi commited on May 22

Skip survive live tests without engine wheel

588bf8c

Siqi commited on May 22

Skip live Rust rendezvous tests without engine wheel

edd5985

Siqi commited on May 22

Improve Play minimap clarity

a412201

Siqi commited on May 22

Persist human Play-tab runs in the standard Playback format

680e8db

yxc20098 commited on May 21

test: HumanController drives a 1v1 side (Phase 2 x Phase 3)

401851b

yxc20098 commited on May 21

Phase 2: Play tab — selection boundary, move arrows, clear selection

d579975

yxc20098 commited on May 21

Recalibrate 8 packs regressed by the count: spawn-spread engine fix

b5f8a60

yxc20098 commited on May 21

Phase 2: Play tab — fix hp, reorder panel, cleaner units table

d8da7e4

yxc20098 commited on May 21

Phase 2: Play tab — show objective, units panel, sharper minimap

4386310

yxc20098 commited on May 21

Phase 2: 'Play' tab in app.py — human-labeling UI

81fb667

yxc20098 commited on May 21

Phase 2: InteractiveSession — turn-steppable backend for GUI play

7d53522

yxc20098 commited on May 21

Phase 3: 1v1 full-macro adversarial harness

aab84a0

yxc20098 commited on May 21

Phase 2: human-labeling machine core

c044b07

yxc20098 commited on May 21

Phase 1: unified Controller interface for the eval stack

c68e036

yxc20098 commited on May 21

fix(scenario): combat-target-priority-highvalue — recalibrate after engine movement fixes

248d766

yxc20098 commited on May 21

fix(scenario): rob-multiple-simultaneous-pressures — recalibrate after engine movement fixes

0eecda5

yxc20098 commited on May 21

fix(scenario): combat-tank-vs-tank-engagement — recalibrate after engine movement fixes

e05ae9b

yxc20098 commited on May 21

fix(scenario): combat-bait-counter-attack — recalibrate after engine movement fixes

7520dea

yxc20098 commited on May 21

fix(scenario): build-repair-priority-under-fire — add hard-tier spawn-witness

df4ce7b

yxc20098 commited on May 21

Commit History

providers: implement BedrockProvider via Converse API 075d71c

conftest: skip image-primary tests that call _render_state helper 9470076

Add Playlist tab — cold-start non-gamer baseline UX 5413afb

Bucket E: align synthetic predicate-test contexts with current YAML 7fada42

Bucket D: discover archived packs + load adversarial-siege from _archive 0f4480c

Bucket C: classify the 11 unclassified active packs ed78310

Bucket A: drop tests referencing archived artofwar-decoy-sacrifice d4148f9

conftest: add run_handoff / open_study_session to engine-token list 060f765

conftest: skip tests that import openra_train inside test bodies df3ac76

Merge origin/main into pr13-revised d4d38ed

Land PR #19 static site infrastructure (rebased onto pr13-revised) 1c57350

def-position-revealed-direction: tailored 112x40 lane arena 277431c

def-tower-line-vs-cluster: per-tier maps that match topology to threat geometry d396087

combat-hold-chokepoint: rewrite as canonical chokepoint-arena generator use case 73e261f

def-walls-vs-towers: tier-differentiated walls vs towers doctrine 6129f0d

no-cheat redesign: build-defensive-tower-line — wide-front LINE topology b65a333

def-multi-direction: tailor per-tier arenas + scale lanes per tier 16b217d

def-position-expected-direction: tailor per-tier maps to the funnel idiom eb84faa

combat-attack-from-behind-fog: per-tier maps reinforce fog-flank doctrine d3b8d98

feat(scenario): econ-contested-expansion — corridor map tailored to geographic contention 03dcb55

econ-second-base-race: redesign with custom map + spatial placement gate 4fb498d

mapgen: obstacles, bridges-arena, chokepoint-arena + per-scenario authoring e3e79cb

Fix P0 regression: explicit agent.cash in 3 test packs 859aa77

Phase 1 engine audit: ENGINE_AUDIT.md + bench-side closures c634971

Quality drive: schema fix, 5 new/revised packs, 4 engine tests, scenario audit 6d71d3b

Engine-feature integration: 4 commands + 9 scenario packs + 4 test suites 20960c1

Strip empty tool_calls from wire history (Together Qwen3.6-Plus fix) fb20de3

Paper-experiment prep: human-study harness + pass^k + paper plan 07dfe2e

Add handoff ablation (recover-from-deficit / capitalize-on-advantage) cb15568

Add perception ablation grid (observation channel × fog of war) 4a5b0dd

Skip live engine tests without openra_train 18c4140

Skip survive live tests without engine wheel 588bf8c

Skip live Rust rendezvous tests without engine wheel edd5985

Improve Play minimap clarity a412201

Persist human Play-tab runs in the standard Playback format 680e8db

test: HumanController drives a 1v1 side (Phase 2 x Phase 3) 401851b

Phase 2: Play tab — selection boundary, move arrows, clear selection d579975

Recalibrate 8 packs regressed by the count: spawn-spread engine fix b5f8a60

Phase 2: Play tab — fix hp, reorder panel, cleaner units table d8da7e4

Phase 2: Play tab — show objective, units panel, sharper minimap 4386310

Phase 2: 'Play' tab in app.py — human-labeling UI 81fb667

Phase 2: InteractiveSession — turn-steppable backend for GUI play 7d53522

Phase 3: 1v1 full-macro adversarial harness aab84a0

Phase 2: human-labeling machine core c044b07

Phase 1: unified Controller interface for the eval stack c68e036

fix(scenario): combat-target-priority-highvalue — recalibrate after engine movement fixes 248d766

fix(scenario): rob-multiple-simultaneous-pressures — recalibrate after engine movement fixes 0eecda5

fix(scenario): combat-tank-vs-tank-engagement — recalibrate after engine movement fixes e05ae9b

fix(scenario): combat-bait-counter-attack — recalibrate after engine movement fixes 7520dea

fix(scenario): build-repair-priority-under-fire — add hard-tier spawn-witness df4ce7b

providers: implement BedrockProvider via Converse API

075d71c

conftest: skip image-primary tests that call _render_state helper

9470076

Add Playlist tab — cold-start non-gamer baseline UX

5413afb

Bucket E: align synthetic predicate-test contexts with current YAML

7fada42

Bucket D: discover archived packs + load adversarial-siege from _archive

0f4480c

Bucket C: classify the 11 unclassified active packs

ed78310

Bucket A: drop tests referencing archived artofwar-decoy-sacrifice

d4148f9

conftest: add run_handoff / open_study_session to engine-token list

060f765

conftest: skip tests that import openra_train inside test bodies

df3ac76

Merge origin/main into pr13-revised

d4d38ed

Land PR #19 static site infrastructure (rebased onto pr13-revised)

1c57350

def-position-revealed-direction: tailored 112x40 lane arena

277431c

def-tower-line-vs-cluster: per-tier maps that match topology to threat geometry

d396087

combat-hold-chokepoint: rewrite as canonical chokepoint-arena generator use case

73e261f

def-walls-vs-towers: tier-differentiated walls vs towers doctrine

6129f0d

no-cheat redesign: build-defensive-tower-line — wide-front LINE topology

b65a333

def-multi-direction: tailor per-tier arenas + scale lanes per tier

16b217d

def-position-expected-direction: tailor per-tier maps to the funnel idiom

eb84faa

combat-attack-from-behind-fog: per-tier maps reinforce fog-flank doctrine

d3b8d98

feat(scenario): econ-contested-expansion — corridor map tailored to geographic contention

03dcb55

econ-second-base-race: redesign with custom map + spatial placement gate

4fb498d

mapgen: obstacles, bridges-arena, chokepoint-arena + per-scenario authoring

e3e79cb

Fix P0 regression: explicit agent.cash in 3 test packs

859aa77

Phase 1 engine audit: ENGINE_AUDIT.md + bench-side closures

c634971

Quality drive: schema fix, 5 new/revised packs, 4 engine tests, scenario audit

6d71d3b

Engine-feature integration: 4 commands + 9 scenario packs + 4 test suites

20960c1

Strip empty tool_calls from wire history (Together Qwen3.6-Plus fix)

fb20de3

Paper-experiment prep: human-study harness + pass^k + paper plan

07dfe2e

Add handoff ablation (recover-from-deficit / capitalize-on-advantage)

cb15568

Add perception ablation grid (observation channel × fog of war)

4a5b0dd

Skip live engine tests without openra_train

18c4140

Skip survive live tests without engine wheel

588bf8c

Skip live Rust rendezvous tests without engine wheel

edd5985

Improve Play minimap clarity

a412201

Persist human Play-tab runs in the standard Playback format

680e8db

test: HumanController drives a 1v1 side (Phase 2 x Phase 3)

401851b

Phase 2: Play tab — selection boundary, move arrows, clear selection

d579975

Recalibrate 8 packs regressed by the count: spawn-spread engine fix

b5f8a60

Phase 2: Play tab — fix hp, reorder panel, cleaner units table

d8da7e4

Phase 2: Play tab — show objective, units panel, sharper minimap

4386310

Phase 2: 'Play' tab in app.py — human-labeling UI

81fb667

Phase 2: InteractiveSession — turn-steppable backend for GUI play

7d53522

Phase 3: 1v1 full-macro adversarial harness

aab84a0

Phase 2: human-labeling machine core

c044b07

Phase 1: unified Controller interface for the eval stack

c68e036

fix(scenario): combat-target-priority-highvalue — recalibrate after engine movement fixes

248d766

fix(scenario): rob-multiple-simultaneous-pressures — recalibrate after engine movement fixes

0eecda5

fix(scenario): combat-tank-vs-tank-engagement — recalibrate after engine movement fixes

e05ae9b

fix(scenario): combat-bait-counter-attack — recalibrate after engine movement fixes

7520dea

fix(scenario): build-repair-priority-under-fire — add hard-tier spawn-witness

df4ce7b