Commit History

Update Red/Blue showdown behavior and refresh Qwen benchmark artifacts.
f4ce885

Viraj commited on

refactor: enhance type safety in inference and evaluation scripts; update pyright config to exclude specific directories
2780361

Viraj commited on

refactor: enhance type safety in inference and evaluation scripts; update pyright config to exclude specific directories
e40ec5e

Viraj commited on

feat: improve blue showdown runtime context
c4730ff

Viraj commited on

feat: add red reasoning metadata
6947df8

Viraj commited on

feat: harden environment integration
ef2c8af

Viraj commited on

feat: add dense red reward scoring
027331e

Viraj commited on

feat: add blue defender curriculum
a0cce81

Viraj commited on

feat: add raw bash red agent access
5d7586e

Viraj commited on

feat: port WarGames phase 0 environment
20eb0ca

Viraj commited on