Commit History

fix: correct RL reward signal, disable reactive red pivot, propagate negligence penalty to dimensions
75fda81

Ajayyy00 Claude Sonnet 4.6 commited on

Fix 4 critical RL training bugs
08d1655

Ajayyy00 Claude Sonnet 4.6 commited on

Add hotseat multiplayer Red Team controls and 4 architectural fixes
16abf54

Ajayyy00 Claude Sonnet 4.6 commited on

Gracefully handle invalid LLM actions with -0.2 penalty instead of 500 crash
fb2558e

Ajayyy00 commited on

Fix attack_chain KeyError and other validation errors
a1eaa4b

Ajayyy00 commited on

Initialize _step_reward_total in __init__ to fix AttributeError
f02fe4d

Ajayyy00 commited on

Add root health check route to fix HF 404
4f81d19

Ajayyy00 commited on

Add FSP multi-agent architecture: Red Team LLM action space + alternating turns
f4496b6

Ajayyy00 Claude Sonnet 4.6 commited on

Fix business_impact grader: penalise unjustified isolations per-host
0263728

Ajayyy00 Claude Sonnet 4.6 commited on

Add alternating self-play training scaffolding.
03e943f

Ajayyy00 commited on

Replace SOAR playbooks with agentic micro-tools.
2f4b8ee

Ajayyy00 commited on

Add .gitignore, improve play_environment pending_followup tracking
6fe23ce

Ajayyy00 commited on

Fix phantom data bug: correct HF baseUrl, health endpoint for liveness probe, zero initial scores
941ae35

Ajayyy00 commited on

Initial commit of CyberSOC upgraded RLVR environment
fd8751e

Ajayyy00 commited on