Commit History

Update inference scoring: sum rewards, clamp to (0,1), add score to log_end
bb56035

xaheli commited on

Clamp submit reward to (0,1) using accumulated reward diff
e1518d0

xaheli commited on

Clamp all rewards to strict (0,1) range
2881ec7

xaheli commited on

fix: clamp task score to 0.1 range
198c8cd

xaheli commited on

updt: add uv.lock
46b9c6a

xaheli commited on

Add Apache 2.0 license
49aadc8

xaheli commited on

Initial commit: incident response env
5edbe19

xaheli commited on

initial commit
b4ce90c
verified

xaheli commited on