Commit History

Sanitize empty finding descriptions
e3e5d23

balloonmann commited on

Harden reward parsing for truncated JSON
fd84f8b

balloonmann commited on

Round2 hardening, campaign fixes, training quickstart, and CI gate
92ead25

balloonmann commited on

Round 2 Implementation: Multi-period campaign, regulatory shocks, adversarial grading, and GRPO training infrastructure
52f5c27

balloonmann commited on

Emergency fix: drop scoring limit to 2 decimal points and 0.01 / 0.99 to pass deep verification
57d984d

balloonmann commited on

Fix phase 2 deep validation grader scoring out-of-bounds error
140ce9d

balloonmann commited on

Refactor graders.py: add explicit partial credit comments, strict duplicate handling, and weighted false negative tracking
e43c1bc

balloonmann commited on

Fix inference stdout formatting and update graders tests for strictly clamped reward constraint bounds
21fc032

balloonmann commited on

fix: increase score epsilon to 0.01 for validator safety
b4d5e6a

balloonmann commited on

feat: v2.0 — fraud detection task, severity grading, investigation mode, security hardening, 78 tests
126bdbd

balloonmann commited on