Round2 hardening, campaign fixes, training quickstart, and CI gate 92ead25 balloonmann commited on Apr 22
Round 2 Implementation: Multi-period campaign, regulatory shocks, adversarial grading, and GRPO training infrastructure 52f5c27 balloonmann commited on Apr 22
Emergency fix: drop scoring limit to 2 decimal points and 0.01 / 0.99 to pass deep verification 57d984d balloonmann commited on Apr 8
Refactor graders.py: add explicit partial credit comments, strict duplicate handling, and weighted false negative tracking e43c1bc balloonmann commited on Apr 7
Fix inference stdout formatting and update graders tests for strictly clamped reward constraint bounds 21fc032 balloonmann commited on Apr 7
feat: v2.0 — fraud detection task, severity grading, investigation mode, security hardening, 78 tests 126bdbd balloonmann commited on Apr 3