Fix: clamp scores to strict (0.001, 0.999) — validator rejects exact 0 and 1 95a7dc0 Somuai12 commited on Apr 10
Audit fixes: tests/ dir, clean imports, reactive corpus, README polish 70f8688 Somuai12 commited on Apr 10
Staff-Level Upgrade: Segmented Evaluation, Noise Filtering, and Task Hardening 4553b37 Somuai12 commited on Apr 10
Implement profound exploit hardening (InstructionGuard, DensityCheck, LogicalAlignment, Step-Locking) a9f749a Somuai12 commited on Apr 9
Enhance: Upgrade test suite to professional simulation showing clear reward shaping 5453275 Somuai12 commited on Apr 8
Fix grading keys mismatch: allow actual dataset metrics to be graded 184bef3 Somuai12 commited on Apr 8
hackathon: final submission candidate (removes binary image for HF compatibility) 6aa8acb Somuai12 commited on Apr 3