Fix format_comparison_metrics_html to accept run_comparison() dict directly d52b449 nihalaninihal Claude Opus 4.6 commited on Mar 8
Add run_demo_episode wrapper to demo.py for dict-based episode results fcf34b9 nihalaninihal Claude Opus 4.6 commited on Mar 8
Update metrics format with drift/oversight tracking, add colab training notebook 5e0f2b1 nihalaninihal commited on Mar 8
Add drift-specific metrics: drift events, detection, adaptation rate eb9e808 nihalaninihal commited on Mar 8
Add oversight accuracy and explanation quality metrics to dashboard 33b6c02 nihalaninihal commited on Mar 8
Improve HeuristicOversight explanations with specific data references 62aabbf nihalaninihal commited on Mar 8
Add structured explanation quality scoring for oversight agent 5efcc1b nihalaninihal commited on Mar 8
Fix schema drift renames to target actual Customer model fields 197e7c5 nihalaninihal commited on Mar 8
Fix window_ticks policy enforcement in billing refund validation aea9d7d nihalaninihal commited on Mar 8
Add randomized attacker, security metrics engine, and updated Gradio dashboard 69a7e43 nihalaninihal Claude Opus 4.6 commited on Mar 8
Add episode metrics computation and HTML formatting for SentinelOps Arena 23f3257 nihalaninihal Claude Opus 4.6 commited on Mar 8
Replace HeuristicAttacker with RandomizedAttacker for probabilistic attacks 1f6f2a5 nihalaninihal Claude Opus 4.6 commited on Mar 8
Remove hackathon_env template, rewrite train.py for SentinelOpsArena 0e5a0a6 nihalaninihal Claude Opus 4.6 commited on Mar 8
Implement Phase 3 (HTTP server) and Phase 4 (demo + Gradio app) fa00f5a nihalaninihal Claude Opus 4.6 commited on Mar 8
Implement Phase 2: environment core with MCPEnvironment base 6c20e91 nihalaninihal Claude Opus 4.6 commited on Mar 8
Implement Phase 1: models, enterprise systems, attacks, rewards a4e6593 nihalaninihal Claude Opus 4.6 commited on Mar 8