Fix format_comparison_metrics_html to accept run_comparison() dict directly d52b449 nihalaninihal Claude Opus 4.6 commited on 3 days ago
Add run_demo_episode wrapper to demo.py for dict-based episode results fcf34b9 nihalaninihal Claude Opus 4.6 commited on 3 days ago
Update metrics format with drift/oversight tracking, add colab training notebook 5e0f2b1 nihalaninihal commited on 3 days ago
Add drift-specific metrics: drift events, detection, adaptation rate eb9e808 nihalaninihal commited on 3 days ago
Add oversight accuracy and explanation quality metrics to dashboard 33b6c02 nihalaninihal commited on 3 days ago
Improve HeuristicOversight explanations with specific data references 62aabbf nihalaninihal commited on 3 days ago
Add structured explanation quality scoring for oversight agent 5efcc1b nihalaninihal commited on 3 days ago
Fix schema drift renames to target actual Customer model fields 197e7c5 nihalaninihal commited on 3 days ago
Fix window_ticks policy enforcement in billing refund validation aea9d7d nihalaninihal commited on 3 days ago
Add randomized attacker, security metrics engine, and updated Gradio dashboard 69a7e43 nihalaninihal Claude Opus 4.6 commited on 3 days ago
Add episode metrics computation and HTML formatting for SentinelOps Arena 23f3257 nihalaninihal Claude Opus 4.6 commited on 3 days ago
Replace HeuristicAttacker with RandomizedAttacker for probabilistic attacks 1f6f2a5 nihalaninihal Claude Opus 4.6 commited on 3 days ago
Remove hackathon_env template, rewrite train.py for SentinelOpsArena 0e5a0a6 nihalaninihal Claude Opus 4.6 commited on 4 days ago
Implement Phase 3 (HTTP server) and Phase 4 (demo + Gradio app) fa00f5a nihalaninihal Claude Opus 4.6 commited on 4 days ago
Implement Phase 2: environment core with MCPEnvironment base 6c20e91 nihalaninihal Claude Opus 4.6 commited on 4 days ago
Implement Phase 1: models, enterprise systems, attacks, rewards a4e6593 nihalaninihal Claude Opus 4.6 commited on 4 days ago