Implement self-improving AI oversight system with nested RL environments e6b0e2f unverified Claude commited on 3 days ago