Improve reward function to break refuse-everything local minimum and scale training bd8220a unverified Claude commited on 3 days ago
Implement self-improving AI oversight system with nested RL environments e6b0e2f unverified Claude commited on 4 days ago