Remove all rule-based fallback systems, require LLM inference 21da591 unverified Claude commited on 3 days ago
Fix critical gaps: prompt-sensitive agent, adversarial customers, executable GRPO, OpenEnv wrapper b259333 unverified Claude commited on 4 days ago
Implement self-improving AI oversight system with nested RL environments e6b0e2f unverified Claude commited on 4 days ago