PERMANENCE: reversibility-aware RL environment for training LLM agents 8f27137 verified chane335 commited on Apr 26
Run 4: tech-only curriculum, 3B model, integrated deploy task 4421025 verified chane335 commited on Apr 25
Sync: new curriculum, 6 tasks, composable rubric, latent dynamics d69dab0 verified chane335 commited on Apr 25