permanence-training / tests /test_tech_tasks_e2e.py

Commit History

PERMANENCE: reversibility-aware RL environment for training LLM agents
8f27137
verified

chane335 commited on

Run 6.1: env precondition fix (destructive DB ops on missing tables short-circuit)
a2327d8
verified

chane335 commited on

Run 6: forced variants (eps 50%→70%), β_rank=0.25, R-level bonus, μ=2 PPO epochs, balanced R1-R5 warmup traces
e198371
verified

chane335 commited on

Run 4: tech-only curriculum, 3B model, integrated deploy task
94bea2c
verified

chane335 commited on

Run 4: tech-only curriculum, 3B model, integrated deploy task
4421025
verified

chane335 commited on