stable-pre-training: env + agents + training script verified (sanity pass) 1af7f0a Bot commited on Apr 24