refactor(env): update environment, paper state, rewards, prompts, and verifier aaba0a8 ianalin123 commited on 4 days ago