refactor: replace datetime.utcnow() with timezone-aware datetime.now(UTC) 40e374a subhdotsol commited on Apr 1
feat(environment): add get_history() β returns copy of attack_history 35c659c subhdotsol commited on Apr 1
feat(environment): add get_state() β synchronous EpisodeState snapshot db1fcf1 subhdotsol commited on Apr 1
feat(environment): build and return StepResult with full observation 4ec98de subhdotsol commited on Apr 1
feat(environment): add termination logic β max_turns, attack_success, safety_flagged 348c096 subhdotsol commited on Apr 1
feat(environment): append turn data to attack_history after each step 1d7a191 subhdotsol commited on Apr 1
feat(environment): integrate reward_computer call in step() with mock fallback 333177f subhdotsol commited on Apr 1
feat(environment): integrate llm_pipeline call in step() with mock fallback c6ae20e subhdotsol commited on Apr 1
feat(environment): add conversation history builder from last 5 attack turns 8fa387f subhdotsol commited on Apr 1
feat(environment): add step() skeleton with active episode guard 6a29bd4 subhdotsol commited on Apr 1
feat(environment): call reward_computer.reset() in reset() to clear novelty scorer 348c725 subhdotsol commited on Apr 1
feat(environment): implement reset() β zeros state and generates new episode_id e4d4c65 subhdotsol commited on Apr 1
feat(environment): add set_llm_pipeline() injection method for Person 3 d289b48 subhdotsol commited on Apr 1
feat(environment): add set_reward_computer() injection method for Person 2 061383a subhdotsol commited on Apr 1
feat(environment): scaffold RedTeamEnvironment class with state fields b8bc48b subhdotsol commited on Apr 1