name: RetailTraderBehaviorCoach version: "1.0" state: timestep: int price: float position: int loss_streak: int pnl: float actions: - NO - WARN - REDUCE - EXIT - COOLDOWN endpoints: reset: /reset step: /step