Spaces:
Sleeping
Sleeping
| name: "SupportTicketEnv" | |
| version: "1.0.0" | |
| description: > | |
| A real-world OpenEnv environment simulating a customer support ticketing system. | |
| The agent must process open tickets by optionally fetching user data, checking internal policies, | |
| and taking terminal actions like issuing a refund, replying, escalating, or closing the ticket. | |
| action_space: | |
| type: "dict" | |
| schema: | | |
| { | |
| "action_type": "[fetch_user_data, check_policy, issue_refund, reply_to_customer, escalate, close_ticket]", | |
| "parameters": {"param_name": "param_value"} | |
| } | |
| observation_space: | |
| type: "dict" | |
| schema: | | |
| { | |
| "ticket": {"TicketInfo object"}, | |
| "available_actions": ["list of strings"], | |
| "system_message": "string", | |
| "history": ["List of strings of past actions"], | |
| "tool_output": "Optional string of the latest action output", | |
| "step_count": "integer" | |
| } | |
| reward_description: > | |
| The reward is between 0.0 and 1.0. Partial credit is given for taking correct | |
| intermediate steps (like checking policy before acting or fetching user data). | |
| Penalties are applied for taking contradictory or destructive actions | |
| (e.g., escalating unnecessarily, issuing refunds against policy). | |