Spaces:

AlgoCore
/

support-ticket-env

Sleeping

App Files Files Community

AlgoCore commited on Apr 5

Commit

bdd9825

1 Parent(s): e693966

Professional README in OpenEnv style

Browse files

Files changed (1) hide show

README.md +109 -11

README.md CHANGED Viewed

@@ -11,22 +11,120 @@ pinned: false
 # Customer Support Ticket Resolution Environment
-A real-world OpenEnv environment where an AI agent learns to handle customer support tickets.
 ## Tasks
-- Task 1 (Easy): Classify ticket into correct category
-- Task 2 (Medium): Classify then choose correct action
-- Task 3 (Hard): Resolve a full queue of 3 tickets
 ## Action Space
-- action_type: classify / reply / escalate / close
-- category: billing / technical / account / general / refund
-- reply_text: free text
 ## Observation Space
-- ticket_id, ticket_text, task_id, current_category, resolved, step_count, feedback, score
 ## Baseline Scores
-- Task 1: 0.87
-- Task 2: 0.71
-- Task 3: 0.58

 # Customer Support Ticket Resolution Environment
+A real-world [OpenEnv](https://github.com/meta-pytorch/OpenEnv) environment where an AI agent acts as a customer support executive, triaging and resolving incoming tickets.
+## Overview
+Customer support triage is one of the most common real-world tasks for AI agents. Every company handles thousands of tickets daily. Getting the classification wrong routes the ticket to the wrong team. Choosing the wrong action has direct business impact. This environment trains agents to handle exactly this challenge.
+## Quick Start
+```python
+from support_ticket_env import SupportAction, SupportTicketEnv
+with SupportTicketEnv(base_url="https://algocore-support-ticket-env.hf.space").sync() as env:
+    # Task 1 - Classify a ticket
+    result = env.reset(task_id=1, seed=42)
+    print(result.observation.ticket_text)
+    result = env.step(SupportAction(action_type="classify", category="billing"))
+    print(result.reward)  # 1.0 if correct
+```
 ## Tasks
+| Task | Difficulty | Description | Score Range |
+|------|-----------|-------------|-------------|
+| Task 1 | Easy | Classify ticket into correct category | 0.0 - 1.0 |
+| Task 2 | Medium | Classify then choose correct action | 0.0 - 1.0 |
+| Task 3 | Hard | Resolve a full queue of 3 tickets | 0.0 - 1.0 |
 ## Action Space
+Actions are `SupportAction` Pydantic objects:
+| Field | Type | Required | Values |
+|-------|------|----------|--------|
+| `action_type` | str | always | `classify` / `reply` / `escalate` / `close` |
+| `category` | str | for classify | `billing` / `technical` / `account` / `general` / `refund` |
+| `reply_text` | str | for reply | free text |
+| `reason` | str | optional | free text |
 ## Observation Space
+| Field | Type | Description |
+|-------|------|-------------|
+| `ticket_id` | str | Unique ticket ID |
+| `ticket_text` | str | Customer message |
+| `task_id` | int | 1, 2, or 3 |
+| `current_category` | str | Category assigned so far |
+| `resolved` | bool | Whether ticket is resolved |
+| `step_count` | int | Steps taken this episode |
+| `feedback` | str | Human-readable feedback |
+| `reward` | float | Reward signal |
+| `done` | bool | Episode finished |
+## Reward Function
+Rewards provide partial progress signals throughout the trajectory:
+- **Task 1:** 1.0 for correct category, 0.0 for wrong
+- **Task 2:** 1.0 correct action, 0.5 defensible alternative, 0.3 classification only
+- **Task 3:** 0.20 classification + 0.40 action + 0.25 reply quality + 0.15 efficiency bonus
+- **Penalty:** -0.05 per step over 10 (loop deterrent)
+## Project Structure
+```
+support_ticket_env/
+├── __init__.py          # Package exports
+├── models.py            # SupportAction, SupportObservation, SupportState
+├── tickets.py           # Ticket dataset with ground-truth labels
+├── graders.py           # Reward/grader functions for all 3 tasks
+├── client.py            # EnvClient subclass
+├── baseline.py          # Baseline inference script
+├── openenv.yaml         # Environment metadata
+├── Dockerfile           # Container definition
+└── server/
+    ├── app.py           # FastAPI entry point
+    └── support_environment.py  # Environment logic
+```
+## Setup
+```bash
+# Install dependencies
+pip install openenv-core fastapi uvicorn pydantic gradio openai
+# Run locally
+cd support_ticket_env
+uvicorn server.app:app --host 0.0.0.0 --port 7860
+# Docker
+docker build -t support-ticket-env .
+docker run -p 7860:7860 support-ticket-env
+# Run tests
+python run_tests.py
+```
 ## Baseline Scores
+Measured with `gpt-4o-mini`, seeds `[42, 7, 123]`:
+| Task | Avg Score |
+|------|-----------|
+| Task 1 - Classification | 0.87 |
+| Task 2 - Action Selection | 0.71 |
+| Task 3 - Full Resolution | 0.58 |
+| **Overall** | **0.72** |
+## Links
+- **HuggingFace Space:** https://huggingface.co/spaces/AlgoCore/support-ticket-env
+- **GitHub:** https://github.com/TryingHardToBeDeveloper/support-ticket-env
+- **OpenEnv Docs:** https://meta-pytorch.org/OpenEnv/
+## License
+MIT