from openenv.core.env_server import create_fastapi_app from models import SupportAction, SupportObservation from server.environment import SupportEnvironment from fastapi.responses import HTMLResponse import uvicorn dashboard_content = r"""
Scaler Meta PyTorch Hackathon Submission
A specialized Reinforcement Learning environment designed to measure LLM agent accuracy in organizational triage. This environment facilitates high-precision routing of complex support tickets.
route: Direct ticket to a department.search: Query internal knowledge base.Billing: Invoices & PaymentsTech: Debugging & API ErrorsSales: Enterprise PricingIntegration with this environment is standard via openenv-core. The following endpoint structure is exposed:
/ws (Main interaction loop)/state (Retrieve SupportState)/reset (Initialize task)# Connect using Python SDK client = SupportEnvClient(base_url='YOUR_SPACE_URL').sync() obs = client.reset(task_name='hard') result = client.step(SupportAction(action_type='route', department='Tech'))
Tasks are weighted by the complexity of ticket content and required lookups.
Reward Signaling: This environment utilizes a [0.01, 0.99] bounded reward system to provide a clean signal for RL training while maintaining strict judge-compliant bounds.