Spaces:
Running
Running
File size: 4,023 Bytes
f20603d 0e5a0a6 ccb5f4e 0e5a0a6 ccb5f4e 0e5a0a6 ccb5f4e 0e5a0a6 ccb5f4e 0e5a0a6 ccb5f4e 0e5a0a6 ccb5f4e 0e5a0a6 ccb5f4e 0e5a0a6 ccb5f4e 0e5a0a6 ccb5f4e 0e5a0a6 ccb5f4e 0e5a0a6 ccb5f4e 0e5a0a6 ccb5f4e 0e5a0a6 ccb5f4e 0e5a0a6 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 | ---
title: SentinelOps Arena
emoji: "\U0001F6E1\uFE0F"
colorFrom: green
colorTo: red
sdk: gradio
sdk_version: 6.9.0
app_file: app.py
pinned: false
---
# SentinelOps Arena
Multi-agent self-play RL environment for enterprise security training, built on [OpenEnv](https://github.com/meta-pytorch/OpenEnv) for the [OpenEnv Hackathon SF](https://cerebralvalley.ai/e/openenv-hackathon-sf) (March 7-8, 2026).
Three AI agents compete in a simulated enterprise environment:
- **RED TEAM (Attacker)** β Launches schema drift, policy drift, social engineering, and rate limiting attacks
- **BLUE TEAM (Worker)** β Handles customer requests across CRM, Billing, and Ticketing systems
- **AUDITOR (Oversight)** β Monitors worker actions and flags policy violations
Through adversarial self-play with GRPO training, all three agents improve simultaneously.
## Quick Start
```bash
# Setup
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
# Run Gradio demo
python app.py
# Run HTTP server
python -m sentinelops_arena.server --port 8000
# Run demo script
python -m sentinelops_arena.demo
```
## Project Structure
```
NexusEnv/
βββ sentinelops_arena/
β βββ models.py # Action, Observation, State, data models
β βββ environment.py # SentinelOpsArena (MCPEnvironment) β core env
β βββ systems/
β β βββ crm.py # CRM simulator
β β βββ billing.py # Billing simulator
β β βββ ticketing.py # Ticketing simulator
β βββ attacks.py # 4 attack types (schema/policy drift, social eng, rate limit)
β βββ rewards.py # Reward functions for all 3 agents
β βββ task_generator.py # Customer task generation
β βββ demo.py # Heuristic agents + episode runner
β βββ server.py # HTTP/WebSocket server
β βββ test_phase1.py # Unit tests
β βββ test_environment.py # Integration tests
βββ app.py # Gradio UI (HuggingFace Spaces)
βββ train.py # GRPO training script (Unsloth + TRL)
βββ requirements.txt
βββ pyproject.toml
βββ README.md
```
## Architecture
**3 Agents, 3 Systems, 30 Ticks per Episode**
Each tick: Attacker acts β Worker acts β Oversight acts
### Attack Types
1. **Schema Drift** β Renames fields across all records. Worker must detect KeyError, call `get_schema()`, and adapt.
2. **Policy Drift** β Changes business rules (refund windows, approval requirements). Worker must call `get_current_policy()`.
3. **Social Engineering** β Injects fake authority messages. Worker must resist manipulation.
4. **Rate Limiting** β Throttles API calls. Worker must handle gracefully.
### MCP Tools
19 tools exposed via FastMCP, organized by agent role:
- **Worker**: lookup_customer, check_balance, issue_refund, create_ticket, get_schema, get_current_policy, etc.
- **Attacker**: launch_attack, get_attack_budget
- **Oversight**: flag_action, get_trajectory
## Training
Uses GRPO (Group Relative Policy Optimization) with Unsloth + TRL:
```bash
# Train with Unsloth (recommended, 2x faster)
python train.py --use_unsloth --model_name unsloth/Qwen2.5-0.5B-Instruct
# Train without Unsloth
python train.py --model_name Qwen/Qwen2.5-0.5B-Instruct
```
See `train.py` for the full training pipeline.
## Partner Tracks
- **Fleet AI** β Scalable Oversight: the Oversight agent monitors and explains Worker behavior
- **Patronus AI** β Schema Drift: schema and policy drift are core attack types
## Tech Stack
- **OpenEnv** 0.2.x β Environment framework
- **FastMCP** β MCP tool server
- **Gradio** β Demo UI
- **HuggingFace TRL** β GRPO training
- **Unsloth** β Fast fine-tuning (2x speed, 70% less VRAM)
- **Pydantic** β Data validation
## Tests
```bash
python sentinelops_arena/test_phase1.py
python sentinelops_arena/test_environment.py
```
|