meta_ai_hackathon / docs /STATE_SPACE.md
GOOD CAT
Final submission prep
ec8c511
# State Space
The environment uses a **22-dimensional normalized observation vector** (`[0,1]` per feature).
Order is fixed by `FEATURE_ORDER` in `traffic_generator.py`.
## Feature Groups
| Group | Features | Semantics |
|---|---|---|
| Volume & timing | bytes sent/received, duration, packet count, packet variance, inter-arrival mean/jitter | throughput shape and temporal burstiness |
| Network metadata | src/dst ports, protocol, DNS query count, connection reuse | routing and communication pattern |
| TLS / certificate | TLS version, JA3 cluster, chain length, cert validity, self-signed | encrypted-session trust indicators |
| Behavioral context | geo distance, time of day, session history score, entropy score | reputation and anomaly context |
## Observation Interfaces
- `evaluate_session(session_id)` returns the vector for a given session.
- `state()` returns environment-level counters and selected session IDs.
- `step_single(action)` returns `observation` for the next queued session.
## Normalization Strategy
- Each raw feature is min-max normalized using bounded ranges in `FEATURE_BOUNDS`.
- Outliers are clipped to `[0,1]` after normalization.
- This enables stable neural training across heterogeneous scales (ports, durations, entropy).
## Markov Context Notes
- Single-session mode is designed for fixed-shape RL loops.
- Multi-session mode supports tool-driven decision systems over dynamic queues.