Spaces:
Sleeping
Sleeping
| # State Space | |
| The environment uses a **22-dimensional normalized observation vector** (`[0,1]` per feature). | |
| Order is fixed by `FEATURE_ORDER` in `traffic_generator.py`. | |
| ## Feature Groups | |
| | Group | Features | Semantics | | |
| |---|---|---| | |
| | Volume & timing | bytes sent/received, duration, packet count, packet variance, inter-arrival mean/jitter | throughput shape and temporal burstiness | | |
| | Network metadata | src/dst ports, protocol, DNS query count, connection reuse | routing and communication pattern | | |
| | TLS / certificate | TLS version, JA3 cluster, chain length, cert validity, self-signed | encrypted-session trust indicators | | |
| | Behavioral context | geo distance, time of day, session history score, entropy score | reputation and anomaly context | | |
| ## Observation Interfaces | |
| - `evaluate_session(session_id)` returns the vector for a given session. | |
| - `state()` returns environment-level counters and selected session IDs. | |
| - `step_single(action)` returns `observation` for the next queued session. | |
| ## Normalization Strategy | |
| - Each raw feature is min-max normalized using bounded ranges in `FEATURE_BOUNDS`. | |
| - Outliers are clipped to `[0,1]` after normalization. | |
| - This enables stable neural training across heterogeneous scales (ports, durations, entropy). | |
| ## Markov Context Notes | |
| - Single-session mode is designed for fixed-shape RL loops. | |
| - Multi-session mode supports tool-driven decision systems over dynamic queues. | |