meta_ai_hackathon / docs /STATE_SPACE.md
GOOD CAT
Final submission prep
ec8c511

State Space

The environment uses a 22-dimensional normalized observation vector ([0,1] per feature). Order is fixed by FEATURE_ORDER in traffic_generator.py.

Feature Groups

Group Features Semantics
Volume & timing bytes sent/received, duration, packet count, packet variance, inter-arrival mean/jitter throughput shape and temporal burstiness
Network metadata src/dst ports, protocol, DNS query count, connection reuse routing and communication pattern
TLS / certificate TLS version, JA3 cluster, chain length, cert validity, self-signed encrypted-session trust indicators
Behavioral context geo distance, time of day, session history score, entropy score reputation and anomaly context

Observation Interfaces

  • evaluate_session(session_id) returns the vector for a given session.
  • state() returns environment-level counters and selected session IDs.
  • step_single(action) returns observation for the next queued session.

Normalization Strategy

  • Each raw feature is min-max normalized using bounded ranges in FEATURE_BOUNDS.
  • Outliers are clipped to [0,1] after normalization.
  • This enables stable neural training across heterogeneous scales (ports, durations, entropy).

Markov Context Notes

  • Single-session mode is designed for fixed-shape RL loops.
  • Multi-session mode supports tool-driven decision systems over dynamic queues.