c2sentinel / API_REFERENCE.md

danielostrow

Document machine-readable output fields for scripting (to_json, to_ioc_format, suspicious_connections, iocs)

3626226 verified 2 days ago

preview code

raw

history blame contribute delete

20.6 kB

C2Sentinel API Reference

Complete technical documentation for the C2Sentinel Python API.

Author: Daniel Ostrow Website: neuralintellect.com

C2Sentinel Class
AnalysisResult Class
ConnectionContext Class
ReconSupport Class
FeatureExtractor Class
LogParser Class
Enums and Constants

C2Sentinel Class

Main interface for C2 detection.

Constructor

C2Sentinel(model: LogBERTC2Sentinel, config: C2SentinelConfig, device: str = 'auto')

Parameter	Type	Description
`model`	LogBERTC2Sentinel	The neural network model
`config`	C2SentinelConfig	Model configuration
`device`	str	Device for inference ('auto', 'cpu', 'cuda')

Class Methods

load

@classmethod
def load(cls, path: str, device: str = 'auto') -> 'C2Sentinel'

Load a pre-trained model from safetensors format.

Parameter	Type	Description
`path`	str	Path to model files (without extension)
`device`	str	Device for inference

Returns: C2Sentinel instance

Example:

sentinel = C2Sentinel.load('c2_sentinel')
sentinel = C2Sentinel.load('/path/to/c2_sentinel', device='cuda')

create_new

@classmethod
def create_new(cls, device: str = 'auto') -> 'C2Sentinel'

Create a new untrained model instance.

Returns: C2Sentinel instance with random weights

Instance Methods

analyze

def analyze(
    self,
    connections: List[Dict],
    threshold: float = 0.5,
    context: Optional[ConnectionContext] = None,
    include_features: bool = False,
    strict_mode: bool = False
) -> AnalysisResult

Analyze a list of connections for C2 activity.

Parameter	Type	Default	Description
`connections`	List[Dict]	required	List of connection records
`threshold`	float	0.5	Detection threshold (0.0-1.0)
`context`	ConnectionContext	None	Optional context for enrichment
`include_features`	bool	False	Include raw feature vector in result
`strict_mode`	bool	False	Enforce minimum 0.7 threshold

Returns: AnalysisResult object

Connection Record Fields:

{
    'timestamp': float,      # Required: Unix timestamp
    'dst_ip': str,           # Required: Destination IP
    'dst_port': int,         # Required: Destination port
    'bytes_sent': int,       # Required: Bytes sent
    'bytes_recv': int,       # Required: Bytes received
    'src_ip': str,           # Optional: Source IP
    'src_port': int,         # Optional: Source port
    'protocol': str,         # Optional: 'tcp' or 'udp'
    'duration': float        # Optional: Duration in seconds
}

Example:

connections = [
    {'timestamp': 1000, 'dst_ip': '10.0.0.1', 'dst_port': 443,
     'bytes_sent': 200, 'bytes_recv': 500},
    {'timestamp': 1060, 'dst_ip': '10.0.0.1', 'dst_port': 443,
     'bytes_sent': 200, 'bytes_recv': 500},
]

result = sentinel.analyze(connections)
result = sentinel.analyze(connections, threshold=0.7, strict_mode=True)

analyze_batch

def analyze_batch(
    self,
    connection_groups: List[List[Dict]],
    threshold: float = 0.5,
    contexts: Optional[List[ConnectionContext]] = None,
    parallel: bool = True
) -> List[AnalysisResult]

Analyze multiple connection groups.

Parameter	Type	Default	Description
`connection_groups`	List[List[Dict]]	required	List of connection lists
`threshold`	float	0.5	Detection threshold
`contexts`	List[ConnectionContext]	None	Context for each group
`parallel`	bool	True	Enable parallel processing

Returns: List of AnalysisResult objects

Example:

groups = [
    [conn1, conn2, conn3],
    [conn4, conn5, conn6],
]
results = sentinel.analyze_batch(groups)

analyze_logs

def analyze_logs(
    self,
    log_lines: List[str],
    group_by_dst: bool = True,
    threshold: float = 0.5
) -> List[Dict]

Parse and analyze raw log lines.

Parameter	Type	Default	Description
`log_lines`	List[str]	required	Raw log lines
`group_by_dst`	bool	True	Group connections by destination IP
`threshold`	float	0.5	Detection threshold

Returns: List of result dictionaries, sorted by probability (descending)

Supported Formats:

JSON logs with standard fields
Zeek/Bro conn.log (tab-separated)
Syslog with IP:port patterns

Example:

with open('conn.log') as f:
    lines = f.readlines()

results = sentinel.analyze_logs(lines, group_by_dst=True)
for r in results:
    print(f"{r['dst_ip']}: {r['c2_probability']}")

add_whitelist

def add_whitelist(
    self,
    ips: List[str] = None,
    domains: List[str] = None
)

Add IPs or domains to the whitelist. Whitelisted destinations receive reduced C2 probability.

Parameter	Type	Description
`ips`	List[str]	IP addresses to whitelist
`domains`	List[str]	Domain names to whitelist

Example:

sentinel.add_whitelist(
    ips=['8.8.8.8', '1.1.1.1'],
    domains=['google.com', 'github.com']
)

add_blacklist

def add_blacklist(
    self,
    ips: List[str] = None,
    domains: List[str] = None
)

Add IPs or domains to the blacklist. Blacklisted destinations receive increased C2 probability.

Parameter	Type	Description
`ips`	List[str]	IP addresses to blacklist
`domains`	List[str]	Domain names to blacklist

save

def save(self, path: str)

Save model to safetensors format.

Parameter	Type	Description
`path`	str	Output path (without extension)

Creates two files:

{path}.safetensors - Model weights
{path}.json - Configuration

Instance Attributes

Attribute	Type	Description
`model`	LogBERTC2Sentinel	The neural network
`config`	C2SentinelConfig	Model configuration
`device`	torch.device	Inference device
`feature_extractor`	FeatureExtractor	Feature extraction module
`log_parser`	LogParser	Log parsing module
`context_engine`	ContextInference	Context inference module
`recon`	ReconSupport	Reconnaissance module

AnalysisResult Class

Dataclass containing analysis results.

Attributes

Attribute	Type	Description
`is_c2`	bool	True if C2 detected
`c2_probability`	float	Probability score (0.0-1.0)
`anomaly_score`	float	Anomaly detection score
`evasion_score`	float	Evasion technique detection score
`confidence`	float	Model confidence in prediction
`c2_type`	str	Detected C2 framework type
`c2_type_confidence`	float	Confidence in C2 type classification
`detection_method`	str	Detection method used
`immediate_detection`	bool	True if signature-based detection
`context_applied`	bool	True if context was applied
`original_probability`	float	Probability before context adjustment
`probability_modifier`	float	Context probability modifier
`matched_legitimate_pattern`	str	Name of matched legitimate pattern
`legitimate_confidence`	float	Confidence in legitimate pattern match
`risk_factors`	List[str]	Factors supporting C2 classification
`mitigating_factors`	List[str]	Factors against C2 classification
`service_type`	str	Detected service type
`recommendations`	List[str]	Suggested follow-up actions
`features`	List[float]	Raw 40-dimensional feature vector
`connections_analyzed`	int	Number of connections processed
`suspicious_connections`	List[Dict]	All connections with individual scores (if C2 detected)
`iocs`	Dict	Extracted IOCs for threat intel (if C2 detected)
`time_range`	Dict	Start, end, and duration of analyzed traffic
`destination_summary`	Dict	Destination IPs, ports, and byte totals

Machine-Readable Output Fields

When C2 is detected, these fields are populated for scripting and automation:

suspicious_connections - List of all connections with scores:

[
    {
        'index': 0,
        'timestamp': 1705600000,
        'src_ip': '192.168.1.100',
        'src_port': 52341,
        'dst_ip': '45.33.32.156',
        'dst_port': 443,
        'bytes_sent': 200,
        'bytes_recv': 500,
        'score': 0.92
    },
    ...
]

iocs - Indicators of Compromise for threat intel:

{
    'ip_addresses': ['45.33.32.156'],
    'ports': [443],
    'c2_type': 'cobalt_strike',
    'timing_signature': {
        'mean_interval': 60.0,
        'interval_cv': 0.05
    },
    'size_signature': {
        'mean_bytes_sent': 200.0,
        'mean_bytes_recv': 500.0,
        'sent_cv': 0.02,
        'recv_cv': 0.03
    },
    'behavioral_indicators': ['Regular timing with consistent sizes', ...]
}

time_range - Temporal bounds of analyzed traffic:

{
    'start': 1705600000.0,
    'end': 1705600420.0,
    'duration': 420.0
}

destination_summary - Traffic summary:

{
    'unique_ips': ['45.33.32.156'],
    'unique_ports': [443],
    'destinations': {'45.33.32.156:443': 8},
    'total_bytes_sent': 1600,
    'total_bytes_recv': 4000
}

Methods

to_dict

def to_dict(self) -> Dict[str, Any]

Convert result to dictionary.

Returns: Dictionary representation of all attributes

to_json

def to_json(self, indent: int = 2) -> str

Convert result to JSON string for scripting.

Parameter	Type	Default	Description
`indent`	int	2	JSON indentation level

Returns: JSON string of all attributes

Example:

result = sentinel.analyze(connections)
json_output = result.to_json()

# Write to file
with open('detection_result.json', 'w') as f:
    f.write(result.to_json())

# Parse in pipeline
import json
data = json.loads(result.to_json())

to_ioc_format

def to_ioc_format(self) -> Dict[str, Any]

Convert result to STIX-like format for threat intelligence platforms.

Returns:

{
    'type': 'indicator',
    'spec_version': '2.1',
    'pattern_type': 'c2-beacon',
    'valid_from': timestamp,
    'labels': ['malicious-activity', 'c2'],
    'confidence': 92,
    'indicators': { ... }  # Same as iocs field
}

Example:

result = sentinel.analyze(connections)
if result.is_c2:
    stix_indicator = result.to_ioc_format()
    # Send to threat intel platform
    send_to_misp(stix_indicator)

ConnectionContext Class

Dataclass for providing additional context to improve detection accuracy.

Constructor

ConnectionContext(
    # Process information
    process_name: Optional[str] = None,
    process_path: Optional[str] = None,
    process_pid: Optional[int] = None,
    parent_process: Optional[str] = None,
    command_line: Optional[str] = None,

    # Network metadata
    dns_queries: Optional[List[str]] = None,
    resolved_hostname: Optional[str] = None,
    tls_sni: Optional[str] = None,
    tls_ja3: Optional[str] = None,
    tls_ja3s: Optional[str] = None,
    certificate_issuer: Optional[str] = None,
    certificate_subject: Optional[str] = None,
    certificate_valid: Optional[bool] = None,
    http_user_agent: Optional[str] = None,
    http_host: Optional[str] = None,

    # Reputation
    ip_reputation: Optional[float] = None,
    domain_reputation: Optional[float] = None,
    known_good: Optional[bool] = None,
    known_bad: Optional[bool] = None,
    threat_intel_match: Optional[str] = None,

    # Host context
    source_hostname: Optional[str] = None,
    source_user: Optional[str] = None,
    source_is_server: Optional[bool] = None,
    source_is_workstation: Optional[bool] = None,

    # Additional
    geo_country: Optional[str] = None,
    geo_asn: Optional[str] = None,
    tags: Optional[List[str]] = None
)

Attribute Details

Attribute	Type	Effect on Analysis
`process_name`	str	Known processes reduce probability
`known_good`	bool	True reduces probability by 90%
`known_bad`	bool	True increases probability by 5x
`ip_reputation`	float	Score > 0.8 reduces probability
`threat_intel_match`	str	Match increases probability by 5x
`tls_ja3`	str	Known C2 JA3 increases probability
`certificate_valid`	bool	False increases probability

Methods

to_dict

def to_dict(self) -> Dict[str, Any]

Convert to dictionary, excluding None values.

ReconSupport Class

Reconnaissance and enrichment utilities.

Class Methods

analyze_ip

@classmethod
def analyze_ip(cls, ip: str) -> Dict[str, Any]

Analyze an IP address.

Parameter	Type	Description
`ip`	str	IP address to analyze

Returns:

{
    'ip': str,              # Original IP
    'is_valid': bool,       # Valid IP format
    'is_private': bool,     # RFC 1918 private range
    'is_loopback': bool,    # Loopback address
    'is_multicast': bool,   # Multicast address
    'is_cdn': bool,         # Known CDN range
    'cdn_provider': str,    # CDN name if applicable
    'ip_version': int,      # 4 or 6
    'reverse_dns': str,     # Reverse DNS lookup result
    'numeric': int          # Numeric representation
}

Known CDN Ranges:

Cloudflare
AWS
Google Cloud
Azure
Akamai

analyze_connection_patterns

@classmethod
def analyze_connection_patterns(cls, connections: List[Dict]) -> Dict[str, Any]

Analyze connection patterns for threat hunting.

Parameter	Type	Description
`connections`	List[Dict]	Connection records

Returns:

{
    'connection_count': int,
    'unique_destinations': int,
    'unique_ports': int,

    'timing': {
        'duration_seconds': float,
        'mean_interval': float,
        'interval_stddev': float,
        'interval_cv': float       # Coefficient of variation
    },

    'volume': {
        'total_sent': int,
        'total_recv': int,
        'mean_sent': float,
        'mean_recv': float,
        'sent_recv_ratio': float
    },

    'ports': {
        port_number: count,        # Port distribution
        ...
    },

    'destinations': {
        ip: analyze_ip_result,     # Per-IP analysis
        ...
    },

    'indicators': {
        'single_destination': bool,
        'consistent_timing': bool,
        'consistent_sizes': bool,
        'uses_common_port': bool,
        'uses_high_port': bool,
        'has_cdn_destination': bool,
        'all_private_destinations': bool
    }
}

generate_iocs

@classmethod
def generate_iocs(
    cls,
    connections: List[Dict],
    result: Dict
) -> Dict[str, List[str]]

Generate Indicators of Compromise from detected C2.

Parameter	Type	Description
`connections`	List[Dict]	Connection records
`result`	Dict	Analysis result dictionary

Returns:

{
    'ips': List[str],                  # Destination IPs
    'ports': List[str],                # Destination ports
    'timing_signatures': List[str],    # Beacon timing patterns
    'behavioral_indicators': List[str] # Behavioral markers
}

Only generates IOCs if result['is_c2'] is True.

FeatureExtractor Class

Extracts 40-dimensional feature vectors from connections.

Constants

C2_TYPES

List of detectable C2 framework types:

[
    'unknown', 'metasploit', 'cobalt_strike', 'sliver', 'havoc',
    'mythic', 'poshc2', 'merlin', 'empire', 'covenant',
    'brute_ratel', 'koadic', 'pupy', 'silenttrinity', 'faction',
    'ibombshell', 'godoh', 'dnscat2', 'iodine', 'dns_generic',
    'http_custom', 'https_custom', 'websocket', 'domain_fronting',
    'cloud_fronting', 'cdn_abuse', 'apt_generic', 'apt28', 'apt29',
    'apt41', 'lazarus', 'fin7', 'turla', 'winnti', 'custom'
]

Methods

extract_features

def extract_features(self, connections: List[Dict]) -> np.ndarray

Extract 40-dimensional feature vector.

Returns: numpy array of shape (40,)

Feature Groups:

Features 0-9: Timing (intervals, jitter, regularity, periodicity)
Features 10-17: Destinations (diversity, persistence, ports)
Features 18-27: Payload (sizes, ratios, consistency)
Features 28-35: Evasion (jitter patterns, bursts, session length)
Features 36-39: Advanced (night activity, fast beacon ratio, duration)

check_metasploit_signature

def check_metasploit_signature(
    self,
    connections: List[Dict]
) -> Tuple[bool, float]

Check for Metasploit-specific signature patterns.

Returns: (is_metasploit, confidence)

check_ssh_keepalive

def check_ssh_keepalive(
    self,
    connections: List[Dict]
) -> Tuple[bool, float]

Check for SSH keepalive pattern.

Criteria:

Port 22
Small packets (< 100 bytes)
Symmetric traffic (sent/recv ratio 0.5-2.0)
Consistent sizes (CV < 0.2)
Regular intervals matching common keepalive values

Returns: (is_ssh_keepalive, confidence)

LogParser Class

Parses various log formats into connection records.

Static Methods

parse_json

@staticmethod
def parse_json(log_line: str) -> Optional[Dict]

Parse JSON formatted log line.

Recognized Fields:

timestamp, @timestamp
src_ip, source_ip, src
dst_ip, dest_ip, dst
src_port, source_port
dst_port, dest_port
bytes_sent, bytes_out
bytes_recv, bytes_in

parse_zeek_conn

@staticmethod
def parse_zeek_conn(log_line: str) -> Optional[Dict]

Parse Zeek/Bro conn.log format (tab-separated).

parse_syslog

@staticmethod
def parse_syslog(log_line: str) -> Optional[Dict]

Parse common syslog/netflow patterns.

Recognized Patterns:

YYYY-MM-DD HH:MM:SS ... IP:port -> IP:port
src=IP ... dst=IP ... sport=port ... dport=port

Enums and Constants

DetectionMethod

class DetectionMethod(Enum):
    SIGNATURE = "signature"    # Port + behavior signature match
    BEHAVIORAL = "behavioral"  # Pure behavioral analysis
    ML = "ml"                  # Machine learning inference
    CONTEXT = "context"        # Context-adjusted detection
    HEURISTIC = "heuristic"    # Rule-based detection
    WHITELIST = "whitelist"    # Matched whitelist pattern

ServiceType

class ServiceType(Enum):
    SSH = "ssh"
    HTTP = "http"
    HTTPS = "https"
    DNS = "dns"
    DATABASE = "database"
    API = "api"
    STREAMING = "streaming"
    GAMING = "gaming"
    VPN = "vpn"
    MONITORING = "monitoring"
    UNKNOWN = "unknown"

C2_INDICATOR_PORTS

High-confidence C2 signature ports:

{4444, 4445, 5555, 31337, 40056}

C2_COMMON_PORTS

Ports commonly used by C2 (require behavioral analysis):

{80, 443, 53, 8080, 8443, 8888}

Convenience Functions

load_model

def load_model(path: str, device: str = 'auto') -> C2Sentinel

Shorthand for C2Sentinel.load().

create_model

def create_model(device: str = 'auto') -> C2Sentinel

Shorthand for C2Sentinel.create_new().

quick_analyze

def quick_analyze(
    connections: List[Dict],
    model_path: str = 'c2_sentinel'
) -> AnalysisResult

One-shot analysis without keeping model in memory.

Error Handling

The API uses standard Python exceptions:

Exception	Cause
`FileNotFoundError`	Model files not found
`ValueError`	Invalid connection format
`RuntimeError`	CUDA/device errors

All methods handle empty or malformed input gracefully, returning neutral results rather than raising exceptions.

C2Sentinel API Reference

Table of Contents

C2Sentinel Class

Constructor

Class Methods

load

create_new

Instance Methods

analyze

analyze_batch

analyze_logs

add_whitelist

add_blacklist

save

Instance Attributes

AnalysisResult Class

Attributes

Machine-Readable Output Fields

Methods

to_dict

to_json

to_ioc_format

ConnectionContext Class

Constructor

Attribute Details

Methods

to_dict

ReconSupport Class

Class Methods

analyze_ip

analyze_connection_patterns

generate_iocs

FeatureExtractor Class

Constants

C2_TYPES

Methods

extract_features

check_metasploit_signature

check_ssh_keepalive

LogParser Class

Static Methods

parse_json

parse_zeek_conn

parse_syslog

Enums and Constants

DetectionMethod

ServiceType

C2_INDICATOR_PORTS

C2_COMMON_PORTS

Convenience Functions

load_model

create_model

quick_analyze

Error Handling