ShadowCore-v1 / README.md
brscftc's picture
Update README.md
b94a7a1 verified
---
license: other
license_name: brsx-open-license
license_link: https://brsxlabs.gt.tc/brsxlicense.html
tags:
- cybersecurity
- hybrid
pipeline_tag: token-classification
---
# ShadowCore-v1
**Lightweight Network Behavior Classification Model**
ShadowCore-v1 is a compact sequence classification model designed to analyze short network-behavior streams and classify overall system state in real time.
The model operates on a minimal four-token vocabulary representing abstract network activity and can detect healthy, degraded, and potentially dangerous traffic patterns with millisecond-scale inference latency on modern hardware.
---
## Overview
ShadowCore-v1 was created to explore whether a small specialized model can perform behavior-level network analysis without requiring massive LLM-scale architectures.
Instead of processing raw packets, logs, or protocol metadata, ShadowCore-v1 analyzes compressed symbolic sequences that represent network behavior over time.
The model is intended for:
* Network health monitoring
* Congestion detection
* Anomaly detection
* Traffic pattern analysis
* Lightweight edge deployment
* Real-time alerting systems
---
## Vocabulary
ShadowCore-v1 uses a fixed vocabulary of only four tokens:
| Token | Meaning |
| ----- | ----------------------------------- |
| U | Upload activity |
| D | Download activity |
| + | High latency / congestion / waiting |
| - | Processing completed / idle time |
Example:
```text
UU--DD--UU--DD--
```
Interpretation:
```text
Request
Processing
Response
Idle
```
This pattern generally represents healthy behavior.
---
## Classification Labels
### NORMAL
Healthy system state.
Characteristics:
* Upload and download remain balanced
* Few latency spikes
* Stable processing flow
* Idle periods present
Example:
```text
UDUDUDUDUDUDUDUD
```
---
### CRITICAL
System degradation.
Characteristics:
* Upload activity begins exceeding download activity
* Latency clusters appear
* Processing flow becomes unstable
* Queue buildup starts forming
Example:
```text
UUUU++++DDUUUU++++DD
```
---
### DANGER
Potential failure or attack condition.
Characteristics:
* Upload activity dominates
* Download activity becomes rare
* Large latency clusters
* Severe congestion
Example:
```text
UUUUUU++++++++UUUUUU++++++++
```
---
## Input Format
Input length:
```text
64 tokens
```
Example:
```text
UUUU++++DDUUUU++++DDUUUU++++DDUUUU++++DDUUUU++++DDUUUU++++DD
```
Output:
```text
NORMAL
CRITICAL
DANGER
```
---
## Architecture
ShadowCore-v1 is built on the same core architecture family used in previous successful experiments including:
* GenoLite
* IsingBreaker
* ShadowCore
Key design goals:
* Small parameter count
* Fast training
* Fast inference
* Low memory usage
* Strong pattern recognition on symbolic sequences
Model size:
```text
~88 Million Parameters
```
---
## Dataset
Training data was generated using a rule-based synthetic behavior generator.
Dataset characteristics:
```text
4,500 samples
1,500 NORMAL
1,500 CRITICAL
1,500 DANGER
```
Features:
* Fixed-length sequences
* Duplicate filtering
* Motif composition
* Cluster variation
* Sequence rotation
* Behavioral balancing
The generator was designed to teach behavior patterns rather than memorization of exact sequences.
---
## Benchmark Results
Evaluation Accuracy:
```text
94.07%
```
The model consistently identifies:
* Healthy traffic patterns
* Growing congestion states
* Severe overload conditions
Testing also demonstrated reasonable behavior on ambiguous boundary cases, where the model produces mixed confidence instead of collapsing into a single class prediction.
---
## Performance
Training Environment:
```text
NVIDIA T4
Batch Size: 64
Epochs: 5
```
Training Time:
```text
~4.5 minutes
```
Inference:
```text
Millisecond-scale
```
on modern GPUs and suitable for real-time monitoring pipelines.
---
## Example Predictions
Input:
```text
UDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUDUD
```
Prediction:
```text
NORMAL
```
---
Input:
```text
UUUU++++DDUUUU++++DDUUUU++++DDUUUU++++DD
```
Prediction:
```text
CRITICAL
```
---
Input:
```text
UUUUUU++++++++UUUUUU++++++++UUUUUU++++++++
```
Prediction:
```text
DANGER
```
---
## Limitations
ShadowCore-v1 was trained on synthetic data.
While the model successfully learns network-behavior concepts, production deployment should include:
* Real traffic validation
* Domain-specific calibration
* Additional anomaly classes
* Real-world benchmark datasets
---
## Future Work
Planned improvements:
* ShadowCore-v2
* Larger motif library
* Real traffic traces
* Multi-stage anomaly classification
* Attack family detection
* Early-warning forecasting
---
## License
Research & Experimental Use
---
**ShadowCore-v1 demonstrates that lightweight specialized models can achieve >90% accuracy on behavior-oriented sequence classification tasks without requiring large-scale foundation models.** 🚀