TABLE 1 // ROW A // TRUST ACCURACY

{trustAcc}%

Trust Accuracy

Correct trust assignment rate against ground-truth agent labels across all evaluation episodes.

BASELINE: {baselineTrust}% SENTINEL: {trustAcc}%

TABLE 1 // ROW B // ADV DETECTION

{detectRate}%

Adversarial Detection Rate

Precision-recall F1 on Byzantine agent identification. False positive rate held below 5% threshold.

BASELINE: {baselineDetect}% SENTINEL: {detectRate}%

TABLE 2 // ROW C // POLICY GAIN

+{improvement}%

Policy Improvement

Cumulative episode return gain over heuristic baseline after convergence.

HEURISTIC TRAINED RL

TABLE 2 // ROW D // FINAL SCORE

{avgScore}

Average Score

Mean normalized score across all tasks. Higher is better (range 0–1, boundary exclusive).

RANDOM: {random ? random.avg_score.toFixed(2) : "0.28"} SENTINEL: {avgScore}

); }