Spaces:

RFTSystems
/

START_HERE__Agent_Forensics_Suite

Running

App Files Files Community

RFTSystems commited on Jan 11

Commit

d6d06cf

verified ·

1 Parent(s): 9f707ae

Update app.py

Browse files

Files changed (1) hide show

app.py +21 -17

app.py CHANGED Viewed

@@ -1,6 +1,11 @@
 import gradio as gr
 SUITE = [
     (
         "ReplayProof Agent POV Verified Replay",
         "https://huggingface.co/spaces/RFTSystems/ReplayProof__Agent_POV__Verified_Replay",
@@ -37,10 +42,10 @@ WHY = (
     "AI is being shipped into real systems faster than teams can reliably reproduce or explain agent behaviour. "
     "When an agent fails, too many postmortems still rely on screenshots, partial logs, and opinions — not evidence.\n\n"
     "The operational risk is not only that an agent does the wrong thing. The deeper risk is that **nobody can prove what happened**: "
-    "what the agent saw, what it called, what it wrote, and where the run diverged. When failures are unreproducible, accountability collapses.\n\n"
-    "RFTSystems exists to make agent behaviour **inspectable and independently verifiable**. This suite produces evidence bundles you can share and validate: "
-    "hash-chained timelines, tamper-evident receipts, deterministic replays, and first-divergence diffs. You don’t need to trust the author — you can verify the evidence.\n\n"
-    "I can’t promise “AI will never take over.” No one can. What I *can* promise is this: **with chain-of-custody logs and receipts, we can prove what happened and who is responsible.**"
 )
 WHY_VERIFICATION_DOC = (
@@ -52,8 +57,6 @@ WHY_VERIFICATION_DOC = (
     "- “We changed a few things and it seems better now.”\n"
     "- “Trust us.”\n\n"
     "That is not engineering. That is damage control.\n\n"
-    "Recent “nearly-Grok” style incidents are the warning flare: capabilities shipped fast, edge cases exploited, then a scramble to patch. "
-    "And once it’s patched, the public can’t prove what was true yesterday. That’s the accountability gap.\n\n"
     "## What must be provable (every time)\n\n"
     "If you’re shipping agents that browse, call tools, write files, automate actions, or influence real users, you need to be able to prove:\n\n"
     "1) **WHEN** it happened (a verifiable timeline)\n"
@@ -63,10 +66,11 @@ WHY_VERIFICATION_DOC = (
     "If you cannot answer those with evidence, you do not have a safe system — you have a black box.\n\n"
     "## Why this collection exists\n\n"
     "This suite exists to end the “unanswered for” failure mode.\n\n"
-    "It turns agent runs into **evidence you can verify independently**:\n\n"
     "- deterministic replays (so anyone can reproduce behaviour)\n"
     "- chain-of-custody logging (so the record can’t be quietly rewritten)\n"
-    "- tamper-evident receipts (so integrity can be proven)\n"
     "- first-divergence diffs (so you can pinpoint exactly where and why two runs split)\n"
     "- audit views (so governance becomes evidence-led, not opinion-led)\n\n"
     "### Bottom line\n\n"
@@ -101,21 +105,21 @@ Verification: Each record is timestamped through the Zenodo/DataCite registry an
 def _build_markdown() -> str:
     md = []
     md.append("# RFTSystems — Agent Forensics Suite")
-    md.append("**Evidence-first instrumentation for AI agents.**")
-    md.append("Audit, prove, replay, and diff agent runs — turning “trust me” into verification.")
     md.append("")
     md.append("## Why I built this")
     md.append(WHY)
     md.append("")
     md.append("## The workflow")
-    md.append("**learn → generate proof → record reality → seal it → diff it → audit it → benchmark it**")
     md.append("")
     md.append("### Quick start (60 seconds)")
-    md.append("1. Open **ReplayProof** and run a deterministic session.")
-    md.append("2. Export the run bundle (receipts + hashes).")
-    md.append("3. Upload the bundle to verify integrity, then share it — anyone can replay it.")
     md.append("")
-    md.append("### Full pipeline (real systems)")
     md.append("1. **Record reality** (Agent Flight Recorder).")
     md.append("2. **Seal it** into receipts (RFT Memory Receipt Engine).")
     md.append("3. **Diff** two runs and find first divergence (TimelineDiff).")
@@ -128,7 +132,7 @@ def _build_markdown() -> str:
     md.append("")
     md.append("## Design principle")
     md.append(
-        "We don’t ‘imprison’ agents. We measure drift from declared intent and produce evidence. "
         "Enforcement remains an operator decision; this suite is the instrumentation layer."
     )
     md.append("")
@@ -156,4 +160,4 @@ with gr.Blocks(title="RFTSystems — Agent Forensics Suite") as demo:
     with gr.Accordion("Licence / Rights Notice (click to expand)", open=False):
         gr.Markdown(LICENSE_NOTICE)
-demo.launch()

 import gradio as gr
 SUITE = [
+    (
+        "AuditPlane — LLM Decision Proofs",
+        "https://huggingface.co/spaces/RFTSystems/AuditPlane__LLM_Decision_Proofs",
+        "Signed verification plane: Ed25519-signed decision receipts + hash-chained runs + replay + drift diffs + Merkle proofs.",
+    ),
     (
         "ReplayProof Agent POV Verified Replay",
         "https://huggingface.co/spaces/RFTSystems/ReplayProof__Agent_POV__Verified_Replay",
     "AI is being shipped into real systems faster than teams can reliably reproduce or explain agent behaviour. "
     "When an agent fails, too many postmortems still rely on screenshots, partial logs, and opinions — not evidence.\n\n"
     "The operational risk is not only that an agent does the wrong thing. The deeper risk is that **nobody can prove what happened**: "
+    "what the system saw, what it decided, what it called, what it wrote, and where the run diverged. When failures are unreproducible, accountability collapses.\n\n"
+    "RFTSystems exists to make behaviour **inspectable and independently verifiable**. This suite produces evidence bundles you can share and validate: "
+    "Ed25519-signed receipts, hash-chained timelines, deterministic replays, Merkle proofs, and first-divergence diffs. You don’t need to trust the author — you can verify the evidence.\n\n"
+    "I can’t promise “AI will never take over.” No one can. What I *can* promise is this: **with chain-of-custody logs and signed receipts, we can prove what happened and who is responsible.**"
 )
 WHY_VERIFICATION_DOC = (
     "- “We changed a few things and it seems better now.”\n"
     "- “Trust us.”\n\n"
     "That is not engineering. That is damage control.\n\n"
     "## What must be provable (every time)\n\n"
     "If you’re shipping agents that browse, call tools, write files, automate actions, or influence real users, you need to be able to prove:\n\n"
     "1) **WHEN** it happened (a verifiable timeline)\n"
     "If you cannot answer those with evidence, you do not have a safe system — you have a black box.\n\n"
     "## Why this collection exists\n\n"
     "This suite exists to end the “unanswered for” failure mode.\n\n"
+    "It turns runs into **evidence you can verify independently**:\n\n"
+    "- Ed25519-signed receipts (so outputs are attestations, not vibes)\n"
+    "- Merkle proofs (so you can verify inclusion without shipping everything)\n"
     "- deterministic replays (so anyone can reproduce behaviour)\n"
     "- chain-of-custody logging (so the record can’t be quietly rewritten)\n"
     "- first-divergence diffs (so you can pinpoint exactly where and why two runs split)\n"
     "- audit views (so governance becomes evidence-led, not opinion-led)\n\n"
     "### Bottom line\n\n"
 def _build_markdown() -> str:
     md = []
     md.append("# RFTSystems — Agent Forensics Suite")
+    md.append("**Evidence-first instrumentation for AI agents and safety decisions.**")
+    md.append("Audit, prove, replay, and diff runs — turning “trust me” into verification.")
     md.append("")
     md.append("## Why I built this")
     md.append(WHY)
     md.append("")
     md.append("## The workflow")
+    md.append("**learn → generate proof → record reality → seal it → replay → diff → audit → benchmark**")
     md.append("")
     md.append("### Quick start (60 seconds)")
+    md.append("1. Open **AuditPlane** and generate a baseline suite.")
+    md.append("2. Replay the same suite and confirm drift diffs (should be 0 if unchanged).")
+    md.append("3. Export the offline bundle — anyone can verify receipts and Merkle proofs.")
     md.append("")
+    md.append("### Agent pipeline (real systems)")
     md.append("1. **Record reality** (Agent Flight Recorder).")
     md.append("2. **Seal it** into receipts (RFT Memory Receipt Engine).")
     md.append("3. **Diff** two runs and find first divergence (TimelineDiff).")
     md.append("")
     md.append("## Design principle")
     md.append(
+        "We don’t ‘hand-wave’ agent safety. We measure drift from declared intent and produce evidence. "
         "Enforcement remains an operator decision; this suite is the instrumentation layer."
     )
     md.append("")
     with gr.Accordion("Licence / Rights Notice (click to expand)", open=False):
         gr.Markdown(LICENSE_NOTICE)
+demo.launch()