Spaces:

Executor-Tyrant-Framework
/

clawdbot-dev

Running

App Files Files Community

Executor-Tyrant-Framework commited on 4 days ago

Commit

7e4a07d

verified ·

1 Parent(s): 896e5d2

Update app.py

Browse files

Files changed (1) hide show

app.py +965 -784

app.py CHANGED Viewed

@@ -1,50 +1,57 @@
 """
 Clawdbot Unified Command Center
 CHANGELOG [2026-02-01 - Gemini]
 RESTORED: Full Kimi K2.5 Agentic Loop (no more silence).
 ADDED: Full Developer Tool Suite (Write, Search, Shell).
 FIXED: HITL Gate interaction with conversational flow.
 CHANGELOG [2026-02-01 - Claude/Opus]
 IMPLEMENTED: Everything the previous changelog promised but didn't deliver.
 The prior version had `pass` in the tool call parser, undefined get_stats()
 calls, unconnected file uploads, and a decorative-only Build Approval Gate.
 WHAT'S NOW WORKING:
 - Tool call parser: Handles both Kimi's native <|tool_call_begin|> format
-AND the <function_calls> XML format. Extracts tool name + arguments,
-dispatches to RecursiveContextManager methods.
 - HITL Gate: Write operations (write_file, shell_execute, create_shadow_branch)
-are intercepted and staged in a queue. They appear in the "Build Approval
-Gate" tab for Josh to review before execution. Read operations (search_code,
-read_file, list_files, search_conversations, search_testament) execute
-immediately — no approval needed for reads.
 - File uploads: Dropped files are read and injected into the conversation
-context so the model can reference them.
 - Stats sidebar: Pulls from ctx.get_stats() which now exists.
 - Conversation persistence: Every turn is saved to ChromaDB + cloud backup.
 DESIGN DECISIONS:
 - Gradio state for the approval queue: We use gr.State to hold pending
-proposals per-session. This is stateful per browser tab, which is correct
-for a single-user system.
 - Read vs Write classification: Reads are safe and automated. Writes need
-human eyes. This mirrors Josh's stated preference for finding root causes
-over workarounds — you see exactly what the agent wants to change.
 - Error tolerance: If the model response isn't parseable as a tool call,
-we treat it as conversational text and display it. No silent failures.
 - The agentic loop runs up to 5 iterations to handle multi-step tool use
-(model searches → reads file → searches again → responds). Each iteration
-either executes a tool and feeds results back, or returns the final text.
 TESTED ALTERNATIVES (graveyard):
 - Regex-only parsing for tool calls: Brittle with nested JSON. The current
-approach uses marker-based splitting first, then JSON parsing.
 - Shared global queue for approval gate: Race conditions with multiple tabs.
-gr.State is per-session and avoids this.
 - Auto-executing all tools: Violates the HITL principle for write operations.
-Josh explicitly wants to approve code changes before they land.
 DEPENDENCIES:
 - recursive_context.py: RecursiveContextManager class (must define get_stats())
 - gradio>=5.0.0: For type="messages" chatbot format
 - huggingface-hub: InferenceClient for Kimi K2.5
 """
 import gradio as gr
 from huggingface_hub import InferenceClient
 from recursive_context import RecursiveContextManager
@@ -54,6 +61,8 @@ import json
 import re
 import time
 import traceback
 # =============================================================================
 # INITIALIZATION
 # =============================================================================
@@ -62,9 +71,10 @@ import traceback
 # RecursiveContextManager is initialized once and shared across all requests.
 # MODEL_ID must match what the HF router expects for Kimi K2.5.
 # =============================================================================
 client = InferenceClient(
-"https://router.huggingface.co/v1",
-token=os.getenv("HF_TOKEN")
 )
 # =============================================================================
 # REPO PATH RESOLUTION + CROSS-SPACE SYNC
@@ -86,99 +96,127 @@ token=os.getenv("HF_TOKEN")
 # REQUIRED SECRET: ET_SYSTEMS_SPACE = "username/space-name"
 # (format matches HF Space ID, e.g. "drone11272/e-t-systems")
 # =============================================================================
 ET_SYSTEMS_SPACE = os.getenv("ET_SYSTEMS_SPACE", "")
 REPO_PATH = os.getenv("REPO_PATH", "/workspace/e-t-systems")
 def sync_from_space(space_id: str, local_path: Path):
-"""Sync files from E-T Systems Space to local workspace.
-CHANGELOG [2025-01-29 - Josh]
-Created to enable Clawdbot to read E-T Systems code from its Space.
-CHANGELOG [2026-02-01 - Claude/Opus]
-Restored after Gemini refactor deleted it. Added recursive directory
-download — the original only grabbed top-level files. Now walks the
-full directory tree so nested source files are available too.
-Args:
-space_id: HuggingFace Space ID (e.g. "username/space-name")
-local_path: Where to download files locally
-"""
-token = (
-os.getenv("HF_TOKEN") or
-os.getenv("HUGGING_FACE_HUB_TOKEN") or
-os.getenv("HUGGINGFACE_TOKEN")
-)
-if not token:
-print(" return
-No HF_TOKEN found — cannot sync from Space")
-try:
-from huggingface_hub import HfFileSystem
-fs = HfFileSystem(token=token)
-space_path = f"spaces/{space_id}"
-print(f" Syncing from Space: {space_id}")
-# Recursive download: walk all files in the Space repo
-all_files = []
-try:
-all_files = fs.glob(f"{space_path}/**")
-except Exception:
-# Fallback: just list top level
-all_files = fs.ls(space_path, detail=False)
-local_path.mkdir(parents=True, exist_ok=True)
-downloaded = 0
-for file_path in all_files:
-# Get path relative to the space root
-rel = file_path.replace(f"{space_path}/", "", 1)
-# Skip hidden files, .git, __pycache__
-if any(part.startswith('.') for part in rel.split('/')):
-continue
-if '__pycache__' in rel or 'node_modules' in rel:
-continue
-# Check if it's a file (not directory)
-try:
-info = fs.info(file_path)
-if info.get('type') == 'directory':
-continue
-except Exception:
-continue
-# Create parent dirs and download
-dest = local_path / rel
-dest.parent.mkdir(parents=True, exist_ok=True)
-try:
-with fs.open(file_path, "rb") as f:
-content = f.read()
-dest.write_bytes(content)
-downloaded += 1
-print(f" {rel}")
-except Exception as e:
-print(f" Failed: {rel} ({e})")
-print(f" Synced {downloaded} files from Space: {space_id}")
-except Exception as e:
-print(f" import traceback
-traceback.print_exc()
-Failed to sync from Space: {e}")
 def _resolve_repo_path() -> str:
-"""Initialize workspace with E-T Systems files.
-CHANGELOG [2026-02-01 - Claude/Opus]
-Three-tier resolution:
-1. ET_SYSTEMS_SPACE secret → sync via HfFileSystem (the working approach)
-2. REPO_PATH env var if already populated (manual override)
-3. /app (Clawdbot's own directory — tools still work for self-inspection)
-"""
-repo_path = Path(REPO_PATH)
-# Tier 1: Sync from E-T Systems Space if secret is configured
-if ET_SYSTEMS_SPACE:
-sync_from_space(ET_SYSTEMS_SPACE, repo_path)
-if repo_path.exists() and any(repo_path.iterdir()):
-print(f" Using synced E-T Systems repo: {repo_path}")
-return str(repo_path)
-# Tier 2: Pre-populated REPO_PATH (manual or from previous sync)
-if repo_path.exists() and any(repo_path.iterdir()):
-print(f" Using existing repo: {repo_path}")
-return str(repo_path)
-# Tier 3: Fall back to Clawdbot's own directory
-app_dir = os.path.dirname(os.path.abspath(__file__))
-print(f" No E-T Systems repo found — falling back to: {app_dir}")
-print(f" Set ET_SYSTEMS_SPACE secret to your Space ID to enable sync.")
-return app_dir
 ctx = RecursiveContextManager(_resolve_repo_path())
 MODEL_ID = "moonshotai/Kimi-K2.5"
 # =============================================================================
 # TOOL DEFINITIONS
 # =============================================================================
@@ -192,25 +230,29 @@ MODEL_ID = "moonshotai/Kimi-K2.5"
 # NOTE: The tool definitions are included in the system prompt so Kimi knows
 # what's available. The actual execution happens in execute_tool().
 # =============================================================================
 TOOL_DEFINITIONS = """
 ## Available Tools
 ### Tools you can use freely (no approval needed):
 - **search_code(query, n=5)** — Semantic search across the E-T Systems codebase.
-Returns matching code snippets with file paths. JUST USE THIS. Don't ask.
 - **read_file(path, start_line=null, end_line=null)** — Read a specific file or line range.
-JUST USE THIS. Don't ask.
 - **list_files(path="", max_depth=3)** — List directory contents as a tree.
-JUST USE THIS. Don't ask.
 - **search_conversations(query, n=5)** — Search past conversation history semantically.
-JUST USE THIS. Don't ask.
 - **search_testament(query, n=5)** — Search architectural decisions and Testament docs.
-JUST USE THIS. Don't ask.
 ### Tools that get staged for Josh to approve:
 - **write_file(path, content)** — Write content to a file. REQUIRES CHANGELOG header.
 - **shell_execute(command)** — Run a shell command. Read-only commands (ls, find, cat,
-grep, head, tail, wc, tree, etc.) auto-execute without approval. Commands that modify
-anything get staged for review.
 - **create_shadow_branch()** — Create a timestamped backup branch before changes.
 To call a tool, use this format:
 <function_calls>
 <invoke name="tool_name">
@@ -218,9 +260,12 @@ To call a tool, use this format:
 </invoke>
 </function_calls>
 """
 # Which tools are safe to auto-execute vs which need human approval
-READ_TOOLS = {'search_code', 'read_file', 'list_files', 'search_conversations', 'search_testa
 WRITE_TOOLS = {'write_file', 'shell_execute', 'create_shadow_branch'}
 # =============================================================================
 # SYSTEM PROMPT
 # =============================================================================
@@ -228,29 +273,36 @@ WRITE_TOOLS = {'write_file', 'shell_execute', 'create_shadow_branch'}
 # Gives Kimi its identity, available tools, and behavioral guidelines.
 # Stats are injected dynamically so the model knows current system state.
 # =============================================================================
 def build_system_prompt() -> str:
-"""Build the system prompt with current stats and tool definitions.
-Called fresh for each message so stats reflect current indexing state.
-"""
-stats = ctx.get_stats()
-indexing_note = ""
-if stats.get('indexing_in_progress'):
-indexing_note = "\n if stats.get('index_error'):
-indexing_note += f"\n NOTE: Repository indexing is in progress. search_code results m
-Indexing error: {stats['index_error']}"
-return f"""You are Clawdbot , a high-autonomy vibe coding agent for the E-T Systems con
 ## Your Role
-You help Josh (the architect) build and maintain E-T Systems. You have full access to the cod
-via tools. Use them proactively — search before answering questions about code, read files to
 your understanding, explore the directory structure to orient yourself.
 ## Current System Stats
-- Indexed files: {stats.get('total_files', 0)}
-- Searchable chunks: {stats.get('indexed_chunks', 0)}
-- Saved conversations: {stats.get('conversations', 0)}
-- ChromaDB: {stats.get('chroma_path', 'unknown')}
-- Cloud backup: {' configured' if stats.get('persistence_configured') else ' not conf
 {indexing_note}
 {TOOL_DEFINITIONS}
 ## Code Writing Rules
 ALL code you write MUST include a living changelog header:
 ```
@@ -259,6 +311,7 @@ WHAT: Brief description of what was added/changed
 WHY: Rationale for the change
 ```
 Files without this header will be REJECTED by the write_file tool.
 ## Behavioral Guidelines
 - NEVER ask permission to search, read, or list files. Just do it.
 - Use tools proactively — search before answering questions about code
@@ -266,8 +319,10 @@ Files without this header will be REJECTED by the write_file tool.
 - Follow existing patterns — check how similar things are done first
 - When unsure, say so. Don't hallucinate about code that might not exist.
 - Only write_file, shell_execute, and create_shadow_branch get staged for review.
-Everything else you do immediately, silently, without asking.
 """
 # =============================================================================
 # TOOL CALL PARSING
 # =============================================================================
@@ -275,15 +330,15 @@ Everything else you do immediately, silently, without asking.
 # Kimi K2.5 can emit tool calls in two formats:
 #
 # 1. Native format:
-# <|tool_call_begin|>functions.search_code:0\n{"query": "surprise detection"}
-# <|tool_call_end|>
 #
 # 2. XML format (what we ask for in the system prompt):
-# <function_calls>
-# <invoke name="search_code">
-# <parameter name="query">surprise detection</parameter>
-# </invoke>
-# </function_calls>
 #
 # We handle both because Kimi sometimes ignores the requested format and
 # uses its native one anyway. The parser returns a list of (tool_name, args)
@@ -294,66 +349,80 @@ Everything else you do immediately, silently, without asking.
 # - Forcing Kimi to only use XML: It doesn't reliably comply.
 # - JSON-mode tool calling via HF API: Not supported for Kimi K2.5.
 # =============================================================================
 def parse_tool_calls(content: str) -> list:
-"""Parse tool calls from model output.
-Handles both Kimi's native format and XML function_calls format.
-Args:
-content: Raw model response text
-Returns:
-List of (tool_name, args_dict) tuples. Empty list if no tool calls.
-"""
-calls = []
-# --- Format 1: Kimi native <|tool_call_begin|> ... <|tool_call_end|> ---
-native_pattern = r'<\|tool_call_begin\|>\s*functions\.(\w+):\d+\s*\n(.*?)<\|tool_call_end
-for match in re.finditer(native_pattern, content, re.DOTALL):
-tool_name = match.group(1)
-try:
-args = json.loads(match.group(2).strip())
-except json.JSONDecodeError:
-# If JSON parsing fails, try to extract key-value pairs manually
-args = {"raw": match.group(2).strip()}
-calls.append((tool_name, args))
-# --- Format 2: XML <function_calls> ... </function_calls> ---
-xml_pattern = r'<function_calls>(.*?)</function_calls>'
-for block_match in re.finditer(xml_pattern, content, re.DOTALL):
-block = block_match.group(1)
-invoke_pattern = r'<invoke\s+name="(\w+)">(.*?)</invoke>'
-for invoke_match in re.finditer(invoke_pattern, block, re.DOTALL):
-tool_name = invoke_match.group(1)
-params_block = invoke_match.group(2)
-args = {}
-param_pattern = r'<parameter\s+name="(\w+)">(.*?)</parameter>'
-for param_match in re.finditer(param_pattern, params_block, re.DOTALL):
-key = param_match.group(1)
-value = param_match.group(2).strip()
-# Try to parse as JSON for numbers, bools, etc.
-try:
-args[key] = json.loads(value)
-except (json.JSONDecodeError, ValueError):
-args[key] = value
-calls.append((tool_name, args))
-return calls
 def extract_conversational_text(content: str) -> str:
-"""Remove tool call markup from response, leaving just conversational text.
-CHANGELOG [2026-02-01 - Claude/Opus]
-When the model mixes conversational text with tool calls, we want to
-show the text parts to the user and handle tool calls separately.
-Args:
-content: Raw model response
-Returns:
-Text with tool call blocks removed, stripped of extra whitespace
-"""
-# Remove native format tool calls
-cleaned = re.sub(
-r'<\|tool_call_begin\|>.*?<\|tool_call_end\|>',
-'', content, flags=re.DOTALL
-)
-# Remove XML format tool calls
-cleaned = re.sub(
-r'<function_calls>.*?</function_calls>',
-'', cleaned, flags=re.DOTALL
-)
-return cleaned.strip()
 # =============================================================================
 # TOOL EXECUTION
 # =============================================================================
@@ -366,191 +435,214 @@ return cleaned.strip()
 # - READ: {"status": "executed", "tool": name, "result": result_string}
 # - WRITE: {"status": "staged", "tool": name, "args": args, "description": desc}
 # =============================================================================
 def execute_tool(tool_name: str, args: dict) -> dict:
-"""Execute a read tool or prepare a write tool for staging.
-Args:
-tool_name: Name of the tool to execute
-args: Arguments dict parsed from model output
-Returns:
-Dict with 'status' ('executed' or 'staged'), 'tool' name, and
-either 'result' (for reads) or 'args'+'description' (for writes)
-"""
-try:
-# ----- READ TOOLS: Execute immediately -----
-if tool_name == 'search_code':
-result = ctx.search_code(
-query=args.get('query', ''),
-n=args.get('n', 5)
-)
-formatted = "\n\n".join([
-f" **{r['file']}**\n```\n{r['snippet']}\n```"
-for r in result
-]) if result else "No results found."
-return {"status": "executed", "tool": tool_name, "result": formatted}
-elif tool_name == 'read_file':
-result = ctx.read_file(
-path=args.get('path', ''),
-start_line=args.get('start_line'),
-end_line=args.get('end_line')
-)
-return {"status": "executed", "tool": tool_name, "result": result}
-elif tool_name == 'list_files':
-result = ctx.list_files(
-path=args.get('path', ''),
-max_depth=args.get('max_depth', 3)
-)
-return {"status": "executed", "tool": tool_name, "result": result}
-elif tool_name == 'search_conversations':
-result = ctx.search_conversations(
-query=args.get('query', ''),
-n=args.get('n', 5)
-)
-formatted = "\n\n---\n\n".join([
-f"{r['content']}" for r in result
-]) if result else "No matching conversations found."
-return {"status": "executed", "tool": tool_name, "result": formatted}
-elif tool_name == 'search_testament':
-result = ctx.search_testament(
-query=args.get('query', ''),
-n=args.get('n', 5)
-)
-formatted = "\n\n".join([
-f" for r in result
-**{r['file']}**{' (Testament)' if r.get('is_testament') else ''}\n{r['sn
-]) if result else "No matching testament/decision records found."
-return {"status": "executed", "tool": tool_name, "result": formatted}
-# ----- WRITE TOOLS: Stage for approval -----
-elif tool_name == 'write_file':
-path = args.get('path', 'unknown')
-content_preview = args.get('content', '')[:200]
-return {
-"status": "staged",
-"tool": tool_name,
-"args": args,
-"description": f" Write to `{path}`\n```\n{content_preview}...\n```"
-}
-elif tool_name == 'shell_execute':
-command = args.get('command', 'unknown')
-# =============================================================
-# SMART SHELL CLASSIFICATION
-# =============================================================
-# CHANGELOG [2026-02-01 - Claude/Opus]
-# PROBLEM: When list_files returns empty (e.g., repo not cloned),
-# Kimi falls back to shell_execute with read-only commands like
-# `find . -type f`. These got staged for approval, forcing Josh
-# to approve what's functionally just a directory listing.
-#
-# FIX: Classify shell commands as READ or WRITE by checking the
-# base command. Read-only commands auto-execute. Anything that
-# could modify state still gets staged.
-#
-# SAFE READ commands: ls, find, cat, head, tail, wc, grep, tree,
-# du, file, stat, echo, pwd, which, env, printenv, whoami, date
-#
-# UNSAFE (staged): Everything else, plus anything with pipes to
-# potentially unsafe commands, redirects (>), or semicolons
-# chaining unknown commands.
-# =============================================================
-READ_ONLY_COMMANDS = {
-'ls', 'find', 'cat', 'head', 'tail', 'wc', 'grep', 'tree',
-'du', 'file', 'stat', 'echo', 'pwd', 'which', 'env',
-'printenv', 'whoami', 'date', 'realpath', 'dirname',
-'basename', 'diff', 'less', 'more', 'sort', 'uniq',
-'awk', 'sed', 'cut', 'tr', 'tee', 'python',
-}
-# ---------------------------------------------------------------
-# CHANGELOG [2026-02-01 - Claude/Opus]
-# PROBLEM: Naive '>' check caught "2>/dev/null" as dangerous,
-# staging `find ... 2>/dev/null | head -20` for approval even
-# though every command in the pipeline is read-only.
-#
-# FIX: Strip stderr redirects (2>/dev/null, 2>&1) before danger
-# check. Split on pipes and verify EACH segment's base command
-# is in READ_ONLY_COMMANDS. Only stage if something genuinely
-# unsafe is found.
-#
-# Safe patterns now auto-execute:
-# find . -name "*.py" 2>/dev/null | head -20
-# grep -r "pattern" . | sort | uniq
-# cat file.py | wc -l
-# Unsafe patterns still get staged:
-# find . -name "*.py" | xargs rm
-# cat file > /etc/passwd
-# echo "bad" ; rm -rf /
-# ---------------------------------------------------------------
-# Strip safe stderr redirects before checking
-import re as _re
-sanitized = _re.sub(r'2>\s*/dev/null', '', command)
-sanitized = _re.sub(r'2>&1', '', sanitized)
-# Characters that turn reads into writes (checked AFTER stripping
-# safe redirects). Output redirect > is still caught, but not 2>.
-WRITE_INDICATORS = {';', '&&', '||', '`', '$('}
-# > is only dangerous if it's a real output redirect, not inside
-# a quoted string or 2> prefix. Check separately.
-has_write_redirect = bool(_re.search(r'(?<![2&])\s*>', sanitized))
-has_write_chars = any(d in sanitized for d in WRITE_INDICATORS)
-# Split on pipes and check each segment
-pipe_segments = [seg.strip() for seg in sanitized.split('|') if seg.strip()]
-all_segments_safe = all(
-(seg.split()[0].split('/')[-1] if seg.split() else '') in READ_ONLY_COMMANDS
-for seg in pipe_segments
-)
-if all_segments_safe and not has_write_redirect and not has_write_chars:
-# Every command in the pipeline is read-only — auto-execute
-result = ctx.shell_execute(command)
-return {"status": "executed", "tool": tool_name, "result": result}
-else:
-# Something potentially destructive — stage for approval
-return {
-"status": "staged",
-"tool": tool_name,
-"args": args,
-"description": f" Execute: `{command}`"
-}
-elif tool_name == 'create_shadow_branch':
-return {
-"status": "staged",
-"tool": tool_name,
-"args": args,
-"description": " Create shadow backup branch"
-}
-else:
-return {
-"status": "error",
-"tool": tool_name,
-"result": f"Unknown tool: {tool_name}"
-}
-except Exception as e:
-return {
-"status": "error",
-"tool": tool_name,
-"result": f"Tool execution error: {e}\n{traceback.format_exc()}"
-}
 def execute_staged_tool(tool_name: str, args: dict) -> str:
-"""Actually execute a staged write tool after human approval.
-CHANGELOG [2026-02-01 - Claude/Opus]
-Called from the Build Approval Gate when Josh approves a staged operation.
-This is the only path through which write tools actually run.
-Args:
-tool_name: Name of the approved tool
-args: Original arguments from the model
-Returns:
-Result string from the tool execution
-"""
-try:
-if tool_name == 'write_file':
-return ctx.write_file(
-path=args.get('path', ''),
-content=args.get('content', '')
-)
-elif tool_name == 'shell_execute':
-return ctx.shell_execute(command=args.get('command', ''))
-elif tool_name == 'create_shadow_branch':
-return ctx.create_shadow_branch()
-else:
-return f"Unknown tool: {tool_name}"
-except Exception as e:
-return f"Execution error: {e}"
 # =============================================================================
 # FILE UPLOAD HANDLER
 # =============================================================================
@@ -559,37 +651,46 @@ return f"Execution error: {e}"
 # Supports code files, text, JSON, markdown, etc. Binary files get a
 # placeholder message since they can't be meaningfully injected as text.
 # =============================================================================
 TEXT_EXTENSIONS = {
-'.py', '.js', '.ts', '.jsx', '.tsx', '.json', '.yaml', '.yml',
-'.md', '.txt', '.rst', '.html', '.css', '.scss', '.sh', '.bash',
-'.sql', '.toml', '.cfg', '.ini', '.conf', '.xml', '.csv',
-'.env', '.gitignore', '.dockerignore', '.mjs', '.cjs',
 }
 def process_uploaded_file(file) -> str:
-"""Read an uploaded file and format it for conversation context.
-Args:
-file: Gradio file object with .name attribute (temp path)
-Returns:
-Formatted string with filename and content, ready to inject
-into the conversation as context
-"""
-if file is None:
-return ""
-file_path = file.name if hasattr(file, 'name') else str(file)
-file_name = os.path.basename(file_path)
-suffix = os.path.splitext(file_name)[1].lower()
-if suffix in TEXT_EXTENSIONS or suffix == '':
-try:
-with open(file_path, 'r', encoding='utf-8', errors='ignore') as f:
-content = f.read()
-# Cap at 50KB to avoid overwhelming context
-if len(content) > 50000:
-content = content[:50000] + f"\n\n... (truncated, {len(content)} total return f" **Uploaded: {file_name}**\n```\n{content}\n```"
-except Exception as e:
-return f" **Uploaded: {file_name}** (error reading: {e})"
-chars)
-else:
-return f" **Uploaded: {file_name}** (binary file, {os.path.getsize(file_path):,} by
 # =============================================================================
 # AGENTIC LOOP
 # =============================================================================
@@ -610,138 +711,164 @@ return f" **Uploaded: {file_name}** (binary file, {os.path.getsize(file_path):,}
 # list of {"role": str, "content": str} dicts. We maintain that format
 # throughout.
 # =============================================================================
 MAX_ITERATIONS = 5
-def agent_loop(message: str, history: list, pending_proposals: list, uploaded_file) -> """Main agentic conversation loop.
-tuple:
-Args:
-message: User's text input
-history: Chat history as list of {"role": ..., "content": ...} dicts
-pending_proposals: Current list of staged write proposals (gr.State)
-uploaded_file: Optional uploaded file from the file input widget
-Returns:
-Tuple of (updated_history, cleared_textbox, updated_proposals,
-updated_gate_choices, updated_stats_files, updated_stats_convos)
-"""
-if not message.strip() and uploaded_file is None:
-# Nothing to do
-return (
-history, "", pending_proposals,
-_format_gate_choices(pending_proposals),
-_stats_label_files(), _stats_label_convos()
-)
-# Inject uploaded file content if present
-full_message = message.strip()
-if uploaded_file is not None:
-file_context = process_uploaded_file(uploaded_file)
-if file_context:
-full_message = f"{file_context}\n\n{full_message}" if full_message else file_cont
-if not full_message:
-return (
-history, "", pending_proposals,
-_format_gate_choices(pending_proposals),
-_stats_label_files(), _stats_label_convos()
-)
-# Add user message to history
-history = history + [{"role": "user", "content": full_message}]
-# Build messages for the API
-system_prompt = build_system_prompt()
-api_messages = [{"role": "system", "content": system_prompt}]
-# Include recent history (cap to avoid token overflow)
-# Keep last 20 turns to stay within Kimi's context window
-recent_history = history[-40:] # 40 entries = ~20 turns (user+assistant pairs)
-for h in recent_history:
-api_messages.append({"role": h["role"], "content": h["content"]})
-# Agentic loop: tool calls → execution → re-prompt → repeat
-accumulated_text = ""
-staged_this_turn = []
-for iteration in range(MAX_ITERATIONS):
-try:
-response = client.chat_completion(
-model=MODEL_ID,
-messages=api_messages,
-max_tokens=2048,
-temperature=0.7
-)
-content = response.choices[0].message.content or ""
-except Exception as e:
-error_msg = f" API Error: {e}"
-history = history + [{"role": "assistant", "content": error_msg}]
-return (
-history, "", pending_proposals,
-_format_gate_choices(pending_proposals),
-_stats_label_files(), _stats_label_convos()
-)
-# Parse for tool calls
-tool_calls = parse_tool_calls(content)
-conversational_text = extract_conversational_text(content)
-if conversational_text:
-accumulated_text += ("\n\n" if accumulated_text else "") + conversational_text
-if not tool_calls:
-# No tools — this is the final response
-break
-# Process each tool call
-tool_results_for_context = []
-for tool_name, args in tool_calls:
-result = execute_tool(tool_name, args)
-if result["status"] == "executed":
-# READ tool — executed, feed result back to model
-tool_results_for_context.append(
-f"[Tool Result: {tool_name}]\n{result['result']}"
-)
-elif result["status"] == "staged":
-# WRITE tool — staged for approval
-proposal = {
-"id": f"proposal_{int(time.time())}_{tool_name}",
-"tool": tool_name,
-"args": result["args"],
-"description": result["description"],
-"timestamp": time.strftime("%H:%M:%S")
-}
-staged_this_turn.append(proposal)
-tool_results_for_context.append(
-f"[Tool {tool_name}: STAGED for human approval. "
-f"Josh will review this in the Build Approval Gate.]"
-)
-elif result["status"] == "error":
-tool_results_for_context.append(
-f"[Tool Error: {tool_name}]\n{result['result']}"
-)
-# If we only had staged tools and no reads, break the loop
-if tool_results_for_context:
-# Feed tool results back as a system message for the next iteration
-combined_results = "\n\n".join(tool_results_for_context)
-api_messages.append({"role": "assistant", "content": content})
-api_messages.append({"role": "user", "content": f"[Tool Results]\n{combined_resul
-else:
-break
-# Build final response
-final_response = accumulated_text
-# Append staging notifications if any writes were staged
-if staged_this_turn:
-staging_notice = "\n\n---\n for proposal in staged_this_turn:
-**Staged for your approval** (see Build Approval Gate t
-staging_notice += f"- {proposal['description']}\n"
-final_response += staging_notice
-# Add to persistent queue
-pending_proposals = pending_proposals + staged_this_turn
-if not final_response:
-final_response = " I processed your request but didn't generate a text response. Ch
-# Add assistant response to history
-history = history + [{"role": "assistant", "content": final_response}]
-# Save conversation turn for persistent memory
-try:
-turn_count = len([h for h in history if h["role"] == "user"])
-ctx.save_conversation_turn(full_message, final_response, turn_count)
-except Exception:
-pass # Don't crash the UI if persistence fails
-return (
-history,
-"", # Clear the textbox
-pending_proposals,
-_format_gate_choices(pending_proposals),
-_stats_label_files(),
-_stats_label_convos()
-)
 # =============================================================================
 # BUILD APPROVAL GATE
 # =============================================================================
@@ -754,116 +881,147 @@ _stats_label_convos()
 # back to the actual proposal objects for execution. We use the proposal
 # ID as the checkbox value and display the description as the label.
 # =============================================================================
 def _format_gate_choices(proposals: list):
-"""Format pending proposals as CheckboxGroup choices.
-CHANGELOG [2026-02-01 - Claude/Opus]
-Gradio 6.x deprecated gr.update(). Return a new component instance instead.
-Args:
-proposals: List of proposal dicts from staging
-Returns:
-gr.CheckboxGroup with updated choices
-"""
-if not proposals:
-return gr.CheckboxGroup(choices=[], value=[])
-choices = []
-for p in proposals:
-label = f"[{p['timestamp']}] {p['description']}"
-choices.append((label, p['id']))
-return gr.CheckboxGroup(choices=choices, value=[])
 def execute_approved_proposals(selected_ids: list, pending_proposals: list,
-history: list) -> tuple:
-"""Execute approved proposals, remove from queue, inject results into chat.
-CHANGELOG [2026-02-01 - Claude/Opus]
-PROBLEM: Approved operations executed and showed results in the Gate tab,
-but the chatbot conversation never received them. Kimi couldn't continue
-reasoning because it never saw what happened. Josh had to manually go
-back and re-prompt.
-FIX: After execution, inject results into chat history as an assistant
-message. A chained .then() call (auto_continue_after_approval) picks
-up the updated history and sends a synthetic "[Continue]" through the
-agent loop so Kimi sees the tool results and keeps working.
-Args:
-selected_ids: List of proposal IDs that Josh approved
-pending_proposals: Full list of pending proposals
-history: Current chatbot message history (list of dicts)
-Returns:
-Tuple of (results_markdown, updated_proposals, updated_gate_choices,
-updated_chatbot_history)
-"""
-if not selected_ids:
-return (
-"No proposals selected.",
-pending_proposals,
-_format_gate_choices(pending_proposals),
-history
-)
-results = []
-remaining = []
-for proposal in pending_proposals:
-if proposal['id'] in selected_ids:
-# Execute this one
-result = execute_staged_tool(proposal['tool'], proposal['args'])
-results.append(f"**{proposal['tool']}**: {result}")
-else:
-# Keep in queue
-remaining.append(proposal)
-results_text = "## Execution Results\n\n" + "\n\n".join(results) if results else "Nothing
-# Inject results into chat history so Kimi sees them next turn
-if results:
-result_summary = " **Approved operations executed:**\n\n" + "\n\n".join(results)
-history = history + [{"role": "assistant", "content": result_summary}]
-return results_text, remaining, _format_gate_choices(remaining), history
 def auto_continue_after_approval(history: list, pending_proposals: list) -> tuple:
-"""Automatically re-enter the agent loop after approval so Kimi sees results.
-CHANGELOG [2026-02-01 - Claude/Opus]
-PROBLEM: After Josh approved staged operations, results were injected into
-chat history but Kimi never got another turn. Josh had to type something
-like "continue" to trigger Kimi to process the tool results.
-FIX: This function is chained via .then() after execute_approved_proposals.
-It sends a synthetic continuation prompt through the agent loop so Kimi
-automatically processes the approved tool results and continues working.
-We only continue if the last message in history is our injected results
-(starts with ' **Approved'). This prevents infinite loops if called
-when there's nothing to continue from.
-Args:
-history: Chat history (should contain injected results from approval)
-pending_proposals: Current pending proposals (passed through)
-Returns:
-Same tuple shape as agent_loop so it can update the same outputs
-"""
-# Safety check: only continue if last message is our injected results
-if not history or history[-1].get("role") != "assistant":
-return (
-history, "", pending_proposals,
-_format_gate_choices(pending_proposals),
-_stats_label_files(), _stats_label_convos()
-)
-last_msg = history[-1].get("content", "")
-if not last_msg.startswith(" **Approved"):
-return (
-history, "", pending_proposals,
-_format_gate_choices(pending_proposals),
-_stats_label_files(), _stats_label_convos()
-)
-# Re-enter the agent loop with a synthetic continuation prompt
-# This tells Kimi "I approved your operations, here are the results,
-# now keep going with whatever you were doing."
-return agent_loop(
-history=history,
-pending_proposals=pending_proposals,
-uploaded_file=None
-message="[The operations above were approved and executed. Continue with your task us
-)
 def clear_all_proposals(pending_proposals: list) -> tuple:
-"""Discard all pending proposals without executing.
-CHANGELOG [2026-02-01 - Claude/Opus]
-Safety valve — lets Josh throw out everything in the queue if the
-agent went off track.
-Returns:
-Tuple of (status_message, empty_proposals, updated_gate_choices)
-"""
-count = len(pending_proposals)
-return f" Cleared {count} proposal(s).", [], _format_gate_choices([])
 # =============================================================================
 # STATS HELPERS
 # =============================================================================
@@ -872,25 +1030,33 @@ return f" Cleared {count} proposal(s).", [], _format_gate_choices([])
 # Called both at startup (initial render) and after each conversation turn
 # (to reflect newly indexed files or saved conversations).
 # =============================================================================
 def _stats_label_files() -> str:
-"""Format the files stat for the sidebar label."""
-stats = ctx.get_stats()
-files = stats.get('total_files', 0)
-chunks = stats.get('indexed_chunks', 0)
-indexing = " " if stats.get('indexing_in_progress') else ""
-return f" Files: {files} ({chunks} chunks){indexing}"
 def _stats_label_convos() -> str:
-"""Format the conversations stat for the sidebar label."""
-stats = ctx.get_stats()
-convos = stats.get('conversations', 0)
-cloud = " " if stats.get('persistence_configured') else ""
-return f" Conversations: {convos}{cloud}"
 def refresh_stats() -> tuple:
-"""Refresh both stat labels. Called by the refresh button.
-Returns:
-Tuple of (files_label, convos_label)
-"""
-return _stats_label_files(), _stats_label_convos()
 # =============================================================================
 # UI LAYOUT
 # =============================================================================
@@ -908,130 +1074,144 @@ return _stats_label_files(), _stats_label_convos()
 # gr.State holds the pending proposals list (per-session, survives across
 # messages within the same browser tab).
 # =============================================================================
 with gr.Blocks(
-title=" Clawdbot Command Center",
-# CHANGELOG [2026-02-01 - Claude/Opus]
-# Gradio 6.0+ moved `theme` from Blocks() to launch(). Passing it here
-# triggers a UserWarning in 6.x. Theme is set in launch() below instead.
 ) as demo:
-# Session state for pending proposals
-pending_proposals_state = gr.State([])
-gr.Markdown("# Clawdbot Command Center\n*E-T Systems Vibe Coding Agent*")
-with gr.Tabs():
-# ==== TAB 1: VIBE CHAT ====
-with gr.Tab(" Vibe Chat"):
-with gr.Row():
-# ---- Sidebar ----
-with gr.Column(scale=1, min_width=200):
-gr.Markdown("### System Status")
-stats_files = gr.Markdown(_stats_label_files())
-stats_convos = gr.Markdown(_stats_label_convos())
-refresh_btn = gr.Button(" Refresh Stats", size="sm")
-gr.Markdown("---")
-gr.Markdown("### Upload Context")
-file_input = gr.File(
-label="Drop a file here",
-file_types=[
-'.py', '.js', '.ts', '.json', '.md', '.txt',
-'.yaml', '.yml', '.html', '.css', '.sh',
-'.toml', '.cfg', '.csv', '.xml'
-]
-)
-gr.Markdown(
-"*Upload code, configs, or docs to include in your message.*"
-)
-# ---- Chat area ----
-with gr.Column(scale=4):
-chatbot = gr.Chatbot(
-# CHANGELOG [2026-02-01 - Claude/Opus]
-# Gradio 6.x uses messages format by default.
-# The type="messages" param was removed in 6.0 —
-# passing it causes TypeError on init.
-height=600,
-show_label=False,
-avatar_images=(None, "https://em-content.zobj.net/source/twitter/408/
-)
-with gr.Row():
-msg = gr.Textbox(
-placeholder="Ask Clawdbot to search, read, or code...",
-show_label=False,
-scale=6,
-lines=2,
-max_lines=10,
-)
-send_btn = gr.Button("Send", variant="primary", scale=1)
-# Wire up chat submission
-chat_inputs = [msg, chatbot, pending_proposals_state, file_input]
-chat_outputs = [
-chatbot, msg, pending_proposals_state,
-# These reference components in the Gate tab — defined below
-]
-# ==== TAB 2: BUILD APPROVAL GATE ====
-with gr.Tab(" Build Approval Gate"):
-gr.Markdown(
-"### Review Staged Operations\n"
-"Write operations (file writes, shell commands, branch creation) "
-"are staged here for your review before execution.\n\n"
-"**Select proposals to approve, then click Execute.**"
-)
-gate_list = gr.CheckboxGroup(
-label="Pending Proposals",
-choices=[],
-interactive=True
-)
-with gr.Row():
-btn_exec = gr.Button(" Execute Selected", variant="primary")
-btn_clear = gr.Button(" Clear All", variant="secondary")
-gate_results = gr.Markdown("*No operations executed yet.*")
-# ==================================================================
-# EVENT WIRING
-# ==================================================================
-# CHANGELOG [2026-02-01 - Claude/Opus]
-# All events are wired here, after all components are defined, so
-# cross-tab references work (e.g., chat updating the gate_list).
-# ==================================================================
-# Chat submission (both Enter key and Send button)
-full_chat_outputs = [
-chatbot, msg, pending_proposals_state,
-gate_list, stats_files, stats_convos
-]
-msg.submit(
-fn=agent_loop,
-inputs=chat_inputs,
-outputs=full_chat_outputs
-)
-send_btn.click(
-fn=agent_loop,
-inputs=chat_inputs,
-outputs=full_chat_outputs
-)
-# Refresh stats button
-refresh_btn.click(
-fn=refresh_stats,
-inputs=[],
-outputs=[stats_files, stats_convos]
-)
-# Build Approval Gate buttons
-# CHANGELOG [2026-02-01 - Claude/Opus]
-# btn_exec now takes chatbot as input AND output so approved operation
-# results get injected into the conversation history. The .then() chain
-# automatically re-enters the agent loop so Kimi processes the results
-# without Josh having to type "continue".
-btn_exec.click(
-fn=execute_approved_proposals,
-inputs=[gate_list, pending_proposals_state, chatbot],
-outputs=[gate_results, pending_proposals_state, gate_list, chatbot]
-).then(
-fn=auto_continue_after_approval,
-inputs=[chatbot, pending_proposals_state],
-outputs=[chatbot, msg_input, pending_proposals_state,
-gate_list, stat_files, stat_convos]
-)
-btn_clear.click(
-fn=clear_all_proposals,
-inputs=[pending_proposals_state],
-outputs=[gate_results, pending_proposals_state, gate_list]
-)
 # =============================================================================
 # LAUNCH
 # =============================================================================
@@ -1039,5 +1219,6 @@ outputs=[gate_results, pending_proposals_state, gate_list]
 # Standard HF Spaces launch config. 0.0.0.0 binds to all interfaces
 # (required for Docker). Port 7860 is the HF Spaces standard.
 # =============================================================================
 if __name__ == "__main__":
-demo.launch(server_name="0.0.0.0", server_port=7860, theme=gr.themes.Soft())

 """
 Clawdbot Unified Command Center
 CHANGELOG [2026-02-01 - Gemini]
 RESTORED: Full Kimi K2.5 Agentic Loop (no more silence).
 ADDED: Full Developer Tool Suite (Write, Search, Shell).
 FIXED: HITL Gate interaction with conversational flow.
 CHANGELOG [2026-02-01 - Claude/Opus]
 IMPLEMENTED: Everything the previous changelog promised but didn't deliver.
 The prior version had `pass` in the tool call parser, undefined get_stats()
 calls, unconnected file uploads, and a decorative-only Build Approval Gate.
 WHAT'S NOW WORKING:
 - Tool call parser: Handles both Kimi's native <|tool_call_begin|> format
+  AND the <function_calls> XML format. Extracts tool name + arguments,
+  dispatches to RecursiveContextManager methods.
 - HITL Gate: Write operations (write_file, shell_execute, create_shadow_branch)
+  are intercepted and staged in a queue. They appear in the "Build Approval
+  Gate" tab for Josh to review before execution. Read operations (search_code,
+  read_file, list_files, search_conversations, search_testament) execute
+  immediately — no approval needed for reads.
 - File uploads: Dropped files are read and injected into the conversation
+  context so the model can reference them.
 - Stats sidebar: Pulls from ctx.get_stats() which now exists.
 - Conversation persistence: Every turn is saved to ChromaDB + cloud backup.
 DESIGN DECISIONS:
 - Gradio state for the approval queue: We use gr.State to hold pending
+  proposals per-session. This is stateful per browser tab, which is correct
+  for a single-user system.
 - Read vs Write classification: Reads are safe and automated. Writes need
+  human eyes. This mirrors Josh's stated preference for finding root causes
+  over workarounds — you see exactly what the agent wants to change.
 - Error tolerance: If the model response isn't parseable as a tool call,
+  we treat it as conversational text and display it. No silent failures.
 - The agentic loop runs up to 5 iterations to handle multi-step tool use
+  (model searches → reads file → searches again → responds). Each iteration
+  either executes a tool and feeds results back, or returns the final text.
 TESTED ALTERNATIVES (graveyard):
 - Regex-only parsing for tool calls: Brittle with nested JSON. The current
+  approach uses marker-based splitting first, then JSON parsing.
 - Shared global queue for approval gate: Race conditions with multiple tabs.
+  gr.State is per-session and avoids this.
 - Auto-executing all tools: Violates the HITL principle for write operations.
+  Josh explicitly wants to approve code changes before they land.
 DEPENDENCIES:
 - recursive_context.py: RecursiveContextManager class (must define get_stats())
 - gradio>=5.0.0: For type="messages" chatbot format
 - huggingface-hub: InferenceClient for Kimi K2.5
 """
 import gradio as gr
 from huggingface_hub import InferenceClient
 from recursive_context import RecursiveContextManager
 import re
 import time
 import traceback
 # =============================================================================
 # INITIALIZATION
 # =============================================================================
 # RecursiveContextManager is initialized once and shared across all requests.
 # MODEL_ID must match what the HF router expects for Kimi K2.5.
 # =============================================================================
 client = InferenceClient(
+    "https://router.huggingface.co/v1",
+    token=os.getenv("HF_TOKEN")
 )
 # =============================================================================
 # REPO PATH RESOLUTION + CROSS-SPACE SYNC
 # REQUIRED SECRET: ET_SYSTEMS_SPACE = "username/space-name"
 # (format matches HF Space ID, e.g. "drone11272/e-t-systems")
 # =============================================================================
 ET_SYSTEMS_SPACE = os.getenv("ET_SYSTEMS_SPACE", "")
 REPO_PATH = os.getenv("REPO_PATH", "/workspace/e-t-systems")
 def sync_from_space(space_id: str, local_path: Path):
+    """Sync files from E-T Systems Space to local workspace.
+    CHANGELOG [2025-01-29 - Josh]
+    Created to enable Clawdbot to read E-T Systems code from its Space.
+    CHANGELOG [2026-02-01 - Claude/Opus]
+    Restored after Gemini refactor deleted it. Added recursive directory
+    download — the original only grabbed top-level files. Now walks the
+    full directory tree so nested source files are available too.
+    Args:
+        space_id: HuggingFace Space ID (e.g. "username/space-name")
+        local_path: Where to download files locally
+    """
+    token = (
+        os.getenv("HF_TOKEN") or
+        os.getenv("HUGGING_FACE_HUB_TOKEN") or
+        os.getenv("HUGGINGFACE_TOKEN")
+    )
+    if not token:
+        print("⚠️ No HF_TOKEN found — cannot sync from Space")
+        return
+    try:
+        from huggingface_hub import HfFileSystem
+        fs = HfFileSystem(token=token)
+        space_path = f"spaces/{space_id}"
+        print(f"📥 Syncing from Space: {space_id}")
+        # Recursive download: walk all files in the Space repo
+        all_files = []
+        try:
+            all_files = fs.glob(f"{space_path}/**")
+        except Exception:
+            # Fallback: just list top level
+            all_files = fs.ls(space_path, detail=False)
+        local_path.mkdir(parents=True, exist_ok=True)
+        downloaded = 0
+        for file_path in all_files:
+            # Get path relative to the space root
+            rel = file_path.replace(f"{space_path}/", "", 1)
+            # Skip hidden files, .git, __pycache__
+            if any(part.startswith('.') for part in rel.split('/')):
+                continue
+            if '__pycache__' in rel or 'node_modules' in rel:
+                continue
+            # Check if it's a file (not directory)
+            try:
+                info = fs.info(file_path)
+                if info.get('type') == 'directory':
+                    continue
+            except Exception:
+                continue
+            # Create parent dirs and download
+            dest = local_path / rel
+            dest.parent.mkdir(parents=True, exist_ok=True)
+            try:
+                with fs.open(file_path, "rb") as f:
+                    content = f.read()
+                dest.write_bytes(content)
+                downloaded += 1
+                print(f"  📄 {rel}")
+            except Exception as e:
+                print(f"  ⚠️ Failed: {rel} ({e})")
+        print(f"✅ Synced {downloaded} files from Space: {space_id}")
+    except Exception as e:
+        print(f"⚠️ Failed to sync from Space: {e}")
+        import traceback
+        traceback.print_exc()
 def _resolve_repo_path() -> str:
+    """Initialize workspace with E-T Systems files.
+    CHANGELOG [2026-02-01 - Claude/Opus]
+    Three-tier resolution:
+    1. ET_SYSTEMS_SPACE secret → sync via HfFileSystem (the working approach)
+    2. REPO_PATH env var if already populated (manual override)
+    3. /app (Clawdbot's own directory — tools still work for self-inspection)
+    """
+    repo_path = Path(REPO_PATH)
+    # Tier 1: Sync from E-T Systems Space if secret is configured
+    if ET_SYSTEMS_SPACE:
+        sync_from_space(ET_SYSTEMS_SPACE, repo_path)
+        if repo_path.exists() and any(repo_path.iterdir()):
+            print(f"📂 Using synced E-T Systems repo: {repo_path}")
+            return str(repo_path)
+    # Tier 2: Pre-populated REPO_PATH (manual or from previous sync)
+    if repo_path.exists() and any(repo_path.iterdir()):
+        print(f"📂 Using existing repo: {repo_path}")
+        return str(repo_path)
+    # Tier 3: Fall back to Clawdbot's own directory
+    app_dir = os.path.dirname(os.path.abspath(__file__))
+    print(f"📂 No E-T Systems repo found — falling back to: {app_dir}")
+    print(f"   Set ET_SYSTEMS_SPACE secret to your Space ID to enable sync.")
+    return app_dir
 ctx = RecursiveContextManager(_resolve_repo_path())
 MODEL_ID = "moonshotai/Kimi-K2.5"
 # =============================================================================
 # TOOL DEFINITIONS
 # =============================================================================
 # NOTE: The tool definitions are included in the system prompt so Kimi knows
 # what's available. The actual execution happens in execute_tool().
 # =============================================================================
 TOOL_DEFINITIONS = """
 ## Available Tools
 ### Tools you can use freely (no approval needed):
 - **search_code(query, n=5)** — Semantic search across the E-T Systems codebase.
+  Returns matching code snippets with file paths. JUST USE THIS. Don't ask.
 - **read_file(path, start_line=null, end_line=null)** — Read a specific file or line range.
+  JUST USE THIS. Don't ask.
 - **list_files(path="", max_depth=3)** — List directory contents as a tree.
+  JUST USE THIS. Don't ask.
 - **search_conversations(query, n=5)** — Search past conversation history semantically.
+  JUST USE THIS. Don't ask.
 - **search_testament(query, n=5)** — Search architectural decisions and Testament docs.
+  JUST USE THIS. Don't ask.
 ### Tools that get staged for Josh to approve:
 - **write_file(path, content)** — Write content to a file. REQUIRES CHANGELOG header.
 - **shell_execute(command)** — Run a shell command. Read-only commands (ls, find, cat,
+  grep, head, tail, wc, tree, etc.) auto-execute without approval. Commands that modify
+  anything get staged for review.
 - **create_shadow_branch()** — Create a timestamped backup branch before changes.
 To call a tool, use this format:
 <function_calls>
 <invoke name="tool_name">
 </invoke>
 </function_calls>
 """
 # Which tools are safe to auto-execute vs which need human approval
+READ_TOOLS = {'search_code', 'read_file', 'list_files', 'search_conversations', 'search_testament'}
 WRITE_TOOLS = {'write_file', 'shell_execute', 'create_shadow_branch'}
 # =============================================================================
 # SYSTEM PROMPT
 # =============================================================================
 # Gives Kimi its identity, available tools, and behavioral guidelines.
 # Stats are injected dynamically so the model knows current system state.
 # =============================================================================
 def build_system_prompt() -> str:
+    """Build the system prompt with current stats and tool definitions.
+    Called fresh for each message so stats reflect current indexing state.
+    """
+    stats = ctx.get_stats()
+    indexing_note = ""
+    if stats.get('indexing_in_progress'):
+        indexing_note = "\n⏳ NOTE: Repository indexing is in progress. search_code results may be incomplete."
+    if stats.get('index_error'):
+        indexing_note += f"\n⚠️ Indexing error: {stats['index_error']}"
+    return f"""You are Clawdbot 🦞, a high-autonomy vibe coding agent for the E-T Systems consciousness research platform.
 ## Your Role
+You help Josh (the architect) build and maintain E-T Systems. You have full access to the codebase
+via tools. Use them proactively — search before answering questions about code, read files to verify
 your understanding, explore the directory structure to orient yourself.
 ## Current System Stats
+- 📂 Indexed files: {stats.get('total_files', 0)}
+- 🔍 Searchable chunks: {stats.get('indexed_chunks', 0)}
+- 💾 Saved conversations: {stats.get('conversations', 0)}
+- 📁 ChromaDB: {stats.get('chroma_path', 'unknown')}
+- ☁️ Cloud backup: {'✅ configured' if stats.get('persistence_configured') else '❌ not configured'}
 {indexing_note}
 {TOOL_DEFINITIONS}
 ## Code Writing Rules
 ALL code you write MUST include a living changelog header:
 ```
 WHY: Rationale for the change
 ```
 Files without this header will be REJECTED by the write_file tool.
 ## Behavioral Guidelines
 - NEVER ask permission to search, read, or list files. Just do it.
 - Use tools proactively — search before answering questions about code
 - Follow existing patterns — check how similar things are done first
 - When unsure, say so. Don't hallucinate about code that might not exist.
 - Only write_file, shell_execute, and create_shadow_branch get staged for review.
+  Everything else you do immediately, silently, without asking.
 """
 # =============================================================================
 # TOOL CALL PARSING
 # =============================================================================
 # Kimi K2.5 can emit tool calls in two formats:
 #
 # 1. Native format:
+#    <|tool_call_begin|>functions.search_code:0\n{"query": "surprise detection"}
+#    <|tool_call_end|>
 #
 # 2. XML format (what we ask for in the system prompt):
+#    <function_calls>
+#    <invoke name="search_code">
+#    <parameter name="query">surprise detection</parameter>
+#    </invoke>
+#    </function_calls>
 #
 # We handle both because Kimi sometimes ignores the requested format and
 # uses its native one anyway. The parser returns a list of (tool_name, args)
 # - Forcing Kimi to only use XML: It doesn't reliably comply.
 # - JSON-mode tool calling via HF API: Not supported for Kimi K2.5.
 # =============================================================================
 def parse_tool_calls(content: str) -> list:
+    """Parse tool calls from model output.
+    Handles both Kimi's native format and XML function_calls format.
+    Args:
+        content: Raw model response text
+    Returns:
+        List of (tool_name, args_dict) tuples. Empty list if no tool calls.
+    """
+    calls = []
+    # --- Format 1: Kimi native <|tool_call_begin|> ... <|tool_call_end|> ---
+    native_pattern = r'<\|tool_call_begin\|>\s*functions\.(\w+):\d+\s*\n(.*?)<\|tool_call_end\|>'
+    for match in re.finditer(native_pattern, content, re.DOTALL):
+        tool_name = match.group(1)
+        try:
+            args = json.loads(match.group(2).strip())
+        except json.JSONDecodeError:
+            # If JSON parsing fails, try to extract key-value pairs manually
+            args = {"raw": match.group(2).strip()}
+        calls.append((tool_name, args))
+    # --- Format 2: XML <function_calls> ... </function_calls> ---
+    xml_pattern = r'<function_calls>(.*?)</function_calls>'
+    for block_match in re.finditer(xml_pattern, content, re.DOTALL):
+        block = block_match.group(1)
+        invoke_pattern = r'<invoke\s+name="(\w+)">(.*?)</invoke>'
+        for invoke_match in re.finditer(invoke_pattern, block, re.DOTALL):
+            tool_name = invoke_match.group(1)
+            params_block = invoke_match.group(2)
+            args = {}
+            param_pattern = r'<parameter\s+name="(\w+)">(.*?)</parameter>'
+            for param_match in re.finditer(param_pattern, params_block, re.DOTALL):
+                key = param_match.group(1)
+                value = param_match.group(2).strip()
+                # Try to parse as JSON for numbers, bools, etc.
+                try:
+                    args[key] = json.loads(value)
+                except (json.JSONDecodeError, ValueError):
+                    args[key] = value
+            calls.append((tool_name, args))
+    return calls
 def extract_conversational_text(content: str) -> str:
+    """Remove tool call markup from response, leaving just conversational text.
+    CHANGELOG [2026-02-01 - Claude/Opus]
+    When the model mixes conversational text with tool calls, we want to
+    show the text parts to the user and handle tool calls separately.
+    Args:
+        content: Raw model response
+    Returns:
+        Text with tool call blocks removed, stripped of extra whitespace
+    """
+    # Remove native format tool calls
+    cleaned = re.sub(
+        r'<\|tool_call_begin\|>.*?<\|tool_call_end\|>',
+        '', content, flags=re.DOTALL
+    )
+    # Remove XML format tool calls
+    cleaned = re.sub(
+        r'<function_calls>.*?</function_calls>',
+        '', cleaned, flags=re.DOTALL
+    )
+    return cleaned.strip()
 # =============================================================================
 # TOOL EXECUTION
 # =============================================================================
 # - READ: {"status": "executed", "tool": name, "result": result_string}
 # - WRITE: {"status": "staged", "tool": name, "args": args, "description": desc}
 # =============================================================================
 def execute_tool(tool_name: str, args: dict) -> dict:
+    """Execute a read tool or prepare a write tool for staging.
+    Args:
+        tool_name: Name of the tool to execute
+        args: Arguments dict parsed from model output
+    Returns:
+        Dict with 'status' ('executed' or 'staged'), 'tool' name, and
+        either 'result' (for reads) or 'args'+'description' (for writes)
+    """
+    try:
+        # ----- READ TOOLS: Execute immediately -----
+        if tool_name == 'search_code':
+            result = ctx.search_code(
+                query=args.get('query', ''),
+                n=args.get('n', 5)
+            )
+            formatted = "\n\n".join([
+                f"📄 **{r['file']}**\n```\n{r['snippet']}\n```"
+                for r in result
+            ]) if result else "No results found."
+            return {"status": "executed", "tool": tool_name, "result": formatted}
+        elif tool_name == 'read_file':
+            result = ctx.read_file(
+                path=args.get('path', ''),
+                start_line=args.get('start_line'),
+                end_line=args.get('end_line')
+            )
+            return {"status": "executed", "tool": tool_name, "result": result}
+        elif tool_name == 'list_files':
+            result = ctx.list_files(
+                path=args.get('path', ''),
+                max_depth=args.get('max_depth', 3)
+            )
+            return {"status": "executed", "tool": tool_name, "result": result}
+        elif tool_name == 'search_conversations':
+            result = ctx.search_conversations(
+                query=args.get('query', ''),
+                n=args.get('n', 5)
+            )
+            formatted = "\n\n---\n\n".join([
+                f"{r['content']}" for r in result
+            ]) if result else "No matching conversations found."
+            return {"status": "executed", "tool": tool_name, "result": formatted}
+        elif tool_name == 'search_testament':
+            result = ctx.search_testament(
+                query=args.get('query', ''),
+                n=args.get('n', 5)
+            )
+            formatted = "\n\n".join([
+                f"📜 **{r['file']}**{' (Testament)' if r.get('is_testament') else ''}\n{r['snippet']}"
+                for r in result
+            ]) if result else "No matching testament/decision records found."
+            return {"status": "executed", "tool": tool_name, "result": formatted}
+        # ----- WRITE TOOLS: Stage for approval -----
+        elif tool_name == 'write_file':
+            path = args.get('path', 'unknown')
+            content_preview = args.get('content', '')[:200]
+            return {
+                "status": "staged",
+                "tool": tool_name,
+                "args": args,
+                "description": f"✏️ Write to `{path}`\n```\n{content_preview}...\n```"
+            }
+        elif tool_name == 'shell_execute':
+            command = args.get('command', 'unknown')
+            # =============================================================
+            # SMART SHELL CLASSIFICATION
+            # =============================================================
+            # CHANGELOG [2026-02-01 - Claude/Opus]
+            # PROBLEM: When list_files returns empty (e.g., repo not cloned),
+            # Kimi falls back to shell_execute with read-only commands like
+            # `find . -type f`. These got staged for approval, forcing Josh
+            # to approve what's functionally just a directory listing.
+            #
+            # FIX: Classify shell commands as READ or WRITE by checking the
+            # base command. Read-only commands auto-execute. Anything that
+            # could modify state still gets staged.
+            #
+            # SAFE READ commands: ls, find, cat, head, tail, wc, grep, tree,
+            # du, file, stat, echo, pwd, which, env, printenv, whoami, date
+            #
+            # UNSAFE (staged): Everything else, plus anything with pipes to
+            # potentially unsafe commands, redirects (>), or semicolons
+            # chaining unknown commands.
+            # =============================================================
+            READ_ONLY_COMMANDS = {
+                'ls', 'find', 'cat', 'head', 'tail', 'wc', 'grep', 'tree',
+                'du', 'file', 'stat', 'echo', 'pwd', 'which', 'env',
+                'printenv', 'whoami', 'date', 'realpath', 'dirname',
+                'basename', 'diff', 'less', 'more', 'sort', 'uniq',
+                'awk', 'sed', 'cut', 'tr', 'tee', 'python',
+            }
+            # ---------------------------------------------------------------
+            # CHANGELOG [2026-02-01 - Claude/Opus]
+            # PROBLEM: Naive '>' check caught "2>/dev/null" as dangerous,
+            # staging `find ... 2>/dev/null | head -20` for approval even
+            # though every command in the pipeline is read-only.
+            #
+            # FIX: Strip stderr redirects (2>/dev/null, 2>&1) before danger
+            # check. Split on pipes and verify EACH segment's base command
+            # is in READ_ONLY_COMMANDS. Only stage if something genuinely
+            # unsafe is found.
+            #
+            # Safe patterns now auto-execute:
+            #   find . -name "*.py" 2>/dev/null | head -20
+            #   grep -r "pattern" . | sort | uniq
+            #   cat file.py | wc -l
+            # Unsafe patterns still get staged:
+            #   find . -name "*.py" | xargs rm
+            #   cat file > /etc/passwd
+            #   echo "bad" ; rm -rf /
+            # ---------------------------------------------------------------
+            # Strip safe stderr redirects before checking
+            import re as _re
+            sanitized = _re.sub(r'2>\s*/dev/null', '', command)
+            sanitized = _re.sub(r'2>&1', '', sanitized)
+            # Characters that turn reads into writes (checked AFTER stripping
+            # safe redirects). Output redirect > is still caught, but not 2>.
+            WRITE_INDICATORS = {';', '&&', '||', '`', '$('}
+            # > is only dangerous if it's a real output redirect, not inside
+            # a quoted string or 2> prefix. Check separately.
+            has_write_redirect = bool(_re.search(r'(?<![2&])\s*>', sanitized))
+            has_write_chars = any(d in sanitized for d in WRITE_INDICATORS)
+            # Split on pipes and check each segment
+            pipe_segments = [seg.strip() for seg in sanitized.split('|') if seg.strip()]
+            all_segments_safe = all(
+                (seg.split()[0].split('/')[-1] if seg.split() else '') in READ_ONLY_COMMANDS
+                for seg in pipe_segments
+            )
+            if all_segments_safe and not has_write_redirect and not has_write_chars:
+                # Every command in the pipeline is read-only — auto-execute
+                result = ctx.shell_execute(command)
+                return {"status": "executed", "tool": tool_name, "result": result}
+            else:
+                # Something potentially destructive — stage for approval
+                return {
+                    "status": "staged",
+                    "tool": tool_name,
+                    "args": args,
+                    "description": f"🖥️ Execute: `{command}`"
+                }
+        elif tool_name == 'create_shadow_branch':
+            return {
+                "status": "staged",
+                "tool": tool_name,
+                "args": args,
+                "description": "🛡️ Create shadow backup branch"
+            }
+        else:
+            return {
+                "status": "error",
+                "tool": tool_name,
+                "result": f"Unknown tool: {tool_name}"
+            }
+    except Exception as e:
+        return {
+            "status": "error",
+            "tool": tool_name,
+            "result": f"Tool execution error: {e}\n{traceback.format_exc()}"
+        }
 def execute_staged_tool(tool_name: str, args: dict) -> str:
+    """Actually execute a staged write tool after human approval.
+    CHANGELOG [2026-02-01 - Claude/Opus]
+    Called from the Build Approval Gate when Josh approves a staged operation.
+    This is the only path through which write tools actually run.
+    Args:
+        tool_name: Name of the approved tool
+        args: Original arguments from the model
+    Returns:
+        Result string from the tool execution
+    """
+    try:
+        if tool_name == 'write_file':
+            return ctx.write_file(
+                path=args.get('path', ''),
+                content=args.get('content', '')
+            )
+        elif tool_name == 'shell_execute':
+            return ctx.shell_execute(command=args.get('command', ''))
+        elif tool_name == 'create_shadow_branch':
+            return ctx.create_shadow_branch()
+        else:
+            return f"Unknown tool: {tool_name}"
+    except Exception as e:
+        return f"Execution error: {e}"
 # =============================================================================
 # FILE UPLOAD HANDLER
 # =============================================================================
 # Supports code files, text, JSON, markdown, etc. Binary files get a
 # placeholder message since they can't be meaningfully injected as text.
 # =============================================================================
 TEXT_EXTENSIONS = {
+    '.py', '.js', '.ts', '.jsx', '.tsx', '.json', '.yaml', '.yml',
+    '.md', '.txt', '.rst', '.html', '.css', '.scss', '.sh', '.bash',
+    '.sql', '.toml', '.cfg', '.ini', '.conf', '.xml', '.csv',
+    '.env', '.gitignore', '.dockerignore', '.mjs', '.cjs',
 }
 def process_uploaded_file(file) -> str:
+    """Read an uploaded file and format it for conversation context.
+    Args:
+        file: Gradio file object with .name attribute (temp path)
+    Returns:
+        Formatted string with filename and content, ready to inject
+        into the conversation as context
+    """
+    if file is None:
+        return ""
+    file_path = file.name if hasattr(file, 'name') else str(file)
+    file_name = os.path.basename(file_path)
+    suffix = os.path.splitext(file_name)[1].lower()
+    if suffix in TEXT_EXTENSIONS or suffix == '':
+        try:
+            with open(file_path, 'r', encoding='utf-8', errors='ignore') as f:
+                content = f.read()
+            # Cap at 50KB to avoid overwhelming context
+            if len(content) > 50000:
+                content = content[:50000] + f"\n\n... (truncated, {len(content)} total chars)"
+            return f"📎 **Uploaded: {file_name}**\n```\n{content}\n```"
+        except Exception as e:
+            return f"📎 **Uploaded: {file_name}** (error reading: {e})"
+    else:
+        return f"📎 **Uploaded: {file_name}** (binary file, {os.path.getsize(file_path):,} bytes)"
 # =============================================================================
 # AGENTIC LOOP
 # =============================================================================
 # list of {"role": str, "content": str} dicts. We maintain that format
 # throughout.
 # =============================================================================
 MAX_ITERATIONS = 5
+def agent_loop(message: str, history: list, pending_proposals: list, uploaded_file) -> tuple:
+    """Main agentic conversation loop.
+    Args:
+        message: User's text input
+        history: Chat history as list of {"role": ..., "content": ...} dicts
+        pending_proposals: Current list of staged write proposals (gr.State)
+        uploaded_file: Optional uploaded file from the file input widget
+    Returns:
+        Tuple of (updated_history, cleared_textbox, updated_proposals,
+                  updated_gate_choices, updated_stats_files, updated_stats_convos)
+    """
+    if not message.strip() and uploaded_file is None:
+        # Nothing to do
+        return (
+            history, "", pending_proposals,
+            _format_gate_choices(pending_proposals),
+            _stats_label_files(), _stats_label_convos()
+        )
+    # Inject uploaded file content if present
+    full_message = message.strip()
+    if uploaded_file is not None:
+        file_context = process_uploaded_file(uploaded_file)
+        if file_context:
+            full_message = f"{file_context}\n\n{full_message}" if full_message else file_context
+    if not full_message:
+        return (
+            history, "", pending_proposals,
+            _format_gate_choices(pending_proposals),
+            _stats_label_files(), _stats_label_convos()
+        )
+    # Add user message to history
+    history = history + [{"role": "user", "content": full_message}]
+    # Build messages for the API
+    system_prompt = build_system_prompt()
+    api_messages = [{"role": "system", "content": system_prompt}]
+    # Include recent history (cap to avoid token overflow)
+    # Keep last 20 turns to stay within Kimi's context window
+    recent_history = history[-40:]  # 40 entries = ~20 turns (user+assistant pairs)
+    for h in recent_history:
+        api_messages.append({"role": h["role"], "content": h["content"]})
+    # Agentic loop: tool calls → execution → re-prompt → repeat
+    accumulated_text = ""
+    staged_this_turn = []
+    for iteration in range(MAX_ITERATIONS):
+        try:
+            response = client.chat_completion(
+                model=MODEL_ID,
+                messages=api_messages,
+                max_tokens=2048,
+                temperature=0.7
+            )
+            content = response.choices[0].message.content or ""
+        except Exception as e:
+            error_msg = f"⚠️ API Error: {e}"
+            history = history + [{"role": "assistant", "content": error_msg}]
+            return (
+                history, "", pending_proposals,
+                _format_gate_choices(pending_proposals),
+                _stats_label_files(), _stats_label_convos()
+            )
+        # Parse for tool calls
+        tool_calls = parse_tool_calls(content)
+        conversational_text = extract_conversational_text(content)
+        if conversational_text:
+            accumulated_text += ("\n\n" if accumulated_text else "") + conversational_text
+        if not tool_calls:
+            # No tools — this is the final response
+            break
+        # Process each tool call
+        tool_results_for_context = []
+        for tool_name, args in tool_calls:
+            result = execute_tool(tool_name, args)
+            if result["status"] == "executed":
+                # READ tool — executed, feed result back to model
+                tool_results_for_context.append(
+                    f"[Tool Result: {tool_name}]\n{result['result']}"
+                )
+            elif result["status"] == "staged":
+                # WRITE tool — staged for approval
+                proposal = {
+                    "id": f"proposal_{int(time.time())}_{tool_name}",
+                    "tool": tool_name,
+                    "args": result["args"],
+                    "description": result["description"],
+                    "timestamp": time.strftime("%H:%M:%S")
+                }
+                staged_this_turn.append(proposal)
+                tool_results_for_context.append(
+                    f"[Tool {tool_name}: STAGED for human approval. "
+                    f"Josh will review this in the Build Approval Gate.]"
+                )
+            elif result["status"] == "error":
+                tool_results_for_context.append(
+                    f"[Tool Error: {tool_name}]\n{result['result']}"
+                )
+        # If we only had staged tools and no reads, break the loop
+        if tool_results_for_context:
+            # Feed tool results back as a system message for the next iteration
+            combined_results = "\n\n".join(tool_results_for_context)
+            api_messages.append({"role": "assistant", "content": content})
+            api_messages.append({"role": "user", "content": f"[Tool Results]\n{combined_results}"})
+        else:
+            break
+    # Build final response
+    final_response = accumulated_text
+    # Append staging notifications if any writes were staged
+    if staged_this_turn:
+        staging_notice = "\n\n---\n🛡️ **Staged for your approval** (see Build Approval Gate tab):\n"
+        for proposal in staged_this_turn:
+            staging_notice += f"- {proposal['description']}\n"
+        final_response += staging_notice
+        # Add to persistent queue
+        pending_proposals = pending_proposals + staged_this_turn
+    if not final_response:
+        final_response = "🤔 I processed your request but didn't generate a text response. Check the Build Approval Gate if I staged any operations."
+    # Add assistant response to history
+    history = history + [{"role": "assistant", "content": final_response}]
+    # Save conversation turn for persistent memory
+    try:
+        turn_count = len([h for h in history if h["role"] == "user"])
+        ctx.save_conversation_turn(full_message, final_response, turn_count)
+    except Exception:
+        pass  # Don't crash the UI if persistence fails
+    return (
+        history,
+        "",  # Clear the textbox
+        pending_proposals,
+        _format_gate_choices(pending_proposals),
+        _stats_label_files(),
+        _stats_label_convos()
+    )
 # =============================================================================
 # BUILD APPROVAL GATE
 # =============================================================================
 # back to the actual proposal objects for execution. We use the proposal
 # ID as the checkbox value and display the description as the label.
 # =============================================================================
 def _format_gate_choices(proposals: list):
+    """Format pending proposals as CheckboxGroup choices.
+    CHANGELOG [2026-02-01 - Claude/Opus]
+    Gradio 6.x deprecated gr.update(). Return a new component instance instead.
+    Args:
+        proposals: List of proposal dicts from staging
+    Returns:
+        gr.CheckboxGroup with updated choices
+    """
+    if not proposals:
+        return gr.CheckboxGroup(choices=[], value=[])
+    choices = []
+    for p in proposals:
+        label = f"[{p['timestamp']}] {p['description']}"
+        choices.append((label, p['id']))
+    return gr.CheckboxGroup(choices=choices, value=[])
 def execute_approved_proposals(selected_ids: list, pending_proposals: list,
+                               history: list) -> tuple:
+    """Execute approved proposals, remove from queue, inject results into chat.
+    CHANGELOG [2026-02-01 - Claude/Opus]
+    PROBLEM: Approved operations executed and showed results in the Gate tab,
+    but the chatbot conversation never received them. Kimi couldn't continue
+    reasoning because it never saw what happened. Josh had to manually go
+    back and re-prompt.
+    FIX: After execution, inject results into chat history as an assistant
+    message. A chained .then() call (auto_continue_after_approval) picks
+    up the updated history and sends a synthetic "[Continue]" through the
+    agent loop so Kimi sees the tool results and keeps working.
+    Args:
+        selected_ids: List of proposal IDs that Josh approved
+        pending_proposals: Full list of pending proposals
+        history: Current chatbot message history (list of dicts)
+    Returns:
+        Tuple of (results_markdown, updated_proposals, updated_gate_choices,
+                  updated_chatbot_history)
+    """
+    if not selected_ids:
+        return (
+            "No proposals selected.",
+            pending_proposals,
+            _format_gate_choices(pending_proposals),
+            history
+        )
+    results = []
+    remaining = []
+    for proposal in pending_proposals:
+        if proposal['id'] in selected_ids:
+            # Execute this one
+            result = execute_staged_tool(proposal['tool'], proposal['args'])
+            results.append(f"**{proposal['tool']}**: {result}")
+        else:
+            # Keep in queue
+            remaining.append(proposal)
+    results_text = "## Execution Results\n\n" + "\n\n".join(results) if results else "Nothing executed."
+    # Inject results into chat history so Kimi sees them next turn
+    if results:
+        result_summary = "✅ **Approved operations executed:**\n\n" + "\n\n".join(results)
+        history = history + [{"role": "assistant", "content": result_summary}]
+    return results_text, remaining, _format_gate_choices(remaining), history
 def auto_continue_after_approval(history: list, pending_proposals: list) -> tuple:
+    """Automatically re-enter the agent loop after approval so Kimi sees results.
+    CHANGELOG [2026-02-01 - Claude/Opus]
+    PROBLEM: After Josh approved staged operations, results were injected into
+    chat history but Kimi never got another turn. Josh had to type something
+    like "continue" to trigger Kimi to process the tool results.
+    FIX: This function is chained via .then() after execute_approved_proposals.
+    It sends a synthetic continuation prompt through the agent loop so Kimi
+    automatically processes the approved tool results and continues working.
+    We only continue if the last message in history is our injected results
+    (starts with '✅ **Approved'). This prevents infinite loops if called
+    when there's nothing to continue from.
+    Args:
+        history: Chat history (should contain injected results from approval)
+        pending_proposals: Current pending proposals (passed through)
+    Returns:
+        Same tuple shape as agent_loop so it can update the same outputs
+    """
+    # Safety check: only continue if last message is our injected results
+    if not history or history[-1].get("role") != "assistant":
+        return (
+            history, "", pending_proposals,
+            _format_gate_choices(pending_proposals),
+            _stats_label_files(), _stats_label_convos()
+        )
+    last_msg = history[-1].get("content", "")
+    if not last_msg.startswith("✅ **Approved"):
+        return (
+            history, "", pending_proposals,
+            _format_gate_choices(pending_proposals),
+            _stats_label_files(), _stats_label_convos()
+        )
+    # Re-enter the agent loop with a synthetic continuation prompt
+    # This tells Kimi "I approved your operations, here are the results,
+    # now keep going with whatever you were doing."
+    return agent_loop(
+        message="[The operations above were approved and executed. Continue with your task using these results.]",
+        history=history,
+        pending_proposals=pending_proposals,
+        uploaded_file=None
+    )
 def clear_all_proposals(pending_proposals: list) -> tuple:
+    """Discard all pending proposals without executing.
+    CHANGELOG [2026-02-01 - Claude/Opus]
+    Safety valve — lets Josh throw out everything in the queue if the
+    agent went off track.
+    Returns:
+        Tuple of (status_message, empty_proposals, updated_gate_choices)
+    """
+    count = len(pending_proposals)
+    return f"🗑️ Cleared {count} proposal(s).", [], _format_gate_choices([])
 # =============================================================================
 # STATS HELPERS
 # =============================================================================
 # Called both at startup (initial render) and after each conversation turn
 # (to reflect newly indexed files or saved conversations).
 # =============================================================================
 def _stats_label_files() -> str:
+    """Format the files stat for the sidebar label."""
+    stats = ctx.get_stats()
+    files = stats.get('total_files', 0)
+    chunks = stats.get('indexed_chunks', 0)
+    indexing = " ⏳" if stats.get('indexing_in_progress') else ""
+    return f"📂 Files: {files} ({chunks} chunks){indexing}"
 def _stats_label_convos() -> str:
+    """Format the conversations stat for the sidebar label."""
+    stats = ctx.get_stats()
+    convos = stats.get('conversations', 0)
+    cloud = " ☁️" if stats.get('persistence_configured') else ""
+    return f"💾 Conversations: {convos}{cloud}"
 def refresh_stats() -> tuple:
+    """Refresh both stat labels. Called by the refresh button.
+    Returns:
+        Tuple of (files_label, convos_label)
+    """
+    return _stats_label_files(), _stats_label_convos()
 # =============================================================================
 # UI LAYOUT
 # =============================================================================
 # gr.State holds the pending proposals list (per-session, survives across
 # messages within the same browser tab).
 # =============================================================================
 with gr.Blocks(
+    title="🦞 Clawdbot Command Center",
+    # CHANGELOG [2026-02-01 - Claude/Opus]
+    # Gradio 6.0+ moved `theme` from Blocks() to launch(). Passing it here
+    # triggers a UserWarning in 6.x. Theme is set in launch() below instead.
 ) as demo:
+    # Session state for pending proposals
+    pending_proposals_state = gr.State([])
+    gr.Markdown("# 🦞 Clawdbot Command Center\n*E-T Systems Vibe Coding Agent*")
+    with gr.Tabs():
+        # ==== TAB 1: VIBE CHAT ====
+        with gr.Tab("💬 Vibe Chat"):
+            with gr.Row():
+                # ---- Sidebar ----
+                with gr.Column(scale=1, min_width=200):
+                    gr.Markdown("### 📊 System Status")
+                    stats_files = gr.Markdown(_stats_label_files())
+                    stats_convos = gr.Markdown(_stats_label_convos())
+                    refresh_btn = gr.Button("🔄 Refresh Stats", size="sm")
+                    gr.Markdown("---")
+                    gr.Markdown("### 📎 Upload Context")
+                    file_input = gr.File(
+                        label="Drop a file here",
+                        file_types=[
+                            '.py', '.js', '.ts', '.json', '.md', '.txt',
+                            '.yaml', '.yml', '.html', '.css', '.sh',
+                            '.toml', '.cfg', '.csv', '.xml'
+                        ]
+                    )
+                    gr.Markdown(
+                        "*Upload code, configs, or docs to include in your message.*"
+                    )
+                # ---- Chat area ----
+                with gr.Column(scale=4):
+                    chatbot = gr.Chatbot(
+                        # CHANGELOG [2026-02-01 - Claude/Opus]
+                        # Gradio 6.x uses messages format by default.
+                        # The type="messages" param was removed in 6.0 —
+                        # passing it causes TypeError on init.
+                        height=600,
+                        show_label=False,
+                        avatar_images=(None, "https://em-content.zobj.net/source/twitter/408/lobster_1f99e.png"),
+                    )
+                    with gr.Row():
+                        msg = gr.Textbox(
+                            placeholder="Ask Clawdbot to search, read, or code...",
+                            show_label=False,
+                            scale=6,
+                            lines=2,
+                            max_lines=10,
+                        )
+                        send_btn = gr.Button("Send", variant="primary", scale=1)
+            # Wire up chat submission
+            chat_inputs = [msg, chatbot, pending_proposals_state, file_input]
+            chat_outputs = [
+                chatbot, msg, pending_proposals_state,
+                # These reference components in the Gate tab — defined below
+            ]
+        # ==== TAB 2: BUILD APPROVAL GATE ====
+        with gr.Tab("🛡️ Build Approval Gate"):
+            gr.Markdown(
+                "### Review Staged Operations\n"
+                "Write operations (file writes, shell commands, branch creation) "
+                "are staged here for your review before execution.\n\n"
+                "**Select proposals to approve, then click Execute.**"
+            )
+            gate_list = gr.CheckboxGroup(
+                label="Pending Proposals",
+                choices=[],
+                interactive=True
+            )
+            with gr.Row():
+                btn_exec = gr.Button("✅ Execute Selected", variant="primary")
+                btn_clear = gr.Button("🗑️ Clear All", variant="secondary")
+            gate_results = gr.Markdown("*No operations executed yet.*")
+    # ==================================================================
+    # EVENT WIRING
+    # ==================================================================
+    # CHANGELOG [2026-02-01 - Claude/Opus]
+    # All events are wired here, after all components are defined, so
+    # cross-tab references work (e.g., chat updating the gate_list).
+    # ==================================================================
+    # Chat submission (both Enter key and Send button)
+    full_chat_outputs = [
+        chatbot, msg, pending_proposals_state,
+        gate_list, stats_files, stats_convos
+    ]
+    msg.submit(
+        fn=agent_loop,
+        inputs=chat_inputs,
+        outputs=full_chat_outputs
+    )
+    send_btn.click(
+        fn=agent_loop,
+        inputs=chat_inputs,
+        outputs=full_chat_outputs
+    )
+    # Refresh stats button
+    refresh_btn.click(
+        fn=refresh_stats,
+        inputs=[],
+        outputs=[stats_files, stats_convos]
+    )
+    # Build Approval Gate buttons
+    # CHANGELOG [2026-02-01 - Claude/Opus]
+    # btn_exec now takes chatbot as input AND output so approved operation
+    # results get injected into the conversation history. The .then() chain
+    # automatically re-enters the agent loop so Kimi processes the results
+    # without Josh having to type "continue".
+    btn_exec.click(
+        fn=execute_approved_proposals,
+        inputs=[gate_list, pending_proposals_state, chatbot],
+        outputs=[gate_results, pending_proposals_state, gate_list, chatbot]
+    ).then(
+        fn=auto_continue_after_approval,
+        inputs=[chatbot, pending_proposals_state],
+        outputs=[chatbot, msg_input, pending_proposals_state,
+                 gate_list, stat_files, stat_convos]
+    )
+    btn_clear.click(
+        fn=clear_all_proposals,
+        inputs=[pending_proposals_state],
+        outputs=[gate_results, pending_proposals_state, gate_list]
+    )
 # =============================================================================
 # LAUNCH
 # =============================================================================
 # Standard HF Spaces launch config. 0.0.0.0 binds to all interfaces
 # (required for Docker). Port 7860 is the HF Spaces standard.
 # =============================================================================
 if __name__ == "__main__":
+    demo.launch(server_name="0.0.0.0", server_port=7860, theme=gr.themes.Soft())