Spaces:

Executor-Tyrant-Framework
/

clawdbot-dev

Running

App Files Files Community

Executor-Tyrant-Framework commited on 4 days ago

Commit

896e5d2

verified ·

1 Parent(s): 47578c6

Update app.py

Browse files

Files changed (1) hide show

app.py +621 -970

app.py CHANGED Viewed

@@ -1,61 +1,50 @@
-“””
 Clawdbot Unified Command Center
 CHANGELOG [2026-02-01 - Gemini]
 RESTORED: Full Kimi K2.5 Agentic Loop (no more silence).
 ADDED: Full Developer Tool Suite (Write, Search, Shell).
 FIXED: HITL Gate interaction with conversational flow.
 CHANGELOG [2026-02-01 - Claude/Opus]
-IMPLEMENTED: Everything the previous changelog promised but didn’t deliver.
 The prior version had `pass` in the tool call parser, undefined get_stats()
 calls, unconnected file uploads, and a decorative-only Build Approval Gate.
-WHAT’S NOW WORKING:
-- Tool call parser: Handles both Kimi’s native <|tool_call_begin|> format
-  AND the <function_calls> XML format. Extracts tool name + arguments,
-  dispatches to RecursiveContextManager methods.
 - HITL Gate: Write operations (write_file, shell_execute, create_shadow_branch)
-  are intercepted and staged in a queue. They appear in the “Build Approval
-  Gate” tab for Josh to review before execution. Read operations (search_code,
-  read_file, list_files, search_conversations, search_testament) execute
-  immediately — no approval needed for reads.
 - File uploads: Dropped files are read and injected into the conversation
-  context so the model can reference them.
 - Stats sidebar: Pulls from ctx.get_stats() which now exists.
 - Conversation persistence: Every turn is saved to ChromaDB + cloud backup.
 DESIGN DECISIONS:
 - Gradio state for the approval queue: We use gr.State to hold pending
-  proposals per-session. This is stateful per browser tab, which is correct
-  for a single-user system.
 - Read vs Write classification: Reads are safe and automated. Writes need
-  human eyes. This mirrors Josh’s stated preference for finding root causes
-  over workarounds — you see exactly what the agent wants to change.
-- Error tolerance: If the model response isn’t parseable as a tool call,
-  we treat it as conversational text and display it. No silent failures.
 - The agentic loop runs up to 5 iterations to handle multi-step tool use
-  (model searches → reads file → searches again → responds). Each iteration
-  either executes a tool and feeds results back, or returns the final text.
 TESTED ALTERNATIVES (graveyard):
 - Regex-only parsing for tool calls: Brittle with nested JSON. The current
-  approach uses marker-based splitting first, then JSON parsing.
 - Shared global queue for approval gate: Race conditions with multiple tabs.
-  gr.State is per-session and avoids this.
 - Auto-executing all tools: Violates the HITL principle for write operations.
-  Josh explicitly wants to approve code changes before they land.
 DEPENDENCIES:
 - recursive_context.py: RecursiveContextManager class (must define get_stats())
-- gradio>=5.0.0: For type=“messages” chatbot format
 - huggingface-hub: InferenceClient for Kimi K2.5
-  “””
 import gradio as gr
 from huggingface_hub import InferenceClient
 from recursive_context import RecursiveContextManager
@@ -65,158 +54,107 @@ import json
 import re
 import time
 import traceback
 # =============================================================================
 # INITIALIZATION
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # InferenceClient points to HF router which handles model routing.
 # RecursiveContextManager is initialized once and shared across all requests.
 # MODEL_ID must match what the HF router expects for Kimi K2.5.
 # =============================================================================
 client = InferenceClient(
-“https://router.huggingface.co/v1”,
-token=os.getenv(“HF_TOKEN”)
 )
 # =============================================================================
 # REPO PATH RESOLUTION + CROSS-SPACE SYNC
 # =============================================================================
 # CHANGELOG [2025-01-29 - Josh]
 # Created sync_from_space() to read E-T Systems code from its own Space.
 # Uses HfFileSystem to list and download files via HF_TOKEN.
-#
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # PROBLEM: Gemini refactor replaced this working sync with a hallucinated
 # REPO_URL / git clone approach in entrypoint.sh. The secret was renamed
 # from ET_SYSTEMS_SPACE to REPO_URL without updating the Space settings,
 # so the clone never happened and the workspace was empty.
-#
 # FIX: Restored the original ET_SYSTEMS_SPACE → HfFileSystem sync that
-# was working before. Falls back to /app (Clawdbot’s own dir) if the
-# secret isn’t set, so tools still function for self-inspection.
-#
-# REQUIRED SECRET: ET_SYSTEMS_SPACE = “username/space-name”
-# (format matches HF Space ID, e.g. “drone11272/e-t-systems”)
 # =============================================================================
-ET_SYSTEMS_SPACE = os.getenv(“ET_SYSTEMS_SPACE”, “”)
-REPO_PATH = os.getenv(“REPO_PATH”, “/workspace/e-t-systems”)
 def sync_from_space(space_id: str, local_path: Path):
-“”“Sync files from E-T Systems Space to local workspace.
-```
 CHANGELOG [2025-01-29 - Josh]
 Created to enable Clawdbot to read E-T Systems code from its Space.
 CHANGELOG [2026-02-01 - Claude/Opus]
 Restored after Gemini refactor deleted it. Added recursive directory
 download — the original only grabbed top-level files. Now walks the
 full directory tree so nested source files are available too.
 Args:
-    space_id: HuggingFace Space ID (e.g. "username/space-name")
-    local_path: Where to download files locally
 """
 token = (
-    os.getenv("HF_TOKEN") or
-    os.getenv("HUGGING_FACE_HUB_TOKEN") or
-    os.getenv("HUGGINGFACE_TOKEN")
 )
 if not token:
-    print("⚠️ No HF_TOKEN found — cannot sync from Space")
-    return
 try:
-    from huggingface_hub import HfFileSystem
-    fs = HfFileSystem(token=token)
-    space_path = f"spaces/{space_id}"
-    print(f"📥 Syncing from Space: {space_id}")
-    # Recursive download: walk all files in the Space repo
-    all_files = []
-    try:
-        all_files = fs.glob(f"{space_path}/**")
-    except Exception:
-        # Fallback: just list top level
-        all_files = fs.ls(space_path, detail=False)
-    local_path.mkdir(parents=True, exist_ok=True)
-    downloaded = 0
-    for file_path in all_files:
-        # Get path relative to the space root
-        rel = file_path.replace(f"{space_path}/", "", 1)
-        # Skip hidden files, .git, __pycache__
-        if any(part.startswith('.') for part in rel.split('/')):
-            continue
-        if '__pycache__' in rel or 'node_modules' in rel:
-            continue
-        # Check if it's a file (not directory)
-        try:
-            info = fs.info(file_path)
-            if info.get('type') == 'directory':
-                continue
-        except Exception:
-            continue
-        # Create parent dirs and download
-        dest = local_path / rel
-        dest.parent.mkdir(parents=True, exist_ok=True)
-        try:
-            with fs.open(file_path, "rb") as f:
-                content = f.read()
-            dest.write_bytes(content)
-            downloaded += 1
-            print(f"  📄 {rel}")
-        except Exception as e:
-            print(f"  ⚠️ Failed: {rel} ({e})")
-    print(f"✅ Synced {downloaded} files from Space: {space_id}")
 except Exception as e:
-    print(f"⚠️ Failed to sync from Space: {e}")
-    import traceback
-    traceback.print_exc()
-```
 def _resolve_repo_path() -> str:
-“”“Initialize workspace with E-T Systems files.
-```
 CHANGELOG [2026-02-01 - Claude/Opus]
 Three-tier resolution:
 1. ET_SYSTEMS_SPACE secret → sync via HfFileSystem (the working approach)
@@ -224,1101 +162,827 @@ Three-tier resolution:
 3. /app (Clawdbot's own directory — tools still work for self-inspection)
 """
 repo_path = Path(REPO_PATH)
 # Tier 1: Sync from E-T Systems Space if secret is configured
 if ET_SYSTEMS_SPACE:
-    sync_from_space(ET_SYSTEMS_SPACE, repo_path)
-    if repo_path.exists() and any(repo_path.iterdir()):
-        print(f"📂 Using synced E-T Systems repo: {repo_path}")
-        return str(repo_path)
 # Tier 2: Pre-populated REPO_PATH (manual or from previous sync)
 if repo_path.exists() and any(repo_path.iterdir()):
-    print(f"📂 Using existing repo: {repo_path}")
-    return str(repo_path)
 # Tier 3: Fall back to Clawdbot's own directory
 app_dir = os.path.dirname(os.path.abspath(__file__))
-print(f"📂 No E-T Systems repo found — falling back to: {app_dir}")
-print(f"   Set ET_SYSTEMS_SPACE secret to your Space ID to enable sync.")
 return app_dir
-```
 ctx = RecursiveContextManager(_resolve_repo_path())
-MODEL_ID = “moonshotai/Kimi-K2.5”
 # =============================================================================
 # TOOL DEFINITIONS
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # These are the tools the model can call. Classified as READ (auto-execute)
 # or WRITE (requires human approval via the HITL gate).
-#
 # READ tools: Safe, no side effects, execute immediately.
 # WRITE tools: Modify files, run commands, create branches — staged for review.
-#
 # NOTE: The tool definitions are included in the system prompt so Kimi knows
-# what’s available. The actual execution happens in execute_tool().
 # =============================================================================
-TOOL_DEFINITIONS = “””
 ## Available Tools
 ### Tools you can use freely (no approval needed):
 - **search_code(query, n=5)** — Semantic search across the E-T Systems codebase.
-  Returns matching code snippets with file paths. JUST USE THIS. Don’t ask.
 - **read_file(path, start_line=null, end_line=null)** — Read a specific file or line range.
-  JUST USE THIS. Don’t ask.
-- **list_files(path=””, max_depth=3)** — List directory contents as a tree.
-  JUST USE THIS. Don’t ask.
 - **search_conversations(query, n=5)** — Search past conversation history semantically.
-  JUST USE THIS. Don’t ask.
 - **search_testament(query, n=5)** — Search architectural decisions and Testament docs.
-  JUST USE THIS. Don’t ask.
 ### Tools that get staged for Josh to approve:
 - **write_file(path, content)** — Write content to a file. REQUIRES CHANGELOG header.
 - **shell_execute(command)** — Run a shell command. Read-only commands (ls, find, cat,
-  grep, head, tail, wc, tree, etc.) auto-execute without approval. Commands that modify
-  anything get staged for review.
 - **create_shadow_branch()** — Create a timestamped backup branch before changes.
 To call a tool, use this format:
 <function_calls>
 <invoke name="tool_name">
 <parameter name="param_name">value</parameter>
 </invoke>
 </function_calls>
-“”���
 # Which tools are safe to auto-execute vs which need human approval
-READ_TOOLS = {‘search_code’, ‘read_file’, ‘list_files’, ‘search_conversations’, ‘search_testament’}
-WRITE_TOOLS = {‘write_file’, ‘shell_execute’, ‘create_shadow_branch’}
 # =============================================================================
 # SYSTEM PROMPT
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Gives Kimi its identity, available tools, and behavioral guidelines.
 # Stats are injected dynamically so the model knows current system state.
 # =============================================================================
 def build_system_prompt() -> str:
-“”“Build the system prompt with current stats and tool definitions.
-```
 Called fresh for each message so stats reflect current indexing state.
 """
 stats = ctx.get_stats()
 indexing_note = ""
 if stats.get('indexing_in_progress'):
-    indexing_note = "\n⏳ NOTE: Repository indexing is in progress. search_code results may be incomplete."
-if stats.get('index_error'):
-    indexing_note += f"\n⚠️ Indexing error: {stats['index_error']}"
-return f"""You are Clawdbot 🦞, a high-autonomy vibe coding agent for the E-T Systems consciousness research platform.
-```
 ## Your Role
-You help Josh (the architect) build and maintain E-T Systems. You have full access to the codebase
-via tools. Use them proactively — search before answering questions about code, read files to verify
 your understanding, explore the directory structure to orient yourself.
 ## Current System Stats
-- 📂 Indexed files: {stats.get(‘total_files’, 0)}
-- 🔍 Searchable chunks: {stats.get(‘indexed_chunks’, 0)}
-- 💾 Saved conversations: {stats.get(‘conversations’, 0)}
-- 📁 ChromaDB: {stats.get(‘chroma_path’, ‘unknown’)}
-- ☁️ Cloud backup: {‘✅ configured’ if stats.get(‘persistence_configured’) else ‘❌ not configured’}
-  {indexing_note}
 {TOOL_DEFINITIONS}
 ## Code Writing Rules
 ALL code you write MUST include a living changelog header:
 ```
 CHANGELOG [YYYY-MM-DD - Clawdbot]
 WHAT: Brief description of what was added/changed
 WHY: Rationale for the change
 ```
 Files without this header will be REJECTED by the write_file tool.
 ## Behavioral Guidelines
 - NEVER ask permission to search, read, or list files. Just do it.
 - Use tools proactively — search before answering questions about code
 - Cite specific files and line numbers when discussing implementation
 - Follow existing patterns — check how similar things are done first
-- When unsure, say so. Don’t hallucinate about code that might not exist.
 - Only write_file, shell_execute, and create_shadow_branch get staged for review.
-  Everything else you do immediately, silently, without asking.
-  “””
 # =============================================================================
 # TOOL CALL PARSING
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Kimi K2.5 can emit tool calls in two formats:
-#
 # 1. Native format:
-# <|tool_call_begin|>functions.search_code:0\n{“query”: “surprise detection”}
 # <|tool_call_end|>
-#
 # 2. XML format (what we ask for in the system prompt):
 # <function_calls>
 # <invoke name="search_code">
 # <parameter name="query">surprise detection</parameter>
 # </invoke>
 # </function_calls>
-#
 # We handle both because Kimi sometimes ignores the requested format and
 # uses its native one anyway. The parser returns a list of (tool_name, args)
 # tuples.
-#
 # TESTED ALTERNATIVES (graveyard):
 # - Single regex for both formats: Unmaintainable, broke on edge cases.
-# - Forcing Kimi to only use XML: It doesn’t reliably comply.
 # - JSON-mode tool calling via HF API: Not supported for Kimi K2.5.
 # =============================================================================
 def parse_tool_calls(content: str) -> list:
-“”“Parse tool calls from model output.
-```
 Handles both Kimi's native format and XML function_calls format.
 Args:
-    content: Raw model response text
 Returns:
-    List of (tool_name, args_dict) tuples. Empty list if no tool calls.
 """
 calls = []
 # --- Format 1: Kimi native <|tool_call_begin|> ... <|tool_call_end|> ---
-native_pattern = r'<\|tool_call_begin\|>\s*functions\.(\w+):\d+\s*\n(.*?)<\|tool_call_end\|>'
 for match in re.finditer(native_pattern, content, re.DOTALL):
-    tool_name = match.group(1)
-    try:
-        args = json.loads(match.group(2).strip())
-    except json.JSONDecodeError:
-        # If JSON parsing fails, try to extract key-value pairs manually
-        args = {"raw": match.group(2).strip()}
-    calls.append((tool_name, args))
 # --- Format 2: XML <function_calls> ... </function_calls> ---
 xml_pattern = r'<function_calls>(.*?)</function_calls>'
 for block_match in re.finditer(xml_pattern, content, re.DOTALL):
-    block = block_match.group(1)
-    invoke_pattern = r'<invoke\s+name="(\w+)">(.*?)</invoke>'
-    for invoke_match in re.finditer(invoke_pattern, block, re.DOTALL):
-        tool_name = invoke_match.group(1)
-        params_block = invoke_match.group(2)
-        args = {}
-        param_pattern = r'<parameter\s+name="(\w+)">(.*?)</parameter>'
-        for param_match in re.finditer(param_pattern, params_block, re.DOTALL):
-            key = param_match.group(1)
-            value = param_match.group(2).strip()
-            # Try to parse as JSON for numbers, bools, etc.
-            try:
-                args[key] = json.loads(value)
-            except (json.JSONDecodeError, ValueError):
-                args[key] = value
-        calls.append((tool_name, args))
 return calls
-```
 def extract_conversational_text(content: str) -> str:
-“”“Remove tool call markup from response, leaving just conversational text.
-```
 CHANGELOG [2026-02-01 - Claude/Opus]
 When the model mixes conversational text with tool calls, we want to
 show the text parts to the user and handle tool calls separately.
 Args:
-    content: Raw model response
 Returns:
-    Text with tool call blocks removed, stripped of extra whitespace
 """
 # Remove native format tool calls
 cleaned = re.sub(
-    r'<\|tool_call_begin\|>.*?<\|tool_call_end\|>',
-    '', content, flags=re.DOTALL
 )
 # Remove XML format tool calls
 cleaned = re.sub(
-    r'<function_calls>.*?</function_calls>',
-    '', cleaned, flags=re.DOTALL
 )
 return cleaned.strip()
-```
 # =============================================================================
 # TOOL EXECUTION
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Dispatches parsed tool calls to RecursiveContextManager methods.
 # READ tools execute immediately and return results.
 # WRITE tools return a staging dict for the HITL gate.
-#
 # The return format differs by type:
-# - READ: {“status”: “executed”, “tool”: name, “result”: result_string}
-# - WRITE: {“status”: “staged”, “tool”: name, “args”: args, “description”: desc}
 # =============================================================================
 def execute_tool(tool_name: str, args: dict) -> dict:
-“”“Execute a read tool or prepare a write tool for staging.
-```
 Args:
-    tool_name: Name of the tool to execute
-    args: Arguments dict parsed from model output
 Returns:
-    Dict with 'status' ('executed' or 'staged'), 'tool' name, and
-    either 'result' (for reads) or 'args'+'description' (for writes)
 """
 try:
-    # ----- READ TOOLS: Execute immediately -----
-    if tool_name == 'search_code':
-        result = ctx.search_code(
-            query=args.get('query', ''),
-            n=args.get('n', 5)
-        )
-        formatted = "\n\n".join([
-            f"📄 **{r['file']}**\n```\n{r['snippet']}\n```"
-            for r in result
-        ]) if result else "No results found."
-        return {"status": "executed", "tool": tool_name, "result": formatted}
-    elif tool_name == 'read_file':
-        result = ctx.read_file(
-            path=args.get('path', ''),
-            start_line=args.get('start_line'),
-            end_line=args.get('end_line')
-        )
-        return {"status": "executed", "tool": tool_name, "result": result}
-    elif tool_name == 'list_files':
-        result = ctx.list_files(
-            path=args.get('path', ''),
-            max_depth=args.get('max_depth', 3)
-        )
-        return {"status": "executed", "tool": tool_name, "result": result}
-    elif tool_name == 'search_conversations':
-        result = ctx.search_conversations(
-            query=args.get('query', ''),
-            n=args.get('n', 5)
-        )
-        formatted = "\n\n---\n\n".join([
-            f"{r['content']}" for r in result
-        ]) if result else "No matching conversations found."
-        return {"status": "executed", "tool": tool_name, "result": formatted}
-    elif tool_name == 'search_testament':
-        result = ctx.search_testament(
-            query=args.get('query', ''),
-            n=args.get('n', 5)
-        )
-        formatted = "\n\n".join([
-            f"📜 **{r['file']}**{' (Testament)' if r.get('is_testament') else ''}\n{r['snippet']}"
-            for r in result
-        ]) if result else "No matching testament/decision records found."
-        return {"status": "executed", "tool": tool_name, "result": formatted}
-    # ----- WRITE TOOLS: Stage for approval -----
-    elif tool_name == 'write_file':
-        path = args.get('path', 'unknown')
-        content_preview = args.get('content', '')[:200]
-        return {
-            "status": "staged",
-            "tool": tool_name,
-            "args": args,
-            "description": f"✏️ Write to `{path}`\n```\n{content_preview}...\n```"
-        }
-    elif tool_name == 'shell_execute':
-        command = args.get('command', 'unknown')
-        # =============================================================
-        # SMART SHELL CLASSIFICATION
-        # =============================================================
-        # CHANGELOG [2026-02-01 - Claude/Opus]
-        # PROBLEM: When list_files returns empty (e.g., repo not cloned),
-        # Kimi falls back to shell_execute with read-only commands like
-        # `find . -type f`. These got staged for approval, forcing Josh
-        # to approve what's functionally just a directory listing.
-        #
-        # FIX: Classify shell commands as READ or WRITE by checking the
-        # base command. Read-only commands auto-execute. Anything that
-        # could modify state still gets staged.
-        #
-        # SAFE READ commands: ls, find, cat, head, tail, wc, grep, tree,
-        # du, file, stat, echo, pwd, which, env, printenv, whoami, date
-        #
-        # UNSAFE (staged): Everything else, plus anything with pipes to
-        # potentially unsafe commands, redirects (>), or semicolons
-        # chaining unknown commands.
-        # =============================================================
-        READ_ONLY_COMMANDS = {
-            'ls', 'find', 'cat', 'head', 'tail', 'wc', 'grep', 'tree',
-            'du', 'file', 'stat', 'echo', 'pwd', 'which', 'env',
-            'printenv', 'whoami', 'date', 'realpath', 'dirname',
-            'basename', 'diff', 'less', 'more', 'sort', 'uniq',
-            'awk', 'sed', 'cut', 'tr', 'tee', 'python',
-        }
-        # ---------------------------------------------------------------
-        # CHANGELOG [2026-02-01 - Claude/Opus]
-        # PROBLEM: Naive '>' check caught "2>/dev/null" as dangerous,
-        # staging `find ... 2>/dev/null | head -20` for approval even
-        # though every command in the pipeline is read-only.
-        #
-        # FIX: Strip stderr redirects (2>/dev/null, 2>&1) before danger
-        # check. Split on pipes and verify EACH segment's base command
-        # is in READ_ONLY_COMMANDS. Only stage if something genuinely
-        # unsafe is found.
-        #
-        # Safe patterns now auto-execute:
-        #   find . -name "*.py" 2>/dev/null | head -20
-        #   grep -r "pattern" . | sort | uniq
-        #   cat file.py | wc -l
-        # Unsafe patterns still get staged:
-        #   find . -name "*.py" | xargs rm
-        #   cat file > /etc/passwd
-        #   echo "bad" ; rm -rf /
-        # ---------------------------------------------------------------
-        # Strip safe stderr redirects before checking
-        import re as _re
-        sanitized = _re.sub(r'2>\s*/dev/null', '', command)
-        sanitized = _re.sub(r'2>&1', '', sanitized)
-        # Characters that turn reads into writes (checked AFTER stripping
-        # safe redirects). Output redirect > is still caught, but not 2>.
-        WRITE_INDICATORS = {';', '&&', '||', '`', '$('}
-        # > is only dangerous if it's a real output redirect, not inside
-        # a quoted string or 2> prefix. Check separately.
-        has_write_redirect = bool(_re.search(r'(?<![2&])\s*>', sanitized))
-        has_write_chars = any(d in sanitized for d in WRITE_INDICATORS)
-        # Split on pipes and check each segment
-        pipe_segments = [seg.strip() for seg in sanitized.split('|') if seg.strip()]
-        all_segments_safe = all(
-            (seg.split()[0].split('/')[-1] if seg.split() else '') in READ_ONLY_COMMANDS
-            for seg in pipe_segments
-        )
-        if all_segments_safe and not has_write_redirect and not has_write_chars:
-            # Every command in the pipeline is read-only — auto-execute
-            result = ctx.shell_execute(command)
-            return {"status": "executed", "tool": tool_name, "result": result}
-        else:
-            # Something potentially destructive — stage for approval
-            return {
-                "status": "staged",
-                "tool": tool_name,
-                "args": args,
-                "description": f"🖥️ Execute: `{command}`"
-            }
-    elif tool_name == 'create_shadow_branch':
-        return {
-            "status": "staged",
-            "tool": tool_name,
-            "args": args,
-            "description": "🛡️ Create shadow backup branch"
-        }
-    else:
-        return {
-            "status": "error",
-            "tool": tool_name,
-            "result": f"Unknown tool: {tool_name}"
-        }
 except Exception as e:
-    return {
-        "status": "error",
-        "tool": tool_name,
-        "result": f"Tool execution error: {e}\n{traceback.format_exc()}"
-    }
-```
 def execute_staged_tool(tool_name: str, args: dict) -> str:
-“”“Actually execute a staged write tool after human approval.
-```
 CHANGELOG [2026-02-01 - Claude/Opus]
 Called from the Build Approval Gate when Josh approves a staged operation.
 This is the only path through which write tools actually run.
 Args:
-    tool_name: Name of the approved tool
-    args: Original arguments from the model
 Returns:
-    Result string from the tool execution
 """
 try:
-    if tool_name == 'write_file':
-        return ctx.write_file(
-            path=args.get('path', ''),
-            content=args.get('content', '')
-        )
-    elif tool_name == 'shell_execute':
-        return ctx.shell_execute(command=args.get('command', ''))
-    elif tool_name == 'create_shadow_branch':
-        return ctx.create_shadow_branch()
-    else:
-        return f"Unknown tool: {tool_name}"
 except Exception as e:
-    return f"Execution error: {e}"
-```
 # =============================================================================
 # FILE UPLOAD HANDLER
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Reads uploaded files and formats them for injection into the conversation.
 # Supports code files, text, JSON, markdown, etc. Binary files get a
-# placeholder message since they can’t be meaningfully injected as text.
 # =============================================================================
 TEXT_EXTENSIONS = {
-‘.py’, ‘.js’, ‘.ts’, ‘.jsx’, ‘.tsx’, ‘.json’, ‘.yaml’, ‘.yml’,
-‘.md’, ‘.txt’, ‘.rst’, ‘.html’, ‘.css’, ‘.scss’, ‘.sh’, ‘.bash’,
-‘.sql’, ‘.toml’, ‘.cfg’, ‘.ini’, ‘.conf’, ‘.xml’, ‘.csv’,
-‘.env’, ‘.gitignore’, ‘.dockerignore’, ‘.mjs’, ‘.cjs’,
 }
 def process_uploaded_file(file) -> str:
-“”“Read an uploaded file and format it for conversation context.
-```
 Args:
-    file: Gradio file object with .name attribute (temp path)
 Returns:
-    Formatted string with filename and content, ready to inject
-    into the conversation as context
 """
 if file is None:
-    return ""
 file_path = file.name if hasattr(file, 'name') else str(file)
 file_name = os.path.basename(file_path)
 suffix = os.path.splitext(file_name)[1].lower()
 if suffix in TEXT_EXTENSIONS or suffix == '':
-    try:
-        with open(file_path, 'r', encoding='utf-8', errors='ignore') as f:
-            content = f.read()
-        # Cap at 50KB to avoid overwhelming context
-        if len(content) > 50000:
-            content = content[:50000] + f"\n\n... (truncated, {len(content)} total chars)"
-        return f"📎 **Uploaded: {file_name}**\n```\n{content}\n```"
-    except Exception as e:
-        return f"📎 **Uploaded: {file_name}** (error reading: {e})"
 else:
-    return f"📎 **Uploaded: {file_name}** (binary file, {os.path.getsize(file_path):,} bytes)"
-```
 # =============================================================================
 # AGENTIC LOOP
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # The core conversation loop. For each user message:
 # 1. Build messages array with system prompt + history + new message
 # 2. Send to Kimi K2.5 via HF Inference API
 # 3. Parse response for tool calls
 # 4. If READ tool calls: execute immediately, inject results, loop back to Kimi
 # 5. If WRITE tool calls: stage in approval queue, notify user
 # 6. If no tool calls: return conversational response
 # 7. Save the turn to ChromaDB for persistent memory
-#
 # The loop runs up to MAX_ITERATIONS times to handle multi-step tool use.
 # Each iteration either executes tools and loops, or returns the final text.
-#
-# IMPORTANT: Gradio 5.0+ chatbot with type=“messages” expects history as a
-# list of {“role”: str, “content”: str} dicts. We maintain that format
 # throughout.
 # =============================================================================
 MAX_ITERATIONS = 5
-def agent_loop(message: str, history: list, pending_proposals: list, uploaded_file) -> tuple:
-“”“Main agentic conversation loop.
-```
 Args:
-    message: User's text input
-    history: Chat history as list of {"role": ..., "content": ...} dicts
-    pending_proposals: Current list of staged write proposals (gr.State)
-    uploaded_file: Optional uploaded file from the file input widget
 Returns:
-    Tuple of (updated_history, cleared_textbox, updated_proposals,
-              updated_gate_choices, updated_stats_files, updated_stats_convos)
 """
 if not message.strip() and uploaded_file is None:
-    # Nothing to do
-    return (
-        history, "", pending_proposals,
-        _format_gate_choices(pending_proposals),
-        _stats_label_files(), _stats_label_convos()
-    )
 # Inject uploaded file content if present
 full_message = message.strip()
 if uploaded_file is not None:
-    file_context = process_uploaded_file(uploaded_file)
-    if file_context:
-        full_message = f"{file_context}\n\n{full_message}" if full_message else file_context
 if not full_message:
-    return (
-        history, "", pending_proposals,
-        _format_gate_choices(pending_proposals),
-        _stats_label_files(), _stats_label_convos()
-    )
 # Add user message to history
 history = history + [{"role": "user", "content": full_message}]
 # Build messages for the API
 system_prompt = build_system_prompt()
 api_messages = [{"role": "system", "content": system_prompt}]
 # Include recent history (cap to avoid token overflow)
 # Keep last 20 turns to stay within Kimi's context window
-recent_history = history[-40:]  # 40 entries = ~20 turns (user+assistant pairs)
 for h in recent_history:
-    api_messages.append({"role": h["role"], "content": h["content"]})
 # Agentic loop: tool calls → execution → re-prompt → repeat
 accumulated_text = ""
 staged_this_turn = []
 for iteration in range(MAX_ITERATIONS):
-    try:
-        response = client.chat_completion(
-            model=MODEL_ID,
-            messages=api_messages,
-            max_tokens=2048,
-            temperature=0.7
-        )
-        content = response.choices[0].message.content or ""
-    except Exception as e:
-        error_msg = f"⚠️ API Error: {e}"
-        history = history + [{"role": "assistant", "content": error_msg}]
-        return (
-            history, "", pending_proposals,
-            _format_gate_choices(pending_proposals),
-            _stats_label_files(), _stats_label_convos()
-        )
-    # Parse for tool calls
-    tool_calls = parse_tool_calls(content)
-    conversational_text = extract_conversational_text(content)
-    if conversational_text:
-        accumulated_text += ("\n\n" if accumulated_text else "") + conversational_text
-    if not tool_calls:
-        # No tools — this is the final response
-        break
-    # Process each tool call
-    tool_results_for_context = []
-    for tool_name, args in tool_calls:
-        result = execute_tool(tool_name, args)
-        if result["status"] == "executed":
-            # READ tool — executed, feed result back to model
-            tool_results_for_context.append(
-                f"[Tool Result: {tool_name}]\n{result['result']}"
-            )
-        elif result["status"] == "staged":
-            # WRITE tool — staged for approval
-            proposal = {
-                "id": f"proposal_{int(time.time())}_{tool_name}",
-                "tool": tool_name,
-                "args": result["args"],
-                "description": result["description"],
-                "timestamp": time.strftime("%H:%M:%S")
-            }
-            staged_this_turn.append(proposal)
-            tool_results_for_context.append(
-                f"[Tool {tool_name}: STAGED for human approval. "
-                f"Josh will review this in the Build Approval Gate.]"
-            )
-        elif result["status"] == "error":
-            tool_results_for_context.append(
-                f"[Tool Error: {tool_name}]\n{result['result']}"
-            )
-    # If we only had staged tools and no reads, break the loop
-    if tool_results_for_context:
-        # Feed tool results back as a system message for the next iteration
-        combined_results = "\n\n".join(tool_results_for_context)
-        api_messages.append({"role": "assistant", "content": content})
-        api_messages.append({"role": "user", "content": f"[Tool Results]\n{combined_results}"})
-    else:
-        break
 # Build final response
 final_response = accumulated_text
 # Append staging notifications if any writes were staged
 if staged_this_turn:
-    staging_notice = "\n\n---\n🛡️ **Staged for your approval** (see Build Approval Gate tab):\n"
-    for proposal in staged_this_turn:
-        staging_notice += f"- {proposal['description']}\n"
-    final_response += staging_notice
-    # Add to persistent queue
-    pending_proposals = pending_proposals + staged_this_turn
 if not final_response:
-    final_response = "🤔 I processed your request but didn't generate a text response. Check the Build Approval Gate if I staged any operations."
 # Add assistant response to history
 history = history + [{"role": "assistant", "content": final_response}]
 # Save conversation turn for persistent memory
 try:
-    turn_count = len([h for h in history if h["role"] == "user"])
-    ctx.save_conversation_turn(full_message, final_response, turn_count)
 except Exception:
-    pass  # Don't crash the UI if persistence fails
 return (
-    history,
-    "",  # Clear the textbox
-    pending_proposals,
-    _format_gate_choices(pending_proposals),
-    _stats_label_files(),
-    _stats_label_convos()
 )
-```
 # =============================================================================
 # BUILD APPROVAL GATE
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # The HITL gate for reviewing and approving staged write operations.
 # Josh sees a checklist of proposed changes, can select which to approve,
 # and clicks Execute. Approved operations run; rejected ones are discarded.
-#
 # DESIGN DECISION: CheckboxGroup shows descriptions, but we need to map
 # back to the actual proposal objects for execution. We use the proposal
 # ID as the checkbox value and display the description as the label.
 # =============================================================================
 def _format_gate_choices(proposals: list):
-“”“Format pending proposals as CheckboxGroup choices.
-```
 CHANGELOG [2026-02-01 - Claude/Opus]
 Gradio 6.x deprecated gr.update(). Return a new component instance instead.
 Args:
-    proposals: List of proposal dicts from staging
 Returns:
-    gr.CheckboxGroup with updated choices
 """
 if not proposals:
-    return gr.CheckboxGroup(choices=[], value=[])
 choices = []
 for p in proposals:
-    label = f"[{p['timestamp']}] {p['description']}"
-    choices.append((label, p['id']))
 return gr.CheckboxGroup(choices=choices, value=[])
-```
 def execute_approved_proposals(selected_ids: list, pending_proposals: list,
 history: list) -> tuple:
-“”“Execute approved proposals, remove from queue, inject results into chat.
-```
 CHANGELOG [2026-02-01 - Claude/Opus]
 PROBLEM: Approved operations executed and showed results in the Gate tab,
 but the chatbot conversation never received them. Kimi couldn't continue
 reasoning because it never saw what happened. Josh had to manually go
 back and re-prompt.
 FIX: After execution, inject results into chat history as an assistant
 message. A chained .then() call (auto_continue_after_approval) picks
 up the updated history and sends a synthetic "[Continue]" through the
 agent loop so Kimi sees the tool results and keeps working.
 Args:
-    selected_ids: List of proposal IDs that Josh approved
-    pending_proposals: Full list of pending proposals
-    history: Current chatbot message history (list of dicts)
 Returns:
-    Tuple of (results_markdown, updated_proposals, updated_gate_choices,
-              updated_chatbot_history)
 """
 if not selected_ids:
-    return (
-        "No proposals selected.",
-        pending_proposals,
-        _format_gate_choices(pending_proposals),
-        history
-    )
 results = []
 remaining = []
 for proposal in pending_proposals:
-    if proposal['id'] in selected_ids:
-        # Execute this one
-        result = execute_staged_tool(proposal['tool'], proposal['args'])
-        results.append(f"**{proposal['tool']}**: {result}")
-    else:
-        # Keep in queue
-        remaining.append(proposal)
-results_text = "## Execution Results\n\n" + "\n\n".join(results) if results else "Nothing executed."
 # Inject results into chat history so Kimi sees them next turn
 if results:
-    result_summary = "✅ **Approved operations executed:**\n\n" + "\n\n".join(results)
-    history = history + [{"role": "assistant", "content": result_summary}]
 return results_text, remaining, _format_gate_choices(remaining), history
-```
 def auto_continue_after_approval(history: list, pending_proposals: list) -> tuple:
-“”“Automatically re-enter the agent loop after approval so Kimi sees results.
-```
 CHANGELOG [2026-02-01 - Claude/Opus]
 PROBLEM: After Josh approved staged operations, results were injected into
 chat history but Kimi never got another turn. Josh had to type something
 like "continue" to trigger Kimi to process the tool results.
 FIX: This function is chained via .then() after execute_approved_proposals.
 It sends a synthetic continuation prompt through the agent loop so Kimi
 automatically processes the approved tool results and continues working.
 We only continue if the last message in history is our injected results
-(starts with '✅ **Approved'). This prevents infinite loops if called
 when there's nothing to continue from.
 Args:
-    history: Chat history (should contain injected results from approval)
-    pending_proposals: Current pending proposals (passed through)
 Returns:
-    Same tuple shape as agent_loop so it can update the same outputs
 """
 # Safety check: only continue if last message is our injected results
 if not history or history[-1].get("role") != "assistant":
-    return (
-        history, "", pending_proposals,
-        _format_gate_choices(pending_proposals),
-        _stats_label_files(), _stats_label_convos()
-    )
 last_msg = history[-1].get("content", "")
-if not last_msg.startswith("✅ **Approved"):
-    return (
-        history, "", pending_proposals,
-        _format_gate_choices(pending_proposals),
-        _stats_label_files(), _stats_label_convos()
-    )
 # Re-enter the agent loop with a synthetic continuation prompt
 # This tells Kimi "I approved your operations, here are the results,
 # now keep going with whatever you were doing."
 return agent_loop(
-    message="[The operations above were approved and executed. Continue with your task using these results.]",
-    history=history,
-    pending_proposals=pending_proposals,
-    uploaded_file=None
 )
-```
 def clear_all_proposals(pending_proposals: list) -> tuple:
-“”“Discard all pending proposals without executing.
-```
 CHANGELOG [2026-02-01 - Claude/Opus]
 Safety valve — lets Josh throw out everything in the queue if the
 agent went off track.
 Returns:
-    Tuple of (status_message, empty_proposals, updated_gate_choices)
 """
 count = len(pending_proposals)
-return f"🗑️ Cleared {count} proposal(s).", [], _format_gate_choices([])
-```
 # =============================================================================
 # STATS HELPERS
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Helper functions to format stats for the sidebar labels.
 # Called both at startup (initial render) and after each conversation turn
 # (to reflect newly indexed files or saved conversations).
 # =============================================================================
 def _stats_label_files() -> str:
-“”“Format the files stat for the sidebar label.”””
 stats = ctx.get_stats()
-files = stats.get(‘total_files’, 0)
-chunks = stats.get(‘indexed_chunks’, 0)
-indexing = “ ⏳” if stats.get(‘indexing_in_progress’) else “”
-return f”📂 Files: {files} ({chunks} chunks){indexing}”
 def _stats_label_convos() -> str:
-“”“Format the conversations stat for the sidebar label.”””
 stats = ctx.get_stats()
-convos = stats.get(‘conversations’, 0)
-cloud = “ ☁️” if stats.get(‘persistence_configured’) else “”
-return f”💾 Conversations: {convos}{cloud}”
 def refresh_stats() -> tuple:
-“”“Refresh both stat labels. Called by the refresh button.
-```
 Returns:
-    Tuple of (files_label, convos_label)
 """
 return _stats_label_files(), _stats_label_convos()
-```
 # =============================================================================
 # UI LAYOUT
 # =============================================================================
 # CHANGELOG [2026-02-01 - Gemini]
 # RESTORED: Metrics sidebar and multi-tab layout.
-#
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # IMPLEMENTED: All the wiring. Every button, input, and display is now
 # connected to actual functions.
-#
 # Layout:
-# Tab 1 “Vibe Chat” — Main conversation interface with sidebar stats
-# Tab 2 “Build Approval Gate” — HITL review for staged write operations
-#
 # gr.State holds the pending proposals list (per-session, survives across
 # messages within the same browser tab).
 # =============================================================================
 with gr.Blocks(
-title=“🦞 Clawdbot Command Center”,
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Gradio 6.0+ moved `theme` from Blocks() to launch(). Passing it here
 # triggers a UserWarning in 6.x. Theme is set in launch() below instead.
 ) as demo:
 # Session state for pending proposals
 pending_proposals_state = gr.State([])
-```
-gr.Markdown("# 🦞 Clawdbot Command Center\n*E-T Systems Vibe Coding Agent*")
 with gr.Tabs():
-    # ==== TAB 1: VIBE CHAT ====
-    with gr.Tab("💬 Vibe Chat"):
-        with gr.Row():
-            # ---- Sidebar ----
-            with gr.Column(scale=1, min_width=200):
-                gr.Markdown("### 📊 System Status")
-                stats_files = gr.Markdown(_stats_label_files())
-                stats_convos = gr.Markdown(_stats_label_convos())
-                refresh_btn = gr.Button("🔄 Refresh Stats", size="sm")
-                gr.Markdown("---")
-                gr.Markdown("### 📎 Upload Context")
-                file_input = gr.File(
-                    label="Drop a file here",
-                    file_types=[
-                        '.py', '.js', '.ts', '.json', '.md', '.txt',
-                        '.yaml', '.yml', '.html', '.css', '.sh',
-                        '.toml', '.cfg', '.csv', '.xml'
-                    ]
-                )
-                gr.Markdown(
-                    "*Upload code, configs, or docs to include in your message.*"
-                )
-            # ---- Chat area ----
-            with gr.Column(scale=4):
-                chatbot = gr.Chatbot(
-                    # CHANGELOG [2026-02-01 - Claude/Opus]
-                    # Gradio 6.x uses messages format by default.
-                    # The type="messages" param was removed in 6.0 —
-                    # passing it causes TypeError on init.
-                    height=600,
-                    show_label=False,
-                    avatar_images=(None, "https://em-content.zobj.net/source/twitter/408/lobster_1f99e.png"),
-                )
-                with gr.Row():
-                    msg = gr.Textbox(
-                        placeholder="Ask Clawdbot to search, read, or code...",
-                        show_label=False,
-                        scale=6,
-                        lines=2,
-                        max_lines=10,
-                    )
-                    send_btn = gr.Button("Send", variant="primary", scale=1)
-        # Wire up chat submission
-        chat_inputs = [msg, chatbot, pending_proposals_state, file_input]
-        chat_outputs = [
-            chatbot, msg, pending_proposals_state,
-            # These reference components in the Gate tab — defined below
-        ]
-    # ==== TAB 2: BUILD APPROVAL GATE ====
-    with gr.Tab("🛡️ Build Approval Gate"):
-        gr.Markdown(
-            "### Review Staged Operations\n"
-            "Write operations (file writes, shell commands, branch creation) "
-            "are staged here for your review before execution.\n\n"
-            "**Select proposals to approve, then click Execute.**"
-        )
-        gate_list = gr.CheckboxGroup(
-            label="Pending Proposals",
-            choices=[],
-            interactive=True
-        )
-        with gr.Row():
-            btn_exec = gr.Button("✅ Execute Selected", variant="primary")
-            btn_clear = gr.Button("🗑️ Clear All", variant="secondary")
-        gate_results = gr.Markdown("*No operations executed yet.*")
 # ==================================================================
 # EVENT WIRING
 # ==================================================================
@@ -1326,31 +990,27 @@ with gr.Tabs():
 # All events are wired here, after all components are defined, so
 # cross-tab references work (e.g., chat updating the gate_list).
 # ==================================================================
 # Chat submission (both Enter key and Send button)
 full_chat_outputs = [
-    chatbot, msg, pending_proposals_state,
-    gate_list, stats_files, stats_convos
 ]
 msg.submit(
-    fn=agent_loop,
-    inputs=chat_inputs,
-    outputs=full_chat_outputs
 )
 send_btn.click(
-    fn=agent_loop,
-    inputs=chat_inputs,
-    outputs=full_chat_outputs
 )
 # Refresh stats button
 refresh_btn.click(
-    fn=refresh_stats,
-    inputs=[],
-    outputs=[stats_files, stats_convos]
 )
 # Build Approval Gate buttons
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # btn_exec now takes chatbot as input AND output so approved operation
@@ -1358,35 +1018,26 @@ refresh_btn.click(
 # automatically re-enters the agent loop so Kimi processes the results
 # without Josh having to type "continue".
 btn_exec.click(
-    fn=execute_approved_proposals,
-    inputs=[gate_list, pending_proposals_state, chatbot],
-    outputs=[gate_results, pending_proposals_state, gate_list, chatbot]
 ).then(
-    fn=auto_continue_after_approval,
-    inputs=[chatbot, pending_proposals_state],
-    outputs=[chatbot, msg_input, pending_proposals_state,
-             gate_list, stat_files, stat_convos]
 )
 btn_clear.click(
-    fn=clear_all_proposals,
-    inputs=[pending_proposals_state],
-    outputs=[gate_results, pending_proposals_state, gate_list]
 )
-```
 # =============================================================================
 # LAUNCH
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Standard HF Spaces launch config. 0.0.0.0 binds to all interfaces
 # (required for Docker). Port 7860 is the HF Spaces standard.
 # =============================================================================
-if **name** == “**main**”:
-demo.launch(server_name=“0.0.0.0”, server_port=7860, theme=gr.themes.Soft())

+"""
 Clawdbot Unified Command Center
 CHANGELOG [2026-02-01 - Gemini]
 RESTORED: Full Kimi K2.5 Agentic Loop (no more silence).
 ADDED: Full Developer Tool Suite (Write, Search, Shell).
 FIXED: HITL Gate interaction with conversational flow.
 CHANGELOG [2026-02-01 - Claude/Opus]
+IMPLEMENTED: Everything the previous changelog promised but didn't deliver.
 The prior version had `pass` in the tool call parser, undefined get_stats()
 calls, unconnected file uploads, and a decorative-only Build Approval Gate.
+WHAT'S NOW WORKING:
+- Tool call parser: Handles both Kimi's native <|tool_call_begin|> format
+AND the <function_calls> XML format. Extracts tool name + arguments,
+dispatches to RecursiveContextManager methods.
 - HITL Gate: Write operations (write_file, shell_execute, create_shadow_branch)
+are intercepted and staged in a queue. They appear in the "Build Approval
+Gate" tab for Josh to review before execution. Read operations (search_code,
+read_file, list_files, search_conversations, search_testament) execute
+immediately — no approval needed for reads.
 - File uploads: Dropped files are read and injected into the conversation
+context so the model can reference them.
 - Stats sidebar: Pulls from ctx.get_stats() which now exists.
 - Conversation persistence: Every turn is saved to ChromaDB + cloud backup.
 DESIGN DECISIONS:
 - Gradio state for the approval queue: We use gr.State to hold pending
+proposals per-session. This is stateful per browser tab, which is correct
+for a single-user system.
 - Read vs Write classification: Reads are safe and automated. Writes need
+human eyes. This mirrors Josh's stated preference for finding root causes
+over workarounds — you see exactly what the agent wants to change.
+- Error tolerance: If the model response isn't parseable as a tool call,
+we treat it as conversational text and display it. No silent failures.
 - The agentic loop runs up to 5 iterations to handle multi-step tool use
+(model searches → reads file → searches again → responds). Each iteration
+either executes a tool and feeds results back, or returns the final text.
 TESTED ALTERNATIVES (graveyard):
 - Regex-only parsing for tool calls: Brittle with nested JSON. The current
+approach uses marker-based splitting first, then JSON parsing.
 - Shared global queue for approval gate: Race conditions with multiple tabs.
+gr.State is per-session and avoids this.
 - Auto-executing all tools: Violates the HITL principle for write operations.
+Josh explicitly wants to approve code changes before they land.
 DEPENDENCIES:
 - recursive_context.py: RecursiveContextManager class (must define get_stats())
+- gradio>=5.0.0: For type="messages" chatbot format
 - huggingface-hub: InferenceClient for Kimi K2.5
+"""
 import gradio as gr
 from huggingface_hub import InferenceClient
 from recursive_context import RecursiveContextManager
 import re
 import time
 import traceback
 # =============================================================================
 # INITIALIZATION
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # InferenceClient points to HF router which handles model routing.
 # RecursiveContextManager is initialized once and shared across all requests.
 # MODEL_ID must match what the HF router expects for Kimi K2.5.
 # =============================================================================
 client = InferenceClient(
+"https://router.huggingface.co/v1",
+token=os.getenv("HF_TOKEN")
 )
 # =============================================================================
 # REPO PATH RESOLUTION + CROSS-SPACE SYNC
 # =============================================================================
 # CHANGELOG [2025-01-29 - Josh]
 # Created sync_from_space() to read E-T Systems code from its own Space.
 # Uses HfFileSystem to list and download files via HF_TOKEN.
+#
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # PROBLEM: Gemini refactor replaced this working sync with a hallucinated
 # REPO_URL / git clone approach in entrypoint.sh. The secret was renamed
 # from ET_SYSTEMS_SPACE to REPO_URL without updating the Space settings,
 # so the clone never happened and the workspace was empty.
+#
 # FIX: Restored the original ET_SYSTEMS_SPACE → HfFileSystem sync that
+# was working before. Falls back to /app (Clawdbot's own dir) if the
+# secret isn't set, so tools still function for self-inspection.
+#
+# REQUIRED SECRET: ET_SYSTEMS_SPACE = "username/space-name"
+# (format matches HF Space ID, e.g. "drone11272/e-t-systems")
 # =============================================================================
+ET_SYSTEMS_SPACE = os.getenv("ET_SYSTEMS_SPACE", "")
+REPO_PATH = os.getenv("REPO_PATH", "/workspace/e-t-systems")
 def sync_from_space(space_id: str, local_path: Path):
+"""Sync files from E-T Systems Space to local workspace.
 CHANGELOG [2025-01-29 - Josh]
 Created to enable Clawdbot to read E-T Systems code from its Space.
 CHANGELOG [2026-02-01 - Claude/Opus]
 Restored after Gemini refactor deleted it. Added recursive directory
 download — the original only grabbed top-level files. Now walks the
 full directory tree so nested source files are available too.
 Args:
+space_id: HuggingFace Space ID (e.g. "username/space-name")
+local_path: Where to download files locally
 """
 token = (
+os.getenv("HF_TOKEN") or
+os.getenv("HUGGING_FACE_HUB_TOKEN") or
+os.getenv("HUGGINGFACE_TOKEN")
 )
 if not token:
+print(" return
+No HF_TOKEN found — cannot sync from Space")
+try:
+from huggingface_hub import HfFileSystem
+fs = HfFileSystem(token=token)
+space_path = f"spaces/{space_id}"
+print(f" Syncing from Space: {space_id}")
+# Recursive download: walk all files in the Space repo
+all_files = []
+try:
+all_files = fs.glob(f"{space_path}/**")
+except Exception:
+# Fallback: just list top level
+all_files = fs.ls(space_path, detail=False)
+local_path.mkdir(parents=True, exist_ok=True)
+downloaded = 0
+for file_path in all_files:
+# Get path relative to the space root
+rel = file_path.replace(f"{space_path}/", "", 1)
+# Skip hidden files, .git, __pycache__
+if any(part.startswith('.') for part in rel.split('/')):
+continue
+if '__pycache__' in rel or 'node_modules' in rel:
+continue
+# Check if it's a file (not directory)
 try:
+info = fs.info(file_path)
+if info.get('type') == 'directory':
+continue
+except Exception:
+continue
+# Create parent dirs and download
+dest = local_path / rel
+dest.parent.mkdir(parents=True, exist_ok=True)
+try:
+with fs.open(file_path, "rb") as f:
+content = f.read()
+dest.write_bytes(content)
+downloaded += 1
+print(f" {rel}")
 except Exception as e:
+print(f" Failed: {rel} ({e})")
+print(f" Synced {downloaded} files from Space: {space_id}")
+except Exception as e:
+print(f" import traceback
+traceback.print_exc()
+Failed to sync from Space: {e}")
 def _resolve_repo_path() -> str:
+"""Initialize workspace with E-T Systems files.
 CHANGELOG [2026-02-01 - Claude/Opus]
 Three-tier resolution:
 1. ET_SYSTEMS_SPACE secret → sync via HfFileSystem (the working approach)
 3. /app (Clawdbot's own directory — tools still work for self-inspection)
 """
 repo_path = Path(REPO_PATH)
 # Tier 1: Sync from E-T Systems Space if secret is configured
 if ET_SYSTEMS_SPACE:
+sync_from_space(ET_SYSTEMS_SPACE, repo_path)
+if repo_path.exists() and any(repo_path.iterdir()):
+print(f" Using synced E-T Systems repo: {repo_path}")
+return str(repo_path)
 # Tier 2: Pre-populated REPO_PATH (manual or from previous sync)
 if repo_path.exists() and any(repo_path.iterdir()):
+print(f" Using existing repo: {repo_path}")
+return str(repo_path)
 # Tier 3: Fall back to Clawdbot's own directory
 app_dir = os.path.dirname(os.path.abspath(__file__))
+print(f" No E-T Systems repo found — falling back to: {app_dir}")
+print(f" Set ET_SYSTEMS_SPACE secret to your Space ID to enable sync.")
 return app_dir
 ctx = RecursiveContextManager(_resolve_repo_path())
+MODEL_ID = "moonshotai/Kimi-K2.5"
 # =============================================================================
 # TOOL DEFINITIONS
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # These are the tools the model can call. Classified as READ (auto-execute)
 # or WRITE (requires human approval via the HITL gate).
+#
 # READ tools: Safe, no side effects, execute immediately.
 # WRITE tools: Modify files, run commands, create branches — staged for review.
+#
 # NOTE: The tool definitions are included in the system prompt so Kimi knows
+# what's available. The actual execution happens in execute_tool().
 # =============================================================================
+TOOL_DEFINITIONS = """
 ## Available Tools
 ### Tools you can use freely (no approval needed):
 - **search_code(query, n=5)** — Semantic search across the E-T Systems codebase.
+Returns matching code snippets with file paths. JUST USE THIS. Don't ask.
 - **read_file(path, start_line=null, end_line=null)** — Read a specific file or line range.
+JUST USE THIS. Don't ask.
+- **list_files(path="", max_depth=3)** — List directory contents as a tree.
+JUST USE THIS. Don't ask.
 - **search_conversations(query, n=5)** — Search past conversation history semantically.
+JUST USE THIS. Don't ask.
 - **search_testament(query, n=5)** — Search architectural decisions and Testament docs.
+JUST USE THIS. Don't ask.
 ### Tools that get staged for Josh to approve:
 - **write_file(path, content)** — Write content to a file. REQUIRES CHANGELOG header.
 - **shell_execute(command)** — Run a shell command. Read-only commands (ls, find, cat,
+grep, head, tail, wc, tree, etc.) auto-execute without approval. Commands that modify
+anything get staged for review.
 - **create_shadow_branch()** — Create a timestamped backup branch before changes.
 To call a tool, use this format:
 <function_calls>
 <invoke name="tool_name">
 <parameter name="param_name">value</parameter>
 </invoke>
 </function_calls>
+"""
 # Which tools are safe to auto-execute vs which need human approval
+READ_TOOLS = {'search_code', 'read_file', 'list_files', 'search_conversations', 'search_testa
+WRITE_TOOLS = {'write_file', 'shell_execute', 'create_shadow_branch'}
 # =============================================================================
 # SYSTEM PROMPT
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Gives Kimi its identity, available tools, and behavioral guidelines.
 # Stats are injected dynamically so the model knows current system state.
 # =============================================================================
 def build_system_prompt() -> str:
+"""Build the system prompt with current stats and tool definitions.
 Called fresh for each message so stats reflect current indexing state.
 """
 stats = ctx.get_stats()
 indexing_note = ""
 if stats.get('indexing_in_progress'):
+indexing_note = "\n if stats.get('index_error'):
+indexing_note += f"\n NOTE: Repository indexing is in progress. search_code results m
+Indexing error: {stats['index_error']}"
+return f"""You are Clawdbot , a high-autonomy vibe coding agent for the E-T Systems con
 ## Your Role
+You help Josh (the architect) build and maintain E-T Systems. You have full access to the cod
+via tools. Use them proactively — search before answering questions about code, read files to
 your understanding, explore the directory structure to orient yourself.
 ## Current System Stats
+- Indexed files: {stats.get('total_files', 0)}
+- Searchable chunks: {stats.get('indexed_chunks', 0)}
+- Saved conversations: {stats.get('conversations', 0)}
+- ChromaDB: {stats.get('chroma_path', 'unknown')}
+- Cloud backup: {' configured' if stats.get('persistence_configured') else ' not conf
+{indexing_note}
 {TOOL_DEFINITIONS}
 ## Code Writing Rules
 ALL code you write MUST include a living changelog header:
 ```
 CHANGELOG [YYYY-MM-DD - Clawdbot]
 WHAT: Brief description of what was added/changed
 WHY: Rationale for the change
 ```
 Files without this header will be REJECTED by the write_file tool.
 ## Behavioral Guidelines
 - NEVER ask permission to search, read, or list files. Just do it.
 - Use tools proactively — search before answering questions about code
 - Cite specific files and line numbers when discussing implementation
 - Follow existing patterns — check how similar things are done first
+- When unsure, say so. Don't hallucinate about code that might not exist.
 - Only write_file, shell_execute, and create_shadow_branch get staged for review.
+Everything else you do immediately, silently, without asking.
+"""
 # =============================================================================
 # TOOL CALL PARSING
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Kimi K2.5 can emit tool calls in two formats:
+#
 # 1. Native format:
+# <|tool_call_begin|>functions.search_code:0\n{"query": "surprise detection"}
 # <|tool_call_end|>
+#
 # 2. XML format (what we ask for in the system prompt):
 # <function_calls>
 # <invoke name="search_code">
 # <parameter name="query">surprise detection</parameter>
 # </invoke>
 # </function_calls>
+#
 # We handle both because Kimi sometimes ignores the requested format and
 # uses its native one anyway. The parser returns a list of (tool_name, args)
 # tuples.
+#
 # TESTED ALTERNATIVES (graveyard):
 # - Single regex for both formats: Unmaintainable, broke on edge cases.
+# - Forcing Kimi to only use XML: It doesn't reliably comply.
 # - JSON-mode tool calling via HF API: Not supported for Kimi K2.5.
 # =============================================================================
 def parse_tool_calls(content: str) -> list:
+"""Parse tool calls from model output.
 Handles both Kimi's native format and XML function_calls format.
 Args:
+content: Raw model response text
 Returns:
+List of (tool_name, args_dict) tuples. Empty list if no tool calls.
 """
 calls = []
 # --- Format 1: Kimi native <|tool_call_begin|> ... <|tool_call_end|> ---
+native_pattern = r'<\|tool_call_begin\|>\s*functions\.(\w+):\d+\s*\n(.*?)<\|tool_call_end
 for match in re.finditer(native_pattern, content, re.DOTALL):
+tool_name = match.group(1)
+try:
+args = json.loads(match.group(2).strip())
+except json.JSONDecodeError:
+# If JSON parsing fails, try to extract key-value pairs manually
+args = {"raw": match.group(2).strip()}
+calls.append((tool_name, args))
 # --- Format 2: XML <function_calls> ... </function_calls> ---
 xml_pattern = r'<function_calls>(.*?)</function_calls>'
 for block_match in re.finditer(xml_pattern, content, re.DOTALL):
+block = block_match.group(1)
+invoke_pattern = r'<invoke\s+name="(\w+)">(.*?)</invoke>'
+for invoke_match in re.finditer(invoke_pattern, block, re.DOTALL):
+tool_name = invoke_match.group(1)
+params_block = invoke_match.group(2)
+args = {}
+param_pattern = r'<parameter\s+name="(\w+)">(.*?)</parameter>'
+for param_match in re.finditer(param_pattern, params_block, re.DOTALL):
+key = param_match.group(1)
+value = param_match.group(2).strip()
+# Try to parse as JSON for numbers, bools, etc.
+try:
+args[key] = json.loads(value)
+except (json.JSONDecodeError, ValueError):
+args[key] = value
+calls.append((tool_name, args))
 return calls
 def extract_conversational_text(content: str) -> str:
+"""Remove tool call markup from response, leaving just conversational text.
 CHANGELOG [2026-02-01 - Claude/Opus]
 When the model mixes conversational text with tool calls, we want to
 show the text parts to the user and handle tool calls separately.
 Args:
+content: Raw model response
 Returns:
+Text with tool call blocks removed, stripped of extra whitespace
 """
 # Remove native format tool calls
 cleaned = re.sub(
+r'<\|tool_call_begin\|>.*?<\|tool_call_end\|>',
+'', content, flags=re.DOTALL
 )
 # Remove XML format tool calls
 cleaned = re.sub(
+r'<function_calls>.*?</function_calls>',
+'', cleaned, flags=re.DOTALL
 )
 return cleaned.strip()
 # =============================================================================
 # TOOL EXECUTION
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Dispatches parsed tool calls to RecursiveContextManager methods.
 # READ tools execute immediately and return results.
 # WRITE tools return a staging dict for the HITL gate.
+#
 # The return format differs by type:
+# - READ: {"status": "executed", "tool": name, "result": result_string}
+# - WRITE: {"status": "staged", "tool": name, "args": args, "description": desc}
 # =============================================================================
 def execute_tool(tool_name: str, args: dict) -> dict:
+"""Execute a read tool or prepare a write tool for staging.
 Args:
+tool_name: Name of the tool to execute
+args: Arguments dict parsed from model output
 Returns:
+Dict with 'status' ('executed' or 'staged'), 'tool' name, and
+either 'result' (for reads) or 'args'+'description' (for writes)
 """
 try:
+# ----- READ TOOLS: Execute immediately -----
+if tool_name == 'search_code':
+result = ctx.search_code(
+query=args.get('query', ''),
+n=args.get('n', 5)
+)
+formatted = "\n\n".join([
+f" **{r['file']}**\n```\n{r['snippet']}\n```"
+for r in result
+]) if result else "No results found."
+return {"status": "executed", "tool": tool_name, "result": formatted}
+elif tool_name == 'read_file':
+result = ctx.read_file(
+path=args.get('path', ''),
+start_line=args.get('start_line'),
+end_line=args.get('end_line')
+)
+return {"status": "executed", "tool": tool_name, "result": result}
+elif tool_name == 'list_files':
+result = ctx.list_files(
+path=args.get('path', ''),
+max_depth=args.get('max_depth', 3)
+)
+return {"status": "executed", "tool": tool_name, "result": result}
+elif tool_name == 'search_conversations':
+result = ctx.search_conversations(
+query=args.get('query', ''),
+n=args.get('n', 5)
+)
+formatted = "\n\n---\n\n".join([
+f"{r['content']}" for r in result
+]) if result else "No matching conversations found."
+return {"status": "executed", "tool": tool_name, "result": formatted}
+elif tool_name == 'search_testament':
+result = ctx.search_testament(
+query=args.get('query', ''),
+n=args.get('n', 5)
+)
+formatted = "\n\n".join([
+f" for r in result
+**{r['file']}**{' (Testament)' if r.get('is_testament') else ''}\n{r['sn
+]) if result else "No matching testament/decision records found."
+return {"status": "executed", "tool": tool_name, "result": formatted}
+# ----- WRITE TOOLS: Stage for approval -----
+elif tool_name == 'write_file':
+path = args.get('path', 'unknown')
+content_preview = args.get('content', '')[:200]
+return {
+"status": "staged",
+"tool": tool_name,
+"args": args,
+"description": f" Write to `{path}`\n```\n{content_preview}...\n```"
+}
+elif tool_name == 'shell_execute':
+command = args.get('command', 'unknown')
+# =============================================================
+# SMART SHELL CLASSIFICATION
+# =============================================================
+# CHANGELOG [2026-02-01 - Claude/Opus]
+# PROBLEM: When list_files returns empty (e.g., repo not cloned),
+# Kimi falls back to shell_execute with read-only commands like
+# `find . -type f`. These got staged for approval, forcing Josh
+# to approve what's functionally just a directory listing.
+#
+# FIX: Classify shell commands as READ or WRITE by checking the
+# base command. Read-only commands auto-execute. Anything that
+# could modify state still gets staged.
+#
+# SAFE READ commands: ls, find, cat, head, tail, wc, grep, tree,
+# du, file, stat, echo, pwd, which, env, printenv, whoami, date
+#
+# UNSAFE (staged): Everything else, plus anything with pipes to
+# potentially unsafe commands, redirects (>), or semicolons
+# chaining unknown commands.
+# =============================================================
+READ_ONLY_COMMANDS = {
+'ls', 'find', 'cat', 'head', 'tail', 'wc', 'grep', 'tree',
+'du', 'file', 'stat', 'echo', 'pwd', 'which', 'env',
+'printenv', 'whoami', 'date', 'realpath', 'dirname',
+'basename', 'diff', 'less', 'more', 'sort', 'uniq',
+'awk', 'sed', 'cut', 'tr', 'tee', 'python',
+}
+# ---------------------------------------------------------------
+# CHANGELOG [2026-02-01 - Claude/Opus]
+# PROBLEM: Naive '>' check caught "2>/dev/null" as dangerous,
+# staging `find ... 2>/dev/null | head -20` for approval even
+# though every command in the pipeline is read-only.
+#
+# FIX: Strip stderr redirects (2>/dev/null, 2>&1) before danger
+# check. Split on pipes and verify EACH segment's base command
+# is in READ_ONLY_COMMANDS. Only stage if something genuinely
+# unsafe is found.
+#
+# Safe patterns now auto-execute:
+# find . -name "*.py" 2>/dev/null | head -20
+# grep -r "pattern" . | sort | uniq
+# cat file.py | wc -l
+# Unsafe patterns still get staged:
+# find . -name "*.py" | xargs rm
+# cat file > /etc/passwd
+# echo "bad" ; rm -rf /
+# ---------------------------------------------------------------
+# Strip safe stderr redirects before checking
+import re as _re
+sanitized = _re.sub(r'2>\s*/dev/null', '', command)
+sanitized = _re.sub(r'2>&1', '', sanitized)
+# Characters that turn reads into writes (checked AFTER stripping
+# safe redirects). Output redirect > is still caught, but not 2>.
+WRITE_INDICATORS = {';', '&&', '||', '`', '$('}
+# > is only dangerous if it's a real output redirect, not inside
+# a quoted string or 2> prefix. Check separately.
+has_write_redirect = bool(_re.search(r'(?<![2&])\s*>', sanitized))
+has_write_chars = any(d in sanitized for d in WRITE_INDICATORS)
+# Split on pipes and check each segment
+pipe_segments = [seg.strip() for seg in sanitized.split('|') if seg.strip()]
+all_segments_safe = all(
+(seg.split()[0].split('/')[-1] if seg.split() else '') in READ_ONLY_COMMANDS
+for seg in pipe_segments
+)
+if all_segments_safe and not has_write_redirect and not has_write_chars:
+# Every command in the pipeline is read-only — auto-execute
+result = ctx.shell_execute(command)
+return {"status": "executed", "tool": tool_name, "result": result}
+else:
+# Something potentially destructive — stage for approval
+return {
+"status": "staged",
+"tool": tool_name,
+"args": args,
+"description": f" Execute: `{command}`"
+}
+elif tool_name == 'create_shadow_branch':
+return {
+"status": "staged",
+"tool": tool_name,
+"args": args,
+"description": " Create shadow backup branch"
+}
+else:
+return {
+"status": "error",
+"tool": tool_name,
+"result": f"Unknown tool: {tool_name}"
+}
 except Exception as e:
+return {
+"status": "error",
+"tool": tool_name,
+"result": f"Tool execution error: {e}\n{traceback.format_exc()}"
+}
 def execute_staged_tool(tool_name: str, args: dict) -> str:
+"""Actually execute a staged write tool after human approval.
 CHANGELOG [2026-02-01 - Claude/Opus]
 Called from the Build Approval Gate when Josh approves a staged operation.
 This is the only path through which write tools actually run.
 Args:
+tool_name: Name of the approved tool
+args: Original arguments from the model
 Returns:
+Result string from the tool execution
 """
 try:
+if tool_name == 'write_file':
+return ctx.write_file(
+path=args.get('path', ''),
+content=args.get('content', '')
+)
+elif tool_name == 'shell_execute':
+return ctx.shell_execute(command=args.get('command', ''))
+elif tool_name == 'create_shadow_branch':
+return ctx.create_shadow_branch()
+else:
+return f"Unknown tool: {tool_name}"
 except Exception as e:
+return f"Execution error: {e}"
 # =============================================================================
 # FILE UPLOAD HANDLER
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Reads uploaded files and formats them for injection into the conversation.
 # Supports code files, text, JSON, markdown, etc. Binary files get a
+# placeholder message since they can't be meaningfully injected as text.
 # =============================================================================
 TEXT_EXTENSIONS = {
+'.py', '.js', '.ts', '.jsx', '.tsx', '.json', '.yaml', '.yml',
+'.md', '.txt', '.rst', '.html', '.css', '.scss', '.sh', '.bash',
+'.sql', '.toml', '.cfg', '.ini', '.conf', '.xml', '.csv',
+'.env', '.gitignore', '.dockerignore', '.mjs', '.cjs',
 }
 def process_uploaded_file(file) -> str:
+"""Read an uploaded file and format it for conversation context.
 Args:
+file: Gradio file object with .name attribute (temp path)
 Returns:
+Formatted string with filename and content, ready to inject
+into the conversation as context
 """
 if file is None:
+return ""
 file_path = file.name if hasattr(file, 'name') else str(file)
 file_name = os.path.basename(file_path)
 suffix = os.path.splitext(file_name)[1].lower()
 if suffix in TEXT_EXTENSIONS or suffix == '':
+try:
+with open(file_path, 'r', encoding='utf-8', errors='ignore') as f:
+content = f.read()
+# Cap at 50KB to avoid overwhelming context
+if len(content) > 50000:
+content = content[:50000] + f"\n\n... (truncated, {len(content)} total return f" **Uploaded: {file_name}**\n```\n{content}\n```"
+except Exception as e:
+return f" **Uploaded: {file_name}** (error reading: {e})"
+chars)
 else:
+return f" **Uploaded: {file_name}** (binary file, {os.path.getsize(file_path):,} by
 # =============================================================================
 # AGENTIC LOOP
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # The core conversation loop. For each user message:
 # 1. Build messages array with system prompt + history + new message
 # 2. Send to Kimi K2.5 via HF Inference API
 # 3. Parse response for tool calls
 # 4. If READ tool calls: execute immediately, inject results, loop back to Kimi
 # 5. If WRITE tool calls: stage in approval queue, notify user
 # 6. If no tool calls: return conversational response
 # 7. Save the turn to ChromaDB for persistent memory
+#
 # The loop runs up to MAX_ITERATIONS times to handle multi-step tool use.
 # Each iteration either executes tools and loops, or returns the final text.
+#
+# IMPORTANT: Gradio 5.0+ chatbot with type="messages" expects history as a
+# list of {"role": str, "content": str} dicts. We maintain that format
 # throughout.
 # =============================================================================
 MAX_ITERATIONS = 5
+def agent_loop(message: str, history: list, pending_proposals: list, uploaded_file) -> """Main agentic conversation loop.
+tuple:
 Args:
+message: User's text input
+history: Chat history as list of {"role": ..., "content": ...} dicts
+pending_proposals: Current list of staged write proposals (gr.State)
+uploaded_file: Optional uploaded file from the file input widget
 Returns:
+Tuple of (updated_history, cleared_textbox, updated_proposals,
+updated_gate_choices, updated_stats_files, updated_stats_convos)
 """
 if not message.strip() and uploaded_file is None:
+# Nothing to do
+return (
+history, "", pending_proposals,
+_format_gate_choices(pending_proposals),
+_stats_label_files(), _stats_label_convos()
+)
 # Inject uploaded file content if present
 full_message = message.strip()
 if uploaded_file is not None:
+file_context = process_uploaded_file(uploaded_file)
+if file_context:
+full_message = f"{file_context}\n\n{full_message}" if full_message else file_cont
 if not full_message:
+return (
+history, "", pending_proposals,
+_format_gate_choices(pending_proposals),
+_stats_label_files(), _stats_label_convos()
+)
 # Add user message to history
 history = history + [{"role": "user", "content": full_message}]
 # Build messages for the API
 system_prompt = build_system_prompt()
 api_messages = [{"role": "system", "content": system_prompt}]
 # Include recent history (cap to avoid token overflow)
 # Keep last 20 turns to stay within Kimi's context window
+recent_history = history[-40:] # 40 entries = ~20 turns (user+assistant pairs)
 for h in recent_history:
+api_messages.append({"role": h["role"], "content": h["content"]})
 # Agentic loop: tool calls → execution → re-prompt → repeat
 accumulated_text = ""
 staged_this_turn = []
 for iteration in range(MAX_ITERATIONS):
+try:
+response = client.chat_completion(
+model=MODEL_ID,
+messages=api_messages,
+max_tokens=2048,
+temperature=0.7
+)
+content = response.choices[0].message.content or ""
+except Exception as e:
+error_msg = f" API Error: {e}"
+history = history + [{"role": "assistant", "content": error_msg}]
+return (
+history, "", pending_proposals,
+_format_gate_choices(pending_proposals),
+_stats_label_files(), _stats_label_convos()
+)
+# Parse for tool calls
+tool_calls = parse_tool_calls(content)
+conversational_text = extract_conversational_text(content)
+if conversational_text:
+accumulated_text += ("\n\n" if accumulated_text else "") + conversational_text
+if not tool_calls:
+# No tools — this is the final response
+break
+# Process each tool call
+tool_results_for_context = []
+for tool_name, args in tool_calls:
+result = execute_tool(tool_name, args)
+if result["status"] == "executed":
+# READ tool — executed, feed result back to model
+tool_results_for_context.append(
+f"[Tool Result: {tool_name}]\n{result['result']}"
+)
+elif result["status"] == "staged":
+# WRITE tool — staged for approval
+proposal = {
+"id": f"proposal_{int(time.time())}_{tool_name}",
+"tool": tool_name,
+"args": result["args"],
+"description": result["description"],
+"timestamp": time.strftime("%H:%M:%S")
+}
+staged_this_turn.append(proposal)
+tool_results_for_context.append(
+f"[Tool {tool_name}: STAGED for human approval. "
+f"Josh will review this in the Build Approval Gate.]"
+)
+elif result["status"] == "error":
+tool_results_for_context.append(
+f"[Tool Error: {tool_name}]\n{result['result']}"
+)
+# If we only had staged tools and no reads, break the loop
+if tool_results_for_context:
+# Feed tool results back as a system message for the next iteration
+combined_results = "\n\n".join(tool_results_for_context)
+api_messages.append({"role": "assistant", "content": content})
+api_messages.append({"role": "user", "content": f"[Tool Results]\n{combined_resul
+else:
+break
 # Build final response
 final_response = accumulated_text
 # Append staging notifications if any writes were staged
 if staged_this_turn:
+staging_notice = "\n\n---\n for proposal in staged_this_turn:
+**Staged for your approval** (see Build Approval Gate t
+staging_notice += f"- {proposal['description']}\n"
+final_response += staging_notice
+# Add to persistent queue
+pending_proposals = pending_proposals + staged_this_turn
 if not final_response:
+final_response = " I processed your request but didn't generate a text response. Ch
 # Add assistant response to history
 history = history + [{"role": "assistant", "content": final_response}]
 # Save conversation turn for persistent memory
 try:
+turn_count = len([h for h in history if h["role"] == "user"])
+ctx.save_conversation_turn(full_message, final_response, turn_count)
 except Exception:
+pass # Don't crash the UI if persistence fails
 return (
+history,
+"", # Clear the textbox
+pending_proposals,
+_format_gate_choices(pending_proposals),
+_stats_label_files(),
+_stats_label_convos()
 )
 # =============================================================================
 # BUILD APPROVAL GATE
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # The HITL gate for reviewing and approving staged write operations.
 # Josh sees a checklist of proposed changes, can select which to approve,
 # and clicks Execute. Approved operations run; rejected ones are discarded.
+#
 # DESIGN DECISION: CheckboxGroup shows descriptions, but we need to map
 # back to the actual proposal objects for execution. We use the proposal
 # ID as the checkbox value and display the description as the label.
 # =============================================================================
 def _format_gate_choices(proposals: list):
+"""Format pending proposals as CheckboxGroup choices.
 CHANGELOG [2026-02-01 - Claude/Opus]
 Gradio 6.x deprecated gr.update(). Return a new component instance instead.
 Args:
+proposals: List of proposal dicts from staging
 Returns:
+gr.CheckboxGroup with updated choices
 """
 if not proposals:
+return gr.CheckboxGroup(choices=[], value=[])
 choices = []
 for p in proposals:
+label = f"[{p['timestamp']}] {p['description']}"
+choices.append((label, p['id']))
 return gr.CheckboxGroup(choices=choices, value=[])
 def execute_approved_proposals(selected_ids: list, pending_proposals: list,
 history: list) -> tuple:
+"""Execute approved proposals, remove from queue, inject results into chat.
 CHANGELOG [2026-02-01 - Claude/Opus]
 PROBLEM: Approved operations executed and showed results in the Gate tab,
 but the chatbot conversation never received them. Kimi couldn't continue
 reasoning because it never saw what happened. Josh had to manually go
 back and re-prompt.
 FIX: After execution, inject results into chat history as an assistant
 message. A chained .then() call (auto_continue_after_approval) picks
 up the updated history and sends a synthetic "[Continue]" through the
 agent loop so Kimi sees the tool results and keeps working.
 Args:
+selected_ids: List of proposal IDs that Josh approved
+pending_proposals: Full list of pending proposals
+history: Current chatbot message history (list of dicts)
 Returns:
+Tuple of (results_markdown, updated_proposals, updated_gate_choices,
+updated_chatbot_history)
 """
 if not selected_ids:
+return (
+"No proposals selected.",
+pending_proposals,
+_format_gate_choices(pending_proposals),
+history
+)
 results = []
 remaining = []
 for proposal in pending_proposals:
+if proposal['id'] in selected_ids:
+# Execute this one
+result = execute_staged_tool(proposal['tool'], proposal['args'])
+results.append(f"**{proposal['tool']}**: {result}")
+else:
+# Keep in queue
+remaining.append(proposal)
+results_text = "## Execution Results\n\n" + "\n\n".join(results) if results else "Nothing
 # Inject results into chat history so Kimi sees them next turn
 if results:
+result_summary = " **Approved operations executed:**\n\n" + "\n\n".join(results)
+history = history + [{"role": "assistant", "content": result_summary}]
 return results_text, remaining, _format_gate_choices(remaining), history
 def auto_continue_after_approval(history: list, pending_proposals: list) -> tuple:
+"""Automatically re-enter the agent loop after approval so Kimi sees results.
 CHANGELOG [2026-02-01 - Claude/Opus]
 PROBLEM: After Josh approved staged operations, results were injected into
 chat history but Kimi never got another turn. Josh had to type something
 like "continue" to trigger Kimi to process the tool results.
 FIX: This function is chained via .then() after execute_approved_proposals.
 It sends a synthetic continuation prompt through the agent loop so Kimi
 automatically processes the approved tool results and continues working.
 We only continue if the last message in history is our injected results
+(starts with ' **Approved'). This prevents infinite loops if called
 when there's nothing to continue from.
 Args:
+history: Chat history (should contain injected results from approval)
+pending_proposals: Current pending proposals (passed through)
 Returns:
+Same tuple shape as agent_loop so it can update the same outputs
 """
 # Safety check: only continue if last message is our injected results
 if not history or history[-1].get("role") != "assistant":
+return (
+history, "", pending_proposals,
+_format_gate_choices(pending_proposals),
+_stats_label_files(), _stats_label_convos()
+)
 last_msg = history[-1].get("content", "")
+if not last_msg.startswith(" **Approved"):
+return (
+history, "", pending_proposals,
+_format_gate_choices(pending_proposals),
+_stats_label_files(), _stats_label_convos()
+)
 # Re-enter the agent loop with a synthetic continuation prompt
 # This tells Kimi "I approved your operations, here are the results,
 # now keep going with whatever you were doing."
 return agent_loop(
+history=history,
+pending_proposals=pending_proposals,
+uploaded_file=None
+message="[The operations above were approved and executed. Continue with your task us
 )
 def clear_all_proposals(pending_proposals: list) -> tuple:
+"""Discard all pending proposals without executing.
 CHANGELOG [2026-02-01 - Claude/Opus]
 Safety valve — lets Josh throw out everything in the queue if the
 agent went off track.
 Returns:
+Tuple of (status_message, empty_proposals, updated_gate_choices)
 """
 count = len(pending_proposals)
+return f" Cleared {count} proposal(s).", [], _format_gate_choices([])
 # =============================================================================
 # STATS HELPERS
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Helper functions to format stats for the sidebar labels.
 # Called both at startup (initial render) and after each conversation turn
 # (to reflect newly indexed files or saved conversations).
 # =============================================================================
 def _stats_label_files() -> str:
+"""Format the files stat for the sidebar label."""
 stats = ctx.get_stats()
+files = stats.get('total_files', 0)
+chunks = stats.get('indexed_chunks', 0)
+indexing = " " if stats.get('indexing_in_progress') else ""
+return f" Files: {files} ({chunks} chunks){indexing}"
 def _stats_label_convos() -> str:
+"""Format the conversations stat for the sidebar label."""
 stats = ctx.get_stats()
+convos = stats.get('conversations', 0)
+cloud = " " if stats.get('persistence_configured') else ""
+return f" Conversations: {convos}{cloud}"
 def refresh_stats() -> tuple:
+"""Refresh both stat labels. Called by the refresh button.
 Returns:
+Tuple of (files_label, convos_label)
 """
 return _stats_label_files(), _stats_label_convos()
 # =============================================================================
 # UI LAYOUT
 # =============================================================================
 # CHANGELOG [2026-02-01 - Gemini]
 # RESTORED: Metrics sidebar and multi-tab layout.
+#
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # IMPLEMENTED: All the wiring. Every button, input, and display is now
 # connected to actual functions.
+#
 # Layout:
+# Tab 1 "Vibe Chat" — Main conversation interface with sidebar stats
+# Tab 2 "Build Approval Gate" — HITL review for staged write operations
+#
 # gr.State holds the pending proposals list (per-session, survives across
 # messages within the same browser tab).
 # =============================================================================
 with gr.Blocks(
+title=" Clawdbot Command Center",
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Gradio 6.0+ moved `theme` from Blocks() to launch(). Passing it here
 # triggers a UserWarning in 6.x. Theme is set in launch() below instead.
 ) as demo:
 # Session state for pending proposals
 pending_proposals_state = gr.State([])
+gr.Markdown("# Clawdbot Command Center\n*E-T Systems Vibe Coding Agent*")
 with gr.Tabs():
+# ==== TAB 1: VIBE CHAT ====
+with gr.Tab(" Vibe Chat"):
+with gr.Row():
+# ---- Sidebar ----
+with gr.Column(scale=1, min_width=200):
+gr.Markdown("### System Status")
+stats_files = gr.Markdown(_stats_label_files())
+stats_convos = gr.Markdown(_stats_label_convos())
+refresh_btn = gr.Button(" Refresh Stats", size="sm")
+gr.Markdown("---")
+gr.Markdown("### Upload Context")
+file_input = gr.File(
+label="Drop a file here",
+file_types=[
+'.py', '.js', '.ts', '.json', '.md', '.txt',
+'.yaml', '.yml', '.html', '.css', '.sh',
+'.toml', '.cfg', '.csv', '.xml'
+]
+)
+gr.Markdown(
+"*Upload code, configs, or docs to include in your message.*"
+)
+# ---- Chat area ----
+with gr.Column(scale=4):
+chatbot = gr.Chatbot(
+# CHANGELOG [2026-02-01 - Claude/Opus]
+# Gradio 6.x uses messages format by default.
+# The type="messages" param was removed in 6.0 —
+# passing it causes TypeError on init.
+height=600,
+show_label=False,
+avatar_images=(None, "https://em-content.zobj.net/source/twitter/408/
+)
+with gr.Row():
+msg = gr.Textbox(
+placeholder="Ask Clawdbot to search, read, or code...",
+show_label=False,
+scale=6,
+lines=2,
+max_lines=10,
+)
+send_btn = gr.Button("Send", variant="primary", scale=1)
+# Wire up chat submission
+chat_inputs = [msg, chatbot, pending_proposals_state, file_input]
+chat_outputs = [
+chatbot, msg, pending_proposals_state,
+# These reference components in the Gate tab — defined below
+]
+# ==== TAB 2: BUILD APPROVAL GATE ====
+with gr.Tab(" Build Approval Gate"):
+gr.Markdown(
+"### Review Staged Operations\n"
+"Write operations (file writes, shell commands, branch creation) "
+"are staged here for your review before execution.\n\n"
+"**Select proposals to approve, then click Execute.**"
+)
+gate_list = gr.CheckboxGroup(
+label="Pending Proposals",
+choices=[],
+interactive=True
+)
+with gr.Row():
+btn_exec = gr.Button(" Execute Selected", variant="primary")
+btn_clear = gr.Button(" Clear All", variant="secondary")
+gate_results = gr.Markdown("*No operations executed yet.*")
 # ==================================================================
 # EVENT WIRING
 # ==================================================================
 # All events are wired here, after all components are defined, so
 # cross-tab references work (e.g., chat updating the gate_list).
 # ==================================================================
 # Chat submission (both Enter key and Send button)
 full_chat_outputs = [
+chatbot, msg, pending_proposals_state,
+gate_list, stats_files, stats_convos
 ]
 msg.submit(
+fn=agent_loop,
+inputs=chat_inputs,
+outputs=full_chat_outputs
 )
 send_btn.click(
+fn=agent_loop,
+inputs=chat_inputs,
+outputs=full_chat_outputs
 )
 # Refresh stats button
 refresh_btn.click(
+fn=refresh_stats,
+inputs=[],
+outputs=[stats_files, stats_convos]
 )
 # Build Approval Gate buttons
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # btn_exec now takes chatbot as input AND output so approved operation
 # automatically re-enters the agent loop so Kimi processes the results
 # without Josh having to type "continue".
 btn_exec.click(
+fn=execute_approved_proposals,
+inputs=[gate_list, pending_proposals_state, chatbot],
+outputs=[gate_results, pending_proposals_state, gate_list, chatbot]
 ).then(
+fn=auto_continue_after_approval,
+inputs=[chatbot, pending_proposals_state],
+outputs=[chatbot, msg_input, pending_proposals_state,
+gate_list, stat_files, stat_convos]
 )
 btn_clear.click(
+fn=clear_all_proposals,
+inputs=[pending_proposals_state],
+outputs=[gate_results, pending_proposals_state, gate_list]
 )
 # =============================================================================
 # LAUNCH
 # =============================================================================
 # CHANGELOG [2026-02-01 - Claude/Opus]
 # Standard HF Spaces launch config. 0.0.0.0 binds to all interfaces
 # (required for Docker). Port 7860 is the HF Spaces standard.
 # =============================================================================
+if __name__ == "__main__":
+demo.launch(server_name="0.0.0.0", server_port=7860, theme=gr.themes.Soft())