Spaces:

deebee7
/

moltbot-hybrid-engine

Running

App Files Files Community

dboa9 commited on 26 days ago

Commit

e787148

1 Parent(s): 9a86887

update

Browse files

Files changed (5) hide show

Dockerfile +21 -5
README.md +13 -2
app.py +66 -22
requirements.txt +2 -1
start.sh +18 -1

Dockerfile CHANGED Viewed

@@ -1,8 +1,7 @@
 # Moltbot Hybrid Engine - Multi-service Dockerfile
-# Runs: FastAPI (port 7860) + Ollama (optional, background)
-# Build: 2026-02-08 v6.0
 # FIX v6: Dual LLM backend - Ollama (if available) + HF Inference API fallback
-# HF Inference API works on Free tier without GPU/Ollama
 FROM python:3.11-slim
@@ -15,9 +14,22 @@ RUN apt-get update && apt-get install -y --no-install-recommends \
     git \
     git-lfs \
     file \
     && apt-get clean \
     && rm -rf /var/lib/apt/lists/*
 # Install Ollama AS ROOT - pinned version, force amd64
 # Using pinned version URL to avoid redirect issues during Docker build
 # Mark as OPTIONAL - app works without it via HF Inference API fallback
@@ -52,14 +64,18 @@ RUN pip install --no-cache-dir --upgrade pip
 # Copy all files with correct ownership
 COPY --chown=user . /app
 # Install Python dependencies (includes huggingface_hub for Inference API)
 RUN pip install --no-cache-dir -r requirements.txt
 # Make start script executable
 RUN chmod +x start.sh
-# Expose HF Spaces port
-EXPOSE 7860
 # CMD required (not ENTRYPOINT) for dev mode compatibility
 CMD ["./start.sh"]

 # Moltbot Hybrid Engine - Multi-service Dockerfile
+# Runs: FastAPI (port 7860) + Ollama (optional) + OpenClaw/Clawdbot gateway (port 18789)
+# Build: 2026-02-14 v7.0 — Add Clawdbot (OpenClaw) for autonomous agent in HF Space
 # FIX v6: Dual LLM backend - Ollama (if available) + HF Inference API fallback
 FROM python:3.11-slim
     git \
     git-lfs \
     file \
+    ca-certificates \
     && apt-get clean \
     && rm -rf /var/lib/apt/lists/*
+# Install Node.js 22 (required for OpenClaw/Clawdbot)
+RUN curl -fsSL https://deb.nodesource.com/setup_22.x | bash - \
+    && apt-get install -y nodejs \
+    && apt-get clean \
+    && rm -rf /var/lib/apt/lists/* \
+    && node -v \
+    && npm -v
+# Install OpenClaw (Clawdbot) globally so gateway can run in Space
+RUN npm install -g openclaw@latest \
+    && (command -v openclaw || true)
 # Install Ollama AS ROOT - pinned version, force amd64
 # Using pinned version URL to avoid redirect issues during Docker build
 # Mark as OPTIONAL - app works without it via HF Inference API fallback
 # Copy all files with correct ownership
 COPY --chown=user . /app
+# OpenClaw/Clawdbot: minimal config so gateway starts without interactive onboarding
+RUN mkdir -p /home/user/.openclaw/workspace
+COPY --chown=user openclaw.json /home/user/.openclaw/openclaw.json
 # Install Python dependencies (includes huggingface_hub for Inference API)
 RUN pip install --no-cache-dir -r requirements.txt
 # Make start script executable
 RUN chmod +x start.sh
+# Expose HF Spaces port (7860) and OpenClaw gateway (18789, internal)
+EXPOSE 7860 18789
 # CMD required (not ENTRYPOINT) for dev mode compatibility
 CMD ["./start.sh"]

README.md CHANGED Viewed

@@ -9,9 +9,9 @@ app_port: 7860
 ---
 # Moltbot Hybrid Engine
-Safe AI agent for legal document processing - Dual LLM backend + file matching + analysis.
-**Version 6.0.0** - Last updated: 2026-02-08
 ## Required Space Secrets
@@ -26,3 +26,14 @@ Set these in Space Settings > Repository secrets:
 1. **Ollama** (local in container) - runs qwen2.5:1.5b if binary installs correctly
 2. **HF Inference API** (fallback) - uses Qwen/Qwen2.5-7B-Instruct hosted by HuggingFace (requires HF_TOKEN)

 ---
 # Moltbot Hybrid Engine
+Safe AI agent for legal document processing - Dual LLM backend + file matching + **Clawdbot (OpenClaw)** in the Space.
+**Version 7.0.0** - Last updated: 2026-02-14 — Clawdbot installed; gateway runs on port 18789, proxied at `/gateway`
 ## Required Space Secrets
 1. **Ollama** (local in container) - runs qwen2.5:1.5b if binary installs correctly
 2. **HF Inference API** (fallback) - uses Qwen/Qwen2.5-7B-Instruct hosted by HuggingFace (requires HF_TOKEN)
+## Clawdbot (OpenClaw) in this Space
+OpenClaw/Clawdbot is **installed and started** in this Space so you can use the autonomous agent with Qwen and Claude.
+- **Control UI / WebChat:** After the Space is running, open: **`https://<your-space>.hf.space/gateway`**
+- The gateway runs internally on port 18789; the FastAPI app proxies `/gateway` and `/gateway/*` to it so a single Space port (7860) serves both the API and Clawdbot.
+- **To get autonomous behaviour** (Clawdbot working on court bundle changes with Qwen/Claude):
+  1. Open the Control UI at `/gateway` and complete pairing/setup (model, API keys).
+  2. Add a **skill** that triggers your pipeline (e.g. run the web editor flow, call this Engine, run the bundler). Skills live in `~/.openclaw/workspace/skills/`; you can add a custom skill that calls your court bundle project endpoints or scripts.
+  3. Wire that skill to the same Qwen/Claude endpoints the editor uses (this Space’s `/api/generate` and your Claude API).

app.py CHANGED Viewed

@@ -1,24 +1,16 @@
 """
-Moltbot Hybrid Engine - Production v6.0.0
-Multi-service: FastAPI endpoints + Dual LLM backend (Ollama + HF Inference API)
 Runs on Hugging Face Spaces
-Build: 2026-02-08
-LLM Strategy:
-  1. Try Ollama (local, if installed and running)
-  2. Fallback to HuggingFace Inference API (always available, no GPU needed)
 Endpoints:
   GET  /              - Health check
   GET  /health        - Detailed health status
-  GET  /security      - Security posture info
-  POST /api/generate  - LLM text generation (Ollama -> HF Inference API fallback)
-  POST /api/search    - Fuzzy file matching
-  POST /api/analyze   - Report analysis (JSON body)
-  POST /api/extract_date - Date extraction from filenames
-  POST /tools/analyze_report - Report analysis via file upload
-  POST /v1/chat/completions - OpenAI-compatible endpoint (for Cursor IDE)
-  GET  /v1/models     - OpenAI-compatible model listing
 """
 import os
 import re
@@ -26,8 +18,8 @@ import json
 import subprocess
 import logging
 from pathlib import Path
-from fastapi import FastAPI, HTTPException, Header, UploadFile, File
-from fastapi.responses import StreamingResponse
 from pydantic import BaseModel
 from typing import List, Optional, Dict, Any, Union
@@ -37,8 +29,8 @@ logger = logging.getLogger("moltbot-engine")
 # Initialize App
 app = FastAPI(
     title="Moltbot Hybrid Engine",
-    description="AI agent for legal document processing - Dual LLM + file matching + analysis",
-    version="6.0.0"
 )
 # API Key for authentication
@@ -255,8 +247,9 @@ def health_check():
     return {
         "status": "running",
         "service": "Moltbot Hybrid Engine",
-        "version": "6.0.0",
         "ollama": ollama,
         "hf_inference_api": {
             "available": True,
             "model": HF_MODEL,
@@ -271,7 +264,7 @@ def detailed_health():
     return {
         "status": "healthy",
         "service": "moltbot-hybrid-engine",
-        "version": "6.0.0",
         "llm_backends": {
             "ollama": {
                 "running": ollama.get("running", False),
@@ -287,7 +280,7 @@ def detailed_health():
         },
         "endpoints": ["/", "/health", "/api/generate", "/api/search",
                       "/api/analyze", "/api/extract_date", "/tools/analyze_report",
-                      "/v1/chat/completions", "/v1/models"]
     }
 @app.get("/security")
@@ -302,6 +295,57 @@ def security_info():
     }
 # Legal document exhibit reference instruction — injected into every generate/chat so edit sources always get it
 _prompts_dir = Path(__file__).resolve().parent / "prompts"
 _LEGAL_EXHIBIT_PROMPT_PATH = _prompts_dir / "legal_exhibit_instruction.txt"

 """
+Moltbot Hybrid Engine - Production v7.0.0
+Multi-service: FastAPI + Ollama (optional) + OpenClaw/Clawdbot gateway (proxied at /gateway)
 Runs on Hugging Face Spaces
+Build: 2026-02-14 — Clawdbot installed in Space; gateway on 18789, proxied at /gateway
 Endpoints:
   GET  /              - Health check
   GET  /health        - Detailed health status
+  GET  /gateway       - OpenClaw/Clawdbot Control UI (reverse proxy to gateway :18789)
+  GET  /gateway/{path} - OpenClaw proxy (path)
+  POST /api/generate  - LLM text generation
+  ...
 """
 import os
 import re
 import subprocess
 import logging
 from pathlib import Path
+from fastapi import FastAPI, HTTPException, Header, UploadFile, File, Request
+from fastapi.responses import StreamingResponse, Response
 from pydantic import BaseModel
 from typing import List, Optional, Dict, Any, Union
 # Initialize App
 app = FastAPI(
     title="Moltbot Hybrid Engine",
+    description="AI agent for legal document processing - Dual LLM + file matching + Clawdbot gateway at /gateway",
+    version="7.0.0"
 )
 # API Key for authentication
     return {
         "status": "running",
         "service": "Moltbot Hybrid Engine",
+        "version": "7.0.0",
         "ollama": ollama,
+        "clawdbot": "OpenClaw gateway proxied at /gateway (if running)",
         "hf_inference_api": {
             "available": True,
             "model": HF_MODEL,
     return {
         "status": "healthy",
         "service": "moltbot-hybrid-engine",
+        "version": "7.0.0",
         "llm_backends": {
             "ollama": {
                 "running": ollama.get("running", False),
         },
         "endpoints": ["/", "/health", "/api/generate", "/api/search",
                       "/api/analyze", "/api/extract_date", "/tools/analyze_report",
+                      "/v1/chat/completions", "/v1/models", "/gateway (Clawdbot UI)"]
     }
 @app.get("/security")
     }
+# OpenClaw/Clawdbot gateway reverse proxy (gateway runs on 18789; Space exposes single port 7860)
+OPENCLAW_GATEWAY_URL = "http://127.0.0.1:18789"
+@app.api_route("/gateway", methods=["GET", "POST", "PUT", "DELETE", "PATCH", "OPTIONS"])
+@app.api_route("/gateway/{path:path}", methods=["GET", "POST", "PUT", "DELETE", "PATCH", "OPTIONS"])
+async def proxy_openclaw_gateway(request: Request, path: str = ""):
+    """Proxy requests to OpenClaw/Clawdbot gateway so Control UI and WebChat are reachable at /gateway."""
+    try:
+        import httpx
+    except ImportError:
+        raise HTTPException(status_code=503, detail="httpx not installed; cannot proxy to Clawdbot gateway")
+    target_path = request.url.path
+    if target_path.startswith("/gateway"):
+        target_path = target_path[7:] or "/"  # strip /gateway -> / or /foo
+    target = f"{OPENCLAW_GATEWAY_URL}{target_path}"
+    if request.url.query:
+        target += "?" + request.url.query
+    headers = {k: v for k, v in request.headers.raw if k.lower() not in (b"host", b"connection")}
+    try:
+        body = await request.body()
+    except Exception:
+        body = b""
+    async with httpx.AsyncClient(timeout=30.0) as client:
+        try:
+            r = await client.request(
+                request.method,
+                target,
+                headers=headers,
+                content=body,
+            )
+        except httpx.ConnectError:
+            return Response(
+                content="Clawdbot gateway not reachable (is it running on 18789?). Start the Space and try again.",
+                status_code=503,
+                media_type="text/plain",
+            )
+        except Exception as e:
+            logger.warning(f"[GATEWAY PROXY] {e}")
+            return Response(content=str(e), status_code=502, media_type="text/plain")
+    out_headers = {}
+    for k, v in r.headers.items():
+        if k.lower() not in ("transfer-encoding", "connection"):
+            out_headers[k] = v
+    return Response(
+        content=r.content,
+        status_code=r.status_code,
+        headers=out_headers,
+        media_type=r.headers.get("content-type", "application/octet-stream"),
+    )
 # Legal document exhibit reference instruction — injected into every generate/chat so edit sources always get it
 _prompts_dir = Path(__file__).resolve().parent / "prompts"
 _LEGAL_EXHIBIT_PROMPT_PATH = _prompts_dir / "legal_exhibit_instruction.txt"

requirements.txt CHANGED Viewed

@@ -4,4 +4,5 @@ uvicorn>=0.24.0
 pydantic>=2.0.0
 python-multipart>=0.0.6
 huggingface_hub>=0.20.0
-requests>=2.31.0

 pydantic>=2.0.0
 python-multipart>=0.0.6
 huggingface_hub>=0.20.0
+requests>=2.31.0
+httpx>=0.25.0

start.sh CHANGED Viewed

@@ -79,8 +79,25 @@ echo "  💡 HF Inference API fallback is always available"
 echo "     (Uses Qwen/Qwen2.5-7B-Instruct hosted by HuggingFace)"
 echo ""
 # 4. Start FastAPI (foreground - keeps container alive)
-echo "[4/4] Starting FastAPI on port 7860..."
 echo "============================================================"
 echo ""
 python -m uvicorn app:app --host 0.0.0.0 --port 7860

 echo "     (Uses Qwen/Qwen2.5-7B-Instruct hosted by HuggingFace)"
 echo ""
+# 3b. Start OpenClaw/Clawdbot gateway in background (port 18789) if available
+echo "[3b/5] Starting OpenClaw (Clawdbot) gateway..."
+if command -v openclaw &> /dev/null; then
+    export OPENCLAW_HOME="${HOME:-/home/user}/.openclaw"
+    if [ -d "$OPENCLAW_HOME" ] || [ -f "$OPENCLAW_HOME/openclaw.json" ]; then
+        nohup openclaw gateway --port 18789 > /tmp/openclaw-gateway.log 2>&1 &
+        OPENCLAW_PID=$!
+        echo "  ✅ Clawdbot gateway started (PID $OPENCLAW_PID, port 18789)"
+        echo "  → Control UI / WebChat available via this Space at /gateway (see app)"
+    else
+        echo "  ⚠️  OpenClaw config not found at $OPENCLAW_HOME — skip gateway"
+    fi
+else
+    echo "  ⚠️  openclaw binary not found — skip Clawdbot gateway"
+fi
+echo ""
 # 4. Start FastAPI (foreground - keeps container alive)
+echo "[4/5] Starting FastAPI on port 7860..."
 echo "============================================================"
 echo ""
 python -m uvicorn app:app --host 0.0.0.0 --port 7860