Spaces:

rahul7star
/

pyclaw

Paused

App Files Files Community

pyclaw / acchite.md

rahul7star

Create acchite.md

d9f1d38 verified about 1 month ago

preview code

raw

history blame contribute delete

6.7 kB

2. System Components

2.1 User Interface

Streamlit-based web UI.
Features:
- Prompt input
- /HELP commands
- Code generation /WRITE_PY
- Code editing + execution (CPU or optional GPU)
- Book, story, or poem management
- Session logs and downloads

2.2 Agent Core — `pygmyclaw.py`

Handles all user-agent interaction:
- Receives user prompts.
- Converts LLM output into JSON tool calls.
- Dynamically loads and executes Python tools.
Speculative decoding:
- 3 drafters + 1 verifier for robust output.
Queue system:
- Redis or JSON-file queue for task scheduling.
Artifact management:
- Stores code, logs, and task outputs in workspace.
- Supports automatic dependency installation.

2.3 Python Multitool — `pygmyclaw_multitool.py`

Tool registry:
- list_tools_detailed, sys_info, log_error, echo, etc.
Dynamic tool addition:
- Agents can create new tools that are callable via JSON.
Safe execution sandbox:
- Python subprocess with controlled input/output.

2.4 LLM Interaction

Ollama / HF-hosted model as backend:
- Multi-instance support for parallel drafters.
- Token-based speculative decoding.

Workflow:

User prompt
    ↓
Agent (LLM)
    ↓
JSON tool call
    ↓
Python tool executes
    ↓
Result returned to LLM
    ↓
Final response to user

Dynamic code generation & execution integrated:
- e.g., user asks for PyTorch demo → agent installs PyTorch → generates editable code → runs it in UI.

2.5 Persistent Memory — Hugging Face

Repository: rahul7star/pyclaw
Stores:
- Generated code & scripts
- Task outputs (.out)
- Logs & session history
- Dynamic tools and metadata
Mechanism:
- Push artifacts via huggingface_hub API
- Pull existing artifacts for agent memory
Benefits: Enables long-term learning, cross-session continuity, and reproducibility.

2.6 Autonomous Task Management

Queue-based execution:
- Tasks added by user or agent itself.
- Background processor executes tasks in order.
Speculative execution:
- Multi-instance drafters improve code reliability.
- Verifier ensures correctness of outputs.
Dynamic tools:
- Tools can evolve or new tools can be created on-the-fly.

2.7 Safety & Resource Management

Code execution sandbox:
- Controlled Python subprocess.
- Auto cleanup of temporary files.
CPU/GPU selection:
- Default: CPU
- Optional: GPU if available and environment variable set.
Dependency management:
- Automatic package installation (e.g., PyTorch for user-requested demos).

3. Example User Workflow

User Prompt: "Create a neural network demo in Python using PyTorch."

Agent Actions:

Detects PyTorch requirement → installs on CPU.
Generates Python code using /WRITE_PY.
Saves code in Hugging Face repo for memory.
Displays code in UI for editing.
Runs code → outputs printed and logged.
Updates agent memory with results and execution logs.
Optionally creates a new dynamic tool for future NN generation.

4. Evolvable Architecture

Feature	Current Status	Future Evolution
Dynamic Tool Creation	✅	Can auto-generate new tools from tasks
Long-Term Memory	✅ via HF	Add semantic search, embeddings for context
Speculative Decoding	✅	Increase drafters, multi-agent cooperation
Autonomous Task Execution	Partial	Recursive task planning, multi-step project handling
Dependency Management	✅	Expand to virtual environments per project
Safe Execution	Partial	Containerized execution (Docker/WSL)

5. Roadmap to Claude-Like Autonomy

Enhance memory: Semantic embeddings + search in HF repo.
Recursive reasoning: Agent generates subtasks autonomously.
Multi-agent collaboration: Multiple PygmyClaws coordinate on large projects.
Learning from outputs: Store completed tasks + feedback for continuous improvement.
Safety & isolation: Dockerized Python execution with resource limits.
Dynamic UI: Allow live editing, execution, and visualization of code outputs.

6. Diagram of End-to-End Flow

           ┌───────────────┐
           │   User Prompt │
           └───────┬───────┘
                   │
                   ▼
           ┌───────────────┐
           │  Agent (LLM)  │
           └───────┬───────┘
                   │ JSON Tool Call
                   ▼
           ┌───────────────┐
           │ Python Tool   │
           │ Executes Task │
           └───────┬───────┘
                   │ Result
                   ▼
           ┌───────────────┐
           │  Agent (LLM)  │
           │ Processes     │
           └───────┬───────┘
                   │
                   ▼
           ┌───────────────┐
           │   UI Output   │
           │ (Code/Result) │
           └───────┬───────┘
                   │
                   ▼
           ┌───────────────┐
           │ │
           │ Persistent    │
           │ Memory Repo   │
           └───────────────┘

2. System Components

2.1 User Interface

2.2 Agent Core — pygmyclaw.py

2.3 Python Multitool — pygmyclaw_multitool.py

2.4 LLM Interaction

2.5 Persistent Memory — Hugging Face

2.6 Autonomous Task Management

2.7 Safety & Resource Management

3. Example User Workflow

4. Evolvable Architecture

5. Roadmap to Claude-Like Autonomy

6. Diagram of End-to-End Flow

2.2 Agent Core — `pygmyclaw.py`

2.3 Python Multitool — `pygmyclaw_multitool.py`