Buckets:

hf-doc-build
/

doc-dev

Files

xet

hf-doc-build/doc-dev / context-course /pr_7 /en /unit6 /introduction.md

HuggingFaceDocBuilder

24 days ago

preview code

download

raw

3.57 kB

	# Introduction to Nano Harness

	Mainstream agent frameworks (Codex, Claude Code, OpenCode) hide the loop that makes them work, which is great for production use, but not for learning.

	Nano Harness is a ~220-line Python agent built for reading, not production. It shows the system prompt, the step loop, tool execution, message history, error handling, and sandboxing in one file.

	> [!WARNING]
	> nano_harness is a learning tool and not intended for production use. It's just a fun way to understand how agents work under the hood. Do not use it for real work!

	## Why start from scratch?

	Learning from scratch is a great way to understand how anything works under the hood.

	Reading a minimal harness makes the design choices legible: when to retry, how to parse output, how to handle errors, and where the security boundary sits. This unit uses `zai-org/GLM-5.1` via Hugging Face Inference Providers as the single model, so the only moving parts are the loop and the tools.

	## What is Nano Harness?

	Nano Harness is a ~220-line Python agent framework that:

	1. Takes a task — "Inspect the workspace and provide a summary"
	2. Calls an LLM — Uses OpenAI-compatible API (defaults to HF router)
	3. Generates Python code — Model outputs executable Python code that will help complete the task
	4. Executes safely — In a sandboxed environment with allowed tools
	5. Observes results — stdout, stderr, exceptions, or a captured final answer
	6. Updates context — Feeds observations back to the model
	7. Repeats — Until the task is done or we hit step limit (we're assuming 50 steps in our implementation)

	## Key Features

	### Code-First Agent
	Instead of returning JSON or text, the model outputs Python code:

	```python
	# Model says:
	files = list_dir(".")
	models = read_file("models.txt")
	final_answer("Files:\n" + "\n".join(files) + "\n\nmodels.txt:\n" + models)
	```

	The agent parses and executes it.

	### Constrained Tools
	Four tools available:

	\| Tool \| What It Does \| Security \|
	\|------\|-------------\|----------\|
	\| `list_dir(path)` \| List directory contents \| Path confinement \|
	\| `read_file(path, max_chars)` \| Read file \| Path confinement, size limit \|
	\| `write_file(path, content)` \| Create/modify file \| Path confinement, disabled by default \|
	\| `exec_cmd(args)` \| Run shell command \| Allowlist (only: ls, cat, pwd, echo, head, tail, wc, rg) \|

	### Sandboxed Execution
	- All file access confined to workspace
	- Commands restricted to allowlist
	- Output size limited to prevent context overflow
	- Timeouts prevent hanging

	### Model via Hugging Face Inference Providers

	The harness defaults to the Hugging Face router, which exposes an OpenAI-compatible `/v1` surface backed by Inference Providers:

	```bash
	export NANO_MODEL="zai-org/GLM-5.1"
	export HF_TOKEN="hf_..."
	```

	## The Agentic Loop

	Nano Harness implements the core loop:

	```
	1. Call LLM with task + message history
	2. Parse model output as Python code
	3. Execute Python (with tools available)
	4. Collect stdout, stderr, exceptions
	5. Append observation to message history
	6. Repeat (max 50 steps)
	7. Done when: final_answer() called or max steps reached
	```

	## What You'll Learn

	This unit walks through the agent loop code piece by piece, explains how the tools are designed and sandboxed, extends the harness with `web_fetch` and HF Hub search, and runs it against `zai-org/GLM-5.1` through Hugging Face Inference Providers.

	## Prerequisites

	Basic Python, familiarity with HTTP APIs and sandboxing, and an HF token with access to Inference Providers.

Xet Storage Details

Size:: 3.57 kB
Xet hash:: 7d7e6d0924267ca77d5822a23c0673cfac254cc2234d2ed68114ece88c4a2e63

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.