gridworld-env / OpenEnv /docs /environment-builder.md
Abhilasha Kakoty
Initial deploy
7078f4d
# Building Your Own Environment with OpenEnv
This guide walks you through creating a custom environment using the `OpenEnv` framework and the `openenv` CLI.
The CLI handles scaffolding, builds, validation, and deployment so you can stay focused on environment logic.
## Overview
A typical workflow looks like:
1. Scaffold a new environment with `openenv init`.
2. Customize your models, environment logic, and FastAPI server.
3. Implement a typed `EnvClient` (WebSocket-based for persistent sessions).
4. Configure dependencies and the Dockerfile once.
5. Use the CLI (`openenv build`, `openenv validate`, `openenv push`) to package and share your work.
!!! note
These integrations are handled automatically by the `openenv` CLI when you run `openenv init`.
### Prerequisites
- Python 3.11+ and [`uv`](https://github.com/astral-sh/uv) for dependency locking
- Docker Desktop / Docker Engine
- The OpenEnv library installed: `pip install https://github.com/meta-pytorch/OpenEnv.git`
## Step-by-Step Guide
Let's walk through the process of building a custom environment with OpenEnv.
### 1. Scaffold with `openenv init`
```bash
# Run from anywhere – defaults to current directory
openenv init my_env
# Optionally choose an output directory
openenv init my_env --output-dir /Users/you/envs
```
The command creates a fully-typed template with `openenv.yaml`, `pyproject.toml`, `uv.lock`, Docker assets, and stub implementations. If you're working inside this repo, move the generated folder under `envs/`.
Typical layout:
```
my_env/
├── __init__.py
├── README.md
├── client.py
├── models.py
├── openenv.yaml
├── pyproject.toml
├── uv.lock
└── server/
├── __init__.py
├── app.py
├── my_environment.py
├── requirements.txt
└── Dockerfile
```
Python classes are generated for the action, observation, environment, and client. For example, you will find `MyEnvironment`, `MyAction`, `MyObservation`, and `MyEnv` (client) in the `my_env` directory based on the name you provided. The environment uses the core `State` class from `openenv.core.env_server.types`.
### 2. Define Models
Edit `models.py` to describe your action and observation using Pydantic:
```python
# models.py
from pydantic import Field
from openenv.core.env_server.types import Action, Observation
class MyAction(Action):
"""Your custom action."""
command: str = Field(..., description="Command to execute")
parameters: dict = Field(default_factory=dict, description="Command parameters")
class MyObservation(Observation):
"""Your custom observation."""
result: str = Field(..., description="Result of the action")
success: bool = Field(..., description="Whether the action succeeded")
```
### 3. Implement Environment Logic
Customize `server/my_environment.py` by extending `Environment`:
```python
# server/my_environment.py
from uuid import uuid4
from openenv.core.env_server.interfaces import Environment
from openenv.core.env_server.types import State
from models import MyAction, MyObservation
class MyEnvironment(Environment):
def __init__(self):
self._state = State(episode_id=str(uuid4()), step_count=0)
def reset(self) -> MyObservation:
self._state = State(episode_id=str(uuid4()), step_count=0)
return MyObservation(result="Ready", success=True, done=False, reward=0.0)
def step(self, action: MyAction) -> MyObservation:
# Implement your logic here
self._state.step_count += 1
result = self._execute_command(action.command)
return MyObservation(result=result, success=True, done=False, reward=1.0)
@property
def state(self) -> State:
return self._state
```
### 4. Create the FastAPI Server
`server/app.py` should expose the environment through `create_app`.
**Important:** You must pass a class or factory function (not an instance) to enable WebSocket-based concurrent sessions:
```python
# server/app.py
from openenv.core.env_server import create_app
from ..models import MyAction, MyObservation
from .my_environment import MyEnvironment
# Pass the class (factory) - each WebSocket session gets its own instance
app = create_app(MyEnvironment, MyAction, MyObservation, env_name="my_env")
```
For environments with constructor arguments, create a factory function:
```python
# server/app.py
import os
from openenv.core.env_server import create_app
from ..models import MyAction, MyObservation
from .my_environment import MyEnvironment
# Read config from environment variables
api_key = os.getenv("MY_API_KEY")
timeout = int(os.getenv("MY_TIMEOUT", "30"))
def create_my_environment():
"""Factory function that creates MyEnvironment with config."""
return MyEnvironment(api_key=api_key, timeout=timeout)
# Pass the factory function
app = create_app(create_my_environment, MyAction, MyObservation, env_name="my_env")
```
### 5. Implement the Client
`client.py` extends `EnvClient` so users can interact with your server via WebSocket for persistent sessions:
```python
# client.py
from openenv.core.env_client import EnvClient
from openenv.core.client_types import StepResult
from .models import MyAction, MyObservation, MyState
class MyEnv(EnvClient[MyAction, MyObservation, MyState]):
def _step_payload(self, action: MyAction) -> dict:
return {"command": action.command, "parameters": action.parameters}
def _parse_result(self, payload: dict) -> StepResult[MyObservation]:
obs_data = payload.get("observation", {})
obs = MyObservation(
result=obs_data.get("result", ""),
success=obs_data.get("success", False),
done=payload.get("done", False),
reward=payload.get("reward"),
)
return StepResult(
observation=obs,
reward=payload.get("reward"),
done=payload.get("done", False),
)
def _parse_state(self, payload: dict) -> State:
return State(
episode_id=payload.get("episode_id"),
step_count=payload.get("step_count", 0),
)
```
The `EnvClient` maintains a persistent WebSocket connection to the server, enabling efficient multi-step interactions with lower latency compared to HTTP. Each client instance gets its own dedicated environment session on the server.
### 6. Configure Dependencies & Dockerfile
The CLI template ships with `pyproject.toml` and `server/Dockerfile`. You should manage your python dependencies with `uv` or `pip` in the `pyproject.toml` file. Other dependencies should be installed in the Dockerfile.
Keep building from the `openenv-base` image so shared tooling stays available:
<details>
<summary>Dockerfile</summary>
```dockerfile
# Copyright (c) Meta Platforms, Inc. and affiliates.
# All rights reserved.
#
# This source code is licensed under the BSD-style license found in the
# LICENSE file in the root directory of this source tree.
# Multi-stage build using openenv-base
# This Dockerfile is flexible and works for both:
# - In-repo environments (with local src/core)
# - Standalone environments (with openenv from pip)
# The build script (openenv build) handles context detection and sets appropriate build args.
ARG BASE_IMAGE=openenv-base:latest
FROM ${BASE_IMAGE} AS builder
WORKDIR /app
# Build argument to control whether we're building standalone or in-repo
ARG BUILD_MODE=in-repo
ARG ENV_NAME=__ENV_NAME__
# Copy environment code (always at root of build context)
COPY . /app/env
# For in-repo builds, openenv is already in the pyproject.toml dependencies
# For standalone builds, openenv will be installed from pip via pyproject.toml
WORKDIR /app/env
# Install dependencies using uv sync
# If uv.lock exists, use it; otherwise resolve on the fly
RUN --mount=type=cache,target=/root/.cache/uv \
if [ -f uv.lock ]; then \
uv sync --frozen --no-install-project --no-editable; \
else \
uv sync --no-install-project --no-editable; \
fi
RUN --mount=type=cache,target=/root/.cache/uv \
if [ -f uv.lock ]; then \
uv sync --frozen --no-editable; \
else \
uv sync --no-editable; \
fi
# Final runtime stage
FROM ${BASE_IMAGE}
WORKDIR /app
# Copy the virtual environment from builder
COPY --from=builder /app/env/.venv /app/.venv
# Copy the environment code
COPY --from=builder /app/env /app/env
# Set PATH to use the virtual environment
ENV PATH="/app/.venv/bin:$PATH"
# Set PYTHONPATH so imports work correctly
ENV PYTHONPATH="/app/env:$PYTHONPATH"
# Health check
HEALTHCHECK --interval=30s --timeout=3s --start-period=5s --retries=3 \
CMD curl -f http://localhost:8000/health || exit 1
# Run the FastAPI server
# The module path is constructed to work with the /app/env structure
CMD ["sh", "-c", "cd /app/env && uvicorn server.app:app --host 0.0.0.0 --port 8000"]
```
</details>
If you introduced extra dependencies in the Dockerfile, you should install them in the Dockerfile before removing temp files.
### 7. Build & Validate with the CLI
From the environment directory:
```bash
cd envs/my_env
openenv build # Builds Docker image (auto-detects context)
openenv validate --verbose
```
`openenv build` understands both standalone environments and in-repo ones. Useful flags:
- `--tag/-t`: override the default `openenv-<env_name>` tag
- `--build-arg KEY=VALUE`: pass multiple Docker build arguments
- `--dockerfile` / `--context`: custom locations when experimenting
- `--no-cache`: force fresh dependency installs
`openenv validate` checks for required files, ensures the Dockerfile/server entrypoints function, and lists supported deployment modes. The command exits non-zero if issues are found so you can wire it into CI.
You can also validate a running environment endpoint and get criteria-level JSON:
```bash
openenv validate --url http://localhost:8000
# or
openenv validate https://username-my-env.hf.space
```
Example runtime output:
```json
{
"target": "http://localhost:8000",
"validation_type": "running_environment",
"standard_version": "1.0.0",
"passed": true,
"summary": {
"passed_count": 6,
"total_count": 6,
"failed_criteria": []
},
"criteria": [
{"id": "health_endpoint", "passed": true},
{"id": "metadata_endpoint", "passed": true},
{"id": "schema_endpoint", "passed": true}
]
}
```
### 8. Push & Share with `openenv push`
Once validation passes, the CLI can deploy directly to Hugging Face Spaces or any registry:
```bash
# Push to HF Spaces (auto enables web UI and prompts for login if needed)
openenv push
# Push to a specific repo or namespace
openenv push --repo-id my-org/my-env
# Push to Docker/ghcr (interface disabled by default)
openenv push --registry ghcr.io/my-org --tag my-env:latest
# Customize image base or visibility
openenv push --base-image ghcr.io/meta-pytorch/openenv-base:latest --private
```
Key options:
- `--directory`: path to the environment (defaults to `cwd`)
- `--repo-id`: explicit Hugging Face space name
- `--registry`: push to Docker Hub, GHCR, etc.
- `--interface/--no-interface`: toggle the optional web UI
- `--base-image`: override the Dockerfile `FROM`
- `--private`: mark the space as private
- Hidden files/directories (`.*`) are excluded by default for Hugging Face uploads
- `--exclude/-e`: optional additional ignore file (newline-separated glob patterns) merged on top of defaults
The command validates your `openenv.yaml`, injects Hugging Face frontmatter when needed, and uploads the prepared bundle.
### 9. Automate Builds (optional)
To trigger Docker builds on every push to `main`, add your environment to the matrix in `.github/workflows/docker-build.yml`:
```yaml
strategy:
matrix:
image:
- name: echo-env
dockerfile: envs/echo_env/server/Dockerfile
- name: chat-env
dockerfile: envs/chat_env/server/Dockerfile
- name: coding-env
dockerfile: envs/coding_env/server/Dockerfile
- name: my-env # Add your environment here
dockerfile: envs/my_env/server/Dockerfile
```
### Use Your Environment
For an end-to-end example of using your environment, see the [Quick Start](quickstart.md) guide. Here is a simple example of using your environment:
```python
from envs.my_env import MyAction, MyEnv
# Create environment from Docker image
client = MyEnv.from_docker_image("my-env:latest")
# Or, connect to the remote space on Hugging Face
client = MyEnv.from_hub("my-org/my-env")
# Or, connect to the local server
client = MyEnv(base_url="http://localhost:8000")
# Use context manager for automatic cleanup (recommended)
with client:
# Reset
result = client.reset()
print(result.observation.result) # "Ready"
# Execute actions
result = client.step(MyAction(command="test", parameters={}))
print(result.observation.result)
print(result.observation.success)
# Get state
state = client.state()
print(state.episode_id)
print(state.step_count)
# Or manually manage the connection
try:
client = MyEnv(base_url="http://localhost:8000")
result = client.reset()
result = client.step(MyAction(command="test", parameters={}))
finally:
client.close()
```
## Troubleshooting
### WebSocket Connection Closed During RL Training
**Symptom:** When training with high token generation or long-running operations, you may see:
```
websockets.exceptions.ConnectionClosedError: keepalive ping timeout
```
**Cause:** Uvicorn's default WebSocket ping timeout is 20 seconds. During heavy workloads (e.g., LLM token generation), the client may not respond to pings in time, causing the server to close the connection.
**Solution:** Increase the WebSocket ping interval and timeout in your Dockerfile:
```dockerfile
# Before (default - 20 second timeout)
CMD ["sh", "-c", "cd /app/env && uvicorn server.app:app --host 0.0.0.0 --port 8000"]
# After (5 minute timeout)
CMD ["sh", "-c", "cd /app/env && uvicorn server.app:app --host 0.0.0.0 --port 8000 --ws-ping-interval 300 --ws-ping-timeout 300"]
```
After modifying the Dockerfile, rebuild and redeploy:
```bash
openenv build
openenv push
```
**Available options:**
| Parameter | Description | Default | Recommended for RL |
|-----------|-------------|---------|-------------------|
| `--ws-ping-interval` | Seconds between server pings | 20 | 300 (5 min) |
| `--ws-ping-timeout` | Seconds to wait for pong response | 20 | 300 (5 min) |
!!! tip
For local development without Docker, pass the same flags directly to uvicorn:
```bash
uvicorn server.app:app --host 0.0.0.0 --port 8000 --ws-ping-interval 300 --ws-ping-timeout 300
```
---
## Nice work! You've now built and used your own OpenEnv environment.
Your next steps are to:
- [Try out the end-to-end tutorial](https://colab.research.google.com/github/meta-pytorch/OpenEnv/blob/main/examples/OpenEnv_Tutorial.ipynb)