Spaces:

seatyyy
/

skillforge

Sleeping

App Files Files Community

seatyyy commited on Mar 8

Commit

9a6cc06

1 Parent(s): ef6eadd

deploy skillforge

Browse files

Files changed (4) hide show

README.md +2 -251
models.py +5 -3
openenv.yaml +5 -0
server/Dockerfile +2 -3

README.md CHANGED Viewed

@@ -1,255 +1,6 @@
 ---
-title: Skill Forge Environment Server
-emoji: 🎧
-colorFrom: yellow
-colorTo: red
 sdk: docker
 pinned: false
-app_port: 8000
-base_path: /web
-tags:
-  - openenv
 ---
-# Skill Forge Environment
-A simple test environment that echoes back messages. Perfect for testing the env APIs as well as demonstrating environment usage patterns.
-## Quick Start
-The simplest way to use the Skill Forge environment is through the `SkillForgeEnv` class:
-```python
-from skill_forge import SkillForgeAction, SkillForgeEnv
-try:
-    # Create environment from Docker image
-    skill_forgeenv = SkillForgeEnv.from_docker_image("skill_forge-env:latest")
-    # Reset
-    result = skill_forgeenv.reset()
-    print(f"Reset: {result.observation.echoed_message}")
-    # Send multiple messages
-    messages = ["Hello, World!", "Testing echo", "Final message"]
-    for msg in messages:
-        result = skill_forgeenv.step(SkillForgeAction(message=msg))
-        print(f"Sent: '{msg}'")
-        print(f"  → Echoed: '{result.observation.echoed_message}'")
-        print(f"  → Length: {result.observation.message_length}")
-        print(f"  → Reward: {result.reward}")
-finally:
-    # Always clean up
-    skill_forgeenv.close()
-```
-That's it! The `SkillForgeEnv.from_docker_image()` method handles:
-- Starting the Docker container
-- Waiting for the server to be ready
-- Connecting to the environment
-- Container cleanup when you call `close()`
-## Building the Docker Image
-Before using the environment, you need to build the Docker image:
-```bash
-# From project root
-docker build -t skill_forge-env:latest -f server/Dockerfile .
-```
-## Deploying to Hugging Face Spaces
-You can easily deploy your OpenEnv environment to Hugging Face Spaces using the `openenv push` command:
-```bash
-# From the environment directory (where openenv.yaml is located)
-openenv push
-# Or specify options
-openenv push --namespace my-org --private
-```
-The `openenv push` command will:
-1. Validate that the directory is an OpenEnv environment (checks for `openenv.yaml`)
-2. Prepare a custom build for Hugging Face Docker space (enables web interface)
-3. Upload to Hugging Face (ensuring you're logged in)
-### Prerequisites
-- Authenticate with Hugging Face: The command will prompt for login if not already authenticated
-### Options
-- `--directory`, `-d`: Directory containing the OpenEnv environment (defaults to current directory)
-- `--repo-id`, `-r`: Repository ID in format 'username/repo-name' (defaults to 'username/env-name' from openenv.yaml)
-- `--base-image`, `-b`: Base Docker image to use (overrides Dockerfile FROM)
-- `--private`: Deploy the space as private (default: public)
-### Examples
-```bash
-# Push to your personal namespace (defaults to username/env-name from openenv.yaml)
-openenv push
-# Push to a specific repository
-openenv push --repo-id my-org/my-env
-# Push with a custom base image
-openenv push --base-image ghcr.io/meta-pytorch/openenv-base:latest
-# Push as a private space
-openenv push --private
-# Combine options
-openenv push --repo-id my-org/my-env --base-image custom-base:latest --private
-```
-After deployment, your space will be available at:
-`https://huggingface.co/spaces/<repo-id>`
-The deployed space includes:
-- **Web Interface** at `/web` - Interactive UI for exploring the environment
-- **API Documentation** at `/docs` - Full OpenAPI/Swagger interface
-- **Health Check** at `/health` - Container health monitoring
-- **WebSocket** at `/ws` - Persistent session endpoint for low-latency interactions
-## Environment Details
-### Action
-**SkillForgeAction**: Contains a single field
-- `message` (str) - The message to echo back
-### Observation
-**SkillForgeObservation**: Contains the echo response and metadata
-- `echoed_message` (str) - The message echoed back
-- `message_length` (int) - Length of the message
-- `reward` (float) - Reward based on message length (length × 0.1)
-- `done` (bool) - Always False for echo environment
-- `metadata` (dict) - Additional info like step count
-### Reward
-The reward is calculated as: `message_length × 0.1`
-- "Hi" → reward: 0.2
-- "Hello, World!" → reward: 1.3
-- Empty message → reward: 0.0
-## Advanced Usage
-### Connecting to an Existing Server
-If you already have a Skill Forge environment server running, you can connect directly:
-```python
-from skill_forge import SkillForgeEnv
-# Connect to existing server
-skill_forgeenv = SkillForgeEnv(base_url="<ENV_HTTP_URL_HERE>")
-# Use as normal
-result = skill_forgeenv.reset()
-result = skill_forgeenv.step(SkillForgeAction(message="Hello!"))
-```
-Note: When connecting to an existing server, `skill_forgeenv.close()` will NOT stop the server.
-### Using the Context Manager
-The client supports context manager usage for automatic connection management:
-```python
-from skill_forge import SkillForgeAction, SkillForgeEnv
-# Connect with context manager (auto-connects and closes)
-with SkillForgeEnv(base_url="http://localhost:8000") as env:
-    result = env.reset()
-    print(f"Reset: {result.observation.echoed_message}")
-    # Multiple steps with low latency
-    for msg in ["Hello", "World", "!"]:
-        result = env.step(SkillForgeAction(message=msg))
-        print(f"Echoed: {result.observation.echoed_message}")
-```
-The client uses WebSocket connections for:
-- **Lower latency**: No HTTP connection overhead per request
-- **Persistent session**: Server maintains your environment state
-- **Efficient for episodes**: Better for many sequential steps
-### Concurrent WebSocket Sessions
-The server supports multiple concurrent WebSocket connections. To enable this,
-modify `server/app.py` to use factory mode:
-```python
-# In server/app.py - use factory mode for concurrent sessions
-app = create_app(
-    SkillForgeEnvironment,  # Pass class, not instance
-    SkillForgeAction,
-    SkillForgeObservation,
-    max_concurrent_envs=4,  # Allow 4 concurrent sessions
-)
-```
-Then multiple clients can connect simultaneously:
-```python
-from skill_forge import SkillForgeAction, SkillForgeEnv
-from concurrent.futures import ThreadPoolExecutor
-def run_episode(client_id: int):
-    with SkillForgeEnv(base_url="http://localhost:8000") as env:
-        result = env.reset()
-        for i in range(10):
-            result = env.step(SkillForgeAction(message=f"Client {client_id}, step {i}"))
-        return client_id, result.observation.message_length
-# Run 4 episodes concurrently
-with ThreadPoolExecutor(max_workers=4) as executor:
-    results = list(executor.map(run_episode, range(4)))
-```
-## Development & Testing
-### Direct Environment Testing
-Test the environment logic directly without starting the HTTP server:
-```bash
-# From the server directory
-python3 server/skill_forge_environment.py
-```
-This verifies that:
-- Environment resets correctly
-- Step executes actions properly
-- State tracking works
-- Rewards are calculated correctly
-### Running Locally
-Run the server locally for development:
-```bash
-uvicorn server.app:app --reload
-```
-## Project Structure
-```
-skill_forge/
-├── .dockerignore         # Docker build exclusions
-├── __init__.py            # Module exports
-├── README.md              # This file
-├── openenv.yaml           # OpenEnv manifest
-├── pyproject.toml         # Project metadata and dependencies
-├── uv.lock                # Locked dependencies (generated)
-├── client.py              # SkillForgeEnv client
-├── models.py              # Action and Observation models
-└── server/
-    ├── __init__.py        # Server module exports
-    ├── skill_forge_environment.py  # Core environment logic
-    ├── app.py             # FastAPI application (HTTP + WebSocket endpoints)
-    └── Dockerfile         # Container image definition
-```

 ---
+title: SkillForge
+emoji: 🔨
 sdk: docker
 pinned: false
 ---

models.py CHANGED Viewed

@@ -19,9 +19,9 @@ class SkillForgeAction(Action):
     """Action for the Skill Forge environment"""
     action_type: Literal["create_skill", "use_skill", "raw_code"]
     content: str = Field(description="The content of the action. For create_skill, it is the template. For use_skill, it is the skill id. For raw_code, it is the code.")
-    skill_name: Optional[str] = None # only for create_skill
     reasoning: str = ""
-    params: Optional[dict] = None
 class SkillForgeObservation(Observation):
@@ -30,10 +30,12 @@ class SkillForgeObservation(Observation):
     task_description: str
     snapshot_data: str  #df.head(5).to_string()
     skill_library: dict
-    context: str
     result_correct: bool
     result_output: str
     expected_output: str
     step_count: int
     total_tokens: int

     """Action for the Skill Forge environment"""
     action_type: Literal["create_skill", "use_skill", "raw_code"]
     content: str = Field(description="The content of the action. For create_skill, it is the template. For use_skill, it is the skill id. For raw_code, it is the code.")
+    skill_name: str = "" # only for create_skill
     reasoning: str = ""
+    params: dict = Field(default_factory=dict, description="Template slot values for use_skill")
 class SkillForgeObservation(Observation):
     task_description: str
     snapshot_data: str  #df.head(5).to_string()
     skill_library: dict
+    context: str
     result_correct: bool
     result_output: str
     expected_output: str
     step_count: int
     total_tokens: int
+    reward: Optional[float] = Field(default=None, description="Reward signal from the last action")
+    done: bool = Field(default=False, description="Whether the episode has terminated")

openenv.yaml CHANGED Viewed

@@ -4,4 +4,9 @@ type: space
 runtime: fastapi
 app: server.app:app
 port: 8000

 runtime: fastapi
 app: server.app:app
 port: 8000
+hf_space:
+  sdk: docker
+  hardware: cpu-basic
+env_vars:
+  ENABLE_WEB_INTERFACE: "true"

server/Dockerfile CHANGED Viewed

@@ -10,8 +10,7 @@
 # - Standalone environments (with openenv from PyPI/Git)
 # The build script (openenv build) handles context detection and sets appropriate build args.
-ARG BASE_IMAGE=ghcr.io/meta-pytorch/openenv-base:latest
-FROM ${BASE_IMAGE} AS builder
 WORKDIR /app
@@ -55,7 +54,7 @@ RUN --mount=type=cache,target=/root/.cache/uv \
     fi
 # Final runtime stage
-FROM ${BASE_IMAGE}
 WORKDIR /app

 # - Standalone environments (with openenv from PyPI/Git)
 # The build script (openenv build) handles context detection and sets appropriate build args.
+FROM ghcr.io/meta-pytorch/openenv-base:latest AS builder
 WORKDIR /app
     fi
 # Final runtime stage
+FROM ghcr.io/meta-pytorch/openenv-base:latest
 WORKDIR /app