File size: 3,564 Bytes
# Docker All Workspace

Each subdirectory is an experiment workspace mounted into a Docker container where Claude Code autonomously evolves C++ solutions for Frontier-CS problems.

## Prerequisites

- `claude-docker` image built locally
- `Competitive-Programming` judge container running on network `algorithmic_default`

## Workspace Structure

```
docker_all_workspace/
├── ev2_skill_0409/                    # Experiment: ev2 skill evaluation
│   ├── frontier_cs_1/                 # Problem 1 workspace
│   │   ├── INSTRUCTION.md             # Agent instructions (includes ev2 skill reference)
│   │   ├── ev2_skill.md               # Evolve-evaluation skill document
│   │   ├── statement.txt              # Problem description
│   │   ├── chk.cc                     # Checker (reference only)
│   │   ├── config.yaml                # Time/memory limits
│   │   ├── examples/                  # Baseline solutions
│   │   │   ├── gpt5.cpp
│   │   │   └── gemini3pro.cpp
│   │   ├── logs/                      # Evolution history (created by agent)
│   │   └── best/                      # Best solution (created by agent)
│   ├── frontier_cs_2/
│   ├── ...
│   └── frontier_cs_10/
```

## How to Launch

### 1. Create and start the container

```bash
EXPERIMENT="ev2_skill_0409"

docker run -d \
  --name ${EXPERIMENT} \
  --privileged \
  --shm-size=4g \
  -v $(pwd)/docker_all_workspace/${EXPERIMENT}:/workspace \
  claude-docker \
  sleep infinity
```

### 2. Connect to the judge network

```bash
docker network connect algorithmic_default ${EXPERIMENT}
```

### 3. Verify judge connectivity

```bash
docker exec ${EXPERIMENT} curl -s http://Competitive-Programming:8081/problems | head -c 100
```

### 4. Enter the container

```bash
docker exec -it ${EXPERIMENT} bash
```

Inside the container, `/workspace/` contains all problem workspaces. Navigate to a problem and start Claude Code:

```bash
cd /workspace/frontier_cs_1
```

### 5. Start Claude Code

Prompt to use:

```
Follow INSTRUCTION.md, please use iterative refinement to improve the scores you achieve, the higher the better. You can log different generations under logs/. Keep your best solution and scores under best/. I believe you can do it. 
IMPORTANT: you can evolve your own evaluation process as well to find some insightful perspectives on how to escape local optima and to create better solutions.
```

## How to Create a New Experiment

```bash
EXPERIMENT="my_experiment_YYYYMMDD"
mkdir -p docker_all_workspace/${EXPERIMENT}

# For each problem:
for PID in $(seq 1 10); do
    DIR="docker_all_workspace/${EXPERIMENT}/frontier_cs_${PID}"
    mkdir -p ${DIR}/examples ${DIR}/logs ${DIR}/best

    SRC="tasks/Frontier-CS/algorithmic/problems/${PID}"
    SOL="tasks/Frontier-CS/algorithmic/solutions/${PID}"

    cp ${SRC}/statement.txt ${DIR}/
    cp ${SRC}/chk.cc ${DIR}/ 2>/dev/null
    cp ${SRC}/config.yaml ${DIR}/
    cp ${SOL}/gpt5.cpp ${DIR}/examples/ 2>/dev/null
    cp ${SOL}/gemini3pro.cpp ${DIR}/examples/ 2>/dev/null

    # Copy skill files and create INSTRUCTION.md as needed
done
```

Then launch the container following steps 1-5 above.

## Quick Reference

```bash
# List all experiment containers
docker ps --filter "ancestor=claude-docker" --format "table {{.Names}}\t{{.Status}}"

# Stop and remove
docker stop ${EXPERIMENT} && docker rm ${EXPERIMENT}

# Check judge
docker ps --filter "name=Competitive-Programming"
```