---
title: ArbitrAgent — OpenEnv 0.2.1 Environments
emoji: 🤝
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: "4.0"
app_file: hf_spaces_app.py
python_version: "3.10"
short_description: Diplomacy, Contractor, and Human Imitation negotiation envs (OpenEnv 0.2.1).
tags:
  - openenv
  - reinforcement-learning
  - negotiation
  - diplomacy
  - gradio
---

# ArbitrAgent — OpenEnv 0.2.1 Demo

Three **OpenEnv 0.2.1**-compatible negotiation environments in one Gradio app:

| Tab | Environment | Description |
|-----|-------------|-------------|
| **DiplomacyNegotiationEnv** | Multi-party Diplomacy game state; supply centers, phases, strategic position. |
| **ContractorNegotiationEnv** | Contractor bidding: budget, multiple contractors, urgency; natural-language negotiation. |
| **HumanImitationEnv** | Real human Diplomacy game states; reward aligned with human outcomes (Phase 2 training). |

## How to use

1. Open a tab for the environment you want.
2. Click **Reset** to start an episode (initial state appears).
3. Type a **natural-language action** in the text box.
4. Click **Submit** to step the environment; **Reward / Info** and the updated state are shown.
5. Repeat until the episode is done, then click **Reset** to start again.

## Requirements

- **OpenEnv 0.2.1** API: `reset()` → (observation, info), `step(action)` → (observation, reward, done, info), `render()` for state text.
- **HumanImitationEnv** needs `training/data/selfplay_states.json` (or `selfplay_states_test.json`) in the repo; if you deploy only the `deploy/` folder, include a data file or that tab will show an error until data is provided.

## Deploying this Space

- **Space root = this folder (`deploy/`)**: Keep `app_file: hf_spaces_app.py` and use `deploy/requirements.txt`. For HumanImitationEnv you must add `training/data/` (or a sample JSON) into the Space.
- **Space root = full repo**: Set `app_file: deploy/hf_spaces_app.py`, add `sys.path` or install the repo as a package so `envs.diplomacy_env`, `envs.contractor_env`, and `envs.human_imitation_env` are importable; include `training/data/selfplay_states.json` for HumanImitationEnv.

Part of **ArbitrAgent** for the OpenEnv Hackathon (March 7–8 2026).