Spaces:
Running
Running
| title: ArbitrAgent β OpenEnv 0.2.1 Environments | |
| emoji: π€ | |
| colorFrom: blue | |
| colorTo: indigo | |
| sdk: gradio | |
| sdk_version: "4.0" | |
| app_file: hf_spaces_app.py | |
| python_version: "3.10" | |
| short_description: Diplomacy, Contractor, and Human Imitation negotiation envs (OpenEnv 0.2.1). | |
| tags: | |
| - openenv | |
| - reinforcement-learning | |
| - negotiation | |
| - diplomacy | |
| - gradio | |
| # ArbitrAgent β OpenEnv 0.2.1 Demo | |
| Three **OpenEnv 0.2.1**-compatible negotiation environments in one Gradio app: | |
| | Tab | Environment | Description | | |
| |-----|-------------|-------------| | |
| | **DiplomacyNegotiationEnv** | Multi-party Diplomacy game state; supply centers, phases, strategic position. | | |
| | **ContractorNegotiationEnv** | Contractor bidding: budget, multiple contractors, urgency; natural-language negotiation. | | |
| | **HumanImitationEnv** | Real human Diplomacy game states; reward aligned with human outcomes (Phase 2 training). | | |
| ## How to use | |
| 1. Open a tab for the environment you want. | |
| 2. Click **Reset** to start an episode (initial state appears). | |
| 3. Type a **natural-language action** in the text box. | |
| 4. Click **Submit** to step the environment; **Reward / Info** and the updated state are shown. | |
| 5. Repeat until the episode is done, then click **Reset** to start again. | |
| ## Requirements | |
| - **OpenEnv 0.2.1** API: `reset()` β (observation, info), `step(action)` β (observation, reward, done, info), `render()` for state text. | |
| - **HumanImitationEnv** needs `training/data/selfplay_states.json` (or `selfplay_states_test.json`) in the repo; if you deploy only the `deploy/` folder, include a data file or that tab will show an error until data is provided. | |
| ## Deploying this Space | |
| - **Space root = this folder (`deploy/`)**: Keep `app_file: hf_spaces_app.py` and use `deploy/requirements.txt`. For HumanImitationEnv you must add `training/data/` (or a sample JSON) into the Space. | |
| - **Space root = full repo**: Set `app_file: deploy/hf_spaces_app.py`, add `sys.path` or install the repo as a package so `envs.diplomacy_env`, `envs.contractor_env`, and `envs.human_imitation_env` are importable; include `training/data/selfplay_states.json` for HumanImitationEnv. | |
| Part of **ArbitrAgent** for the OpenEnv Hackathon (March 7β8 2026). | |