ArbitrAgent / deploy /README.md
AbeBhatti
Initial commit: ArbitrAgent with README, agent loop, envs, demo, training
bf0a450
---
title: ArbitrAgent β€” OpenEnv 0.2.1 Environments
emoji: 🀝
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: "4.0"
app_file: hf_spaces_app.py
python_version: "3.10"
short_description: Diplomacy, Contractor, and Human Imitation negotiation envs (OpenEnv 0.2.1).
tags:
- openenv
- reinforcement-learning
- negotiation
- diplomacy
- gradio
---
# ArbitrAgent β€” OpenEnv 0.2.1 Demo
Three **OpenEnv 0.2.1**-compatible negotiation environments in one Gradio app:
| Tab | Environment | Description |
|-----|-------------|-------------|
| **DiplomacyNegotiationEnv** | Multi-party Diplomacy game state; supply centers, phases, strategic position. |
| **ContractorNegotiationEnv** | Contractor bidding: budget, multiple contractors, urgency; natural-language negotiation. |
| **HumanImitationEnv** | Real human Diplomacy game states; reward aligned with human outcomes (Phase 2 training). |
## How to use
1. Open a tab for the environment you want.
2. Click **Reset** to start an episode (initial state appears).
3. Type a **natural-language action** in the text box.
4. Click **Submit** to step the environment; **Reward / Info** and the updated state are shown.
5. Repeat until the episode is done, then click **Reset** to start again.
## Requirements
- **OpenEnv 0.2.1** API: `reset()` β†’ (observation, info), `step(action)` β†’ (observation, reward, done, info), `render()` for state text.
- **HumanImitationEnv** needs `training/data/selfplay_states.json` (or `selfplay_states_test.json`) in the repo; if you deploy only the `deploy/` folder, include a data file or that tab will show an error until data is provided.
## Deploying this Space
- **Space root = this folder (`deploy/`)**: Keep `app_file: hf_spaces_app.py` and use `deploy/requirements.txt`. For HumanImitationEnv you must add `training/data/` (or a sample JSON) into the Space.
- **Space root = full repo**: Set `app_file: deploy/hf_spaces_app.py`, add `sys.path` or install the repo as a package so `envs.diplomacy_env`, `envs.contractor_env`, and `envs.human_imitation_env` are importable; include `training/data/selfplay_states.json` for HumanImitationEnv.
Part of **ArbitrAgent** for the OpenEnv Hackathon (March 7–8 2026).