Update README.md
Browse files
README.md
CHANGED
|
@@ -1,19 +1,31 @@
|
|
| 1 |
---
|
| 2 |
-
license:
|
| 3 |
-
|
|
|
|
| 4 |
language:
|
| 5 |
- en
|
| 6 |
pipeline_tag: text-generation
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
---
|
| 8 |
|
| 9 |
-
|
| 10 |
# OpenSWE: Efficient SWE Environment Synthesis at Scale
|
| 11 |
|
| 12 |
-
<
|
| 13 |
-
|
| 14 |
-
|
| 15 |
-
|
| 16 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 17 |
|
| 18 |
<p align="center"> <img src="asset/teaser.png" style="width: 93%;" id="title-icon"> </p>
|
| 19 |
</div>
|
|
@@ -75,52 +87,6 @@ This repository contains the official implementation of the OpenSWE pipeline—a
|
|
| 75 |
|
| 76 |
Training on OpenSWE alone yields large improvements over SWE-rebench across all model sizes and scaffolds; combining with SWE-rebench further improves 72B (e.g., 68.0% SWE-Agent). Data scaling analysis shows log-linear improvement with no saturation (see paper for curves). General capability evaluation shows gains on code (e.g., HumanEval +29), math (e.g., MATH-500 +12.2 for 72B), and science benchmarks without degrading factual recall.
|
| 77 |
|
| 78 |
-
## Quick Start
|
| 79 |
-
|
| 80 |
-
### 1. Data schema
|
| 81 |
-
|
| 82 |
-
Collect your dataset in the following schema:
|
| 83 |
-
|
| 84 |
-
| Field | Type | Description |
|
| 85 |
-
|-------|------|-------------|
|
| 86 |
-
| `instance_id` | `str` | Unique identifier for the sample. |
|
| 87 |
-
| `repo` | `str` | Full GitHub repo name (e.g., `psf/requests`). |
|
| 88 |
-
| `base_commit` | `str` | SHA of the commit immediately before the PR's first change. |
|
| 89 |
-
| `end_commit` | `str` | SHA of the final commit in the PR. |
|
| 90 |
-
| `problem_statement` | `str` | Issue description or problem to solve. |
|
| 91 |
-
| `patch` | `str` | Diff of changes to functional (non-test) code. |
|
| 92 |
-
| `test_patch` | `str` | Diff of changes to the test suite. |
|
| 93 |
-
| `language` | `str` | Primary programming language of the repo. |
|
| 94 |
-
|
| 95 |
-
### 2. (Recommended) Prepare system
|
| 96 |
-
|
| 97 |
-
- Download all git repositories into a _repocache_ directory.
|
| 98 |
-
- Build base Docker images with `scripts/prepare_baseimg.py`.
|
| 99 |
-
|
| 100 |
-
### 3. Apply patches for SWE-bench evaluation
|
| 101 |
-
|
| 102 |
-
Before running evaluation, apply:
|
| 103 |
-
|
| 104 |
-
- **swe-agent.patch** — for [SWE-agent/SWE-agent](https://github.com/SWE-agent/SWE-agent): adds `skip_fetch` and OpenSWE instance fields.
|
| 105 |
-
- **swe-bench-fork.patch** — for [SWE-rebench/SWE-bench-fork](https://github.com/SWE-rebench/SWE-bench-fork): adds `eval_script` support and `OPENSWE_EXIT_CODE` grading.
|
| 106 |
-
|
| 107 |
-
Replace `/path/to/openswe` with your OpenSWE repo root. On conflicts use `git apply --reject` and resolve `.rej` files. Apply each patch once per repo.
|
| 108 |
-
|
| 109 |
-
### 4. Configure and run
|
| 110 |
-
|
| 111 |
-
Edit `examples/run.sh` (set `OPENSWE_ROOT`, `DATA_PATH`, `OUTPUT_DIR`, `SETUP_DIR`, `RESULT_DIR`, `DATA_PATH`, API keys, and `DOCKER_REPOSITORY`), then:
|
| 112 |
-
|
| 113 |
-
```bash
|
| 114 |
-
bash examples/run.sh
|
| 115 |
-
```
|
| 116 |
-
|
| 117 |
-
For multi-machine building, see [Parallel Task Execution System](./scripts/parallel).
|
| 118 |
-
|
| 119 |
-
## Troubleshooting
|
| 120 |
-
|
| 121 |
-
- **Dataset missing**: Ensure your dataset JSONL exists at the path set in `DATA_PATH`; check schema matches the table above.
|
| 122 |
-
- **Patch conflicts**: Resolve `.rej` files after `git apply --reject` for swe-agent and swe-bench-fork.
|
| 123 |
-
|
| 124 |
## Acknowledgement
|
| 125 |
|
| 126 |
OpenSWE is inspired by [SWE-Rebench](https://arxiv.org/abs/2505.20411) and [SWE-Factory](https://arxiv.org/abs/2506.10954). We thank these teams for their open-source contributions.
|
|
@@ -134,10 +100,13 @@ This project is licensed under AGPL-3.0. See [LICENSE](./LICENSE) for details.
|
|
| 134 |
If you find OpenSWE useful, please cite:
|
| 135 |
|
| 136 |
```bibtex
|
| 137 |
-
@
|
| 138 |
-
|
| 139 |
-
|
| 140 |
-
|
| 141 |
-
|
|
|
|
|
|
|
|
|
|
| 142 |
}
|
| 143 |
```
|
|
|
|
| 1 |
---
|
| 2 |
+
license: other
|
| 3 |
+
license_name: qwen
|
| 4 |
+
license_link: https://huggingface.co/GAIR/OpenSWE-32B/LICENSE
|
| 5 |
language:
|
| 6 |
- en
|
| 7 |
pipeline_tag: text-generation
|
| 8 |
+
library_name: transformers
|
| 9 |
+
base_model:
|
| 10 |
+
- Qwen/Qwen2.5-32B
|
| 11 |
+
tags:
|
| 12 |
+
- software
|
| 13 |
+
- environment
|
| 14 |
+
- agent
|
| 15 |
+
- code
|
| 16 |
---
|
| 17 |
|
|
|
|
| 18 |
# OpenSWE: Efficient SWE Environment Synthesis at Scale
|
| 19 |
|
| 20 |
+
<div align="center">
|
| 21 |
+
|
| 22 |
+
[](https://github.com/GAIR-NLP/OpenSWE/blob/main/asset/paper.pdf)
|
| 23 |
+
[](https://arxiv.org/pdf/2603.13023)
|
| 24 |
+
[](https://github.com/GAIR-NLP/OpenSWE)
|
| 25 |
+
[](https://huggingface.co/datasets/GAIR/OpenSWE)
|
| 26 |
+
[](https://huggingface.co/GAIR/OpenSWE-72B)
|
| 27 |
+
|
| 28 |
+
</div>
|
| 29 |
|
| 30 |
<p align="center"> <img src="asset/teaser.png" style="width: 93%;" id="title-icon"> </p>
|
| 31 |
</div>
|
|
|
|
| 87 |
|
| 88 |
Training on OpenSWE alone yields large improvements over SWE-rebench across all model sizes and scaffolds; combining with SWE-rebench further improves 72B (e.g., 68.0% SWE-Agent). Data scaling analysis shows log-linear improvement with no saturation (see paper for curves). General capability evaluation shows gains on code (e.g., HumanEval +29), math (e.g., MATH-500 +12.2 for 72B), and science benchmarks without degrading factual recall.
|
| 89 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 90 |
## Acknowledgement
|
| 91 |
|
| 92 |
OpenSWE is inspired by [SWE-Rebench](https://arxiv.org/abs/2505.20411) and [SWE-Factory](https://arxiv.org/abs/2506.10954). We thank these teams for their open-source contributions.
|
|
|
|
| 100 |
If you find OpenSWE useful, please cite:
|
| 101 |
|
| 102 |
```bibtex
|
| 103 |
+
@misc{fu2026davincienvopensweenvironment,
|
| 104 |
+
title={daVinci-Env: Open SWE Environment Synthesis at Scale},
|
| 105 |
+
author={Dayuan Fu and Shenyu Wu and Yunze Wu and Zerui Peng and Yaxing Huang and Jie Sun and Ji Zeng and Mohan Jiang and Lin Zhang and Yukun Li and Jiarui Hu and Liming Liu and Jinlong Hou and Pengfei Liu},
|
| 106 |
+
year={2026},
|
| 107 |
+
eprint={2603.13023},
|
| 108 |
+
archivePrefix={arXiv},
|
| 109 |
+
primaryClass={cs.SE},
|
| 110 |
+
url={https://arxiv.org/abs/2603.13023},
|
| 111 |
}
|
| 112 |
```
|