fudayuan commited on
Commit
d5898fe
·
verified ·
1 Parent(s): 9a8c126

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -59
README.md CHANGED
@@ -1,19 +1,31 @@
1
  ---
2
- license: apache-2.0
3
- license_link: https://huggingface.co/GAIR/OpenSWE-32B/blob/main/LICENSE
 
4
  language:
5
  - en
6
  pipeline_tag: text-generation
 
 
 
 
 
 
 
 
7
  ---
8
 
9
-
10
  # OpenSWE: Efficient SWE Environment Synthesis at Scale
11
 
12
- <p align="center">
13
- <img src="asset/arxiv-logo.svg" alt="arXiv" width="16" height="16"> <a href="https://arxiv.org/abs/" target="_blank">Paper</a> &nbsp; | &nbsp;
14
- <a href="https://github.com/GAIR-NLP/OpenSWE" target="_blank">Code</a> &nbsp; | &nbsp;
15
- <a href="https://github.com/GAIR-NLP/OpenSWE" target="_blank">Environments & Scripts</a>
16
- </p>
 
 
 
 
17
 
18
  <p align="center"> <img src="asset/teaser.png" style="width: 93%;" id="title-icon"> </p>
19
  </div>
@@ -75,52 +87,6 @@ This repository contains the official implementation of the OpenSWE pipeline—a
75
 
76
  Training on OpenSWE alone yields large improvements over SWE-rebench across all model sizes and scaffolds; combining with SWE-rebench further improves 72B (e.g., 68.0% SWE-Agent). Data scaling analysis shows log-linear improvement with no saturation (see paper for curves). General capability evaluation shows gains on code (e.g., HumanEval +29), math (e.g., MATH-500 +12.2 for 72B), and science benchmarks without degrading factual recall.
77
 
78
- ## Quick Start
79
-
80
- ### 1. Data schema
81
-
82
- Collect your dataset in the following schema:
83
-
84
- | Field | Type | Description |
85
- |-------|------|-------------|
86
- | `instance_id` | `str` | Unique identifier for the sample. |
87
- | `repo` | `str` | Full GitHub repo name (e.g., `psf/requests`). |
88
- | `base_commit` | `str` | SHA of the commit immediately before the PR's first change. |
89
- | `end_commit` | `str` | SHA of the final commit in the PR. |
90
- | `problem_statement` | `str` | Issue description or problem to solve. |
91
- | `patch` | `str` | Diff of changes to functional (non-test) code. |
92
- | `test_patch` | `str` | Diff of changes to the test suite. |
93
- | `language` | `str` | Primary programming language of the repo. |
94
-
95
- ### 2. (Recommended) Prepare system
96
-
97
- - Download all git repositories into a _repocache_ directory.
98
- - Build base Docker images with `scripts/prepare_baseimg.py`.
99
-
100
- ### 3. Apply patches for SWE-bench evaluation
101
-
102
- Before running evaluation, apply:
103
-
104
- - **swe-agent.patch** — for [SWE-agent/SWE-agent](https://github.com/SWE-agent/SWE-agent): adds `skip_fetch` and OpenSWE instance fields.
105
- - **swe-bench-fork.patch** — for [SWE-rebench/SWE-bench-fork](https://github.com/SWE-rebench/SWE-bench-fork): adds `eval_script` support and `OPENSWE_EXIT_CODE` grading.
106
-
107
- Replace `/path/to/openswe` with your OpenSWE repo root. On conflicts use `git apply --reject` and resolve `.rej` files. Apply each patch once per repo.
108
-
109
- ### 4. Configure and run
110
-
111
- Edit `examples/run.sh` (set `OPENSWE_ROOT`, `DATA_PATH`, `OUTPUT_DIR`, `SETUP_DIR`, `RESULT_DIR`, `DATA_PATH`, API keys, and `DOCKER_REPOSITORY`), then:
112
-
113
- ```bash
114
- bash examples/run.sh
115
- ```
116
-
117
- For multi-machine building, see [Parallel Task Execution System](./scripts/parallel).
118
-
119
- ## Troubleshooting
120
-
121
- - **Dataset missing**: Ensure your dataset JSONL exists at the path set in `DATA_PATH`; check schema matches the table above.
122
- - **Patch conflicts**: Resolve `.rej` files after `git apply --reject` for swe-agent and swe-bench-fork.
123
-
124
  ## Acknowledgement
125
 
126
  OpenSWE is inspired by [SWE-Rebench](https://arxiv.org/abs/2505.20411) and [SWE-Factory](https://arxiv.org/abs/2506.10954). We thank these teams for their open-source contributions.
@@ -134,10 +100,13 @@ This project is licensed under AGPL-3.0. See [LICENSE](./LICENSE) for details.
134
  If you find OpenSWE useful, please cite:
135
 
136
  ```bibtex
137
- @article{openswe2026,
138
- title={daVinci-Env: Open SWE Environment Synthesis at Scale},
139
- author={Dayuan Fu and Shenyu Wu and Yunze Wu and Zerui Peng and Yaxing Huang and Jie Sun and Ji Zeng and Mohan Jiang and Lin Zhang and Yukun Li and Jiarui Hu and Liming Liu and Jinlong Hou and Pengfei Liu},
140
- journal={arXiv preprint},
141
- year={2026}
 
 
 
142
  }
143
  ```
 
1
  ---
2
+ license: other
3
+ license_name: qwen
4
+ license_link: https://huggingface.co/GAIR/OpenSWE-32B/LICENSE
5
  language:
6
  - en
7
  pipeline_tag: text-generation
8
+ library_name: transformers
9
+ base_model:
10
+ - Qwen/Qwen2.5-32B
11
+ tags:
12
+ - software
13
+ - environment
14
+ - agent
15
+ - code
16
  ---
17
 
 
18
  # OpenSWE: Efficient SWE Environment Synthesis at Scale
19
 
20
+ <div align="center">
21
+
22
+ [![Paper](https://img.shields.io/badge/Paper-PDF-1f6feb.svg)](https://github.com/GAIR-NLP/OpenSWE/blob/main/asset/paper.pdf)
23
+ [![arXiv](https://img.shields.io/badge/arXiv-2601.18418-b31b1b.svg)](https://arxiv.org/pdf/2603.13023)
24
+ [![GitHub](https://img.shields.io/badge/GitHub-Repository-green)](https://github.com/GAIR-NLP/OpenSWE)
25
+ [![Hugging Face](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Dataset-blue)](https://huggingface.co/datasets/GAIR/OpenSWE)
26
+ [![Hugging Face](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Model-blue)](https://huggingface.co/GAIR/OpenSWE-72B)
27
+
28
+ </div>
29
 
30
  <p align="center"> <img src="asset/teaser.png" style="width: 93%;" id="title-icon"> </p>
31
  </div>
 
87
 
88
  Training on OpenSWE alone yields large improvements over SWE-rebench across all model sizes and scaffolds; combining with SWE-rebench further improves 72B (e.g., 68.0% SWE-Agent). Data scaling analysis shows log-linear improvement with no saturation (see paper for curves). General capability evaluation shows gains on code (e.g., HumanEval +29), math (e.g., MATH-500 +12.2 for 72B), and science benchmarks without degrading factual recall.
89
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
90
  ## Acknowledgement
91
 
92
  OpenSWE is inspired by [SWE-Rebench](https://arxiv.org/abs/2505.20411) and [SWE-Factory](https://arxiv.org/abs/2506.10954). We thank these teams for their open-source contributions.
 
100
  If you find OpenSWE useful, please cite:
101
 
102
  ```bibtex
103
+ @misc{fu2026davincienvopensweenvironment,
104
+ title={daVinci-Env: Open SWE Environment Synthesis at Scale},
105
+ author={Dayuan Fu and Shenyu Wu and Yunze Wu and Zerui Peng and Yaxing Huang and Jie Sun and Ji Zeng and Mohan Jiang and Lin Zhang and Yukun Li and Jiarui Hu and Liming Liu and Jinlong Hou and Pengfei Liu},
106
+ year={2026},
107
+ eprint={2603.13023},
108
+ archivePrefix={arXiv},
109
+ primaryClass={cs.SE},
110
+ url={https://arxiv.org/abs/2603.13023},
111
  }
112
  ```