Update README.md
Browse files
README.md
CHANGED
|
@@ -15,3 +15,42 @@ tags:
|
|
| 15 |
- swe-bench
|
| 16 |
---
|
| 17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 15 |
- swe-bench
|
| 16 |
---
|
| 17 |
|
| 18 |
+
# OpenAgent
|
| 19 |
+
|
| 20 |
+
<p align="center">
|
| 21 |
+
<img src="assets/logo.png" alt="OpenAgent logo" width="220"/>
|
| 22 |
+
</p>
|
| 23 |
+
|
| 24 |
+
<p align="center">
|
| 25 |
+
<strong>OpenAgent</strong> is an open-source effort to curate the best datasets for training agents.
|
| 26 |
+
</p>
|
| 27 |
+
|
| 28 |
+
---
|
| 29 |
+
|
| 30 |
+
**OpenAgent** is a model trained for agentic tasks such as **Terminal-Bench 2.0** and **SWE-Bench**.
|
| 31 |
+
|
| 32 |
+
It was trained on the dataset:
|
| 33 |
+
|
| 34 |
+
- [`DedeProGames/Terminal-Reasoning-SFT`](https://huggingface.co/datasets/DedeProGames/Terminal-Reasoning-SFT)
|
| 35 |
+
|
| 36 |
+
OpenAgent is designed to improve performance on terminal-based reasoning, software engineering, and tool-using workflows.
|
| 37 |
+
|
| 38 |
+
---
|
| 39 |
+
|
| 40 |
+
## Benchmarks
|
| 41 |
+
|
| 42 |
+
| Model | Harness | Terminal-Bench 2.0 | SWE-Bench Verified |
|
| 43 |
+
| --- | --- | ---: | ---: |
|
| 44 |
+
| [Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) | Terminus-2 | 0.0 | 0.7 |
|
| 45 |
+
| **[OpenAgent](https://huggingface.co/DedeProGames/OpenAgent)** | Terminus-2 | **4.9** | **15.7** |
|
| 46 |
+
| [Qwen3-32B](https://huggingface.co/Qwen/Qwen3-32B) | Terminus-2 | 1.9 | 5.7 |
|
| 47 |
+
| [Qwen/Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct) | OpenHands | 10.1 | 49.2 |
|
| 48 |
+
|
| 49 |
+
---
|
| 50 |
+
|
| 51 |
+
## Hugging Face
|
| 52 |
+
|
| 53 |
+
- **Model:** [`DedeProGames/OpenAgent`](https://huggingface.co/DedeProGames/OpenAgent)
|
| 54 |
+
- **Dataset:** [`DedeProGames/Terminal-Reasoning-SFT`](https://huggingface.co/datasets/DedeProGames/Terminal-Reasoning-SFT)
|
| 55 |
+
|
| 56 |
+
OpenAgent is an open-source effort focused on building stronger agentic models through better datasets, practical training, and real benchmark evaluation.
|