DedeProGames commited on
Commit
bba2d5c
·
verified ·
1 Parent(s): bdf1e43

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -0
README.md CHANGED
@@ -15,3 +15,42 @@ tags:
15
  - swe-bench
16
  ---
17
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
  - swe-bench
16
  ---
17
 
18
+ # OpenAgent
19
+
20
+ <p align="center">
21
+ <img src="assets/logo.png" alt="OpenAgent logo" width="220"/>
22
+ </p>
23
+
24
+ <p align="center">
25
+ <strong>OpenAgent</strong> is an open-source effort to curate the best datasets for training agents.
26
+ </p>
27
+
28
+ ---
29
+
30
+ **OpenAgent** is a model trained for agentic tasks such as **Terminal-Bench 2.0** and **SWE-Bench**.
31
+
32
+ It was trained on the dataset:
33
+
34
+ - [`DedeProGames/Terminal-Reasoning-SFT`](https://huggingface.co/datasets/DedeProGames/Terminal-Reasoning-SFT)
35
+
36
+ OpenAgent is designed to improve performance on terminal-based reasoning, software engineering, and tool-using workflows.
37
+
38
+ ---
39
+
40
+ ## Benchmarks
41
+
42
+ | Model | Harness | Terminal-Bench 2.0 | SWE-Bench Verified |
43
+ | --- | --- | ---: | ---: |
44
+ | [Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) | Terminus-2 | 0.0 | 0.7 |
45
+ | **[OpenAgent](https://huggingface.co/DedeProGames/OpenAgent)** | Terminus-2 | **4.9** | **15.7** |
46
+ | [Qwen3-32B](https://huggingface.co/Qwen/Qwen3-32B) | Terminus-2 | 1.9 | 5.7 |
47
+ | [Qwen/Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct) | OpenHands | 10.1 | 49.2 |
48
+
49
+ ---
50
+
51
+ ## Hugging Face
52
+
53
+ - **Model:** [`DedeProGames/OpenAgent`](https://huggingface.co/DedeProGames/OpenAgent)
54
+ - **Dataset:** [`DedeProGames/Terminal-Reasoning-SFT`](https://huggingface.co/datasets/DedeProGames/Terminal-Reasoning-SFT)
55
+
56
+ OpenAgent is an open-source effort focused on building stronger agentic models through better datasets, practical training, and real benchmark evaluation.