lixiaoxi45 nielsr HF Staff commited on
Commit
e052d27
·
1 Parent(s): dcb945f

Add model card and metadata (#1)

Browse files

- Add model card and metadata (11e2109a22c15e03618b3868cf32b2d4f1c713fb)


Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +42 -3
README.md CHANGED
@@ -1,3 +1,42 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ tags:
6
+ - reasoning
7
+ - tool-use
8
+ - agent
9
+ ---
10
+
11
+ # DeepAgent: A General Reasoning Agent with Scalable Toolsets
12
+
13
+ DeepAgent is an end-to-end deep reasoning agent that performs autonomous thinking, tool discovery, and action execution within a single, coherent reasoning process. It is designed to overcome the limitations of traditional, predefined workflows by maintaining a global perspective on tasks and dynamically discovering tools.
14
+
15
+ - **Paper:** [DeepAgent: A General Reasoning Agent with Scalable Toolsets](https://huggingface.co/papers/2510.21618)
16
+ - **Repository:** [GitHub - RUC-NLPIR/DeepAgent](https://github.com/RUC-NLPIR/DeepAgent)
17
+
18
+ ## Key Features
19
+
20
+ - **Unified Agentic Reasoning**: DeepAgent operates in a single stream of thought, autonomously reasoning about the task and discoverying necessary tools.
21
+ - **Autonomous Memory Folding**: To handle long-horizon interactions, DeepAgent introduces a mechanism that compresses past interactions into structured episodic, working, and tool memories, reducing context explosion while preserving critical information.
22
+ - **ToolPO Strategy**: An end-to-end reinforcement learning strategy tailored for general tool use, utilizing LLM-simulated APIs and fine-grained credit assignment for tool invocation.
23
+
24
+ ## Performance
25
+
26
+ Extensive experiments on eight benchmarks, including general tool-use tasks (ToolBench, API-Bank, TMDB, Spotify, ToolHop) and downstream applications (ALFWorld, WebShop, GAIA, HLE), demonstrate that DeepAgent consistently outperforms baselines across both labeled-tool and open-set tool retrieval scenarios.
27
+
28
+ ## Citation
29
+
30
+ If you find this work helpful, please cite the paper:
31
+
32
+ ```bibtex
33
+ @misc{deepagent,
34
+ title={DeepAgent: A General Reasoning Agent with Scalable Toolsets},
35
+ author={Xiaoxi Li and Wenxiang Jiao and Jiarui Jin and Guanting Dong and Jiajie Jin and Yinuo Wang and Hao Wang and Yutao Zhu and Ji-Rong Wen and Yuan Lu and Zhicheng Dou},
36
+ year={2025},
37
+ eprint={2510.21618},
38
+ archivePrefix={arXiv},
39
+ primaryClass={cs.AI},
40
+ url={https://arxiv.org/abs/2510.21618},
41
+ }
42
+ ```