xzxuan commited on
Commit
25d036c
·
verified ·
1 Parent(s): 7aff8c3
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ base_model:
6
+ - Qwen/Qwen3-4B
7
+ metrics:
8
+ - accuracy
9
+ model-index:
10
+ - name: WideSeek-R1-4B
11
+ results:
12
+ - task:
13
+ type: WideSearch
14
+ dataset:
15
+ type: WideSearch
16
+ name: WideSearch
17
+ metrics:
18
+ - type: accuracy
19
+ value: 40.0
20
+ ---
21
+
22
+
23
+ # WideSeek-R1-4B
24
+
25
+ ## Overview
26
+
27
+ ![image](fig/scaling.png)
28
+
29
+ Recent advancements in Large Language Models (LLMs) have largely focused on depth scaling, where a single agent solves long-horizon problems with multi-turn reasoning and tool use. However, as tasks grow broader, the key bottleneck shifts from individual competence to organizational capability.
30
+
31
+ In this work, we explore a complementary dimension of width scaling with multi-agent systems to address broad information seeking. Existing multi-agent systems often rely on hand-crafted workflows and turn-taking interactions that fail to parallelize work effectively. To bridge this gap, we propose WideSeek-R1, a lead-agent-subagent framework trained via multi-agent reinforcement learning (MARL) to synergize scalable orchestration and parallel execution. By utilizing a shared LLM with isolated contexts and specialized tools, WideSeek-R1 jointly optimizes the lead agent and parallel subagents on a curated dataset of 20k broad information-seeking tasks.
32
+
33
+ Extensive experiments show that WideSeek-R1-4B achieves an item F1 score of 40.0\% on the WideSearch benchmark, which is comparable to the performance of single-agent DeepSeek-R1-671B. Furthermore, WideSeek-R1-4B exhibits consistent performance gains as the number of parallel subagents increases, highlighting the effectiveness of width scaling.
34
+
35
+ For more details, see our [project page](https://thu-nics.github.io/WideSeek-R1/)
36
+
37
+ ## Citation
38
+
39
+ If you use this model in your research, please cite our paper:
40
+
41
+ ```bibtex
42
+ To Be Done
43
+ ```