Safetensors
English
qwen2
File size: 1,707 Bytes
a1c078e
 
96376ff
 
 
 
 
 
 
 
 
a1c078e
ba607ac
96376ff
 
 
 
 
 
 
 
 
 
 
 
 
0702535
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
---
license: apache-2.0
datasets:
- PeterJinGo/nq_hotpotqa_train
language:
- en
metrics:
- exact_match
- f1
base_model:
- Qwen/Qwen2.5-3B
---
# SE-Search-3B

*Self-Evolving Search Agent via Memory and Dense Reward*

## ๐Ÿ”ฅ News

- Paper available on \[[ArXiv](https://arxiv.org/pdf/2603.03293)\]

## ๐Ÿ“– Overview

SE-Search is a Self-Evolving Search agent that improves online search behavior through a **Think-Search-Memorize** strategy:

- **Memory Purification**: Retains salient evidence while filtering irrelevant content
- **Atomic Query**: Promotes shorter and more diverse queries, improving evidence acquisition
- **Dense Rewards**: Provides fine-grained feedback that speeds up training and improves performance

## ๐Ÿ™ Acknowledgements

Built upon [VeRL](https://github.com/volcengine/verl), [Search-R1](https://github.com/PeterGriffinJin/Search-R1), and [AutoRefine](https://github.com/syr-cn/AutoRefine). Thanks to the authors for their valuable work.

## ๐ŸŽ“ Citations

```latex
@misc{li2026sesearch,
      title={SE-Search: Self-Evolving Search Agent via Memory and Dense Reward}, 
      author={Jian Li and Yizhang Jin and Dongqi Liu and Hang Ding and Jiafu Wu and Dongsheng Chen and Yunhang Shen and Yulei Qin and Ying Tai and Chengjie Wang and Xiaotong Yuan and Yabiao Wang},
      year={2026},
      eprint={2603.03293},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2603.03293}, 
}

@article{li2025survey,
  title={A Survey on AI Search with Large Language Models},
  author={Li, Jian and Li, Xiaoxi and Zheng, Yan and Jin, Yizhang and Wang, Shuo and Wu, Jiafu and Wang, Yabiao and Wang, Chengjie and Yuan, Xiaotong},
  year={2025}
}

```