nielsr HF Staff commited on
Commit
378d866
·
verified ·
1 Parent(s): 41ce8e8

Add model card for SimpleDeepSearcher

Browse files

This PR adds a comprehensive model card for SimpleDeepSearcher, linking it to the paper [SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis](https://huggingface.co/papers/2505.16834).

It includes the `license`, `pipeline_tag`, and `library_name` metadata, making the model more discoverable and clearly indicating its compatibility with the Hugging Face `transformers` library. It also provides links to the official GitHub repository, the associated dataset, and other Hugging Face model checkpoints, along with a detailed overview and key contributions from the paper.

Please review and merge this PR.

Files changed (1) hide show
  1. README.md +56 -0
README.md ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ pipeline_tag: text-generation
4
+ library_name: transformers
5
+ ---
6
+
7
+ # SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
8
+
9
+ This repository contains the models and code for **SimpleDeepSearcher**, a lightweight yet effective framework for enhancing Large Language Models (LLMs) in complex deep search scenarios. As described in the paper [SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis](https://huggingface.co/papers/2505.16834), it bridges the gap in high-quality training trajectories by synthesizing realistic user interactions in live web search environments. This approach enables supervised fine-tuning (SFT) with only a small amount of curated data, achieving strong performance with significantly reduced computational cost and development complexity.
10
+
11
+ <p align="center">
12
+ <img src="https://github.com/RUCAIBox/SimpleDeepSearcher/raw/main/assets/simplelog.jpg" alt="Example Image" width="550"/>
13
+ </p>
14
+
15
+ ## Overview
16
+ SimpleDeepSearcher is a lightweight yet effective framework for enhancing large language models (LLMs) in deep search tasks. Unlike traditional RAG or RL-based methods, SimpleDeepSearcher strategically synthesizes high-quality reasoning trajectories in real-world web environments, enabling supervised fine-tuning (SFT) with only a small amount of curated data. This results in strong performance with significantly reduced computational cost and development complexity.
17
+
18
+ ## Key Contributions
19
+
20
+ - We propose a real web-based data synthesis framework that simulates realistic user search behaviors, generating multi-turn reasoning and search trajectories.
21
+ - We design a multi-criteria data curation strategy that jointly optimizes both input question selection and output response filtering through orthogonal filtering dimensions.
22
+ - Experimental results demonstrate that SFT on only 871 samples enables SimpleDeepSearcher to outperform strong baselines (especially RL-based baselines) on both in-domain and out-of-domain benchmarks.
23
+
24
+ ## Overall Performance
25
+
26
+ <p align="center">
27
+ <img src="https://github.com/RUCAIBox/SimpleDeepSearcher/raw/main/assets/overall_performance.png" alt="Overall Performance" width="800"/>
28
+ </p>
29
+
30
+ ## Associated Resources
31
+
32
+ * **Paper**: [SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis](https://huggingface.co/papers/2505.16834)
33
+ * **Code**: [GitHub Repository](https://github.com/RUCAIBox/SimpleDeepSearcher)
34
+ * **Dataset**: [0.8k-data-SimpleDeepSearcher](https://huggingface.co/datasets/RUC-AIBOX/0.8k-data-SimpleDeepSearcher)
35
+ * **Other Models**:\
36
+ * [SimpleDeepSearcher-Qwen-7B](https://huggingface.co/RUC-AIBOX/Qwen-7B-SimpleDeepSearcher)
37
+ * [SimpleDeepSearcher-Qwen-32B](https://huggingface.co/RUC-AIBOX/Qwen-32B-SimpleDeepSearcher)
38
+ * [SimpleDeepSearcher-Dpsk-Distilled-Qwen-32B](https://huggingface.co/RUC-AIBOX/Dpsk-Distilled-Qwen-32B-SimpleDeepSearcher)
39
+ * [SimpleDeepSearcher-QwQ-32B](https://huggingface.co/RUC-AIBOX/QwQ-32B-SimpleDeepSearcher)
40
+
41
+ For detailed installation, data construction, training, and inference instructions, please refer to the [GitHub repository](https://github.com/RUCAIBox/SimpleDeepSearcher).
42
+
43
+ ## Citation
44
+ Please kindly cite our report if they are helpful for your research.
45
+
46
+ ```bibtex
47
+ @article{sun2025simpledeepsearcher,
48
+ title={SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis},
49
+ author={Sun, Shuang and Song, Huatong and Wang, Yuhao and Ren, Ruiyang and Jiang, Jinhao and Zhang, Junjie and Bai, Fei and Deng, Jia and Zhao, Wayne Xin and Liu, Zheng and others},
50
+ journal={arXiv preprint arXiv:2505.16834},
51
+ year={2025}
52
+ }
53
+ ```
54
+
55
+ ## License
56
+ This project is released under the [MIT License](https://github.com/RUCAIBox/SimpleDeepSearcher/blob/main/LICENSE).