Improve model card: Add description, paper/code links, relevant tags, license, and pipeline tag

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +60 -7
README.md CHANGED
@@ -2,29 +2,65 @@
2
  library_name: transformers
3
  tags:
4
  - generated_from_trainer
 
 
 
5
  model-index:
6
  - name: Online-Searcher-QwQ-32B
7
  results: []
 
 
8
  ---
9
 
10
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
- should probably proofread and complete it, then remove this comment. -->
12
 
13
- # Online-Searcher-QwQ-32B
14
 
15
- This model was trained from scratch on an unknown dataset.
16
 
17
  ## Model description
18
 
19
- More information needed
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
 
21
  ## Intended uses & limitations
22
 
23
- More information needed
 
 
24
 
25
  ## Training and evaluation data
26
 
27
- More information needed
 
 
 
 
28
 
29
  ## Training procedure
30
 
@@ -45,3 +81,20 @@ The following hyperparameters were used during training:
45
  - Pytorch 2.5.1+cu124
46
  - Datasets 2.19.0
47
  - Tokenizers 0.20.3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  library_name: transformers
3
  tags:
4
  - generated_from_trainer
5
+ - deep-search
6
+ - web-agent
7
+ - rag
8
  model-index:
9
  - name: Online-Searcher-QwQ-32B
10
  results: []
11
+ license: mit
12
+ pipeline_tag: text-generation
13
  ---
14
 
15
+ # Online-Searcher-QwQ-32B (SimpleDeepSearcher)
 
16
 
17
+ This model, `Online-Searcher-QwQ-32B`, is part of the `SimpleDeepSearcher` family, a lightweight yet effective framework for enhancing large language models (LLMs) in deep search tasks. It was presented in the paper [SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis](https://huggingface.co/papers/2505.16834).
18
 
19
+ **Code:** [https://github.com/RUCAIBox/SimpleDeepSearcher](https://github.com/RUCAIBox/SimpleDeepSearcher)
20
 
21
  ## Model description
22
 
23
+ `SimpleDeepSearcher` addresses critical limitations in existing retrieval-augmented generation (RAG) systems for complex deep search scenarios. It tackles the lack of high-quality training trajectories and the distributional mismatches in simulated environments, as well as prohibitive computational costs.
24
+
25
+ This framework strategically engineers data by synthesizing high-quality training data, simulating realistic user interactions in live web search environments. This is coupled with a multi-criteria curation strategy that optimizes the diversity and quality of both input and output. Experiments on five benchmarks demonstrate that supervised fine-tuning (SFT) on only 871 curated samples yields significant improvements over RL-based baselines.
26
+
27
+ `Online-Searcher-QwQ-32B` is a 32B model, likely based on a Qwen2 backbone as indicated in its `config.json`, fine-tuned within this `SimpleDeepSearcher` framework. Our work establishes SFT as a viable pathway by systematically addressing the data-scarce bottleneck, offering practical insights for efficient deep search systems.
28
+
29
+ <p align="center">
30
+ <img src="https://raw.githubusercontent.com/RUCAIBox/SimpleDeepSearcher/main/assets/simplelog.jpg" alt="SimpleDeepSearcher Logo" width="550"/>
31
+ </p>
32
+
33
+ ### Key Contributions
34
+
35
+ - A real web-based data synthesis framework that simulates realistic user search behaviors, generating multi-turn reasoning and search trajectories.
36
+ - A multi-criteria data curation strategy that jointly optimizes both input question selection and output response filtering through orthogonal filtering dimensions.
37
+ - Experimental results demonstrate that SFT on only 871 samples enables SimpleDeepSearcher to outperform strong baselines (especially RL-based baselines) on both in-domain and out-of-domain benchmarks.
38
+
39
+ ### Overall Performance
40
+
41
+ <p align="center">
42
+ <img src="https://raw.githubusercontent.com/RUCAIBox/SimpleDeepSearcher/main/assets/overall_performance.png" alt="Overall Performance" width="800"/>
43
+ </p>
44
+
45
+ ### Framework Overview
46
+
47
+ <p align="center">
48
+ <img src="https://raw.githubusercontent.com/RUCAIBox/SimpleDeepSearcher/main/assets/pipeline.png" alt="Framework Overview" width="800"/>
49
+ </p>
50
 
51
  ## Intended uses & limitations
52
 
53
+ This model is primarily intended for research and development in areas related to deep information seeking, web-powered reasoning, retrieval-augmented generation (RAG) systems, and multi-step complex reasoning tasks. It is designed to be a lightweight yet effective solution for scenarios requiring iterative information retrieval from the web.
54
+
55
+ **Limitations**: While `SimpleDeepSearcher` demonstrates strong performance with high data efficiency using a small curated dataset, its effectiveness in highly dynamic or adversarial web environments may require further evaluation. The model's performance relies on the quality and diversity of its synthesized training trajectories.
56
 
57
  ## Training and evaluation data
58
 
59
+ The model was trained on a high-quality dataset of 871 curated samples. This training data was synthesized by simulating realistic user interactions within live web search environments. A multi-criteria curation strategy was applied to optimize both input question selection and output response filtering, ensuring data diversity and quality across various domains.
60
+
61
+ ## Sample Usage
62
+
63
+ For detailed instructions on how to use `SimpleDeepSearcher` for inference or training, please refer to the [Quick Start section in the official GitHub repository](https://github.com/RUCAIBox/SimpleDeepSearcher#%EF%B8%8F-quick-start). The repository provides scripts for environment setup, data construction, and inference generation.
64
 
65
  ## Training procedure
66
 
 
81
  - Pytorch 2.5.1+cu124
82
  - Datasets 2.19.0
83
  - Tokenizers 0.20.3
84
+
85
+ ## Citation
86
+
87
+ If you find our work useful, please cite our paper:
88
+
89
+ ```bibtex
90
+ @article{sun2025simpledeepsearcher,
91
+ title={SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis},
92
+ author={Sun, Shuang and Song, Huatong and Wang, Yuhao and Ren, Ruiyang and Jiang, Jinhao and Zhang, Junjie and Bai, Fei and Deng, Jia and Zhao, Wayne Xin and Liu, Zheng and others},
93
+ journal={arXiv preprint arXiv:2505.16834},
94
+ year={2025}
95
+ }
96
+ ```
97
+
98
+ ## License
99
+
100
+ This project is released under the [MIT License](https://github.com/RUCAIBox/SimpleDeepSearcher/blob/main/LICENSE).