Improve model card: Add description, paper/code links, relevant tags, license, and pipeline tag
#1
by
nielsr
HF Staff
- opened
README.md
CHANGED
|
@@ -2,29 +2,65 @@
|
|
| 2 |
library_name: transformers
|
| 3 |
tags:
|
| 4 |
- generated_from_trainer
|
|
|
|
|
|
|
|
|
|
| 5 |
model-index:
|
| 6 |
- name: Online-Searcher-QwQ-32B
|
| 7 |
results: []
|
|
|
|
|
|
|
| 8 |
---
|
| 9 |
|
| 10 |
-
|
| 11 |
-
should probably proofread and complete it, then remove this comment. -->
|
| 12 |
|
| 13 |
-
|
| 14 |
|
| 15 |
-
|
| 16 |
|
| 17 |
## Model description
|
| 18 |
|
| 19 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 20 |
|
| 21 |
## Intended uses & limitations
|
| 22 |
|
| 23 |
-
|
|
|
|
|
|
|
| 24 |
|
| 25 |
## Training and evaluation data
|
| 26 |
|
| 27 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 28 |
|
| 29 |
## Training procedure
|
| 30 |
|
|
@@ -45,3 +81,20 @@ The following hyperparameters were used during training:
|
|
| 45 |
- Pytorch 2.5.1+cu124
|
| 46 |
- Datasets 2.19.0
|
| 47 |
- Tokenizers 0.20.3
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
library_name: transformers
|
| 3 |
tags:
|
| 4 |
- generated_from_trainer
|
| 5 |
+
- deep-search
|
| 6 |
+
- web-agent
|
| 7 |
+
- rag
|
| 8 |
model-index:
|
| 9 |
- name: Online-Searcher-QwQ-32B
|
| 10 |
results: []
|
| 11 |
+
license: mit
|
| 12 |
+
pipeline_tag: text-generation
|
| 13 |
---
|
| 14 |
|
| 15 |
+
# Online-Searcher-QwQ-32B (SimpleDeepSearcher)
|
|
|
|
| 16 |
|
| 17 |
+
This model, `Online-Searcher-QwQ-32B`, is part of the `SimpleDeepSearcher` family, a lightweight yet effective framework for enhancing large language models (LLMs) in deep search tasks. It was presented in the paper [SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis](https://huggingface.co/papers/2505.16834).
|
| 18 |
|
| 19 |
+
**Code:** [https://github.com/RUCAIBox/SimpleDeepSearcher](https://github.com/RUCAIBox/SimpleDeepSearcher)
|
| 20 |
|
| 21 |
## Model description
|
| 22 |
|
| 23 |
+
`SimpleDeepSearcher` addresses critical limitations in existing retrieval-augmented generation (RAG) systems for complex deep search scenarios. It tackles the lack of high-quality training trajectories and the distributional mismatches in simulated environments, as well as prohibitive computational costs.
|
| 24 |
+
|
| 25 |
+
This framework strategically engineers data by synthesizing high-quality training data, simulating realistic user interactions in live web search environments. This is coupled with a multi-criteria curation strategy that optimizes the diversity and quality of both input and output. Experiments on five benchmarks demonstrate that supervised fine-tuning (SFT) on only 871 curated samples yields significant improvements over RL-based baselines.
|
| 26 |
+
|
| 27 |
+
`Online-Searcher-QwQ-32B` is a 32B model, likely based on a Qwen2 backbone as indicated in its `config.json`, fine-tuned within this `SimpleDeepSearcher` framework. Our work establishes SFT as a viable pathway by systematically addressing the data-scarce bottleneck, offering practical insights for efficient deep search systems.
|
| 28 |
+
|
| 29 |
+
<p align="center">
|
| 30 |
+
<img src="https://raw.githubusercontent.com/RUCAIBox/SimpleDeepSearcher/main/assets/simplelog.jpg" alt="SimpleDeepSearcher Logo" width="550"/>
|
| 31 |
+
</p>
|
| 32 |
+
|
| 33 |
+
### Key Contributions
|
| 34 |
+
|
| 35 |
+
- A real web-based data synthesis framework that simulates realistic user search behaviors, generating multi-turn reasoning and search trajectories.
|
| 36 |
+
- A multi-criteria data curation strategy that jointly optimizes both input question selection and output response filtering through orthogonal filtering dimensions.
|
| 37 |
+
- Experimental results demonstrate that SFT on only 871 samples enables SimpleDeepSearcher to outperform strong baselines (especially RL-based baselines) on both in-domain and out-of-domain benchmarks.
|
| 38 |
+
|
| 39 |
+
### Overall Performance
|
| 40 |
+
|
| 41 |
+
<p align="center">
|
| 42 |
+
<img src="https://raw.githubusercontent.com/RUCAIBox/SimpleDeepSearcher/main/assets/overall_performance.png" alt="Overall Performance" width="800"/>
|
| 43 |
+
</p>
|
| 44 |
+
|
| 45 |
+
### Framework Overview
|
| 46 |
+
|
| 47 |
+
<p align="center">
|
| 48 |
+
<img src="https://raw.githubusercontent.com/RUCAIBox/SimpleDeepSearcher/main/assets/pipeline.png" alt="Framework Overview" width="800"/>
|
| 49 |
+
</p>
|
| 50 |
|
| 51 |
## Intended uses & limitations
|
| 52 |
|
| 53 |
+
This model is primarily intended for research and development in areas related to deep information seeking, web-powered reasoning, retrieval-augmented generation (RAG) systems, and multi-step complex reasoning tasks. It is designed to be a lightweight yet effective solution for scenarios requiring iterative information retrieval from the web.
|
| 54 |
+
|
| 55 |
+
**Limitations**: While `SimpleDeepSearcher` demonstrates strong performance with high data efficiency using a small curated dataset, its effectiveness in highly dynamic or adversarial web environments may require further evaluation. The model's performance relies on the quality and diversity of its synthesized training trajectories.
|
| 56 |
|
| 57 |
## Training and evaluation data
|
| 58 |
|
| 59 |
+
The model was trained on a high-quality dataset of 871 curated samples. This training data was synthesized by simulating realistic user interactions within live web search environments. A multi-criteria curation strategy was applied to optimize both input question selection and output response filtering, ensuring data diversity and quality across various domains.
|
| 60 |
+
|
| 61 |
+
## Sample Usage
|
| 62 |
+
|
| 63 |
+
For detailed instructions on how to use `SimpleDeepSearcher` for inference or training, please refer to the [Quick Start section in the official GitHub repository](https://github.com/RUCAIBox/SimpleDeepSearcher#%EF%B8%8F-quick-start). The repository provides scripts for environment setup, data construction, and inference generation.
|
| 64 |
|
| 65 |
## Training procedure
|
| 66 |
|
|
|
|
| 81 |
- Pytorch 2.5.1+cu124
|
| 82 |
- Datasets 2.19.0
|
| 83 |
- Tokenizers 0.20.3
|
| 84 |
+
|
| 85 |
+
## Citation
|
| 86 |
+
|
| 87 |
+
If you find our work useful, please cite our paper:
|
| 88 |
+
|
| 89 |
+
```bibtex
|
| 90 |
+
@article{sun2025simpledeepsearcher,
|
| 91 |
+
title={SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis},
|
| 92 |
+
author={Sun, Shuang and Song, Huatong and Wang, Yuhao and Ren, Ruiyang and Jiang, Jinhao and Zhang, Junjie and Bai, Fei and Deng, Jia and Zhao, Wayne Xin and Liu, Zheng and others},
|
| 93 |
+
journal={arXiv preprint arXiv:2505.16834},
|
| 94 |
+
year={2025}
|
| 95 |
+
}
|
| 96 |
+
```
|
| 97 |
+
|
| 98 |
+
## License
|
| 99 |
+
|
| 100 |
+
This project is released under the [MIT License](https://github.com/RUCAIBox/SimpleDeepSearcher/blob/main/LICENSE).
|