Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
conversational
text-generation-inference
Instructions to use RUC-AIBOX/QwQ-32B-SimpleDeepSearcher with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use RUC-AIBOX/QwQ-32B-SimpleDeepSearcher with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="RUC-AIBOX/QwQ-32B-SimpleDeepSearcher") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("RUC-AIBOX/QwQ-32B-SimpleDeepSearcher") model = AutoModelForCausalLM.from_pretrained("RUC-AIBOX/QwQ-32B-SimpleDeepSearcher") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use RUC-AIBOX/QwQ-32B-SimpleDeepSearcher with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "RUC-AIBOX/QwQ-32B-SimpleDeepSearcher" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "RUC-AIBOX/QwQ-32B-SimpleDeepSearcher", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/RUC-AIBOX/QwQ-32B-SimpleDeepSearcher
- SGLang
How to use RUC-AIBOX/QwQ-32B-SimpleDeepSearcher with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "RUC-AIBOX/QwQ-32B-SimpleDeepSearcher" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "RUC-AIBOX/QwQ-32B-SimpleDeepSearcher", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "RUC-AIBOX/QwQ-32B-SimpleDeepSearcher" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "RUC-AIBOX/QwQ-32B-SimpleDeepSearcher", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use RUC-AIBOX/QwQ-32B-SimpleDeepSearcher with Docker Model Runner:
docker model run hf.co/RUC-AIBOX/QwQ-32B-SimpleDeepSearcher
Improve model card: Add description, paper/code links, relevant tags, license, and pipeline tag
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -2,29 +2,65 @@
|
|
| 2 |
library_name: transformers
|
| 3 |
tags:
|
| 4 |
- generated_from_trainer
|
|
|
|
|
|
|
|
|
|
| 5 |
model-index:
|
| 6 |
- name: Online-Searcher-QwQ-32B
|
| 7 |
results: []
|
|
|
|
|
|
|
| 8 |
---
|
| 9 |
|
| 10 |
-
|
| 11 |
-
should probably proofread and complete it, then remove this comment. -->
|
| 12 |
|
| 13 |
-
|
| 14 |
|
| 15 |
-
|
| 16 |
|
| 17 |
## Model description
|
| 18 |
|
| 19 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 20 |
|
| 21 |
## Intended uses & limitations
|
| 22 |
|
| 23 |
-
|
|
|
|
|
|
|
| 24 |
|
| 25 |
## Training and evaluation data
|
| 26 |
|
| 27 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 28 |
|
| 29 |
## Training procedure
|
| 30 |
|
|
@@ -45,3 +81,20 @@ The following hyperparameters were used during training:
|
|
| 45 |
- Pytorch 2.5.1+cu124
|
| 46 |
- Datasets 2.19.0
|
| 47 |
- Tokenizers 0.20.3
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
library_name: transformers
|
| 3 |
tags:
|
| 4 |
- generated_from_trainer
|
| 5 |
+
- deep-search
|
| 6 |
+
- web-agent
|
| 7 |
+
- rag
|
| 8 |
model-index:
|
| 9 |
- name: Online-Searcher-QwQ-32B
|
| 10 |
results: []
|
| 11 |
+
license: mit
|
| 12 |
+
pipeline_tag: text-generation
|
| 13 |
---
|
| 14 |
|
| 15 |
+
# Online-Searcher-QwQ-32B (SimpleDeepSearcher)
|
|
|
|
| 16 |
|
| 17 |
+
This model, `Online-Searcher-QwQ-32B`, is part of the `SimpleDeepSearcher` family, a lightweight yet effective framework for enhancing large language models (LLMs) in deep search tasks. It was presented in the paper [SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis](https://huggingface.co/papers/2505.16834).
|
| 18 |
|
| 19 |
+
**Code:** [https://github.com/RUCAIBox/SimpleDeepSearcher](https://github.com/RUCAIBox/SimpleDeepSearcher)
|
| 20 |
|
| 21 |
## Model description
|
| 22 |
|
| 23 |
+
`SimpleDeepSearcher` addresses critical limitations in existing retrieval-augmented generation (RAG) systems for complex deep search scenarios. It tackles the lack of high-quality training trajectories and the distributional mismatches in simulated environments, as well as prohibitive computational costs.
|
| 24 |
+
|
| 25 |
+
This framework strategically engineers data by synthesizing high-quality training data, simulating realistic user interactions in live web search environments. This is coupled with a multi-criteria curation strategy that optimizes the diversity and quality of both input and output. Experiments on five benchmarks demonstrate that supervised fine-tuning (SFT) on only 871 curated samples yields significant improvements over RL-based baselines.
|
| 26 |
+
|
| 27 |
+
`Online-Searcher-QwQ-32B` is a 32B model, likely based on a Qwen2 backbone as indicated in its `config.json`, fine-tuned within this `SimpleDeepSearcher` framework. Our work establishes SFT as a viable pathway by systematically addressing the data-scarce bottleneck, offering practical insights for efficient deep search systems.
|
| 28 |
+
|
| 29 |
+
<p align="center">
|
| 30 |
+
<img src="https://raw.githubusercontent.com/RUCAIBox/SimpleDeepSearcher/main/assets/simplelog.jpg" alt="SimpleDeepSearcher Logo" width="550"/>
|
| 31 |
+
</p>
|
| 32 |
+
|
| 33 |
+
### Key Contributions
|
| 34 |
+
|
| 35 |
+
- A real web-based data synthesis framework that simulates realistic user search behaviors, generating multi-turn reasoning and search trajectories.
|
| 36 |
+
- A multi-criteria data curation strategy that jointly optimizes both input question selection and output response filtering through orthogonal filtering dimensions.
|
| 37 |
+
- Experimental results demonstrate that SFT on only 871 samples enables SimpleDeepSearcher to outperform strong baselines (especially RL-based baselines) on both in-domain and out-of-domain benchmarks.
|
| 38 |
+
|
| 39 |
+
### Overall Performance
|
| 40 |
+
|
| 41 |
+
<p align="center">
|
| 42 |
+
<img src="https://raw.githubusercontent.com/RUCAIBox/SimpleDeepSearcher/main/assets/overall_performance.png" alt="Overall Performance" width="800"/>
|
| 43 |
+
</p>
|
| 44 |
+
|
| 45 |
+
### Framework Overview
|
| 46 |
+
|
| 47 |
+
<p align="center">
|
| 48 |
+
<img src="https://raw.githubusercontent.com/RUCAIBox/SimpleDeepSearcher/main/assets/pipeline.png" alt="Framework Overview" width="800"/>
|
| 49 |
+
</p>
|
| 50 |
|
| 51 |
## Intended uses & limitations
|
| 52 |
|
| 53 |
+
This model is primarily intended for research and development in areas related to deep information seeking, web-powered reasoning, retrieval-augmented generation (RAG) systems, and multi-step complex reasoning tasks. It is designed to be a lightweight yet effective solution for scenarios requiring iterative information retrieval from the web.
|
| 54 |
+
|
| 55 |
+
**Limitations**: While `SimpleDeepSearcher` demonstrates strong performance with high data efficiency using a small curated dataset, its effectiveness in highly dynamic or adversarial web environments may require further evaluation. The model's performance relies on the quality and diversity of its synthesized training trajectories.
|
| 56 |
|
| 57 |
## Training and evaluation data
|
| 58 |
|
| 59 |
+
The model was trained on a high-quality dataset of 871 curated samples. This training data was synthesized by simulating realistic user interactions within live web search environments. A multi-criteria curation strategy was applied to optimize both input question selection and output response filtering, ensuring data diversity and quality across various domains.
|
| 60 |
+
|
| 61 |
+
## Sample Usage
|
| 62 |
+
|
| 63 |
+
For detailed instructions on how to use `SimpleDeepSearcher` for inference or training, please refer to the [Quick Start section in the official GitHub repository](https://github.com/RUCAIBox/SimpleDeepSearcher#%EF%B8%8F-quick-start). The repository provides scripts for environment setup, data construction, and inference generation.
|
| 64 |
|
| 65 |
## Training procedure
|
| 66 |
|
|
|
|
| 81 |
- Pytorch 2.5.1+cu124
|
| 82 |
- Datasets 2.19.0
|
| 83 |
- Tokenizers 0.20.3
|
| 84 |
+
|
| 85 |
+
## Citation
|
| 86 |
+
|
| 87 |
+
If you find our work useful, please cite our paper:
|
| 88 |
+
|
| 89 |
+
```bibtex
|
| 90 |
+
@article{sun2025simpledeepsearcher,
|
| 91 |
+
title={SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis},
|
| 92 |
+
author={Sun, Shuang and Song, Huatong and Wang, Yuhao and Ren, Ruiyang and Jiang, Jinhao and Zhang, Junjie and Bai, Fei and Deng, Jia and Zhao, Wayne Xin and Liu, Zheng and others},
|
| 93 |
+
journal={arXiv preprint arXiv:2505.16834},
|
| 94 |
+
year={2025}
|
| 95 |
+
}
|
| 96 |
+
```
|
| 97 |
+
|
| 98 |
+
## License
|
| 99 |
+
|
| 100 |
+
This project is released under the [MIT License](https://github.com/RUCAIBox/SimpleDeepSearcher/blob/main/LICENSE).
|