Instructions to use RUC-AIBOX/QwQ-32B-SimpleDeepSearcher with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use RUC-AIBOX/QwQ-32B-SimpleDeepSearcher with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="RUC-AIBOX/QwQ-32B-SimpleDeepSearcher")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("RUC-AIBOX/QwQ-32B-SimpleDeepSearcher")
model = AutoModelForCausalLM.from_pretrained("RUC-AIBOX/QwQ-32B-SimpleDeepSearcher")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use RUC-AIBOX/QwQ-32B-SimpleDeepSearcher with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "RUC-AIBOX/QwQ-32B-SimpleDeepSearcher"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "RUC-AIBOX/QwQ-32B-SimpleDeepSearcher",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/RUC-AIBOX/QwQ-32B-SimpleDeepSearcher

SGLang

How to use RUC-AIBOX/QwQ-32B-SimpleDeepSearcher with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "RUC-AIBOX/QwQ-32B-SimpleDeepSearcher" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "RUC-AIBOX/QwQ-32B-SimpleDeepSearcher",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "RUC-AIBOX/QwQ-32B-SimpleDeepSearcher" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "RUC-AIBOX/QwQ-32B-SimpleDeepSearcher",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use RUC-AIBOX/QwQ-32B-SimpleDeepSearcher with Docker Model Runner:
```
docker model run hf.co/RUC-AIBOX/QwQ-32B-SimpleDeepSearcher
```

Improve model card: Add description, paper/code links, relevant tags, license, and pipeline tag

by nielsr HF Staff - opened Oct 9, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+60

-7

Files changed (1) hide show

README.md +60 -7

README.md CHANGED Viewed

@@ -2,29 +2,65 @@
 library_name: transformers
 tags:
 - generated_from_trainer
 model-index:
 - name: Online-Searcher-QwQ-32B
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# Online-Searcher-QwQ-32B
-This model was trained from scratch on an unknown dataset.
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
@@ -45,3 +81,20 @@ The following hyperparameters were used during training:
 - Pytorch 2.5.1+cu124
 - Datasets 2.19.0
 - Tokenizers 0.20.3

 library_name: transformers
 tags:
 - generated_from_trainer
+- deep-search
+- web-agent
+- rag
 model-index:
 - name: Online-Searcher-QwQ-32B
   results: []
+license: mit
+pipeline_tag: text-generation
 ---
+# Online-Searcher-QwQ-32B (SimpleDeepSearcher)
+This model, `Online-Searcher-QwQ-32B`, is part of the `SimpleDeepSearcher` family, a lightweight yet effective framework for enhancing large language models (LLMs) in deep search tasks. It was presented in the paper [SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis](https://huggingface.co/papers/2505.16834).
+**Code:** [https://github.com/RUCAIBox/SimpleDeepSearcher](https://github.com/RUCAIBox/SimpleDeepSearcher)
 ## Model description
+`SimpleDeepSearcher` addresses critical limitations in existing retrieval-augmented generation (RAG) systems for complex deep search scenarios. It tackles the lack of high-quality training trajectories and the distributional mismatches in simulated environments, as well as prohibitive computational costs.
+This framework strategically engineers data by synthesizing high-quality training data, simulating realistic user interactions in live web search environments. This is coupled with a multi-criteria curation strategy that optimizes the diversity and quality of both input and output. Experiments on five benchmarks demonstrate that supervised fine-tuning (SFT) on only 871 curated samples yields significant improvements over RL-based baselines.
+`Online-Searcher-QwQ-32B` is a 32B model, likely based on a Qwen2 backbone as indicated in its `config.json`, fine-tuned within this `SimpleDeepSearcher` framework. Our work establishes SFT as a viable pathway by systematically addressing the data-scarce bottleneck, offering practical insights for efficient deep search systems.
+<p align="center">
+  <img src="https://raw.githubusercontent.com/RUCAIBox/SimpleDeepSearcher/main/assets/simplelog.jpg" alt="SimpleDeepSearcher Logo" width="550"/>
+</p>
+### Key Contributions
+-   A real web-based data synthesis framework that simulates realistic user search behaviors, generating multi-turn reasoning and search trajectories.
+-   A multi-criteria data curation strategy that jointly optimizes both input question selection and output response filtering through orthogonal filtering dimensions.
+-   Experimental results demonstrate that SFT on only 871 samples enables SimpleDeepSearcher to outperform strong baselines (especially RL-based baselines) on both in-domain and out-of-domain benchmarks.
+### Overall Performance
+<p align="center">
+  <img src="https://raw.githubusercontent.com/RUCAIBox/SimpleDeepSearcher/main/assets/overall_performance.png" alt="Overall Performance" width="800"/>
+</p>
+### Framework Overview
+<p align="center">
+  <img src="https://raw.githubusercontent.com/RUCAIBox/SimpleDeepSearcher/main/assets/pipeline.png" alt="Framework Overview" width="800"/>
+</p>
 ## Intended uses & limitations
+This model is primarily intended for research and development in areas related to deep information seeking, web-powered reasoning, retrieval-augmented generation (RAG) systems, and multi-step complex reasoning tasks. It is designed to be a lightweight yet effective solution for scenarios requiring iterative information retrieval from the web.
+**Limitations**: While `SimpleDeepSearcher` demonstrates strong performance with high data efficiency using a small curated dataset, its effectiveness in highly dynamic or adversarial web environments may require further evaluation. The model's performance relies on the quality and diversity of its synthesized training trajectories.
 ## Training and evaluation data
+The model was trained on a high-quality dataset of 871 curated samples. This training data was synthesized by simulating realistic user interactions within live web search environments. A multi-criteria curation strategy was applied to optimize both input question selection and output response filtering, ensuring data diversity and quality across various domains.
+## Sample Usage
+For detailed instructions on how to use `SimpleDeepSearcher` for inference or training, please refer to the [Quick Start section in the official GitHub repository](https://github.com/RUCAIBox/SimpleDeepSearcher#%EF%B8%8F-quick-start). The repository provides scripts for environment setup, data construction, and inference generation.
 ## Training procedure
 - Pytorch 2.5.1+cu124
 - Datasets 2.19.0
 - Tokenizers 0.20.3
+## Citation
+If you find our work useful, please cite our paper:
+```bibtex
+@article{sun2025simpledeepsearcher,
+  title={SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis},
+  author={Sun, Shuang and Song, Huatong and Wang, Yuhao and Ren, Ruiyang and Jiang, Jinhao and Zhang, Junjie and Bai, Fei and Deng, Jia and Zhao, Wayne Xin and Liu, Zheng and others},
+  journal={arXiv preprint arXiv:2505.16834},
+  year={2025}
+}
+```
+## License
+This project is released under the [MIT License](https://github.com/RUCAIBox/SimpleDeepSearcher/blob/main/LICENSE).