inclusionAI
/

ASearcher-Web-QwQ-V2

Model card Files Files and versions

ASearcher-Web-QwQ-V2 / README.md

samjia2000's picture

Update README.md

6af204d verified 6 months ago

|

2.71 kB

	# ASearcher-Web-QwQ-V2

	## Overview

	ASearcher-Web-QwQ-V2 is a 32B-scale search agent trained using large-scale reinforcement learning. This model represents an improved version of the ASearcher framework, achieving cutting-edge performance on challenging web search benchmarks through advanced agentic RL training techniques.

	## Key Features

	- 🏆 Cutting-Edge Performance: Achieves Avg@4 scores of 58.7, 51.1, and 74.5 on GAIA, xBench-DeepSearch, and Frames benchmarks respectively
	- ⚡ Fully Asynchronous RL Training: Enables efficient long-horizon search capabilities with tool calls exceedind 100 rounds
	- 🔁 Advanced Data Synthesis: Trained on autonomously generated QA pairs with rigorous multi-stage validation
	- 🌐 Real Web Search Capabilities: Designed to interact with live web search tools for up-to-date information retrieval

	## Performance Highlights

	\| Benchmark \| Avg@4 Score \| Pass@4 Score \|
	\|-----------\|------------\|-------------\|
	\| GAIA \| 58.7 \| 74.7 \|
	\| xBench-DeepSearch \| 51.1 \| 75.0 \|
	\| Frames \| 74.5 \| 85.5 \|

	Substantial RL Improvements: Reinforcement learning training brings significant gains:
	- +15.0 improvement on GAIA
	- +22.4 improvement on xBench-DeepSearch
	- +14.6 improvement on Frames

	## Quick Start


	### Evaluation

	To reproduce the benchmark results:

	```bash
	cd evaluation/
	python search_eval_async.py \
	--model_name_or_path inclusionAI/ASearcher-Web-QwQ-V2 \
	--data_names GAIA,xbench-deepsearch,Frames \
	--agent-type asearcher-reasoning \
	--search-client-type async-web-search-access
	```

	## Training Details

	This model was trained using:
	- Architecture: QwQ-32B
	- Training Method: Fully asynchronous reinforcement learning
	- Data: Synthesized QA pairs with multi-stage validation
	- Framework: AReaL

	## Applications

	- Complex web search and information retrieval
	- Multi-step problem solving with tool usage
	- Real-time information gathering and synthesis
	- Long-horizon reasoning tasks


	## Citation

	If you use this model, please cite:

	```bibtex
	@misc{gao2025turnsunlockinglonghorizonagentic,
	title={Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL},
	author={Jiaxuan Gao and Wei Fu and Minyang Xie and Shusheng Xu and Chuyi He and Zhiyu Mei and Banghua Zhu and Yi Wu},
	year={2025},
	eprint={2508.07976},
	archivePrefix={arXiv},
	primaryClass={cs.CL},
	url={https://arxiv.org/abs/2508.07976},
	}
	```

	## License

	---
	license: apache-2.0
	---


	## Contact

	For questions and support, please refer to the [ASearcher GitHub repository](https://github.com/inclusionAI/ASearcher) or open an issue on the project page.