---
license: apache-2.0
datasets:
- hotpotqa/hotpot_qa
base_model:
- Qwen/Qwen2.5-7B-Instruct
---
## Model Card for RAG-R1
### Model Details
* **Model Name:** RAG-R1-sq-7b
* **Version:** 1.0
* **Model Type:** RAG
* **Developers:** Zhiwen Tan, Jiaming Huang, Qintong Wu, Hongxuan Zhang, Chenyi Zhuang, Jinjie Gu
[](https://arxiv.org/abs/2507.02962) [](https://github.com/inclusionAI/AWorld-RL/tree/main/RAG-R1)
### Overview
RAG-R1 is a deepsearch training framework designed to enable LLMs to adaptively leverage internal and external knowledge during the reasoning process.
We further expand the generation and retrieval processes within the framework from single-query mode to multi-query parallelism, aimed at reducing inference time and enhancing the model's capabilities.
Extensive experiments on seven question-answering benchmarks demonstrate that our method outperforms the strongest baseline by up to 13.2% and decreases inference time by 11.1%.
### Framework