Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,50 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
datasets:
|
| 4 |
+
- hotpotqa/hotpot_qa
|
| 5 |
+
base_model:
|
| 6 |
+
- Qwen/Qwen2.5-7B-Instruct
|
| 7 |
+
---
|
| 8 |
+
|
| 9 |
+
|
| 10 |
+
## Model Card for RAG-R1
|
| 11 |
+
|
| 12 |
+
### Model Details
|
| 13 |
+
|
| 14 |
+
* **Model Name:** RAG-R1-sq-7b
|
| 15 |
+
* **Version:** 1.0
|
| 16 |
+
* **Model Type:** RAG
|
| 17 |
+
* **Developers:** Zhiwen Tan, Jiaming Huang, Qintong Wu, Hongxuan Zhang, Chenyi Zhuang, Jinjie Gu
|
| 18 |
+
|
| 19 |
+
[](https://arxiv.org/abs/2507.02962) [](https://github.com/inclusionAI/AWorld-RL/tree/main/RAG-R1)
|
| 20 |
+
|
| 21 |
+
### Overview
|
| 22 |
+
|
| 23 |
+
RAG-R1 is a deepsearch training framework designed to enable LLMs to adaptively leverage internal and external knowledge during the reasoning process.
|
| 24 |
+
We further expand the generation and retrieval processes within the framework from single-query mode to multi-query parallelism, aimed at reducing inference time and enhancing the model's capabilities.
|
| 25 |
+
Extensive experiments on seven question-answering benchmarks demonstrate that our method outperforms the strongest baseline by up to 13.2% and decreases inference time by 11.1%.
|
| 26 |
+
|
| 27 |
+
### Framework
|
| 28 |
+
|
| 29 |
+
<img src="RAG-R1.png" style="width:100%;">
|
| 30 |
+
<h5 align="center"> Overall framework of RAG-R1.</h5>
|
| 31 |
+
|
| 32 |
+
### Performance
|
| 33 |
+
|
| 34 |
+
<img src="RAG-R1-result.png" style="width:100%;">
|
| 35 |
+
<h5 align="left">Performance comparisons on QA benchmarks under the EM metric. The best and second
|
| 36 |
+
best results are bold and underlined, respectively.</h5>
|
| 37 |
+
|
| 38 |
+
### Acknowledgements
|
| 39 |
+
RAG-R1 is inspired by [Deepseek-R1](https://github.com/deepseek-ai/DeepSeek-R1) with its implementation based on [veRL](https://github.com/volcengine/verl) and [Search-r1](https://github.com/PeterGriffinJin/Search-R1). We deeply appreciate the contributions of these teams to open-source research and development.
|
| 40 |
+
|
| 41 |
+
### Citation
|
| 42 |
+
Please cite our repo if our works are helpful for your research.
|
| 43 |
+
```
|
| 44 |
+
@article{RAG-R1,
|
| 45 |
+
title={RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query Parallelism},
|
| 46 |
+
author={Zhiwen Tan and Jiaming Huang and Qintong Wu and Hongxuan Zhang and Chenyi Zhuang and Jinjie Gu},
|
| 47 |
+
journal={arXiv preprint arXiv:2507.02962},
|
| 48 |
+
year={2025}
|
| 49 |
+
}
|
| 50 |
+
```
|