Update README.md
Browse files
README.md
CHANGED
|
@@ -24,6 +24,12 @@ In this work, we propose a GRPO based training method for such a system that all
|
|
| 24 |
<img src="assets/multiround_grpo.png" alt="Alt Text" width="750">
|
| 25 |
</p>
|
| 26 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 27 |
|
| 28 |
## Usage
|
| 29 |
```python
|
|
@@ -39,12 +45,6 @@ SYSTEM_PROMPT = You are a helpful assistant. When the user asks a question, you
|
|
| 39 |
The User provides this detailed progress as additional context in the next round. You then respond again with further thinking and further progress. When the User says that the current round is the final (third) round, you provide an answer inside the answer tags. You also enclose a final answer in third round in the box: \boxed{}. Only this boxed final answer is used for evaluation.
|
| 40 |
```
|
| 41 |
|
| 42 |
-
## Results
|
| 43 |
-
Our results are shown below:
|
| 44 |
-
<p align="center">
|
| 45 |
-
<img src="assets/motif_results.png" alt="Alt Text" width="750">
|
| 46 |
-
</p>
|
| 47 |
-
|
| 48 |
## Citation
|
| 49 |
If you find our work useful, consider citing it as:
|
| 50 |
```bibtex
|
|
|
|
| 24 |
<img src="assets/multiround_grpo.png" alt="Alt Text" width="750">
|
| 25 |
</p>
|
| 26 |
|
| 27 |
+
## Results
|
| 28 |
+
Our results are shown below:
|
| 29 |
+
<p align="center">
|
| 30 |
+
<img src="assets/motif_results.png" alt="Alt Text" width="750">
|
| 31 |
+
</p>
|
| 32 |
+
|
| 33 |
|
| 34 |
## Usage
|
| 35 |
```python
|
|
|
|
| 45 |
The User provides this detailed progress as additional context in the next round. You then respond again with further thinking and further progress. When the User says that the current round is the final (third) round, you provide an answer inside the answer tags. You also enclose a final answer in third round in the box: \boxed{}. Only this boxed final answer is used for evaluation.
|
| 46 |
```
|
| 47 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 48 |
## Citation
|
| 49 |
If you find our work useful, consider citing it as:
|
| 50 |
```bibtex
|