Text Generation
Transformers
Safetensors
English
purbeshmitra commited on
Commit
013d0d0
·
verified ·
1 Parent(s): 9666493

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -24,6 +24,12 @@ In this work, we propose a GRPO based training method for such a system that all
24
  <img src="assets/multiround_grpo.png" alt="Alt Text" width="750">
25
  </p>
26
 
 
 
 
 
 
 
27
 
28
  ## Usage
29
  ```python
@@ -39,12 +45,6 @@ SYSTEM_PROMPT = You are a helpful assistant. When the user asks a question, you
39
  The User provides this detailed progress as additional context in the next round. You then respond again with further thinking and further progress. When the User says that the current round is the final (third) round, you provide an answer inside the answer tags. You also enclose a final answer in third round in the box: \boxed{}. Only this boxed final answer is used for evaluation.
40
  ```
41
 
42
- ## Results
43
- Our results are shown below:
44
- <p align="center">
45
- <img src="assets/motif_results.png" alt="Alt Text" width="750">
46
- </p>
47
-
48
  ## Citation
49
  If you find our work useful, consider citing it as:
50
  ```bibtex
 
24
  <img src="assets/multiround_grpo.png" alt="Alt Text" width="750">
25
  </p>
26
 
27
+ ## Results
28
+ Our results are shown below:
29
+ <p align="center">
30
+ <img src="assets/motif_results.png" alt="Alt Text" width="750">
31
+ </p>
32
+
33
 
34
  ## Usage
35
  ```python
 
45
  The User provides this detailed progress as additional context in the next round. You then respond again with further thinking and further progress. When the User says that the current round is the final (third) round, you provide an answer inside the answer tags. You also enclose a final answer in third round in the box: \boxed{}. Only this boxed final answer is used for evaluation.
46
  ```
47
 
 
 
 
 
 
 
48
  ## Citation
49
  If you find our work useful, consider citing it as:
50
  ```bibtex