dipta007 commited on
Commit
70ca118
·
verified ·
1 Parent(s): 54b21c8
Files changed (1) hide show
  1. README.md +6 -0
README.md CHANGED
@@ -85,6 +85,12 @@ GRPO is supervised with a sum of seven rewards, grouped into three families:
85
 
86
  ## Quickstart
87
 
 
 
 
 
 
 
88
  DecomposeRL expects a specific verification prompt around your `claim` + `evidence_doc`. The `build_prompt` helper below wraps them for you so you don't have to construct the full instruction block every time.
89
 
90
  ```python
 
85
 
86
  ## Quickstart
87
 
88
+ A complete runnable script is included in the repo as [`example.py`](./example.py) (download it [here](https://huggingface.co/dipta007/decomposeRL-7b/resolve/main/example.py)):
89
+
90
+ ```bash
91
+ python example.py
92
+ ```
93
+
94
  DecomposeRL expects a specific verification prompt around your `claim` + `evidence_doc`. The `build_prompt` helper below wraps them for you so you don't have to construct the full instruction block every time.
95
 
96
  ```python