Update README.md
Browse files
README.md
CHANGED
|
@@ -114,11 +114,12 @@ outputs = pipeline(
|
|
| 114 |
print(outputs[0]["generated_text"][-1]['content'])
|
| 115 |
````
|
| 116 |
|
|
|
|
|
|
|
|
|
|
| 117 |
To use the model with GenSelect inference, we recommend following our
|
| 118 |
-
[reference implementation in NeMo-Skills](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/pipeline/genselect.py).
|
| 119 |
-
|
| 120 |
-
[prompt](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/prompt/config/openmath/genselect.yaml) for the math problems.
|
| 121 |
-
We will add the prompt we used for the coding problems and a reference implementation soon!
|
| 122 |
|
| 123 |
You can learn more about GenSelect in these papers:
|
| 124 |
* [AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset](https://arxiv.org/abs/2504.16891)
|
|
@@ -130,7 +131,7 @@ You can learn more about GenSelect in these papers:
|
|
| 130 |
If you find the data useful, please cite:
|
| 131 |
```
|
| 132 |
@article{ahmad2025opencodereasoning,
|
| 133 |
-
title={OpenCodeReasoning: Advancing Data Distillation for Competitive Coding},
|
| 134 |
author={Wasi Uddin Ahmad, Sean Narenthiran, Somshubra Majumdar, Aleksander Ficek, Siddhartha Jain, Jocelyn Huang, Vahid Noroozi, Boris Ginsburg},
|
| 135 |
year={2025},
|
| 136 |
eprint={2504.01943},
|
|
@@ -142,7 +143,7 @@ If you find the data useful, please cite:
|
|
| 142 |
|
| 143 |
```
|
| 144 |
@misc{ahmad2025opencodereasoningiisimpletesttime,
|
| 145 |
-
title={OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique},
|
| 146 |
author={Wasi Uddin Ahmad and Somshubra Majumdar and Aleksander Ficek and Sean Narenthiran and Mehrzad Samadi and Jocelyn Huang and Siddhartha Jain and Vahid Noroozi and Boris Ginsburg},
|
| 147 |
year={2025},
|
| 148 |
eprint={2507.09075},
|
|
@@ -154,7 +155,7 @@ If you find the data useful, please cite:
|
|
| 154 |
|
| 155 |
```
|
| 156 |
@misc{moshkov2025aimo2winningsolutionbuilding,
|
| 157 |
-
title={AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset},
|
| 158 |
author={Ivan Moshkov and Darragh Hanley and Ivan Sorokin and Shubham Toshniwal and Christof Henkel and Benedikt Schifferer and Wei Du and Igor Gitman},
|
| 159 |
year={2025},
|
| 160 |
eprint={2504.16891},
|
|
@@ -164,6 +165,16 @@ If you find the data useful, please cite:
|
|
| 164 |
}
|
| 165 |
```
|
| 166 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 167 |
## Additional Information:
|
| 168 |
|
| 169 |
### Deployment Geography:
|
|
|
|
| 114 |
print(outputs[0]["generated_text"][-1]['content'])
|
| 115 |
````
|
| 116 |
|
| 117 |
+
We have added [a simple transformer-based script](https://huggingface.co/nvidia/OpenReasoning-Nemotron-7B/blob/main/genselect_hf.py) in this repo to illustrate GenSelect.
|
| 118 |
+
To learn how to use the models in GenSelect mode with NeMo-Skills, see our [documentation](https://nvidia.github.io/NeMo-Skills/releases/openreasoning/evaluation/).
|
| 119 |
+
|
| 120 |
To use the model with GenSelect inference, we recommend following our
|
| 121 |
+
[reference implementation in NeMo-Skills](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/pipeline/genselect.py). Alternatively, you can manually extract the summary from all solutions and use this
|
| 122 |
+
[prompt](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/prompt/config/openmath/genselect.yaml) for the math problems. We will add the prompt we used for the coding problems and a reference implementation soon!
|
|
|
|
|
|
|
| 123 |
|
| 124 |
You can learn more about GenSelect in these papers:
|
| 125 |
* [AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset](https://arxiv.org/abs/2504.16891)
|
|
|
|
| 131 |
If you find the data useful, please cite:
|
| 132 |
```
|
| 133 |
@article{ahmad2025opencodereasoning,
|
| 134 |
+
title={{OpenCodeReasoning: Advancing Data Distillation for Competitive Coding}},
|
| 135 |
author={Wasi Uddin Ahmad, Sean Narenthiran, Somshubra Majumdar, Aleksander Ficek, Siddhartha Jain, Jocelyn Huang, Vahid Noroozi, Boris Ginsburg},
|
| 136 |
year={2025},
|
| 137 |
eprint={2504.01943},
|
|
|
|
| 143 |
|
| 144 |
```
|
| 145 |
@misc{ahmad2025opencodereasoningiisimpletesttime,
|
| 146 |
+
title={{OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique}},
|
| 147 |
author={Wasi Uddin Ahmad and Somshubra Majumdar and Aleksander Ficek and Sean Narenthiran and Mehrzad Samadi and Jocelyn Huang and Siddhartha Jain and Vahid Noroozi and Boris Ginsburg},
|
| 148 |
year={2025},
|
| 149 |
eprint={2507.09075},
|
|
|
|
| 155 |
|
| 156 |
```
|
| 157 |
@misc{moshkov2025aimo2winningsolutionbuilding,
|
| 158 |
+
title={{AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset}},
|
| 159 |
author={Ivan Moshkov and Darragh Hanley and Ivan Sorokin and Shubham Toshniwal and Christof Henkel and Benedikt Schifferer and Wei Du and Igor Gitman},
|
| 160 |
year={2025},
|
| 161 |
eprint={2504.16891},
|
|
|
|
| 165 |
}
|
| 166 |
```
|
| 167 |
|
| 168 |
+
```
|
| 169 |
+
@inproceedings{toshniwal2025genselect,
|
| 170 |
+
title={{GenSelect: A Generative Approach to Best-of-N}},
|
| 171 |
+
author={Shubham Toshniwal and Ivan Sorokin and Aleksander Ficek and Ivan Moshkov and Igor Gitman},
|
| 172 |
+
booktitle={2nd AI for Math Workshop @ ICML 2025},
|
| 173 |
+
year={2025},
|
| 174 |
+
url={https://openreview.net/forum?id=8LhnmNmUDb}
|
| 175 |
+
}
|
| 176 |
+
```
|
| 177 |
+
|
| 178 |
## Additional Information:
|
| 179 |
|
| 180 |
### Deployment Geography:
|