Update README.md
Browse files
README.md
CHANGED
|
@@ -35,7 +35,7 @@ The model is trained from [meta-llama/Llama-3.1-8B-Instruct](https://huggingface
|
|
| 35 |
|
| 36 |
## Usage
|
| 37 |
|
| 38 |
-
See https://github.com/RLHFlow/RLHF-Reward-Modeling/tree/main/math for detailed examples.
|
| 39 |
|
| 40 |
## Citation
|
| 41 |
|
|
|
|
| 35 |
|
| 36 |
## Usage
|
| 37 |
|
| 38 |
+
See https://github.com/RLHFlow/RLHF-Reward-Modeling/tree/main/math-rm for detailed examples.
|
| 39 |
|
| 40 |
## Citation
|
| 41 |
|