RLHFlow
/

Llama3.1-8B-PRM-Mistral-Data

Text Generation

text-generation-inference

Model card Files Files and versions

weqweasdas commited on Nov 9, 2024

Commit

0a4c93a

·

verified ·

1 Parent(s): b3abd8c

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -35,7 +35,7 @@ The model is trained from [meta-llama/Llama-3.1-8B-Instruct](https://huggingface
 ## Usage
-See https://github.com/RLHFlow/RLHF-Reward-Modeling/tree/main/math for detailed examples.
 ## Citation

 ## Usage
+See https://github.com/RLHFlow/RLHF-Reward-Modeling/tree/main/math-rm for detailed examples.
 ## Citation