Update README.md
Browse files
README.md
CHANGED
|
@@ -16,16 +16,17 @@ tags:
|
|
| 16 |
# ActPRM
|
| 17 |
|
| 18 |
## Links
|
| 19 |
-
- [GitHub](https://github.com/sail-sg/ActivePRM)
|
| 20 |
-
- [ActivePRM Collection](https://huggingface.co/collections/sail/active-prm-67fccbc10c94c41a6d6b26d9)
|
| 21 |
|
| 22 |
## Introduction
|
| 23 |
|
| 24 |
-
We release our process reward model effectively trained with active learning based on `Qwen/Qwen2.5-Math-7B-Instruct`.
|
| 25 |
|
| 26 |
-
|
| 27 |
-
<img src="https://raw.githubusercontent.com/sail-sg/ActivePRM/refs/heads/
|
| 28 |
-
|
|
|
|
| 29 |
|
| 30 |
### Usage
|
| 31 |
We provide a code snippet to show the usage of our PRM.
|
|
|
|
| 16 |
# ActPRM
|
| 17 |
|
| 18 |
## Links
|
| 19 |
+
- 💻 [GitHub](https://github.com/sail-sg/ActivePRM)
|
| 20 |
+
- 🤗 [ActivePRM Collection](https://huggingface.co/collections/sail/active-prm-67fccbc10c94c41a6d6b26d9)
|
| 21 |
|
| 22 |
## Introduction
|
| 23 |
|
| 24 |
+
We release our process reward model effectively trained with active learning based on `Qwen/Qwen2.5-Math-7B-Instruct`. The model achieved 75.0% on [ProcessBench](https://github.com/QwenLM/ProcessBench) and 65.5% on [PRMBench](https://github.com/ssmisya/PRMBench).
|
| 25 |
|
| 26 |
+
<p align="center">
|
| 27 |
+
<img src="https://raw.githubusercontent.com/sail-sg/ActivePRM/refs/heads/master/assets/processbench.png" width=70%/>
|
| 28 |
+
<img src="https://raw.githubusercontent.com/sail-sg/ActivePRM/refs/heads/master/assets/prmbench.png" width=70%/>
|
| 29 |
+
</p>
|
| 30 |
|
| 31 |
### Usage
|
| 32 |
We provide a code snippet to show the usage of our PRM.
|