vermouthdky commited on
Commit
a8319bc
·
verified ·
1 Parent(s): 36186c1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -6
README.md CHANGED
@@ -16,16 +16,20 @@ tags:
16
  # ActPRM
17
 
18
  ## Links
19
- - [GitHub](https://github.com/sail-sg/ActivePRM)
20
- - [ActivePRM Collection](https://huggingface.co/collections/sail/active-prm-67fccbc10c94c41a6d6b26d9)
21
 
22
  ## Introduction
23
 
24
- We release our process reward model effectively trained with active learning based on `Qwen/Qwen2.5-Math-7B-Instruct`.
 
 
 
 
 
 
 
25
 
26
- <!-- <p align="center">
27
- <img src="https://raw.githubusercontent.com/sail-sg/ActivePRM/refs/heads/main/assets/figure1.png" width=70%/>
28
- </p> -->
29
 
30
  ### Usage
31
  We provide a code snippet to show the usage of our PRM.
 
16
  # ActPRM
17
 
18
  ## Links
19
+ - 💻 [GitHub](https://github.com/sail-sg/ActivePRM)
20
+ - 🤗 [ActivePRM Collection](https://huggingface.co/collections/sail/active-prm-67fccbc10c94c41a6d6b26d9)
21
 
22
  ## Introduction
23
 
24
+ We release our process reward model effectively trained with active learning based on `Qwen/Qwen2.5-Math-PRM-7B`.
25
+ The model achieved 76.0% on [ProcessBench](https://github.com/QwenLM/ProcessBench) and 66.7% on [PRMBench](https://github.com/ssmisya/PRMBench).
26
+ Both achieve new state-of-the-art (SOTA) performance among 7B parameter models (as of 15/04/2025).
27
+
28
+ <p align="center">
29
+ <img src="https://raw.githubusercontent.com/sail-sg/ActivePRM/refs/heads/master/assets/processbench.png" width=80%/>
30
+ <img src="https://raw.githubusercontent.com/sail-sg/ActivePRM/refs/heads/master/assets/prmbench.png" width=80%/>
31
+ </p>
32
 
 
 
 
33
 
34
  ### Usage
35
  We provide a code snippet to show the usage of our PRM.