Safetensors
English
qwen2
Files changed (1) hide show
  1. README.md +26 -14
README.md CHANGED
@@ -1,14 +1,26 @@
1
- ---
2
- license: apache-2.0
3
- datasets:
4
- - SJTULean/LeanStatement_SFT
5
- - SJTULean/LeanStatement_RL
6
- language:
7
- - en
8
- metrics:
9
- - accuracy
10
- base_model:
11
- - Qwen/Qwen2.5-7B-Instruct
12
- ---
13
- Through PPO using a reward model trained on merely 5% of the LeanStatement_RL dataset, our model achieved enhanced performance, reaching a **pass@1 compilation success rate** of **95.3%** (465/488) on MiniF2F and **76.0%** (284/374) on ProofNet benchmarks.
14
- Combined with our SFT results, these findings establish a new **state-of-the-art** in mathematics formalization as of December 2024.
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - SJTULean/LeanStatement_SFT
5
+ - SJTULean/LeanStatement_RL
6
+ language:
7
+ - zho
8
+ - eng
9
+ - fra
10
+ - spa
11
+ - por
12
+ - deu
13
+ - ita
14
+ - rus
15
+ - jpn
16
+ - kor
17
+ - vie
18
+ - tha
19
+ - ara
20
+ metrics:
21
+ - accuracy
22
+ base_model:
23
+ - Qwen/Qwen2.5-7B-Instruct
24
+ ---
25
+ Through PPO using a reward model trained on merely 5% of the LeanStatement_RL dataset, our model achieved enhanced performance, reaching a **pass@1 compilation success rate** of **95.3%** (465/488) on MiniF2F and **76.0%** (284/374) on ProofNet benchmarks.
26
+ Combined with our SFT results, these findings establish a new **state-of-the-art** in mathematics formalization as of December 2024.