Hush-cd nielsr HF Staff commited on
Commit
d58b5ed
Β·
verified Β·
1 Parent(s): cf02a7a

Add pipeline_tag, library_name and link to paper (#1)

Browse files

- Add pipeline_tag, library_name and link to paper (cb936155815809e4214605481ba465ddee1436dc)


Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +70 -60
README.md CHANGED
@@ -1,61 +1,71 @@
1
- ---
2
- inference: false
3
- language:
4
- - en
5
- - zh
6
- tags:
7
- - instruction-finetuning
8
- task_categories:
9
- - text-generation
10
- base_model:
11
- - Qwen/Qwen2.5-32B-Instruct
12
- license: cc-by-nc-nd-4.0
13
- ---
14
- <h1 align="center">
15
- πŸ” xVerify-32B-I
16
- </h1>
17
-
18
- <p align="center">
19
- <div style="display: flex; justify-content: center; gap: 10px;">
20
- <a href="https://github.com/IAAR-Shanghai/xVerify">
21
- <img src="https://img.shields.io/badge/GitHub-Repository-blue?logo=github" alt="GitHub"/>
22
- </a>
23
- <a href="https://huggingface.co/IAAR-Shanghai/xVerify-32B-I">
24
- <img src="https://img.shields.io/badge/πŸ€—%20Hugging%20Face-xVerify--32B--I-yellow" alt="Hugging Face"/>
25
- </a>
26
- </div>
27
- </p>
28
- xVerify is an evaluation tool fine-tuned from a pre-trained large language model, designed specifically for objective questions with a single correct answer. It accurately extracts the final answer from lengthy reasoning processes and efficiently identifies equivalence across different forms of expressions.
29
-
30
- ---
31
-
32
- ## ✨ Key Features
33
-
34
- ### πŸ“Š Broad Applicability
35
- Suitable for various objective question evaluation scenarios including math problems, multiple-choice questions, classification tasks, and short-answer questions.
36
-
37
- ### ⛓️ Handles Long Reasoning Chains
38
- Effectively processes answers with extensive reasoning steps to extract the final answer, regardless of complexity.
39
-
40
- ### 🌐 Multilingual Support
41
- Primarily handles Chinese and English responses while remaining compatible with other languages.
42
-
43
- ### πŸ”„ Powerful Equivalence Judgment
44
- - βœ“ Recognizes basic transformations like letter case changes and Greek letter conversions
45
- - βœ“ Identifies equivalent mathematical expressions across formats (LaTeX, fractions, scientific notation)
46
- - βœ“ Determines semantic equivalence in natural language answers
47
- - βœ“ Matches multiple-choice responses by content rather than just option identifiers
48
-
49
- ---
50
-
51
-
52
- ## πŸ“š Citation
53
-
54
- ```bibtex
55
- @article{xVerify,
56
- title={xVerify: Efficient Answer Verifier for Reasoning Model Evaluations},
57
- author={Ding Chen and Qingchen Yu and Pengyuan Wang and Wentao Zhang and Bo Tang and Feiyu Xiong and Xinchi Li and Minchuan Yang and Zhiyu Li},
58
- journal={arXiv preprint arXiv:2504.10481},
59
- year={2025},
60
- }
 
 
 
 
 
 
 
 
 
 
61
  ```
 
1
+ ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-32B-Instruct
4
+ language:
5
+ - en
6
+ - zh
7
+ license: cc-by-nc-nd-4.0
8
+ tags:
9
+ - instruction-finetuning
10
+ - evaluation
11
+ - reasoning
12
+ inference: false
13
+ pipeline_tag: text-generation
14
+ library_name: transformers
15
+ arxiv: 2504.10481
16
+ ---
17
+
18
+ <h1 align="center">
19
+ πŸ” xVerify-32B-I
20
+ </h1>
21
+
22
+ <p align="center">
23
+ <div style="display: flex; justify-content: center; gap: 10px;">
24
+ <a href="https://github.com/IAAR-Shanghai/xVerify">
25
+ <img src="https://img.shields.io/badge/GitHub-Repository-blue?logo=github" alt="GitHub"/>
26
+ </a>
27
+ <a href="https://huggingface.co/IAAR-Shanghai/xVerify-32B-I">
28
+ <img src="https://img.shields.io/badge/πŸ€—%20Hugging%20Face-xVerify--32B--I-yellow" alt="Hugging Face"/>
29
+ </a>
30
+ <a href="https://huggingface.co/papers/2504.10481">
31
+ <img src="https://img.shields.io/badge/Paper-arXiv-red?logo=arxiv" alt="Paper"/>
32
+ </a>
33
+ </div>
34
+ </p>
35
+
36
+ xVerify is an evaluation tool fine-tuned from a pre-trained large language model, designed specifically for objective questions with a single correct answer. It accurately extracts the final answer from lengthy reasoning processes and efficiently identifies equivalence across different forms of expressions.
37
+
38
+ This model is presented in the paper [xVerify: Efficient Answer Verifier for Reasoning Model Evaluations](https://huggingface.co/papers/2504.10481).
39
+
40
+ ---
41
+
42
+ ## ✨ Key Features
43
+
44
+ ### πŸ“Š Broad Applicability
45
+ Suitable for various objective question evaluation scenarios including math problems, multiple-choice questions, classification tasks, and short-answer questions.
46
+
47
+ ### ⛓️ Handles Long Reasoning Chains
48
+ Effectively processes answers with extensive reasoning steps to extract the final answer, regardless of complexity.
49
+
50
+ ### 🌐 Multilingual Support
51
+ Primarily handles Chinese and English responses while remaining compatible with other languages.
52
+
53
+ ### πŸ”„ Powerful Equivalence Judgment
54
+ - βœ“ Recognizes basic transformations like letter case changes and Greek letter conversions
55
+ - βœ“ Identifies equivalent mathematical expressions across formats (LaTeX, fractions, scientific notation)
56
+ - βœ“ Determines semantic equivalence in natural language answers
57
+ - βœ“ Matches multiple-choice responses by content rather than just option identifiers
58
+
59
+ ---
60
+
61
+
62
+ ## πŸ“š Citation
63
+
64
+ ```bibtex
65
+ @article{xVerify,
66
+ title={xVerify: Efficient Answer Verifier for Reasoning Model Evaluations},
67
+ author={Ding Chen and Qingchen Yu and Pengyuan Wang and Wentao Zhang and Bo Tang and Feiyu Xiong and Xinchi Li and Minchuan Yang and Zhiyu Li},
68
+ journal={arXiv preprint arXiv:2504.10481},
69
+ year={2025},
70
+ }
71
  ```